Improvement of Blood-Brain Barrier Permeability Prediction Using Cosine Similarity [Published online J. Comput. Chem. Jpn. Int. Ed., 9, -, by J-STAGE]

[Published online Journal of Computer Chemistry, Japan -International Edition Vol.9, -, by J-STAGE]
<Title:> Improvement of Blood-Brain Barrier Permeability Prediction Using Cosine Similarity
<Author(s):> Hiroshi SAKIYAMA, Ryushi MOTOKI, Takashi OKUNO, Jian-Qiang LIU
<Corresponding author E-Mill:> saki(at)sci.kj.yamagata-u.ac.jp
<Abstract:> Prediction of blood-brain barrier permeability for chemicals is one of the key issues in brain drug development. In this study, the effect of using training data relatively similar to the test data was investigated in order to improve the performance of machine learning methods in predicting blood-brain barrier permeability. The results showed that selecting training data with high cosine similarity to the test data improved prediction performance with a smaller number of training data. The best model in this study also showed improved scores on two external test sets to examine generalization performance, outperforming excellent existing models. The cosine similarity method is expected to be effective for predicting the properties of compounds with large diversity and a small number of data.
<Keywords:> Blood-brain barrier permeability, Prediction, Cosine similarity, Machine learning, Chemicals
<URL:> https://www.jstage.jst.go.jp/article/jccjie/9/0/9_2023-0017/_html