Artificial neural networks (ANNs) are trained using high-throughput screening (HTS) data to recover active compounds from a large data set. Improved classification performance was obtained on combining predictions made by multiple ANNs. The HTS data, acquired from a methionine aminopeptidases inhibition study, consisted of a library of 43,347 compounds, and the ratio of active to nonactive compounds, RA/N, was 0.0321. Back-propagation ANNs were trained and validated using principal components derived from the physicochemical features of the compounds. On selecting the training parameters carefully, an ANN recovers one-third of all active compounds from the validation set with a 3-fold gain in RA/N value. Further gains in R A/N values were obtained upon combining the predictions made by a number of ANNs. The generalization property of the back-propagation ANNs was used to train those ANNs with the same training samples, after being initialized with different sets of random weights. As a result, only 10% of all available compounds were needed for training and validation, and the rest of the data set was screened with more than a 10-fold gain of the original RA/N value. Thus, ANNs trained with limited HTS data might become useful in recovering active compounds from large data sets. © 2009 Society for Biomolecular Sciences.
Mendeley helps you to discover research relevant for your work.
CITATION STYLE
Chakrabarti, S., Svojanovsky, S. R., Slavik, R., Georg, G. I., Wilson, G. S., & Smith, P. G. (2009). Artificial neural network-based analysis of high-throughput screening data for improved prediction of active compounds. Journal of Biomolecular Screening, 14(10), 1236–1244. https://doi.org/10.1177/1087057109351312