Machine learning methods in chemoinformatics. 2014

John B O Mitchell
School of Chemistry, University of St Andrews, St Andrews, UK.

Machine learning algorithms are generally developed in computer science or adjacent disciplines and find their way into chemical modeling by a process of diffusion. Though particular machine learning methods are popular in chemoinformatics and quantitative structure-activity relationships (QSAR), many others exist in the technical literature. This discussion is methods-based and focused on some algorithms that chemoinformatics researchers frequently use. It makes no claim to be exhaustive. We concentrate on methods for supervised learning, predicting the unknown property values of a test set of instances, usually molecules, based on the known values for a training set. Particularly relevant approaches include Artificial Neural Networks, Random Forest, Support Vector Machine, k-Nearest Neighbors and naïve Bayes classifiers.

UI MeSH Term Description Entries

Related Publications

John B O Mitchell
June 2012, Journal of chemical information and modeling,
John B O Mitchell
August 2022, Annual review of biomedical data science,
John B O Mitchell
August 2018, Drug discovery today,
John B O Mitchell
May 2019, Chemical biology & drug design,
John B O Mitchell
July 2023, International journal of molecular sciences,
John B O Mitchell
April 2019, Molecular informatics,
John B O Mitchell
April 2009, Journal of medicinal chemistry,
John B O Mitchell
October 2015, Gastroenterology,
Copied contents to your clipboard!