Knowledge-based fragment binding prediction. 2014

Grace W Tang, and Russ B Altman
Department of Bioengineering, Stanford University, Stanford, California, United States of America.

Target-based drug discovery must assess many drug-like compounds for potential activity. Focusing on low-molecular-weight compounds (fragments) can dramatically reduce the chemical search space. However, approaches for determining protein-fragment interactions have limitations. Experimental assays are time-consuming, expensive, and not always applicable. At the same time, computational approaches using physics-based methods have limited accuracy. With increasing high-resolution structural data for protein-ligand complexes, there is now an opportunity for data-driven approaches to fragment binding prediction. We present FragFEATURE, a machine learning approach to predict small molecule fragments preferred by a target protein structure. We first create a knowledge base of protein structural environments annotated with the small molecule substructures they bind. These substructures have low-molecular weight and serve as a proxy for fragments. FragFEATURE then compares the structural environments within a target protein to those in the knowledge base to retrieve statistically preferred fragments. It merges information across diverse ligands with shared substructures to generate predictions. Our results demonstrate FragFEATURE's ability to rediscover fragments corresponding to the ligand bound with 74% precision and 82% recall on average. For many protein targets, it identifies high scoring fragments that are substructures of known inhibitors. FragFEATURE thus predicts fragments that can serve as inputs to fragment-based drug design or serve as refinement criteria for creating target-specific compound libraries for experimental or computational screening.

UI MeSH Term Description Entries
D008958 Models, Molecular Models used experimentally or theoretically to study molecular shape, electronic properties, or interactions; includes analogous molecules, computer-generated graphics, and mechanical structures. Molecular Models,Model, Molecular,Molecular Model
D011485 Protein Binding The process in which substances, either endogenous or exogenous, bind to proteins, peptides, enzymes, protein precursors, or allied compounds. Specific protein-binding measures are often used as assays in diagnostic assessments. Plasma Protein Binding Capacity,Binding, Protein
D011506 Proteins Linear POLYPEPTIDES that are synthesized on RIBOSOMES and may be further modified, crosslinked, cleaved, or assembled into complex proteins with several subunits. The specific sequence of AMINO ACIDS determines the shape the polypeptide will take, during PROTEIN FOLDING, and the function of the protein. Gene Products, Protein,Gene Proteins,Protein,Protein Gene Products,Proteins, Gene
D019359 Knowledge The body of truths or facts accumulated in the course of time, the cumulated sum of information, its volume and nature, in any civilization, period, or country. Epistemology

Related Publications

Grace W Tang, and Russ B Altman
June 2010, Bioinformatics (Oxford, England),
Grace W Tang, and Russ B Altman
February 2016, IET systems biology,
Grace W Tang, and Russ B Altman
August 2004, Journal of medicinal chemistry,
Grace W Tang, and Russ B Altman
January 2005, Journal of chemical information and modeling,
Grace W Tang, and Russ B Altman
January 2012, PloS one,
Grace W Tang, and Russ B Altman
November 1990, Journal of theoretical biology,
Grace W Tang, and Russ B Altman
November 2016, Drug metabolism and disposition: the biological fate of chemicals,
Grace W Tang, and Russ B Altman
April 2019, Journal of chemical theory and computation,
Grace W Tang, and Russ B Altman
September 2011, Journal of computer-aided molecular design,
Copied contents to your clipboard!