Constructive induction and protein tertiary structure prediction. 1993

T R Ioerger, and L Rendell, and S Subramaniam
Department of Computer Science, Beckman Institute, University of Illinois, Urbana 61801, USA.

To date, the only methods that have been used successfully to predict protein structures have been based on identifying homologous proteins whose structures are known. However, such methods are limited by the fact that some proteins have similar structure but no significant sequence homology. We consider two ways of applying machine learning to facilitate protein structure prediction. We argue that a straightforward approach will not be able to improve the accuracy of classification achieved by clustering by alignment scores alone. In contrast, we present a novel constructive induction approach that learns better representations of amino acid sequences in terms of physical and chemical properties. Our learning method combines knowledge and search to shift the representation of sequences so that semantic similarity is more easily recognized by syntactic matching. Our approach promises not only to find new structural relationships among protein sequences, but also expands our understanding of the roles knowledge can play in learning via experience in this challenging domain.

UI MeSH Term Description Entries
D008432 Mathematical Computing Computer-assisted interpretation and analysis of various mathematical functions related to a particular problem. Statistical Computing,Computing, Statistical,Mathematic Computing,Statistical Programs, Computer Based,Computing, Mathematic,Computing, Mathematical,Computings, Mathematic,Computings, Mathematical,Computings, Statistical,Mathematic Computings,Mathematical Computings,Statistical Computings
D005544 Forecasting The prediction or projection of the nature of future problems or existing conditions based upon the extrapolation or interpretation of existing scientific data or by the application of scientific methodology. Futurology,Projections and Predictions,Future,Predictions and Projections
D001185 Artificial Intelligence Theory and development of COMPUTER SYSTEMS which perform tasks that normally require human intelligence. Such tasks may include speech recognition, LEARNING; VISUAL PERCEPTION; MATHEMATICAL COMPUTING; reasoning, PROBLEM SOLVING, DECISION-MAKING, and translation of language. AI (Artificial Intelligence),Computer Reasoning,Computer Vision Systems,Knowledge Acquisition (Computer),Knowledge Representation (Computer),Machine Intelligence,Computational Intelligence,Acquisition, Knowledge (Computer),Computer Vision System,Intelligence, Artificial,Intelligence, Computational,Intelligence, Machine,Knowledge Representations (Computer),Reasoning, Computer,Representation, Knowledge (Computer),System, Computer Vision,Systems, Computer Vision,Vision System, Computer,Vision Systems, Computer
D016415 Sequence Alignment The arrangement of two or more amino acid or base sequences from an organism or organisms in such a way as to align areas of the sequences sharing common properties. The degree of relatedness or homology between the sequences is predicted computationally or statistically based on weights assigned to the elements aligned between the sequences. This in turn can serve as a potential indicator of the genetic relatedness between the organisms. Sequence Homology Determination,Determination, Sequence Homology,Alignment, Sequence,Alignments, Sequence,Determinations, Sequence Homology,Sequence Alignments,Sequence Homology Determinations
D017386 Sequence Homology, Amino Acid The degree of similarity between sequences of amino acids. This information is useful for the analyzing genetic relatedness of proteins and species. Homologous Sequences, Amino Acid,Amino Acid Sequence Homology,Homologs, Amino Acid Sequence,Homologs, Protein Sequence,Homology, Protein Sequence,Protein Sequence Homologs,Protein Sequence Homology,Sequence Homology, Protein,Homolog, Protein Sequence,Homologies, Protein Sequence,Protein Sequence Homolog,Protein Sequence Homologies,Sequence Homolog, Protein,Sequence Homologies, Protein,Sequence Homologs, Protein
D017434 Protein Structure, Tertiary The level of protein structure in which combinations of secondary protein structures (ALPHA HELICES; BETA SHEETS; loop regions, and AMINO ACID MOTIFS) pack together to form folded shapes. Disulfide bridges between cysteines in two different parts of the polypeptide chain along with other interactions between the chains play a role in the formation and stabilization of tertiary structure. Tertiary Protein Structure,Protein Structures, Tertiary,Tertiary Protein Structures
D017510 Protein Folding Processes involved in the formation of TERTIARY PROTEIN STRUCTURE. Protein Folding, Globular,Folding, Globular Protein,Folding, Protein,Foldings, Globular Protein,Foldings, Protein,Globular Protein Folding,Globular Protein Foldings,Protein Foldings,Protein Foldings, Globular

Related Publications

T R Ioerger, and L Rendell, and S Subramaniam
May 2001, Current protocols in protein science,
T R Ioerger, and L Rendell, and S Subramaniam
July 2003, Nucleic acids research,
T R Ioerger, and L Rendell, and S Subramaniam
October 2003, Proceedings of the National Academy of Sciences of the United States of America,
T R Ioerger, and L Rendell, and S Subramaniam
January 2014, Methods in molecular biology (Clifton, N.J.),
T R Ioerger, and L Rendell, and S Subramaniam
April 1993, Bio/technology (Nature Publishing Company),
T R Ioerger, and L Rendell, and S Subramaniam
January 2003, Bioinformatics (Oxford, England),
T R Ioerger, and L Rendell, and S Subramaniam
February 2024, Molecules (Basel, Switzerland),
T R Ioerger, and L Rendell, and S Subramaniam
April 2018, Journal of bioinformatics and computational biology,
T R Ioerger, and L Rendell, and S Subramaniam
April 1999, Proteins,
T R Ioerger, and L Rendell, and S Subramaniam
January 2022, Proteins,
Copied contents to your clipboard!