New methods for accurate prediction of protein secondary structure. 1999

J M Chandonia, and M Karplus
Department of Cellular and Molecular Pharmacology, University of California at San Francisco, USA.

A primary and a secondary neural network are applied to secondary structure and structural class prediction for a database of 681 non-homologous protein chains. A new method of decoding the outputs of the secondary structure prediction network is used to produce an estimate of the probability of finding each type of secondary structure at every position in the sequence. In addition to providing a reliable estimate of the accuracy of the predictions, this method gives a more accurate Q3 (74.6%) than the cutoff method which is commonly used. Use of these predictions in jury methods improves the Q3 to 74.8%, the best available at present. On a database of 126 proteins commonly used for comparison of prediction methods, the jury predictions are 76.6% accurate. An estimate of the overall Q3 for a given sequence is made by averaging the estimated accuracy of the prediction over all residues in the sequence. As an example, the analysis is applied to the target beta-cryptogein, which was a difficult target for ab initio predictions in the CASP2 study; it shows that the prediction made with the present method (62% of residues correct) is close to the expected accuracy (66%) for this protein. The larger database and use of a new network training protocol also improve structural class prediction accuracy to 86%, relative to 80% obtained previously. Secondary structure content is predicted with accuracy comparable to that obtained with spectroscopic methods, such as vibrational or electronic circular dichroism and Fourier transform infrared spectroscopy.

UI MeSH Term Description Entries
D008722 Methods A series of steps taken in order to conduct research. Techniques,Methodological Studies,Methodological Study,Procedures,Studies, Methodological,Study, Methodological,Method,Procedure,Technique
D011336 Probability The study of chance processes or the relative frequency characterizing a chance process. Probabilities
D005656 Fungal Proteins Proteins found in any species of fungus. Fungal Gene Products,Fungal Gene Proteins,Fungal Peptides,Gene Products, Fungal,Yeast Proteins,Gene Proteins, Fungal,Peptides, Fungal,Proteins, Fungal
D016208 Databases, Factual Extensive collections, reputedly complete, of facts and data garnered from material of a specialized subject area and made available for analysis and application. The collection can be automated by various contemporary methods for retrieval. The concept should be differentiated from DATABASES, BIBLIOGRAPHIC which is restricted to collections of bibliographic references. Databanks, Factual,Data Banks, Factual,Data Bases, Factual,Data Bank, Factual,Data Base, Factual,Databank, Factual,Database, Factual,Factual Data Bank,Factual Data Banks,Factual Data Base,Factual Data Bases,Factual Databank,Factual Databanks,Factual Database,Factual Databases
D016571 Neural Networks, Computer A computer architecture, implementable in either hardware or software, modeled after biological neural networks. Like the biological system in which the processing capability is a result of the interconnection strengths between arrays of nonlinear processing nodes, computerized neural networks, often called perceptrons or multilayer connectionist models, consist of neuron-like units. A homogeneous group of units makes up a layer. These networks are good at pattern recognition. They are adaptive, performing tasks by example, and thus are better for decision-making than are linear learning machines or cluster analysis. They do not require explicit programming. Computational Neural Networks,Connectionist Models,Models, Neural Network,Neural Network Models,Neural Networks (Computer),Perceptrons,Computational Neural Network,Computer Neural Network,Computer Neural Networks,Connectionist Model,Model, Connectionist,Model, Neural Network,Models, Connectionist,Network Model, Neural,Network Models, Neural,Network, Computational Neural,Network, Computer Neural,Network, Neural (Computer),Networks, Computational Neural,Networks, Computer Neural,Networks, Neural (Computer),Neural Network (Computer),Neural Network Model,Neural Network, Computational,Neural Network, Computer,Neural Networks, Computational,Perceptron
D017433 Protein Structure, Secondary The level of protein structure in which regular hydrogen-bond interactions within contiguous stretches of polypeptide chain give rise to ALPHA-HELICES; BETA-STRANDS (which align to form BETA-SHEETS), or other types of coils. This is the first folding level of protein conformation. Secondary Protein Structure,Protein Structures, Secondary,Secondary Protein Structures,Structure, Secondary Protein,Structures, Secondary Protein
D020418 Algal Proteins Proteins found in any species of algae. Algal Gene Product,Algal Gene Products,Algal Gene Protein,Algal Gene Proteins,Algal Peptide,Algal Peptides,Algal Protein,Gene Products, Algal,Gene Product, Algal,Gene Protein, Algal,Gene Proteins, Algal,Peptide, Algal,Peptides, Algal,Product, Algal Gene,Protein, Algal,Protein, Algal Gene,Proteins, Algal

Related Publications

J M Chandonia, and M Karplus
April 2005, Bioinformatics (Oxford, England),
J M Chandonia, and M Karplus
November 2004, Bioinformatics (Oxford, England),
J M Chandonia, and M Karplus
August 1993, Journal of molecular biology,
J M Chandonia, and M Karplus
January 2019, Biochemical and biophysical research communications,
J M Chandonia, and M Karplus
November 2000, Current protein & peptide science,
J M Chandonia, and M Karplus
May 2016, Wiley interdisciplinary reviews. RNA,
J M Chandonia, and M Karplus
April 2001, Journal of protein chemistry,
J M Chandonia, and M Karplus
May 2001, Current protocols in protein science,
J M Chandonia, and M Karplus
January 2007, Tanpakushitsu kakusan koso. Protein, nucleic acid, enzyme,
Copied contents to your clipboard!