Protein secondary structure prediction by the analysis of variation and conservation in multiple alignments. 1995

D S Tuckwell, and M J Humphries, and A Brass
School of Biological Sciences, University of Manchester, UK. mbdxt@seqnet.dl.ac.uk

A number of methods exist for the prediction of protein secondary structure from primary sequence. One method identifies variable charged and conserved hydrophobic residues within large multiple alignments as a means of indicating outside and inside sites respectively in the protein structure. These sites are then manually fitted to secondary structure templates to generate a secondary structure prediction. Using the existing theoretical bases of this method, we present an algorithm (STAMA) which automatically carries out the initial variation/conservation analysis of the alignment. We also test the accuracy of complete predictions carried out by manual fitting of the STAMA-derived assignments to structure templates, using five large multiple alignments each including a protein of known structure. The method was found on average to predict only 57% of residues in the correct secondary structure, and was only as accurate as predictions carried out using the established and automated method of Garnier, Osguthorpe and Robson (1978) applied to a single sequence. When used in conjunction with other secondary structure prediction methods, however, the resulting consensus predictions were found to be very accurate, with 78% of the elements (alpha helices or beta strands) for which a consensus could be obtained being predicted correctly. The algorithm presented here, plus the assessment of the accuracy of prediction generated by this method, should enable this predictive approach to receive informed general use.

UI MeSH Term Description Entries
D011506 Proteins Linear POLYPEPTIDES that are synthesized on RIBOSOMES and may be further modified, crosslinked, cleaved, or assembled into complex proteins with several subunits. The specific sequence of AMINO ACIDS determines the shape the polypeptide will take, during PROTEIN FOLDING, and the function of the protein. Gene Products, Protein,Gene Proteins,Protein,Protein Gene Products,Proteins, Gene
D004563 Electrochemistry The study of chemical changes resulting from electrical action and electrical activity resulting from chemical changes. Electrochemistries
D005069 Evaluation Studies as Topic Works about studies that determine the effectiveness or value of processes, personnel, and equipment, or the material on conducting such studies. Critique,Evaluation Indexes,Evaluation Methodology,Evaluation Report,Evaluation Research,Methodology, Evaluation,Pre-Post Tests,Qualitative Evaluation,Quantitative Evaluation,Theoretical Effectiveness,Use-Effectiveness,Critiques,Effectiveness, Theoretical,Evaluation Methodologies,Evaluation Reports,Evaluation, Qualitative,Evaluation, Quantitative,Evaluations, Qualitative,Evaluations, Quantitative,Indexes, Evaluation,Methodologies, Evaluation,Pre Post Tests,Pre-Post Test,Qualitative Evaluations,Quantitative Evaluations,Report, Evaluation,Reports, Evaluation,Research, Evaluation,Test, Pre-Post,Tests, Pre-Post,Use Effectiveness
D000465 Algorithms A procedure consisting of a sequence of algebraic formulas and/or logical steps to calculate or determine a given task. Algorithm
D012984 Software Sequential operating programs and data which instruct the functioning of a digital computer. Computer Programs,Computer Software,Open Source Software,Software Engineering,Software Tools,Computer Applications Software,Computer Programs and Programming,Computer Software Applications,Application, Computer Software,Applications Software, Computer,Applications Softwares, Computer,Applications, Computer Software,Computer Applications Softwares,Computer Program,Computer Software Application,Engineering, Software,Open Source Softwares,Program, Computer,Programs, Computer,Software Application, Computer,Software Applications, Computer,Software Tool,Software, Computer,Software, Computer Applications,Software, Open Source,Softwares, Computer Applications,Softwares, Open Source,Source Software, Open,Source Softwares, Open,Tool, Software,Tools, Software
D014644 Genetic Variation Genotypic differences observed among individuals in a population. Genetic Diversity,Variation, Genetic,Diversity, Genetic,Diversities, Genetic,Genetic Diversities,Genetic Variations,Variations, Genetic
D015394 Molecular Structure The location of the atoms, groups or ions relative to one another in a molecule, as well as the number, type and location of covalent bonds. Structure, Molecular,Molecular Structures,Structures, Molecular
D016415 Sequence Alignment The arrangement of two or more amino acid or base sequences from an organism or organisms in such a way as to align areas of the sequences sharing common properties. The degree of relatedness or homology between the sequences is predicted computationally or statistically based on weights assigned to the elements aligned between the sequences. This in turn can serve as a potential indicator of the genetic relatedness between the organisms. Sequence Homology Determination,Determination, Sequence Homology,Alignment, Sequence,Alignments, Sequence,Determinations, Sequence Homology,Sequence Alignments,Sequence Homology Determinations
D017386 Sequence Homology, Amino Acid The degree of similarity between sequences of amino acids. This information is useful for the analyzing genetic relatedness of proteins and species. Homologous Sequences, Amino Acid,Amino Acid Sequence Homology,Homologs, Amino Acid Sequence,Homologs, Protein Sequence,Homology, Protein Sequence,Protein Sequence Homologs,Protein Sequence Homology,Sequence Homology, Protein,Homolog, Protein Sequence,Homologies, Protein Sequence,Protein Sequence Homolog,Protein Sequence Homologies,Sequence Homolog, Protein,Sequence Homologies, Protein,Sequence Homologs, Protein
D017433 Protein Structure, Secondary The level of protein structure in which regular hydrogen-bond interactions within contiguous stretches of polypeptide chain give rise to ALPHA-HELICES; BETA-STRANDS (which align to form BETA-SHEETS), or other types of coils. This is the first folding level of protein conformation. Secondary Protein Structure,Protein Structures, Secondary,Secondary Protein Structures,Structure, Secondary Protein,Structures, Secondary Protein

Related Publications

D S Tuckwell, and M J Humphries, and A Brass
December 1995, Computer applications in the biosciences : CABIOS,
D S Tuckwell, and M J Humphries, and A Brass
March 1995, Journal of molecular biology,
D S Tuckwell, and M J Humphries, and A Brass
November 2000, Current protein & peptide science,
D S Tuckwell, and M J Humphries, and A Brass
April 1997, Journal of molecular biology,
D S Tuckwell, and M J Humphries, and A Brass
November 1993, Protein engineering,
D S Tuckwell, and M J Humphries, and A Brass
January 2020, Bioinformatics (Oxford, England),
D S Tuckwell, and M J Humphries, and A Brass
January 1996, Journal of computational biology : a journal of computational molecular cell biology,
D S Tuckwell, and M J Humphries, and A Brass
January 2014, Methods in molecular biology (Clifton, N.J.),
D S Tuckwell, and M J Humphries, and A Brass
February 2002, Proteins,
D S Tuckwell, and M J Humphries, and A Brass
December 2004, Computational biology and chemistry,
Copied contents to your clipboard!