Quantification of secondary structure prediction improvement using multiple alignments. 1993

J M Levin, and S Pascarella, and P Argos, and J Garnier
Unité d'Ingénierie des Protéines, Biotechnologies, INRA, Jouy-en-Josas, France.

The use of multiple sequence alignments for secondary structure predictions is analysed. Seven different protein families, containing only sequences of known structure, were considered to provide a range of alignment and prediction conditions. Using alignments obtained by spatial superposition of main chain atoms in known tertiary protein structures allowed a mean of 8% in secondary structure prediction accuracy, when compared to those obtained from the individual sequences. Substitution of these alignments by those determined directly from an automated sequence alignment algorithm showed variations in the prediction accuracy which correlated with the quality of the multiple alignments and distance of the primary sequence. Secondary structure predictions can be reliably improved using alignments from an automatic alignment procedure with a mean increase of 6.8%, giving an overall prediction accuracy of 68.5%, if there is a minimum of 25% sequence identity between all sequences in a family.

UI MeSH Term Description Entries
D008969 Molecular Sequence Data Descriptions of specific amino acid, carbohydrate, or nucleotide sequences which have appeared in the published literature and/or are deposited in and maintained by databanks such as GENBANK, European Molecular Biology Laboratory (EMBL), National Biomedical Research Foundation (NBRF), or other sequence repositories. Sequence Data, Molecular,Molecular Sequencing Data,Data, Molecular Sequence,Data, Molecular Sequencing,Sequencing Data, Molecular
D011506 Proteins Linear POLYPEPTIDES that are synthesized on RIBOSOMES and may be further modified, crosslinked, cleaved, or assembled into complex proteins with several subunits. The specific sequence of AMINO ACIDS determines the shape the polypeptide will take, during PROTEIN FOLDING, and the function of the protein. Gene Products, Protein,Gene Proteins,Protein,Protein Gene Products,Proteins, Gene
D000595 Amino Acid Sequence The order of amino acids as they occur in a polypeptide chain. This is referred to as the primary structure of proteins. It is of fundamental importance in determining PROTEIN CONFORMATION. Protein Structure, Primary,Amino Acid Sequences,Sequence, Amino Acid,Sequences, Amino Acid,Primary Protein Structure,Primary Protein Structures,Protein Structures, Primary,Structure, Primary Protein,Structures, Primary Protein
D012984 Software Sequential operating programs and data which instruct the functioning of a digital computer. Computer Programs,Computer Software,Open Source Software,Software Engineering,Software Tools,Computer Applications Software,Computer Programs and Programming,Computer Software Applications,Application, Computer Software,Applications Software, Computer,Applications Softwares, Computer,Applications, Computer Software,Computer Applications Softwares,Computer Program,Computer Software Application,Engineering, Software,Open Source Softwares,Program, Computer,Programs, Computer,Software Application, Computer,Software Applications, Computer,Software Tool,Software, Computer,Software, Computer Applications,Software, Open Source,Softwares, Computer Applications,Softwares, Open Source,Source Software, Open,Source Softwares, Open,Tool, Software,Tools, Software
D015203 Reproducibility of Results The statistical reproducibility of measurements (often in a clinical context), including the testing of instrumentation or techniques to obtain reproducible results. The concept includes reproducibility of physiological measurements, which may be used to develop rules to assess probability or prognosis, or response to a stimulus; reproducibility of occurrence of a condition; and reproducibility of experimental results. Reliability and Validity,Reliability of Result,Reproducibility Of Result,Reproducibility of Finding,Validity of Result,Validity of Results,Face Validity,Reliability (Epidemiology),Reliability of Results,Reproducibility of Findings,Test-Retest Reliability,Validity (Epidemiology),Finding Reproducibilities,Finding Reproducibility,Of Result, Reproducibility,Of Results, Reproducibility,Reliabilities, Test-Retest,Reliability, Test-Retest,Result Reliabilities,Result Reliability,Result Validities,Result Validity,Result, Reproducibility Of,Results, Reproducibility Of,Test Retest Reliability,Validity and Reliability,Validity, Face
D016415 Sequence Alignment The arrangement of two or more amino acid or base sequences from an organism or organisms in such a way as to align areas of the sequences sharing common properties. The degree of relatedness or homology between the sequences is predicted computationally or statistically based on weights assigned to the elements aligned between the sequences. This in turn can serve as a potential indicator of the genetic relatedness between the organisms. Sequence Homology Determination,Determination, Sequence Homology,Alignment, Sequence,Alignments, Sequence,Determinations, Sequence Homology,Sequence Alignments,Sequence Homology Determinations
D017433 Protein Structure, Secondary The level of protein structure in which regular hydrogen-bond interactions within contiguous stretches of polypeptide chain give rise to ALPHA-HELICES; BETA-STRANDS (which align to form BETA-SHEETS), or other types of coils. This is the first folding level of protein conformation. Secondary Protein Structure,Protein Structures, Secondary,Secondary Protein Structures,Structure, Secondary Protein,Structures, Secondary Protein

Related Publications

J M Levin, and S Pascarella, and P Argos, and J Garnier
January 2020, Bioinformatics (Oxford, England),
J M Levin, and S Pascarella, and P Argos, and J Garnier
November 2000, Current protein & peptide science,
J M Levin, and S Pascarella, and P Argos, and J Garnier
April 1997, Journal of molecular biology,
J M Levin, and S Pascarella, and P Argos, and J Garnier
January 1996, Journal of computational biology : a journal of computational molecular cell biology,
J M Levin, and S Pascarella, and P Argos, and J Garnier
February 2002, Proteins,
J M Levin, and S Pascarella, and P Argos, and J Garnier
January 2014, Methods in molecular biology (Clifton, N.J.),
J M Levin, and S Pascarella, and P Argos, and J Garnier
December 1995, Computer applications in the biosciences : CABIOS,
J M Levin, and S Pascarella, and P Argos, and J Garnier
January 1996, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing,
J M Levin, and S Pascarella, and P Argos, and J Garnier
December 1995, Computer applications in the biosciences : CABIOS,
J M Levin, and S Pascarella, and P Argos, and J Garnier
December 2004, Computational biology and chemistry,
Copied contents to your clipboard!