Multiple alignment through protein secondary-structure information. 2005

Giuliano Armano, and Luciano Milanesi, and Alessandro Orro
Department of Electrical and Electronic Engineering, University of Cagliari, Cagliari I-09123, Italy. armano@diee.unica.it

It is well known that protein secondary-structure information can help the process of performing multiple alignment, in particular when the amount of similarity among the involved sequences moves toward the "twilight zone" (less than 30% of pairwise similarity). In this paper, a multiple alignment algorithm is presented, explicitly designed for exploiting any available secondary-structure information. A layered architecture with two interacting levels has been defined for dealing with both primary- and secondary-structure information of target sequences. Secondary structure (either available or predicted by resorting to a technique based on multiple experts) is used to calculate an initial alignment at the secondary level, to be arranged by locally scoped operators devised to refine the alignment at the primary level. Aimed at evaluating the impact of secondary information on the quality of alignments, in particular alignments with a low degree of similarity, the technique has been implemented and assessed on relevant test cases.

UI MeSH Term Description Entries
D008969 Molecular Sequence Data Descriptions of specific amino acid, carbohydrate, or nucleotide sequences which have appeared in the published literature and/or are deposited in and maintained by databanks such as GENBANK, European Molecular Biology Laboratory (EMBL), National Biomedical Research Foundation (NBRF), or other sequence repositories. Sequence Data, Molecular,Molecular Sequencing Data,Data, Molecular Sequence,Data, Molecular Sequencing,Sequencing Data, Molecular
D011487 Protein Conformation The characteristic 3-dimensional shape of a protein, including the secondary, supersecondary (motifs), tertiary (domains) and quaternary structure of the peptide chain. PROTEIN STRUCTURE, QUATERNARY describes the conformation assumed by multimeric proteins (aggregates of more than one polypeptide chain). Conformation, Protein,Conformations, Protein,Protein Conformations
D011506 Proteins Linear POLYPEPTIDES that are synthesized on RIBOSOMES and may be further modified, crosslinked, cleaved, or assembled into complex proteins with several subunits. The specific sequence of AMINO ACIDS determines the shape the polypeptide will take, during PROTEIN FOLDING, and the function of the protein. Gene Products, Protein,Gene Proteins,Protein,Protein Gene Products,Proteins, Gene
D000465 Algorithms A procedure consisting of a sequence of algebraic formulas and/or logical steps to calculate or determine a given task. Algorithm
D000595 Amino Acid Sequence The order of amino acids as they occur in a polypeptide chain. This is referred to as the primary structure of proteins. It is of fundamental importance in determining PROTEIN CONFORMATION. Protein Structure, Primary,Amino Acid Sequences,Sequence, Amino Acid,Sequences, Amino Acid,Primary Protein Structure,Primary Protein Structures,Protein Structures, Primary,Structure, Primary Protein,Structures, Primary Protein
D016415 Sequence Alignment The arrangement of two or more amino acid or base sequences from an organism or organisms in such a way as to align areas of the sequences sharing common properties. The degree of relatedness or homology between the sequences is predicted computationally or statistically based on weights assigned to the elements aligned between the sequences. This in turn can serve as a potential indicator of the genetic relatedness between the organisms. Sequence Homology Determination,Determination, Sequence Homology,Alignment, Sequence,Alignments, Sequence,Determinations, Sequence Homology,Sequence Alignments,Sequence Homology Determinations
D017386 Sequence Homology, Amino Acid The degree of similarity between sequences of amino acids. This information is useful for the analyzing genetic relatedness of proteins and species. Homologous Sequences, Amino Acid,Amino Acid Sequence Homology,Homologs, Amino Acid Sequence,Homologs, Protein Sequence,Homology, Protein Sequence,Protein Sequence Homologs,Protein Sequence Homology,Sequence Homology, Protein,Homolog, Protein Sequence,Homologies, Protein Sequence,Protein Sequence Homolog,Protein Sequence Homologies,Sequence Homolog, Protein,Sequence Homologies, Protein,Sequence Homologs, Protein
D017433 Protein Structure, Secondary The level of protein structure in which regular hydrogen-bond interactions within contiguous stretches of polypeptide chain give rise to ALPHA-HELICES; BETA-STRANDS (which align to form BETA-SHEETS), or other types of coils. This is the first folding level of protein conformation. Secondary Protein Structure,Protein Structures, Secondary,Secondary Protein Structures,Structure, Secondary Protein,Structures, Secondary Protein
D020539 Sequence Analysis, Protein A process that includes the determination of AMINO ACID SEQUENCE of a protein (or peptide, oligopeptide or peptide fragment) and the information analysis of the sequence. Amino Acid Sequence Analysis,Peptide Sequence Analysis,Protein Sequence Analysis,Sequence Determination, Protein,Amino Acid Sequence Analyses,Amino Acid Sequence Determination,Amino Acid Sequence Determinations,Amino Acid Sequencing,Peptide Sequence Determination,Protein Sequencing,Sequence Analyses, Amino Acid,Sequence Analysis, Amino Acid,Sequence Analysis, Peptide,Sequence Determination, Amino Acid,Sequence Determinations, Amino Acid,Acid Sequencing, Amino,Analyses, Peptide Sequence,Analyses, Protein Sequence,Analysis, Peptide Sequence,Analysis, Protein Sequence,Peptide Sequence Analyses,Peptide Sequence Determinations,Protein Sequence Analyses,Protein Sequence Determination,Protein Sequence Determinations,Sequence Analyses, Peptide,Sequence Analyses, Protein,Sequence Determination, Peptide,Sequence Determinations, Peptide,Sequence Determinations, Protein,Sequencing, Amino Acid,Sequencing, Protein

Related Publications

Giuliano Armano, and Luciano Milanesi, and Alessandro Orro
December 2006, Journal of computational biology : a journal of computational molecular cell biology,
Giuliano Armano, and Luciano Milanesi, and Alessandro Orro
May 2017, Bioinformatics (Oxford, England),
Giuliano Armano, and Luciano Milanesi, and Alessandro Orro
July 2010, Nucleic acids research,
Giuliano Armano, and Luciano Milanesi, and Alessandro Orro
August 2004, Current protein & peptide science,
Giuliano Armano, and Luciano Milanesi, and Alessandro Orro
October 1994, Protein science : a publication of the Protein Society,
Giuliano Armano, and Luciano Milanesi, and Alessandro Orro
January 2006, Molekuliarnaia biologiia,
Giuliano Armano, and Luciano Milanesi, and Alessandro Orro
January 2008, Methods in molecular biology (Clifton, N.J.),
Giuliano Armano, and Luciano Milanesi, and Alessandro Orro
July 2005, Nucleic acids research,
Giuliano Armano, and Luciano Milanesi, and Alessandro Orro
August 2000, Proteins,
Giuliano Armano, and Luciano Milanesi, and Alessandro Orro
January 1996, Methods in enzymology,
Copied contents to your clipboard!