Pro-Frame: similarity-based gene recognition in eukaryotic DNA sequences with errors. 2001

A A Mironov, and P S Novichkov, and M S Gelfand
State Scientific Center for Biotechnology NIIGenetika, Moscow, 113545, Russia.

Performance of existing algorithms for similarity-based gene recognition in eukaryotes drops when the genomic DNA has been sequenced with errors. A modification of the spliced alignment algorithm allows for gene recognition in sequences with errors, in particular frameshifts. It tolerates up to 5% of sequencing errors without considerable drop of prediction reliability when a sufficiently close homologous protein is available (normalized evolutionary distance similarity score 50% or higher).

UI MeSH Term Description Entries
D011506 Proteins Linear POLYPEPTIDES that are synthesized on RIBOSOMES and may be further modified, crosslinked, cleaved, or assembled into complex proteins with several subunits. The specific sequence of AMINO ACIDS determines the shape the polypeptide will take, during PROTEIN FOLDING, and the function of the protein. Gene Products, Protein,Gene Proteins,Protein,Protein Gene Products,Proteins, Gene
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D000465 Algorithms A procedure consisting of a sequence of algebraic formulas and/or logical steps to calculate or determine a given task. Algorithm
D012984 Software Sequential operating programs and data which instruct the functioning of a digital computer. Computer Programs,Computer Software,Open Source Software,Software Engineering,Software Tools,Computer Applications Software,Computer Programs and Programming,Computer Software Applications,Application, Computer Software,Applications Software, Computer,Applications Softwares, Computer,Applications, Computer Software,Computer Applications Softwares,Computer Program,Computer Software Application,Engineering, Software,Open Source Softwares,Program, Computer,Programs, Computer,Software Application, Computer,Software Applications, Computer,Software Tool,Software, Computer,Software, Computer Applications,Software, Open Source,Softwares, Computer Applications,Softwares, Open Source,Source Software, Open,Source Softwares, Open,Tool, Software,Tools, Software
D016415 Sequence Alignment The arrangement of two or more amino acid or base sequences from an organism or organisms in such a way as to align areas of the sequences sharing common properties. The degree of relatedness or homology between the sequences is predicted computationally or statistically based on weights assigned to the elements aligned between the sequences. This in turn can serve as a potential indicator of the genetic relatedness between the organisms. Sequence Homology Determination,Determination, Sequence Homology,Alignment, Sequence,Alignments, Sequence,Determinations, Sequence Homology,Sequence Alignments,Sequence Homology Determinations
D017422 Sequence Analysis, DNA A multistage process that includes cloning, physical mapping, subcloning, determination of the DNA SEQUENCE, and information analysis. DNA Sequence Analysis,Sequence Determination, DNA,Analysis, DNA Sequence,DNA Sequence Determination,DNA Sequence Determinations,DNA Sequencing,Determination, DNA Sequence,Determinations, DNA Sequence,Sequence Determinations, DNA,Analyses, DNA Sequence,DNA Sequence Analyses,Sequence Analyses, DNA,Sequencing, DNA
D018965 Frameshifting, Ribosomal A directed change in translational READING FRAMES that allows the production of a single protein from two or more OVERLAPPING GENES. The process is programmed by the nucleotide sequence of the MRNA and is sometimes also affected by the secondary or tertiary mRNA structure. It has been described mainly in VIRUSES (especially RETROVIRUSES); RETROTRANSPOSONS; and bacterial insertion elements but also in some cellular genes. Frameshifting, Translational,Ribosomal Frameshifting,Ribosomal Frame Shift,Ribosomal Frame Shifting,Ribosomal Frameshift,Frame Shift, Ribosomal,Frame Shifting, Ribosomal,Frame Shifts, Ribosomal,Frameshift, Ribosomal,Frameshifts, Ribosomal,Ribosomal Frame Shifts,Ribosomal Frameshifts,Translational Frameshifting
D019295 Computational Biology A field of biology concerned with the development of techniques for the collection and manipulation of biological data, and the use of such data to make biological discoveries or predictions. This field encompasses all computational methods and theories for solving biological problems including manipulation of models and datasets. Bioinformatics,Molecular Biology, Computational,Bio-Informatics,Biology, Computational,Computational Molecular Biology,Bio Informatics,Bio-Informatic,Bioinformatic,Biologies, Computational Molecular,Biology, Computational Molecular,Computational Molecular Biologies,Molecular Biologies, Computational

Related Publications

A A Mironov, and P S Novichkov, and M S Gelfand
October 1980, Nucleic acids research,
A A Mironov, and P S Novichkov, and M S Gelfand
May 1979, Perception & psychophysics,
A A Mironov, and P S Novichkov, and M S Gelfand
May 1992, Proceedings of the National Academy of Sciences of the United States of America,
A A Mironov, and P S Novichkov, and M S Gelfand
January 1988, BioTechniques,
A A Mironov, and P S Novichkov, and M S Gelfand
January 1986, Giornale di batteriologia, virologia ed immunologia,
A A Mironov, and P S Novichkov, and M S Gelfand
February 1981, FEBS letters,
A A Mironov, and P S Novichkov, and M S Gelfand
March 2011, Journal of computational chemistry,
A A Mironov, and P S Novichkov, and M S Gelfand
January 2015, BMC genomics,
A A Mironov, and P S Novichkov, and M S Gelfand
July 2003, Nucleic acids research,
A A Mironov, and P S Novichkov, and M S Gelfand
November 1980, Proceedings of the National Academy of Sciences of the United States of America,
Copied contents to your clipboard!