[Plan for finding homologies in nucleotide sequence databases using preliminarily calculated sequence samples]. 1995

V B Filatov, and E I Golovanov, and A A Aleksandrov

A scheme of fast similarity search of nucleotide sequences is suggested based on sequence imaging, which results in chunks of information much less than original sequence but more specialized for comparison. Three methods were developed using three different imaging functions. The first is based on identity of local sites of up to twelve nucleotides, the second is based on statistical homology of local 42 nucleotide fragments, and the third is based on the homology of 100-150 nucleotide fragments and models the comparison of restriction maps. Each of them requires the library of sequence images. The total size of such a library is less than the size of sequences stored in compressed form. The sequences are aligned allowing local homology searches. The method reduces total time for a similarity search about 100-fold. The programs can be easily included in any software, which allows user to define his own set of sequences. One of the programs is implemented within DNA-SUN software and is used in Institute of Molecular Genetics and Institute of Molecular Biology.

UI MeSH Term Description Entries
D008969 Molecular Sequence Data Descriptions of specific amino acid, carbohydrate, or nucleotide sequences which have appeared in the published literature and/or are deposited in and maintained by databanks such as GENBANK, European Molecular Biology Laboratory (EMBL), National Biomedical Research Foundation (NBRF), or other sequence repositories. Sequence Data, Molecular,Molecular Sequencing Data,Data, Molecular Sequence,Data, Molecular Sequencing,Sequencing Data, Molecular
D000595 Amino Acid Sequence The order of amino acids as they occur in a polypeptide chain. This is referred to as the primary structure of proteins. It is of fundamental importance in determining PROTEIN CONFORMATION. Protein Structure, Primary,Amino Acid Sequences,Sequence, Amino Acid,Sequences, Amino Acid,Primary Protein Structure,Primary Protein Structures,Protein Structures, Primary,Structure, Primary Protein,Structures, Primary Protein
D001483 Base Sequence The sequence of PURINES and PYRIMIDINES in nucleic acids and polynucleotides. It is also called nucleotide sequence. DNA Sequence,Nucleotide Sequence,RNA Sequence,DNA Sequences,Base Sequences,Nucleotide Sequences,RNA Sequences,Sequence, Base,Sequence, DNA,Sequence, Nucleotide,Sequence, RNA,Sequences, Base,Sequences, DNA,Sequences, Nucleotide,Sequences, RNA
D012689 Sequence Homology, Nucleic Acid The sequential correspondence of nucleotides in one nucleic acid molecule with those of another nucleic acid molecule. Sequence homology is an indication of the genetic relatedness of different organisms and gene function. Base Sequence Homology,Homologous Sequences, Nucleic Acid,Homologs, Nucleic Acid Sequence,Homology, Base Sequence,Homology, Nucleic Acid Sequence,Nucleic Acid Sequence Homologs,Nucleic Acid Sequence Homology,Sequence Homology, Base,Base Sequence Homologies,Homologies, Base Sequence,Sequence Homologies, Base
D016208 Databases, Factual Extensive collections, reputedly complete, of facts and data garnered from material of a specialized subject area and made available for analysis and application. The collection can be automated by various contemporary methods for retrieval. The concept should be differentiated from DATABASES, BIBLIOGRAPHIC which is restricted to collections of bibliographic references. Databanks, Factual,Data Banks, Factual,Data Bases, Factual,Data Bank, Factual,Data Base, Factual,Databank, Factual,Database, Factual,Factual Data Bank,Factual Data Banks,Factual Data Base,Factual Data Bases,Factual Databank,Factual Databanks,Factual Database,Factual Databases

Related Publications

V B Filatov, and E I Golovanov, and A A Aleksandrov
January 1990, Methods in enzymology,
V B Filatov, and E I Golovanov, and A A Aleksandrov
July 1999, FEBS letters,
V B Filatov, and E I Golovanov, and A A Aleksandrov
March 1968, Plant physiology,
V B Filatov, and E I Golovanov, and A A Aleksandrov
July 1988, Nucleic acids research,
V B Filatov, and E I Golovanov, and A A Aleksandrov
January 1981, Molecular & general genetics : MGG,
V B Filatov, and E I Golovanov, and A A Aleksandrov
January 1986, Nucleic acids research,
V B Filatov, and E I Golovanov, and A A Aleksandrov
January 1975, Molecular & general genetics : MGG,
V B Filatov, and E I Golovanov, and A A Aleksandrov
April 1976, Virology,
V B Filatov, and E I Golovanov, and A A Aleksandrov
January 1987, Gene,
V B Filatov, and E I Golovanov, and A A Aleksandrov
July 1999, Trends in biochemical sciences,
Copied contents to your clipboard!