Comparative statistics for DNA and protein sequences: multiple sequence analysis. 1985

S Karlin, and G Ghandour

Concepts and methods [Karlin, S. & Ghandour, G. (1985) Proc. Natl. Acad. Sci. USA 82, 5800-5804] for the analysis of patterns and relationships are extended to multiple DNA and protein sequences. Functionals include multiple sequence common word occurrence distributions, characterizations of high frequency shared words, and ascertainment of long block identities. Various comparisons of sequences using natural alphabets obtained from grouping nucleotides or amino acids by their chemical and functional characteristics are described. Specific applications are given to globin genes, mitochondrial genomes, and a variety of mammalian viruses.

UI MeSH Term Description Entries
D007145 Immunoglobulin kappa-Chains One of the types of light chains of the immunoglobulins with a molecular weight of approximately 22 kDa. Ig kappa Chains,Immunoglobulins, kappa-Chain,kappa-Immunoglobulin Light Chains,Immunoglobulin kappa-Chain,kappa-Chain Immunoglobulins,kappa-Immunoglobulin Light Chain,kappa-Immunoglobulin Subgroup VK-12,kappa-Immunoglobulin Subgroup VK-21,Chains, Ig kappa,Immunoglobulin kappa Chain,Immunoglobulin kappa Chains,Immunoglobulins, kappa Chain,Light Chain, kappa-Immunoglobulin,Light Chains, kappa-Immunoglobulin,kappa Chain Immunoglobulins,kappa Chains, Ig,kappa Immunoglobulin Light Chain,kappa Immunoglobulin Light Chains,kappa Immunoglobulin Subgroup VK 12,kappa Immunoglobulin Subgroup VK 21,kappa-Chain, Immunoglobulin,kappa-Chains, Immunoglobulin
D011506 Proteins Linear POLYPEPTIDES that are synthesized on RIBOSOMES and may be further modified, crosslinked, cleaved, or assembled into complex proteins with several subunits. The specific sequence of AMINO ACIDS determines the shape the polypeptide will take, during PROTEIN FOLDING, and the function of the protein. Gene Products, Protein,Gene Proteins,Protein,Protein Gene Products,Proteins, Gene
D004247 DNA A deoxyribonucleotide polymer that is the primary genetic material of all cells. Eukaryotic and prokaryotic organisms normally contain DNA in a double-stranded state, yet several important biological processes transiently involve single-stranded regions. DNA, which consists of a polysugar-phosphate backbone possessing projections of purines (adenine and guanine) and pyrimidines (thymine and cytosine), forms a double helix that is held together by hydrogen bonds between these purines and pyrimidines (adenine to thymine and guanine to cytosine). DNA, Double-Stranded,Deoxyribonucleic Acid,ds-DNA,DNA, Double Stranded,Double-Stranded DNA,ds DNA
D004272 DNA, Mitochondrial Double-stranded DNA of MITOCHONDRIA. In eukaryotes, the mitochondrial GENOME is circular and codes for ribosomal RNAs, transfer RNAs, and about 10 proteins. Mitochondrial DNA,mtDNA
D004279 DNA, Viral Deoxyribonucleic acid that makes up the genetic material of viruses. Viral DNA
D005914 Globins A superfamily of proteins containing the globin fold which is composed of 6-8 alpha helices arranged in a characterstic HEME enclosing structure. Globin
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D000595 Amino Acid Sequence The order of amino acids as they occur in a polypeptide chain. This is referred to as the primary structure of proteins. It is of fundamental importance in determining PROTEIN CONFORMATION. Protein Structure, Primary,Amino Acid Sequences,Sequence, Amino Acid,Sequences, Amino Acid,Primary Protein Structure,Primary Protein Structures,Protein Structures, Primary,Structure, Primary Protein,Structures, Primary Protein
D000596 Amino Acids Organic compounds that generally contain an amino (-NH2) and a carboxyl (-COOH) group. Twenty alpha-amino acids are the subunits which are polymerized to form proteins. Amino Acid,Acid, Amino,Acids, Amino
D000818 Animals Unicellular or multicellular, heterotrophic organisms, that have sensation and the power of voluntary movement. Under the older five kingdom paradigm, Animalia was one of the kingdoms. Under the modern three domain model, Animalia represents one of the many groups in the domain EUKARYOTA. Animal,Metazoa,Animalia

Related Publications

S Karlin, and G Ghandour
April 2005, Current protocols in bioinformatics,
S Karlin, and G Ghandour
July 1994, Molecular biology and evolution,
S Karlin, and G Ghandour
June 2010, Current protocols in bioinformatics,
S Karlin, and G Ghandour
November 1999, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics,
S Karlin, and G Ghandour
January 2008, Transboundary and emerging diseases,
S Karlin, and G Ghandour
January 2015, IEEE/ACM transactions on computational biology and bioinformatics,
S Karlin, and G Ghandour
June 2005, Journal of bioinformatics and computational biology,
S Karlin, and G Ghandour
January 2004, Proceedings. IEEE Computational Systems Bioinformatics Conference,
S Karlin, and G Ghandour
February 1998, Methods (San Diego, Calif.),
S Karlin, and G Ghandour
November 2022, Bioinformatics (Oxford, England),
Copied contents to your clipboard!