On the statistical significance of nucleic acid similarities. 1984

D J Lipman, and W J Wilbur, and T F Smith, and M S Waterman

When evaluating sequence similarities among nucleic acids by the usual methods, statistical significance is often found when the biological significance of the similarity is dubious. We demonstrate that the known statistical properties of nucleic acid sequences strongly affect the statistical distribution of similarity values when calculated by standard procedures. We propose a series of models which account for some of these known statistical properties. The utility of the method is demonstrated in evaluating high relative similarity scores in four specific cases in which there is little biological context by which to judge the similarities. In two of the cases we identify the statistical properties which are responsible for the apparent similarity. In the other two cases the statistical significance of the similarity persists even when the known statistical properties of sequences are modelled. For one of these cases biological significance is likely while the other case remains an enigma.

UI MeSH Term Description Entries
D009696 Nucleic Acids High molecular weight polymers containing a mixture of purine and pyrimidine nucleotides chained together by ribose or deoxyribose linkages. Nucleic Acid,Acid, Nucleic,Acids, Nucleic
D005075 Biological Evolution The process of cumulative change over successive generations through which organisms acquire their distinguishing morphological and physiological characteristics. Evolution, Biological
D001483 Base Sequence The sequence of PURINES and PYRIMIDINES in nucleic acids and polynucleotides. It is also called nucleotide sequence. DNA Sequence,Nucleotide Sequence,RNA Sequence,DNA Sequences,Base Sequences,Nucleotide Sequences,RNA Sequences,Sequence, Base,Sequence, DNA,Sequence, Nucleotide,Sequence, RNA,Sequences, Base,Sequences, DNA,Sequences, Nucleotide,Sequences, RNA
D013223 Statistics as Topic Works about the science and art of collecting, summarizing, and analyzing data that are subject to random variation. Area Analysis,Estimation Technics,Estimation Techniques,Indirect Estimation Technics,Indirect Estimation Techniques,Multiple Classification Analysis,Service Statistics,Statistical Study,Statistics, Service,Tables and Charts as Topic,Analyses, Area,Analyses, Multiple Classification,Area Analyses,Classification Analyses, Multiple,Classification Analysis, Multiple,Estimation Technic, Indirect,Estimation Technics, Indirect,Estimation Technique,Estimation Technique, Indirect,Estimation Techniques, Indirect,Indirect Estimation Technic,Indirect Estimation Technique,Multiple Classification Analyses,Statistical Studies,Studies, Statistical,Study, Statistical,Technic, Indirect Estimation,Technics, Estimation,Technics, Indirect Estimation,Technique, Estimation,Technique, Indirect Estimation,Techniques, Estimation,Techniques, Indirect Estimation

Related Publications

D J Lipman, and W J Wilbur, and T F Smith, and M S Waterman
January 1985, Nucleic acids research,
D J Lipman, and W J Wilbur, and T F Smith, and M S Waterman
June 1968, Evolution; international journal of organic evolution,
D J Lipman, and W J Wilbur, and T F Smith, and M S Waterman
December 1980, FEBS letters,
D J Lipman, and W J Wilbur, and T F Smith, and M S Waterman
February 1952, Biochimica et biophysica acta,
D J Lipman, and W J Wilbur, and T F Smith, and M S Waterman
March 2003, Journal of molecular biology,
D J Lipman, and W J Wilbur, and T F Smith, and M S Waterman
December 1990, Protein engineering,
D J Lipman, and W J Wilbur, and T F Smith, and M S Waterman
August 1965, Life sciences,
D J Lipman, and W J Wilbur, and T F Smith, and M S Waterman
August 1995, Journal of biomolecular structure & dynamics,
D J Lipman, and W J Wilbur, and T F Smith, and M S Waterman
April 1983, Nucleic acids research,
D J Lipman, and W J Wilbur, and T F Smith, and M S Waterman
December 2012, Current opinion in chemical biology,
Copied contents to your clipboard!