The frequency of matching sequences in DNA. 1984

G P Moore, and A R Moore, and L I Grossman

Equations are presented which allow prediction of the number of direct or indirect matching sequences in DNA. Predicted match frequencies can be calculated for any match length, DNA strand length and DNA base composition, assuming only that the DNA sequence is random. The effect of varying these parameters is described, and match frequency is related to the total frequency of repeats. Equations were verified by computer search of randomly generated DNA sequences. A group of published DNA sequences was searched for matches and the results compared to the calculated predictions for random DNA. In general, natural DNA was found to be similar to random DNA with respect to frequency of matching sequences.

UI MeSH Term Description Entries
D008433 Mathematics The deductive study of shape, quantity, and dependence. (From McGraw-Hill Dictionary of Scientific and Technical Terms, 6th ed) Mathematic
D008957 Models, Genetic Theoretical representations that simulate the behavior or activity of genetic processes or phenomena. They include the use of mathematical equations, computers, and other electronic equipment. Genetic Models,Genetic Model,Model, Genetic
D012091 Repetitive Sequences, Nucleic Acid Sequences of DNA or RNA that occur in multiple copies. There are several types: INTERSPERSED REPETITIVE SEQUENCES are copies of transposable elements (DNA TRANSPOSABLE ELEMENTS or RETROELEMENTS) dispersed throughout the genome. TERMINAL REPEAT SEQUENCES flank both ends of another sequence, for example, the long terminal repeats (LTRs) on RETROVIRUSES. Variations may be direct repeats, those occurring in the same direction, or inverted repeats, those opposite to each other in direction. TANDEM REPEAT SEQUENCES are copies which lie adjacent to each other, direct or inverted (INVERTED REPEAT SEQUENCES). DNA Repetitious Region,Direct Repeat,Genes, Selfish,Nucleic Acid Repetitive Sequences,Repetitive Region,Selfish DNA,Selfish Genes,DNA, Selfish,Repetitious Region, DNA,Repetitive Sequence,DNA Repetitious Regions,DNAs, Selfish,Direct Repeats,Gene, Selfish,Repeat, Direct,Repeats, Direct,Repetitious Regions, DNA,Repetitive Regions,Repetitive Sequences,Selfish DNAs,Selfish Gene
D003201 Computers Programmable electronic devices designed to accept data, perform prescribed mathematical and logical operations at high speed, and display the results of these operations. Calculators, Programmable,Computer Hardware,Computers, Digital,Hardware, Computer,Calculator, Programmable,Computer,Computer, Digital,Digital Computer,Digital Computers,Programmable Calculator,Programmable Calculators
D004247 DNA A deoxyribonucleotide polymer that is the primary genetic material of all cells. Eukaryotic and prokaryotic organisms normally contain DNA in a double-stranded state, yet several important biological processes transiently involve single-stranded regions. DNA, which consists of a polysugar-phosphate backbone possessing projections of purines (adenine and guanine) and pyrimidines (thymine and cytosine), forms a double helix that is held together by hydrogen bonds between these purines and pyrimidines (adenine to thymine and guanine to cytosine). DNA, Double-Stranded,Deoxyribonucleic Acid,ds-DNA,DNA, Double Stranded,Double-Stranded DNA,ds DNA
D001482 Base Composition The relative amounts of the PURINES and PYRIMIDINES in a nucleic acid. Base Ratio,G+C Composition,Guanine + Cytosine Composition,G+C Content,GC Composition,GC Content,Guanine + Cytosine Content,Base Compositions,Base Ratios,Composition, Base,Composition, G+C,Composition, GC,Compositions, Base,Compositions, G+C,Compositions, GC,Content, G+C,Content, GC,Contents, G+C,Contents, GC,G+C Compositions,G+C Contents,GC Compositions,GC Contents,Ratio, Base,Ratios, Base

Related Publications

G P Moore, and A R Moore, and L I Grossman
January 2004, Proceedings. IEEE Computational Systems Bioinformatics Conference,
G P Moore, and A R Moore, and L I Grossman
April 2005, Journal of bioinformatics and computational biology,
G P Moore, and A R Moore, and L I Grossman
March 2007, Bioinformatics (Oxford, England),
G P Moore, and A R Moore, and L I Grossman
September 2021, Computers in biology and medicine,
G P Moore, and A R Moore, and L I Grossman
November 1999, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics,
G P Moore, and A R Moore, and L I Grossman
February 2015, Sao Paulo medical journal = Revista paulista de medicina,
G P Moore, and A R Moore, and L I Grossman
January 2011, International journal of bioinformatics research and applications,
G P Moore, and A R Moore, and L I Grossman
May 1985, Journal of bacteriology,
G P Moore, and A R Moore, and L I Grossman
December 1997, BioTechniques,
G P Moore, and A R Moore, and L I Grossman
August 2007, Journal of medical systems,
Copied contents to your clipboard!