Analysis of the largest tandemly repeated DNA families in the human genome. 2008

Peter E Warburton, and Dan Hasson, and Flavia Guillem, and Chloe Lescale, and Xiaoping Jin, and Gyorgy Abrusan
Deptartment of Genetics and Genomic Sciences, Mount Sinai School of Medicine, New York, NY 10029, USA. peter.warburton@mssm.edu

BACKGROUND Tandemly Repeated DNA represents a large portion of the human genome, and accounts for a significant amount of copy number variation. Here we present a genome wide analysis of the largest tandem repeats found in the human genome sequence. RESULTS Using Tandem Repeats Finder (TRF), tandem repeat arrays greater than 10 kb in total size were identified, and classified into simple sequence e.g. GAATG, classical satellites e.g. alpha satellite DNA, and locus specific VNTR arrays. Analysis of these large sequenced regions revealed that several "simple sequence" arrays actually showed complex domain and/or higher order repeat organization. Using additional methods, we further identified a total of 96 additional arrays with tandem repeat units greater than 2 kb (the detection limit of TRF), 53 of which contained genes or repeated exons. The overall size of an array of tandem 12 kb repeats which spanned a gap on chromosome 8 was found to be 600 kb to 1.7 Mbp in size, representing one of the largest non-centromeric arrays characterized. Several novel megasatellite tandem DNA families were observed that are characterized by repeating patterns of interspersed transposable elements that have expanded presumably by unequal crossing over. One of these families is found on 11 different chromosomes in >25 arrays, and represents one of the largest most widespread megasatellite DNA families. CONCLUSIONS This study represents the most comprehensive genome wide analysis of large tandem repeats in the human genome, and will serve as an important resource towards understanding the organization and copy number variation of these complex DNA families.

UI MeSH Term Description Entries
D002877 Chromosomes, Human Very long DNA molecules and associated proteins, HISTONES, and non-histone chromosomal proteins (CHROMOSOMAL PROTEINS, NON-HISTONE). Normally 46 chromosomes, including two sex chromosomes are found in the nucleus of human cells. They carry the hereditary information of the individual. Chromosome, Human,Human Chromosome,Human Chromosomes
D004247 DNA A deoxyribonucleotide polymer that is the primary genetic material of all cells. Eukaryotic and prokaryotic organisms normally contain DNA in a double-stranded state, yet several important biological processes transiently involve single-stranded regions. DNA, which consists of a polysugar-phosphate backbone possessing projections of purines (adenine and guanine) and pyrimidines (thymine and cytosine), forms a double helix that is held together by hydrogen bonds between these purines and pyrimidines (adenine to thymine and guanine to cytosine). DNA, Double-Stranded,Deoxyribonucleic Acid,ds-DNA,DNA, Double Stranded,Double-Stranded DNA,ds DNA
D004276 DNA, Satellite Highly repetitive DNA sequences found in HETEROCHROMATIN, mainly near centromeres. They are composed of simple sequences (very short) (see MINISATELLITE REPEATS) repeated in tandem many times to form large blocks of sequence. Additionally, following the accumulation of mutations, these blocks of repeats have been repeated in tandem themselves. The degree of repetition is on the order of 1000 to 10 million at each locus. Loci are few, usually one or two per chromosome. They were called satellites since in density gradients, they often sediment as distinct, satellite bands separate from the bulk of genomic DNA owing to a distinct BASE COMPOSITION. Satellite DNA,Satellite I DNA,DNA, Satellite I,DNAs, Satellite,DNAs, Satellite I,I DNA, Satellite,I DNAs, Satellite,Satellite DNAs,Satellite I DNAs
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D014644 Genetic Variation Genotypic differences observed among individuals in a population. Genetic Diversity,Variation, Genetic,Diversity, Genetic,Diversities, Genetic,Genetic Diversities,Genetic Variations,Variations, Genetic
D015894 Genome, Human The complete genetic complement contained in the DNA of a set of CHROMOSOMES in a HUMAN. The length of the human genome is about 3 billion base pairs. Human Genome,Genomes, Human,Human Genomes
D017404 In Situ Hybridization, Fluorescence A type of IN SITU HYBRIDIZATION in which target sequences are stained with fluorescent dye so their location and size can be determined using fluorescence microscopy. This staining is sufficiently distinct that the hybridization signal can be seen both in metaphase spreads and in interphase nuclei. FISH Technique,Fluorescent in Situ Hybridization,Hybridization in Situ, Fluorescence,FISH Technic,Hybridization in Situ, Fluorescent,In Situ Hybridization, Fluorescent,FISH Technics,FISH Techniques,Technic, FISH,Technics, FISH,Technique, FISH,Techniques, FISH
D018626 Retroelements Elements that are transcribed into RNA, reverse-transcribed into DNA and then inserted into a new site in the genome. Long terminal repeats (LTRs) similar to those from retroviruses are contained in retrotransposons and retrovirus-like elements. Retroposons, such as LONG INTERSPERSED NUCLEOTIDE ELEMENTS and SHORT INTERSPERSED NUCLEOTIDE ELEMENTS do not contain LTRs. MDG1 Retrotransposons,Mobile Dispersed Genetic Elements,Retroposons,Retrotransposons,Retrovirus-like Elements,Ty1 Transposon,Element, Retrovirus-like,Elements, Retrovirus-like,MDG1 Retrotransposon,Retroelement,Retroposon,Retrotransposon,Retrotransposon, MDG1,Retrotransposons, MDG1,Retrovirus like Elements,Retrovirus-like Element,Transposon, Ty1,Transposons, Ty1,Ty1 Transposons
D018628 Gene Dosage The number of copies of a given gene present in the cell of an organism. An increase in gene dosage (by GENE DUPLICATION for example) can result in higher levels of gene product formation. GENE DOSAGE COMPENSATION mechanisms result in adjustments to the level GENE EXPRESSION when there are changes or differences in gene dosage. Gene Copy Number,Copy Number, Gene,Copy Numbers, Gene,Dosage, Gene,Dosages, Gene,Gene Copy Numbers,Gene Dosages,Number, Gene Copy,Numbers, Gene Copy
D020080 Tandem Repeat Sequences Copies of DNA sequences which lie adjacent to each other in the same orientation (direct tandem repeats) or in the opposite direction to each other (INVERTED TANDEM REPEATS). Direct Tandem Repeats,Tandem Repeat,Tandem Repeats,Direct Tandem Repeat,Repeat Sequence, Tandem,Repeat Sequences, Tandem,Repeat, Direct Tandem,Repeat, Tandem,Repeats, Direct Tandem,Repeats, Tandem,Sequence, Tandem Repeat,Sequences, Tandem Repeat,Tandem Repeat Sequence,Tandem Repeat, Direct,Tandem Repeats, Direct

Related Publications

Peter E Warburton, and Dan Hasson, and Flavia Guillem, and Chloe Lescale, and Xiaoping Jin, and Gyorgy Abrusan
October 2011, BMC genomics,
Peter E Warburton, and Dan Hasson, and Flavia Guillem, and Chloe Lescale, and Xiaoping Jin, and Gyorgy Abrusan
October 1984, Biochemical and biophysical research communications,
Peter E Warburton, and Dan Hasson, and Flavia Guillem, and Chloe Lescale, and Xiaoping Jin, and Gyorgy Abrusan
December 1995, Genome,
Peter E Warburton, and Dan Hasson, and Flavia Guillem, and Chloe Lescale, and Xiaoping Jin, and Gyorgy Abrusan
March 1985, Proceedings of the National Academy of Sciences of the United States of America,
Peter E Warburton, and Dan Hasson, and Flavia Guillem, and Chloe Lescale, and Xiaoping Jin, and Gyorgy Abrusan
June 2006, Genome,
Peter E Warburton, and Dan Hasson, and Flavia Guillem, and Chloe Lescale, and Xiaoping Jin, and Gyorgy Abrusan
January 2006, Acta biochimica Polonica,
Peter E Warburton, and Dan Hasson, and Flavia Guillem, and Chloe Lescale, and Xiaoping Jin, and Gyorgy Abrusan
August 1994, Nucleic acids research,
Peter E Warburton, and Dan Hasson, and Flavia Guillem, and Chloe Lescale, and Xiaoping Jin, and Gyorgy Abrusan
August 1984, Molecular and cellular biology,
Peter E Warburton, and Dan Hasson, and Flavia Guillem, and Chloe Lescale, and Xiaoping Jin, and Gyorgy Abrusan
February 1993, Gene,
Peter E Warburton, and Dan Hasson, and Flavia Guillem, and Chloe Lescale, and Xiaoping Jin, and Gyorgy Abrusan
June 2006, Mutation research,
Copied contents to your clipboard!