Complete nucleotide sequence of the rabbit beta-like globin gene cluster. Analysis of intergenic sequences and comparison with the human beta-like globin gene cluster. 1989

J B Margot, and G W Demers, and R C Hardison
Department of Molecular and Cell Biology, Paul M. Althouse Laboratory, Pennsylvania State University, University Park 16802.

The nucleotide sequence of the entire beta-like globin gene cluster of rabbits has been determined. This sequence of a continuous stretch of 44.5 x 10(3) base-pairs (bp) starts about 6 x 10(3) bp upstream from epsilon (the 5'-most gene) and ends about 12 x 10(3) bp downstream from beta (the 3'-most gene). Analysis of the sequence reveals that: (1) the sequence is relatively A + T rich (about 60%); (2) regions with high G + C content are associated with OcC repeats, a short interspersed repeated DNA in rabbits; (3) the distribution of polypurines, polypyrimidines and alternating purine/pyrimidine tracts is not random within the cluster; (4) most open reading frames are associated with known globin coding regions, OcC repeats or long interspersed repeats (L1 repeats); (5) the most prominent open reading frames are found in the L1 repeats; (6) different strand asymmetries in base composition are associated with embyronic and adult genes as well as the tandem L1 repeats at the 3' end of the cluster; and (7) essentially all the repeats appear to have been inserted by a transposon mechanism. A comparison of the sequence with itself by a dot-plot analysis has revealed nine new members of the OcC family of repeats in addition to the six previously reported. The OcC repeats tend to be clustered, particularly in the epsilon-gamma and gamma-psi delta intergenic regions. Dot-plot comparisons between the rabbit and the human clusters have revealed extensive sequence matches. Homology starts about 6 x 10(3) bp 5' to epsilon or as far upstream as the rabbit sequence is available. It continues throughout the entire cluster and stops about 0.7 x 10(3) bp 3' to beta, at which point several repeats have inserted in both rabbits and humans. Throughout the gene cluster, the homology is interrupted mainly by insertions or deletions in either the rabbit or the human genome. Almost all of the insertions are of known short or long repeated DNAs. The positions of the insertions are different in the two gene clusters, which indicates that both short and long repeats have been transposing throughout the genome for the time since the mammalian radiation. An alignment of rabbit and human sequences allows the calculation of the substitution rate around epsilon. Sequences far removed from the gene are evolving at a rate equivalent to the pseudogene rate, although some short regions show an apparently higher rate.(ABSTRACT TRUNCATED AT 400 WORDS)

UI MeSH Term Description Entries
D008969 Molecular Sequence Data Descriptions of specific amino acid, carbohydrate, or nucleotide sequences which have appeared in the published literature and/or are deposited in and maintained by databanks such as GENBANK, European Molecular Biology Laboratory (EMBL), National Biomedical Research Foundation (NBRF), or other sequence repositories. Sequence Data, Molecular,Molecular Sequencing Data,Data, Molecular Sequence,Data, Molecular Sequencing,Sequencing Data, Molecular
D011685 Purine Nucleotides Purines attached to a RIBOSE and a phosphate that can polymerize to form DNA and RNA. Nucleotides, Purine
D011687 Purines A series of heterocyclic compounds that are variously substituted in nature and are known also as purine bases. They include ADENINE and GUANINE, constituents of nucleic acids, as well as many alkaloids such as CAFFEINE and THEOPHYLLINE. Uric acid is the metabolic end product of purine metabolism.
D011742 Pyrimidine Nucleotides Pyrimidines with a RIBOSE and phosphate attached that can polymerize to form DNA and RNA. Nucleotides, Pyrimidine
D011743 Pyrimidines A family of 6-membered heterocyclic compounds occurring in nature in a wide variety of forms. They include several nucleic acid constituents (CYTOSINE; THYMINE; and URACIL) and form the basic structure of the barbiturates.
D011817 Rabbits A burrowing plant-eating mammal with hind limbs that are longer than its fore limbs. It belongs to the family Leporidae of the order Lagomorpha, and in contrast to hares, possesses 22 instead of 24 pairs of chromosomes. Belgian Hare,New Zealand Rabbit,New Zealand Rabbits,New Zealand White Rabbit,Rabbit,Rabbit, Domestic,Chinchilla Rabbits,NZW Rabbits,New Zealand White Rabbits,Oryctolagus cuniculus,Chinchilla Rabbit,Domestic Rabbit,Domestic Rabbits,Hare, Belgian,NZW Rabbit,Rabbit, Chinchilla,Rabbit, NZW,Rabbit, New Zealand,Rabbits, Chinchilla,Rabbits, Domestic,Rabbits, NZW,Rabbits, New Zealand,Zealand Rabbit, New,Zealand Rabbits, New,cuniculus, Oryctolagus
D012091 Repetitive Sequences, Nucleic Acid Sequences of DNA or RNA that occur in multiple copies. There are several types: INTERSPERSED REPETITIVE SEQUENCES are copies of transposable elements (DNA TRANSPOSABLE ELEMENTS or RETROELEMENTS) dispersed throughout the genome. TERMINAL REPEAT SEQUENCES flank both ends of another sequence, for example, the long terminal repeats (LTRs) on RETROVIRUSES. Variations may be direct repeats, those occurring in the same direction, or inverted repeats, those opposite to each other in direction. TANDEM REPEAT SEQUENCES are copies which lie adjacent to each other, direct or inverted (INVERTED REPEAT SEQUENCES). DNA Repetitious Region,Direct Repeat,Genes, Selfish,Nucleic Acid Repetitive Sequences,Repetitive Region,Selfish DNA,Selfish Genes,DNA, Selfish,Repetitious Region, DNA,Repetitive Sequence,DNA Repetitious Regions,DNAs, Selfish,Direct Repeats,Gene, Selfish,Repeat, Direct,Repeats, Direct,Repetitious Regions, DNA,Repetitive Regions,Repetitive Sequences,Selfish DNAs,Selfish Gene
D004247 DNA A deoxyribonucleotide polymer that is the primary genetic material of all cells. Eukaryotic and prokaryotic organisms normally contain DNA in a double-stranded state, yet several important biological processes transiently involve single-stranded regions. DNA, which consists of a polysugar-phosphate backbone possessing projections of purines (adenine and guanine) and pyrimidines (thymine and cytosine), forms a double helix that is held together by hydrogen bonds between these purines and pyrimidines (adenine to thymine and guanine to cytosine). DNA, Double-Stranded,Deoxyribonucleic Acid,ds-DNA,DNA, Double Stranded,Double-Stranded DNA,ds DNA
D005810 Multigene Family A set of genes descended by duplication and variation from some ancestral gene. Such genes may be clustered together on the same chromosome or dispersed on different chromosomes. Examples of multigene families include those that encode the hemoglobins, immunoglobulins, histocompatibility antigens, actins, tubulins, keratins, collagens, heat shock proteins, salivary glue proteins, chorion proteins, cuticle proteins, yolk proteins, and phaseolins, as well as histones, ribosomal RNA, and transfer RNA genes. The latter three are examples of reiterated genes, where hundreds of identical genes are present in a tandem array. (King & Stanfield, A Dictionary of Genetics, 4th ed) Gene Clusters,Genes, Reiterated,Cluster, Gene,Clusters, Gene,Families, Multigene,Family, Multigene,Gene Cluster,Gene, Reiterated,Multigene Families,Reiterated Gene,Reiterated Genes
D005914 Globins A superfamily of proteins containing the globin fold which is composed of 6-8 alpha helices arranged in a characterstic HEME enclosing structure. Globin

Related Publications

J B Margot, and G W Demers, and R C Hardison
October 2001, Molecular and cellular biology,
J B Margot, and G W Demers, and R C Hardison
November 1981, The Journal of biological chemistry,
J B Margot, and G W Demers, and R C Hardison
October 1980, Cell,
J B Margot, and G W Demers, and R C Hardison
July 1983, The Journal of biological chemistry,
J B Margot, and G W Demers, and R C Hardison
October 1980, Cell,
J B Margot, and G W Demers, and R C Hardison
April 1977, Cell,
J B Margot, and G W Demers, and R C Hardison
November 2005, Molecular and cellular biology,
J B Margot, and G W Demers, and R C Hardison
February 1980, Cell,
J B Margot, and G W Demers, and R C Hardison
December 1980, Proceedings of the National Academy of Sciences of the United States of America,
Copied contents to your clipboard!