Structural features of the 5' upstream regulatory region of the gene encoding rat amyloid precursor protein. 1993

J M Chernak
Molecular Neurobiology Unit, National Institute on Aging, National Institutes of Health, Baltimore, MD 21224.

The 5' upstream regulatory region of the gene encoding the rat amyloid precursor protein (APP) was cloned and sequenced. It lacks both a TATA box and a CAAT box, has a high G + C content (68%), is 89% homologous to the corresponding region of the mouse APP gene, and 82% homologous to the corresponding region of the human APP gene. This region contains putative regulatory elements both 5' and 3' to the probable transcription start point (tsp). There are consensus DNA sites for the binding of SP1, AP2, AP4 and GC factor (GCF) proteins, and two GC boxes with the consensus sequence, 5'-GGGYGCRG. Potential regulatory sites with only a single mismatch to the consensus sequences include three SP1, one AP1, five AP2, and two GCF sites, as well as one GC box. There are also six potential stem-loop secondary structures (SSS) near the probable tsp. A consecutive series of elements, consisting of a GC box, AP2 site, three SSS, two SP1 sites, and AP4, AP1 and GCF sites just upstream from the probable tsp, are well-conserved between the rat, mouse and human sequences. An additional AP2 site, two GC boxes, and two additional SSS appear to be conserved between species. However, two possible rat SP1 sites, three possible rat AP2 sites, and two possible rat GCF sites are lacking in the human. On the other hand, the rat sequence is missing four potential SP1 sites, four potential AP2 sites, and nine potential GC boxes which are found in the human sequence.(ABSTRACT TRUNCATED AT 250 WORDS)

UI MeSH Term Description Entries
D008969 Molecular Sequence Data Descriptions of specific amino acid, carbohydrate, or nucleotide sequences which have appeared in the published literature and/or are deposited in and maintained by databanks such as GENBANK, European Molecular Biology Laboratory (EMBL), National Biomedical Research Foundation (NBRF), or other sequence repositories. Sequence Data, Molecular,Molecular Sequencing Data,Data, Molecular Sequence,Data, Molecular Sequencing,Sequencing Data, Molecular
D012045 Regulatory Sequences, Nucleic Acid Nucleic acid sequences involved in regulating the expression of genes. Nucleic Acid Regulatory Sequences,Regulatory Regions, Nucleic Acid (Genetics),Region, Regulatory,Regions, Regulatory,Regulator Regions, Nucleic Acid,Regulatory Region,Regulatory Regions
D003001 Cloning, Molecular The insertion of recombinant DNA molecules from prokaryotic and/or eukaryotic sources into a replicating vehicle, such as a plasmid or virus vector, and the introduction of the resultant hybrid molecules into recipient cells without altering the viability of those cells. Molecular Cloning
D004247 DNA A deoxyribonucleotide polymer that is the primary genetic material of all cells. Eukaryotic and prokaryotic organisms normally contain DNA in a double-stranded state, yet several important biological processes transiently involve single-stranded regions. DNA, which consists of a polysugar-phosphate backbone possessing projections of purines (adenine and guanine) and pyrimidines (thymine and cytosine), forms a double helix that is held together by hydrogen bonds between these purines and pyrimidines (adenine to thymine and guanine to cytosine). DNA, Double-Stranded,Deoxyribonucleic Acid,ds-DNA,DNA, Double Stranded,Double-Stranded DNA,ds DNA
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D000818 Animals Unicellular or multicellular, heterotrophic organisms, that have sensation and the power of voluntary movement. Under the older five kingdom paradigm, Animalia was one of the kingdoms. Under the modern three domain model, Animalia represents one of the many groups in the domain EUKARYOTA. Animal,Metazoa,Animalia
D001482 Base Composition The relative amounts of the PURINES and PYRIMIDINES in a nucleic acid. Base Ratio,G+C Composition,Guanine + Cytosine Composition,G+C Content,GC Composition,GC Content,Guanine + Cytosine Content,Base Compositions,Base Ratios,Composition, Base,Composition, G+C,Composition, GC,Compositions, Base,Compositions, G+C,Compositions, GC,Content, G+C,Content, GC,Contents, G+C,Contents, GC,G+C Compositions,G+C Contents,GC Compositions,GC Contents,Ratio, Base,Ratios, Base
D001483 Base Sequence The sequence of PURINES and PYRIMIDINES in nucleic acids and polynucleotides. It is also called nucleotide sequence. DNA Sequence,Nucleotide Sequence,RNA Sequence,DNA Sequences,Base Sequences,Nucleotide Sequences,RNA Sequences,Sequence, Base,Sequence, DNA,Sequence, Nucleotide,Sequence, RNA,Sequences, Base,Sequences, DNA,Sequences, Nucleotide,Sequences, RNA
D012689 Sequence Homology, Nucleic Acid The sequential correspondence of nucleotides in one nucleic acid molecule with those of another nucleic acid molecule. Sequence homology is an indication of the genetic relatedness of different organisms and gene function. Base Sequence Homology,Homologous Sequences, Nucleic Acid,Homologs, Nucleic Acid Sequence,Homology, Base Sequence,Homology, Nucleic Acid Sequence,Nucleic Acid Sequence Homologs,Nucleic Acid Sequence Homology,Sequence Homology, Base,Base Sequence Homologies,Homologies, Base Sequence,Sequence Homologies, Base
D016385 TATA Box A conserved A-T rich sequence which is contained in promoters for RNA polymerase II. The segment is seven base pairs long and the nucleotides most commonly found are TATAAAA. Hogness Box,Box, Hogness,Box, TATA

Related Publications

J M Chernak
December 1995, Indian journal of biochemistry & biophysics,
J M Chernak
April 1989, Proceedings of the National Academy of Sciences of the United States of America,
Copied contents to your clipboard!