Analysis of overrepresented motifs in human core promoters reveals dual regulatory roles of YY1. 2007

Hualin Xi, and Yong Yu, and Yutao Fu, and Jonathan Foley, and Anason Halees, and Zhiping Weng
Bioinformatics Program, Boston University, Boston, Massachusetts 02215, USA.

A set of 723 high-quality human core promoter sequences were compiled and analyzed for overrepresented motifs. Beside the two well-characterized core promoter motifs (TATA and Inr), several known motifs (YY1, Sp1, NRF-1, NRF-2, CAAT, and CREB) and one potentially new motif (motif8) were found. Interestingly, YY1 and motif8 mostly reside immediately downstream from the TSS. In particular, the YY1 motif occurs primarily in genes with 5'-UTRs shorter than 40 base pairs (bp) and its locations coincide with the translation start site. We verified that the YY1 motif is bound by YY1 in vitro. We then performed detailed analysis on YY1 chromatin immunoprecipitation data with a whole-genome human promoter microarray (ChIP-chip) and revealed that the thus identified promoters in HeLa cells were highly enriched with the YY1 motif. Moreover, the motif overlapped with the translation start sites on the plus strand of a group of genes, many with short 5'-UTRs, and with the transcription start sites on the minus strand of another distinct group of genes; together, the two groups of genes accounted for the majority of the YY1-bound promoters in the ChIP-chip data. Furthermore, the first group of genes was highly enriched in the functional categories of ribosomal proteins and nuclear-encoded mitochondria proteins. We suggest that the YY1 motif plays a dual role in both transcription and translation initiation of these genes. We also discuss the evolutionary advantages of housing a transcriptional element inside the transcript in terms of the migration of these genes in the human genome.

UI MeSH Term Description Entries
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D012269 Ribosomal Proteins Proteins found in ribosomes. They are believed to have a catalytic function in reconstituting biologically active ribosomal subunits. Proteins, Ribosomal,Ribosomal Protein,Protein, Ribosomal
D014158 Transcription, Genetic The biosynthesis of RNA carried out on a template of DNA. The biosynthesis of DNA from an RNA template is called REVERSE TRANSCRIPTION. Genetic Transcription
D015894 Genome, Human The complete genetic complement contained in the DNA of a set of CHROMOSOMES in a HUMAN. The length of the human genome is about 3 billion base pairs. Human Genome,Genomes, Human,Human Genomes
D016385 TATA Box A conserved A-T rich sequence which is contained in promoters for RNA polymerase II. The segment is seven base pairs long and the nucleotides most commonly found are TATAAAA. Hogness Box,Box, Hogness,Box, TATA
D019143 Evolution, Molecular The process of cumulative change at the level of DNA; RNA; and PROTEINS, over successive generations. Molecular Evolution,Genetic Evolution,Evolution, Genetic
D020121 5' Untranslated Regions The sequence at the 5' end of the messenger RNA that does not code for product. This sequence contains the ribosome binding site and other transcription and translation regulating sequences. 5'UTR,Leader Sequences, mRNA,Leader mRNA Sequences,mRNA Leader Sequences,5' UTR,5' UTRs,5' Untranslated Region,5'UTRs,Leader Sequence, mRNA,Leader mRNA Sequence,Region, 5' Untranslated,Regions, 5' Untranslated,Sequence, Leader mRNA,Sequence, mRNA Leader,Sequences, Leader mRNA,Sequences, mRNA Leader,UTR, 5',UTRs, 5',Untranslated Region, 5',Untranslated Regions, 5',mRNA Leader Sequence,mRNA Sequence, Leader,mRNA Sequences, Leader
D020218 Response Elements Nucleotide sequences, usually upstream, which are recognized by specific regulatory transcription factors, thereby causing gene response to various regulatory agents. These elements may be found in both promoter and enhancer regions. Element, Response,Elements, Response,Response Element
D020411 Oligonucleotide Array Sequence Analysis Hybridization of a nucleic acid sample to a very large set of OLIGONUCLEOTIDE PROBES, which have been attached individually in columns and rows to a solid support, to determine a BASE SEQUENCE, or to detect variations in a gene sequence, GENE EXPRESSION, or for GENE MAPPING. DNA Microarrays,Gene Expression Microarray Analysis,Oligonucleotide Arrays,cDNA Microarrays,DNA Arrays,DNA Chips,DNA Microchips,Gene Chips,Oligodeoxyribonucleotide Array Sequence Analysis,Oligonucleotide Microarrays,Sequence Analysis, Oligonucleotide Array,cDNA Arrays,Array, DNA,Array, Oligonucleotide,Array, cDNA,Arrays, DNA,Arrays, Oligonucleotide,Arrays, cDNA,Chip, DNA,Chip, Gene,Chips, DNA,Chips, Gene,DNA Array,DNA Chip,DNA Microarray,DNA Microchip,Gene Chip,Microarray, DNA,Microarray, Oligonucleotide,Microarray, cDNA,Microarrays, DNA,Microarrays, Oligonucleotide,Microarrays, cDNA,Microchip, DNA,Microchips, DNA,Oligonucleotide Array,Oligonucleotide Microarray,cDNA Array,cDNA Microarray
D020869 Gene Expression Profiling The determination of the pattern of genes expressed at the level of GENETIC TRANSCRIPTION, under specific circumstances or in a specific cell. Gene Expression Analysis,Gene Expression Pattern Analysis,Transcript Expression Analysis,Transcriptome Profiling,Transcriptomics,mRNA Differential Display,Gene Expression Monitoring,Transcriptome Analysis,Analyses, Gene Expression,Analyses, Transcript Expression,Analyses, Transcriptome,Analysis, Gene Expression,Analysis, Transcript Expression,Analysis, Transcriptome,Differential Display, mRNA,Differential Displays, mRNA,Expression Analyses, Gene,Expression Analysis, Gene,Gene Expression Analyses,Gene Expression Monitorings,Gene Expression Profilings,Monitoring, Gene Expression,Monitorings, Gene Expression,Profiling, Gene Expression,Profiling, Transcriptome,Profilings, Gene Expression,Profilings, Transcriptome,Transcript Expression Analyses,Transcriptome Analyses,Transcriptome Profilings,mRNA Differential Displays

Related Publications

Hualin Xi, and Yong Yu, and Yutao Fu, and Jonathan Foley, and Anason Halees, and Zhiping Weng
March 2008, Genome research,
Hualin Xi, and Yong Yu, and Yutao Fu, and Jonathan Foley, and Anason Halees, and Zhiping Weng
December 2008, Computational biology and chemistry,
Hualin Xi, and Yong Yu, and Yutao Fu, and Jonathan Foley, and Anason Halees, and Zhiping Weng
November 2006, Cytotechnology,
Hualin Xi, and Yong Yu, and Yutao Fu, and Jonathan Foley, and Anason Halees, and Zhiping Weng
January 2002, BMC bioinformatics,
Hualin Xi, and Yong Yu, and Yutao Fu, and Jonathan Foley, and Anason Halees, and Zhiping Weng
March 2005, Nature,
Hualin Xi, and Yong Yu, and Yutao Fu, and Jonathan Foley, and Anason Halees, and Zhiping Weng
February 2018, PLoS computational biology,
Hualin Xi, and Yong Yu, and Yutao Fu, and Jonathan Foley, and Anason Halees, and Zhiping Weng
February 2012, BMC bioinformatics,
Hualin Xi, and Yong Yu, and Yutao Fu, and Jonathan Foley, and Anason Halees, and Zhiping Weng
November 2018, Epigenetics & chromatin,
Hualin Xi, and Yong Yu, and Yutao Fu, and Jonathan Foley, and Anason Halees, and Zhiping Weng
July 2007, BMC genomics,
Hualin Xi, and Yong Yu, and Yutao Fu, and Jonathan Foley, and Anason Halees, and Zhiping Weng
December 2007, BMC genomics,
Copied contents to your clipboard!