Genome-wide analysis of regions similar to promoters of histone genes. 2010

Rajesh Chowdhary, and Vladimir B Bajic, and Difeng Dong, and Limsoon Wong, and Jun S Liu
Department of Statistics, Harvard University, Cambridge, MA 02138, USA. chowdhary@stat.harvard.edu

BACKGROUND The purpose of this study is to: i) develop a computational model of promoters of human histone-encoding genes (shortly histone genes), an important class of genes that participate in various critical cellular processes, ii) use the model so developed to identify regions across the human genome that have similar structure as promoters of histone genes; such regions could represent potential genomic regulatory regions, e.g. promoters, of genes that may be coregulated with histone genes, and iii/ identify in this way genes that have high likelihood of being coregulated with the histone genes. RESULTS We successfully developed a histone promoter model using a comprehensive collection of histone genes. Based on leave-one-out cross-validation test, the model produced good prediction accuracy (94.1% sensitivity, 92.6% specificity, and 92.8% positive predictive value). We used this model to predict across the genome a number of genes that shared similar promoter structures with the histone gene promoters. We thus hypothesize that these predicted genes could be coregulated with histone genes. This hypothesis matches well with the available gene expression, gene ontology, and pathways data. Jointly with promoters of the above-mentioned genes, we found a large number of intergenic regions with similar structure as histone promoters. CONCLUSIONS This study represents one of the most comprehensive computational analyses conducted thus far on a genome-wide scale of promoters of human histone genes. Our analysis suggests a number of other human genes that share a high similarity of promoter structure with the histone genes and thus are highly likely to be coregulated, and consequently coexpressed, with the histone genes. We also found that there are a large number of intergenic regions across the genome with their structures similar to promoters of histone genes. These regions may be promoters of yet unidentified genes, or may represent remote control regions that participate in regulation of histone and histone-coregulated gene transcription initiation. While these hypotheses still remain to be verified, we believe that these form a useful resource for researchers to further explore regulation of human histone genes and human genome. It is worthwhile to note that the regulatory regions of the human genome remain largely un-annotated even today and this study is an attempt to supplement our understanding of histone regulatory regions.

UI MeSH Term Description Entries
D011401 Promoter Regions, Genetic DNA sequences which are recognized (directly or indirectly) and bound by a DNA-dependent RNA polymerase during the initiation of transcription. Highly conserved sequences within the promoter include the Pribnow box in bacteria and the TATA BOX in eukaryotes. rRNA Promoter,Early Promoters, Genetic,Late Promoters, Genetic,Middle Promoters, Genetic,Promoter Regions,Promoter, Genetic,Promotor Regions,Promotor, Genetic,Pseudopromoter, Genetic,Early Promoter, Genetic,Genetic Late Promoter,Genetic Middle Promoters,Genetic Promoter,Genetic Promoter Region,Genetic Promoter Regions,Genetic Promoters,Genetic Promotor,Genetic Promotors,Genetic Pseudopromoter,Genetic Pseudopromoters,Late Promoter, Genetic,Middle Promoter, Genetic,Promoter Region,Promoter Region, Genetic,Promoter, Genetic Early,Promoter, rRNA,Promoters, Genetic,Promoters, Genetic Middle,Promoters, rRNA,Promotor Region,Promotors, Genetic,Pseudopromoters, Genetic,Region, Genetic Promoter,Region, Promoter,Region, Promotor,Regions, Genetic Promoter,Regions, Promoter,Regions, Promotor,rRNA Promoters
D006657 Histones Small chromosomal proteins (approx 12-20 kD) possessing an open, unfolded structure and attached to the DNA in cell nuclei by ionic linkages. Classification into the various types (designated histone I, histone II, etc.) is based on the relative amounts of arginine and lysine in each. Histone,Histone H1,Histone H1(s),Histone H2a,Histone H2b,Histone H3,Histone H3.3,Histone H4,Histone H5,Histone H7
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D001499 Bayes Theorem A theorem in probability theory named for Thomas Bayes (1702-1761). In epidemiology, it is used to obtain the probability of disease in a group of people with some characteristic on the basis of the overall rate of that disease and of the likelihood of that characteristic in healthy and diseased individuals. The most familiar application is in clinical decision analysis where it is used for estimating the probability of a particular diagnosis given the appearance of some symptoms or test result. Bayesian Analysis,Bayesian Estimation,Bayesian Forecast,Bayesian Method,Bayesian Prediction,Analysis, Bayesian,Bayesian Approach,Approach, Bayesian,Approachs, Bayesian,Bayesian Approachs,Estimation, Bayesian,Forecast, Bayesian,Method, Bayesian,Prediction, Bayesian,Theorem, Bayes
D015894 Genome, Human The complete genetic complement contained in the DNA of a set of CHROMOSOMES in a HUMAN. The length of the human genome is about 3 billion base pairs. Human Genome,Genomes, Human,Human Genomes
D023281 Genomics The systematic study of the complete DNA sequences (GENOME) of organisms. Included is construction of complete genetic, physical, and transcript maps, and the analysis of this structural genomic information on a global scale such as in GENOME WIDE ASSOCIATION STUDIES. Functional Genomics,Structural Genomics,Comparative Genomics,Genomics, Comparative,Genomics, Functional,Genomics, Structural

Related Publications

Rajesh Chowdhary, and Vladimir B Bajic, and Difeng Dong, and Limsoon Wong, and Jun S Liu
February 2005, BMC genomics,
Rajesh Chowdhary, and Vladimir B Bajic, and Difeng Dong, and Limsoon Wong, and Jun S Liu
June 2012, Genome research,
Rajesh Chowdhary, and Vladimir B Bajic, and Difeng Dong, and Limsoon Wong, and Jun S Liu
December 2013, Proceedings of the National Academy of Sciences of the United States of America,
Rajesh Chowdhary, and Vladimir B Bajic, and Difeng Dong, and Limsoon Wong, and Jun S Liu
August 2005, Circulation research,
Rajesh Chowdhary, and Vladimir B Bajic, and Difeng Dong, and Limsoon Wong, and Jun S Liu
July 2008, BMC genomics,
Rajesh Chowdhary, and Vladimir B Bajic, and Difeng Dong, and Limsoon Wong, and Jun S Liu
January 2003, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing,
Rajesh Chowdhary, and Vladimir B Bajic, and Difeng Dong, and Limsoon Wong, and Jun S Liu
May 2015, Oncology reports,
Rajesh Chowdhary, and Vladimir B Bajic, and Difeng Dong, and Limsoon Wong, and Jun S Liu
January 2007, Research in microbiology,
Rajesh Chowdhary, and Vladimir B Bajic, and Difeng Dong, and Limsoon Wong, and Jun S Liu
May 2022, Genes,
Rajesh Chowdhary, and Vladimir B Bajic, and Difeng Dong, and Limsoon Wong, and Jun S Liu
October 2005, Cell,
Copied contents to your clipboard!