Accurate inference of transcription factor binding from DNA sequence and chromatin accessibility data. 2011

Roger Pique-Regi, and Jacob F Degner, and Athma A Pai, and Daniel J Gaffney, and Yoav Gilad, and Jonathan K Pritchard
Department of Human Genetics, University of Chicago, Chicago, Illinois 60637, USA. rpique@uchicago.edu

Accurate functional annotation of regulatory elements is essential for understanding global gene regulation. Here, we report a genome-wide map of 827,000 transcription factor binding sites in human lymphoblastoid cell lines, which is comprised of sites corresponding to 239 position weight matrices of known transcription factor binding motifs, and 49 novel sequence motifs. To generate this map, we developed a probabilistic framework that integrates cell- or tissue-specific experimental data such as histone modifications and DNase I cleavage patterns with genomic information such as gene annotation and evolutionary conservation. Comparison to empirical ChIP-seq data suggests that our method is highly accurate yet has the advantage of targeting many factors in a single assay. We anticipate that this approach will be a valuable tool for genome-wide studies of gene regulation in a wide variety of cell types or tissues under diverse conditions.

UI MeSH Term Description Entries
D011485 Protein Binding The process in which substances, either endogenous or exogenous, bind to proteins, peptides, enzymes, protein precursors, or allied compounds. Specific protein-binding measures are often used as assays in diagnostic assessments. Plasma Protein Binding Capacity,Binding, Protein
D012045 Regulatory Sequences, Nucleic Acid Nucleic acid sequences involved in regulating the expression of genes. Nucleic Acid Regulatory Sequences,Regulatory Regions, Nucleic Acid (Genetics),Region, Regulatory,Regions, Regulatory,Regulator Regions, Nucleic Acid,Regulatory Region,Regulatory Regions
D002478 Cells, Cultured Cells propagated in vitro in special media conducive to their growth. Cultured cells are used to study developmental, morphologic, metabolic, physiologic, and genetic processes, among others. Cultured Cells,Cell, Cultured,Cultured Cell
D002843 Chromatin The material of CHROMOSOMES. It is a complex of DNA; HISTONES; and nonhistone proteins (CHROMOSOMAL PROTEINS, NON-HISTONE) found within the nucleus of a cell. Chromatins
D006657 Histones Small chromosomal proteins (approx 12-20 kD) possessing an open, unfolded structure and attached to the DNA in cell nuclei by ionic linkages. Classification into the various types (designated histone I, histone II, etc.) is based on the relative amounts of arginine and lysine in each. Histone,Histone H1,Histone H1(s),Histone H2a,Histone H2b,Histone H3,Histone H3.3,Histone H4,Histone H5,Histone H7
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D001665 Binding Sites The parts of a macromolecule that directly participate in its specific combination with another molecule. Combining Site,Binding Site,Combining Sites,Site, Binding,Site, Combining,Sites, Binding,Sites, Combining
D014157 Transcription Factors Endogenous substances, usually proteins, which are effective in the initiation, stimulation, or termination of the genetic transcription process. Transcription Factor,Factor, Transcription,Factors, Transcription
D014158 Transcription, Genetic The biosynthesis of RNA carried out on a template of DNA. The biosynthesis of DNA from an RNA template is called REVERSE TRANSCRIPTION. Genetic Transcription
D016678 Genome The genetic complement of an organism, including all of its GENES, as represented in its DNA, or in some cases, its RNA. Genomes

Related Publications

Roger Pique-Regi, and Jacob F Degner, and Athma A Pai, and Daniel J Gaffney, and Yoav Gilad, and Jonathan K Pritchard
July 2020, Genome research,
Roger Pique-Regi, and Jacob F Degner, and Athma A Pai, and Daniel J Gaffney, and Yoav Gilad, and Jonathan K Pritchard
May 2017, Nucleic acids research,
Roger Pique-Regi, and Jacob F Degner, and Athma A Pai, and Daniel J Gaffney, and Yoav Gilad, and Jonathan K Pritchard
June 2017, Nucleic acids research,
Roger Pique-Regi, and Jacob F Degner, and Athma A Pai, and Daniel J Gaffney, and Yoav Gilad, and Jonathan K Pritchard
July 2011, Molecular cell,
Roger Pique-Regi, and Jacob F Degner, and Athma A Pai, and Daniel J Gaffney, and Yoav Gilad, and Jonathan K Pritchard
March 2008, PloS one,
Roger Pique-Regi, and Jacob F Degner, and Athma A Pai, and Daniel J Gaffney, and Yoav Gilad, and Jonathan K Pritchard
October 2020, Transcription,
Roger Pique-Regi, and Jacob F Degner, and Athma A Pai, and Daniel J Gaffney, and Yoav Gilad, and Jonathan K Pritchard
January 2017, Nucleic acids research,
Roger Pique-Regi, and Jacob F Degner, and Athma A Pai, and Daniel J Gaffney, and Yoav Gilad, and Jonathan K Pritchard
February 2018, Nature communications,
Roger Pique-Regi, and Jacob F Degner, and Athma A Pai, and Daniel J Gaffney, and Yoav Gilad, and Jonathan K Pritchard
January 2017, PLoS computational biology,
Roger Pique-Regi, and Jacob F Degner, and Athma A Pai, and Daniel J Gaffney, and Yoav Gilad, and Jonathan K Pritchard
September 2012, Genome research,
Copied contents to your clipboard!