Which Genetics Variants in DNase-Seq Footprints Are More Likely to Alter Binding? 2016

Gregory A Moyerbrailean, and Cynthia A Kalita, and Chris T Harvey, and Xiaoquan Wen, and Francesca Luca, and Roger Pique-Regi
Center for Molecular Medicine and Genetics, Wayne State University, Detroit, Michigan, United States of America.

Large experimental efforts are characterizing the regulatory genome, yet we are still missing a systematic definition of functional and silent genetic variants in non-coding regions. Here, we integrated DNaseI footprinting data with sequence-based transcription factor (TF) motif models to predict the impact of a genetic variant on TF binding across 153 tissues and 1,372 TF motifs. Each annotation we derived is specific for a cell-type condition or assay and is locally motif-driven. We found 5.8 million genetic variants in footprints, 66% of which are predicted by our model to affect TF binding. Comprehensive examination using allele-specific hypersensitivity (ASH) reveals that only the latter group consistently shows evidence for ASH (3,217 SNPs at 20% FDR), suggesting that most (97%) genetic variants in footprinted regulatory regions are indeed silent. Combining this information with GWAS data reveals that our annotation helps in computationally fine-mapping 86 SNPs in GWAS hit regions with at least a 2-fold increase in the posterior odds of picking the causal SNP. The rich meta information provided by the tissue-specificity and the identity of the putative TF binding site being affected also helps in identifying the underlying mechanism supporting the association. As an example, the enrichment for LDL level-associated SNPs is 9.1-fold higher among SNPs predicted to affect HNF4 binding sites than in a background model already including tissue-specific annotation.

UI MeSH Term Description Entries
D011485 Protein Binding The process in which substances, either endogenous or exogenous, bind to proteins, peptides, enzymes, protein precursors, or allied compounds. Specific protein-binding measures are often used as assays in diagnostic assessments. Plasma Protein Binding Capacity,Binding, Protein
D012045 Regulatory Sequences, Nucleic Acid Nucleic acid sequences involved in regulating the expression of genes. Nucleic Acid Regulatory Sequences,Regulatory Regions, Nucleic Acid (Genetics),Region, Regulatory,Regions, Regulatory,Regulator Regions, Nucleic Acid,Regulatory Region,Regulatory Regions
D003851 Deoxyribonucleases Enzymes which catalyze the hydrolases of ester bonds within DNA. EC 3.1.-. DNAase,DNase,Deoxyribonuclease,Desoxyribonuclease,Desoxyribonucleases,Nucleases, DNA,Acid DNase,Alkaline DNase,DNA Nucleases,DNase, Acid,DNase, Alkaline
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D000483 Alleles Variant forms of the same gene, occupying the same locus on homologous CHROMOSOMES, and governing the variants in production of the same gene product. Allelomorphs,Allele,Allelomorph
D001665 Binding Sites The parts of a macromolecule that directly participate in its specific combination with another molecule. Combining Site,Binding Site,Combining Sites,Site, Binding,Site, Combining,Sites, Binding,Sites, Combining
D014157 Transcription Factors Endogenous substances, usually proteins, which are effective in the initiation, stimulation, or termination of the genetic transcription process. Transcription Factor,Factor, Transcription,Factors, Transcription
D015203 Reproducibility of Results The statistical reproducibility of measurements (often in a clinical context), including the testing of instrumentation or techniques to obtain reproducible results. The concept includes reproducibility of physiological measurements, which may be used to develop rules to assess probability or prognosis, or response to a stimulus; reproducibility of occurrence of a condition; and reproducibility of experimental results. Reliability and Validity,Reliability of Result,Reproducibility Of Result,Reproducibility of Finding,Validity of Result,Validity of Results,Face Validity,Reliability (Epidemiology),Reliability of Results,Reproducibility of Findings,Test-Retest Reliability,Validity (Epidemiology),Finding Reproducibilities,Finding Reproducibility,Of Result, Reproducibility,Of Results, Reproducibility,Reliabilities, Test-Retest,Reliability, Test-Retest,Result Reliabilities,Result Reliability,Result Validities,Result Validity,Result, Reproducibility Of,Results, Reproducibility Of,Test Retest Reliability,Validity and Reliability,Validity, Face
D055106 Genome-Wide Association Study An analysis comparing the allele frequencies of all available (or a whole GENOME representative set of) polymorphic markers to identify gene candidates or quantitative trait loci associated with a specific organism trait or specific disease or condition. Genome Wide Association Analysis,Genome Wide Association Study,GWA Study,Genome Wide Association Scan,Genome Wide Association Studies,Whole Genome Association Analysis,Whole Genome Association Study,Association Studies, Genome-Wide,Association Study, Genome-Wide,GWA Studies,Genome-Wide Association Studies,Studies, GWA,Studies, Genome-Wide Association,Study, GWA,Study, Genome-Wide Association
D058977 Molecular Sequence Annotation The addition of descriptive information about the function or structure of a molecular sequence to its MOLECULAR SEQUENCE DATA record. Gene Annotation,Protein Annotation,Annotation, Gene,Annotation, Molecular Sequence,Annotation, Protein,Annotations, Gene,Annotations, Molecular Sequence,Annotations, Protein,Gene Annotations,Molecular Sequence Annotations,Protein Annotations,Sequence Annotation, Molecular,Sequence Annotations, Molecular

Related Publications

Gregory A Moyerbrailean, and Cynthia A Kalita, and Chris T Harvey, and Xiaoquan Wen, and Francesca Luca, and Roger Pique-Regi
January 2012, Frontiers in genetics,
Gregory A Moyerbrailean, and Cynthia A Kalita, and Chris T Harvey, and Xiaoquan Wen, and Francesca Luca, and Roger Pique-Regi
January 2012, Clinical transplantation,
Gregory A Moyerbrailean, and Cynthia A Kalita, and Chris T Harvey, and Xiaoquan Wen, and Francesca Luca, and Roger Pique-Regi
January 2017, Human molecular genetics,
Gregory A Moyerbrailean, and Cynthia A Kalita, and Chris T Harvey, and Xiaoquan Wen, and Francesca Luca, and Roger Pique-Regi
March 2024, Sleep health,
Gregory A Moyerbrailean, and Cynthia A Kalita, and Chris T Harvey, and Xiaoquan Wen, and Francesca Luca, and Roger Pique-Regi
November 2020, International journal of medical informatics,
Gregory A Moyerbrailean, and Cynthia A Kalita, and Chris T Harvey, and Xiaoquan Wen, and Francesca Luca, and Roger Pique-Regi
October 2014, Knee surgery, sports traumatology, arthroscopy : official journal of the ESSKA,
Gregory A Moyerbrailean, and Cynthia A Kalita, and Chris T Harvey, and Xiaoquan Wen, and Francesca Luca, and Roger Pique-Regi
February 2019, Genome biology,
Gregory A Moyerbrailean, and Cynthia A Kalita, and Chris T Harvey, and Xiaoquan Wen, and Francesca Luca, and Roger Pique-Regi
January 2017, PloS one,
Gregory A Moyerbrailean, and Cynthia A Kalita, and Chris T Harvey, and Xiaoquan Wen, and Francesca Luca, and Roger Pique-Regi
August 2013, Quarterly journal of experimental psychology (2006),
Gregory A Moyerbrailean, and Cynthia A Kalita, and Chris T Harvey, and Xiaoquan Wen, and Francesca Luca, and Roger Pique-Regi
January 2008, Genome biology,
Copied contents to your clipboard!