Using electrostatic potentials to predict DNA-binding sites on DNA-binding proteins. 2003

Susan Jones, and Hugh P Shanahan, and Helen M Berman, and Janet M Thornton
EMBL--European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK. suej@ebi.ac.uk

A method to detect DNA-binding sites on the surface of a protein structure is important for functional annotation. This work describes the analysis of residue patches on the surface of DNA-binding proteins and the development of a method of predicting DNA-binding sites using a single feature of these surface patches. Surface patches and the DNA-binding sites were initially analysed for accessibility, electrostatic potential, residue propensity, hydrophobicity and residue conservation. From this, it was observed that the DNA-binding sites were, in general, amongst the top 10% of patches with the largest positive electrostatic scores. This knowledge led to the development of a prediction method in which patches of surface residues were selected such that they excluded residues with negative electrostatic scores. This method was used to make predictions for a data set of 56 non-homologous DNA-binding proteins. Correct predictions made for 68% of the data set.

UI MeSH Term Description Entries
D004247 DNA A deoxyribonucleotide polymer that is the primary genetic material of all cells. Eukaryotic and prokaryotic organisms normally contain DNA in a double-stranded state, yet several important biological processes transiently involve single-stranded regions. DNA, which consists of a polysugar-phosphate backbone possessing projections of purines (adenine and guanine) and pyrimidines (thymine and cytosine), forms a double helix that is held together by hydrogen bonds between these purines and pyrimidines (adenine to thymine and guanine to cytosine). DNA, Double-Stranded,Deoxyribonucleic Acid,ds-DNA,DNA, Double Stranded,Double-Stranded DNA,ds DNA
D004268 DNA-Binding Proteins Proteins which bind to DNA. The family includes proteins which bind to both double- and single-stranded DNA and also includes specific DNA binding proteins in serum which can be used as markers for malignant diseases. DNA Helix Destabilizing Proteins,DNA-Binding Protein,Single-Stranded DNA Binding Proteins,DNA Binding Protein,DNA Single-Stranded Binding Protein,SS DNA BP,Single-Stranded DNA-Binding Protein,Binding Protein, DNA,DNA Binding Proteins,DNA Single Stranded Binding Protein,DNA-Binding Protein, Single-Stranded,Protein, DNA-Binding,Single Stranded DNA Binding Protein,Single Stranded DNA Binding Proteins
D001665 Binding Sites The parts of a macromolecule that directly participate in its specific combination with another molecule. Combining Site,Binding Site,Combining Sites,Site, Binding,Site, Combining,Sites, Binding,Sites, Combining
D017124 Conserved Sequence A sequence of amino acids in a polypeptide or of nucleotides in DNA or RNA that is similar across multiple species. A known set of conserved sequences is represented by a CONSENSUS SEQUENCE. AMINO ACID MOTIFS are often composed of conserved sequences. Conserved Sequences,Sequence, Conserved,Sequences, Conserved
D055672 Static Electricity The accumulation of an electric charge on a object Electrostatic,Electrostatics,Static Charge,Charge, Static,Charges, Static,Electricity, Static,Static Charges
D057927 Hydrophobic and Hydrophilic Interactions The thermodynamic interaction between a substance and WATER. Hydrophilic Interactions,Hydrophilic and Hydrophobic Interactions,Hydrophilicity,Hydrophobic Interactions,Hydrophobicity,Hydrophilic Interaction,Hydrophilicities,Hydrophobic Interaction,Hydrophobicities,Interaction, Hydrophilic,Interaction, Hydrophobic,Interactions, Hydrophilic,Interactions, Hydrophobic
D019295 Computational Biology A field of biology concerned with the development of techniques for the collection and manipulation of biological data, and the use of such data to make biological discoveries or predictions. This field encompasses all computational methods and theories for solving biological problems including manipulation of models and datasets. Bioinformatics,Molecular Biology, Computational,Bio-Informatics,Biology, Computational,Computational Molecular Biology,Bio Informatics,Bio-Informatic,Bioinformatic,Biologies, Computational Molecular,Biology, Computational Molecular,Computational Molecular Biologies,Molecular Biologies, Computational
D030541 Databases, Genetic Databases devoted to knowledge about specific genes and gene products. Genetic Databases,Genetic Sequence Databases,OMIM,Online Mendelian Inheritance In Man,Genetic Data Banks,Genetic Data Bases,Genetic Databanks,Genetic Information Databases,Bank, Genetic Data,Banks, Genetic Data,Data Bank, Genetic,Data Banks, Genetic,Data Base, Genetic,Data Bases, Genetic,Databank, Genetic,Databanks, Genetic,Database, Genetic,Database, Genetic Information,Database, Genetic Sequence,Databases, Genetic Information,Databases, Genetic Sequence,Genetic Data Bank,Genetic Data Base,Genetic Databank,Genetic Database,Genetic Information Database,Genetic Sequence Database,Information Database, Genetic,Information Databases, Genetic,Sequence Database, Genetic,Sequence Databases, Genetic

Related Publications

Susan Jones, and Hugh P Shanahan, and Helen M Berman, and Janet M Thornton
July 2006, Proteins,
Susan Jones, and Hugh P Shanahan, and Helen M Berman, and Janet M Thornton
March 2008, Current protocols in bioinformatics,
Susan Jones, and Hugh P Shanahan, and Helen M Berman, and Janet M Thornton
April 2009, Journal of molecular biology,
Susan Jones, and Hugh P Shanahan, and Helen M Berman, and Janet M Thornton
December 1976, Nucleic acids research,
Susan Jones, and Hugh P Shanahan, and Helen M Berman, and Janet M Thornton
December 2011, Journal of bioinformatics and computational biology,
Susan Jones, and Hugh P Shanahan, and Helen M Berman, and Janet M Thornton
July 2018, Proteins,
Susan Jones, and Hugh P Shanahan, and Helen M Berman, and Janet M Thornton
November 2010, PLoS computational biology,
Susan Jones, and Hugh P Shanahan, and Helen M Berman, and Janet M Thornton
June 2012, Bioinformatics (Oxford, England),
Susan Jones, and Hugh P Shanahan, and Helen M Berman, and Janet M Thornton
August 1970, Journal of the American Chemical Society,
Copied contents to your clipboard!