DNA-binding residues and binding mode prediction with binding-mechanism concerned models. 2009

Yu-Feng Huang, and Chun-Chin Huang, and Yu-Cheng Liu, and Yen-Jen Oyang, and Chien-Kang Huang
Department of Computer Science and Information Engineering, National Taiwan University, Taipei, 106, Taiwan, Republic of China. yfhuang@csie.ntu.edu.tw

BACKGROUND Protein-DNA interactions are essential for fundamental biological activities including DNA transcription, replication, packaging, repair and rearrangement. Proteins interacting with DNA can be classified into two categories of binding mechanisms - sequence-specific and non-specific binding. Protein-DNA specific binding provides a mechanism to recognize correct nucleotide base pairs for sequence-specific identification. Protein-DNA non-specific binding shows sequence independent interaction for accelerated targeting by interacting with DNA backbone. Both sequence-specific and non-specific binding residues contribute to their roles for interaction. RESULTS The proposed framework has two stage predictors: DNA-binding residues prediction and binding mode prediction. In the first stage - DNA-binding residues prediction, the predictor for DNA specific binding residues achieves 96.45% accuracy with 50.14% sensitivity, 99.31% specificity, 81.70% precision, and 62.15% F-measure. The predictor for DNA non-specific binding residues achieves 89.14% accuracy with 53.06% sensitivity, 95.25% specificity, 65.47% precision, and 58.62% F-measure. While combining prediction results of sequence-specific and non-specific binding residues with OR operation, the predictor achieves 89.26% accuracy with 56.86% sensitivity, 95.63% specificity, 71.92% precision, and 63.51% F-measure. In the second stage, protein-DNA binding mode prediction achieves 75.83% accuracy while using support vector machine with multi-class prediction. CONCLUSIONS This article presents the design of a sequence based predictor aiming to identify sequence-specific and non-specific binding residues in a transcription factor with DNA binding-mechanism concerned. The protein-DNA binding mode prediction was introduced to help improve DNA-binding residues prediction. In addition, the results of this study will help with the design of binding-mechanism concerned predictors for other families of proteins interacting with DNA.

UI MeSH Term Description Entries
D008958 Models, Molecular Models used experimentally or theoretically to study molecular shape, electronic properties, or interactions; includes analogous molecules, computer-generated graphics, and mechanical structures. Molecular Models,Model, Molecular,Molecular Model
D009690 Nucleic Acid Conformation The spatial arrangement of the atoms of a nucleic acid or polynucleotide that results in its characteristic 3-dimensional shape. DNA Conformation,RNA Conformation,Conformation, DNA,Conformation, Nucleic Acid,Conformation, RNA,Conformations, DNA,Conformations, Nucleic Acid,Conformations, RNA,DNA Conformations,Nucleic Acid Conformations,RNA Conformations
D004247 DNA A deoxyribonucleotide polymer that is the primary genetic material of all cells. Eukaryotic and prokaryotic organisms normally contain DNA in a double-stranded state, yet several important biological processes transiently involve single-stranded regions. DNA, which consists of a polysugar-phosphate backbone possessing projections of purines (adenine and guanine) and pyrimidines (thymine and cytosine), forms a double helix that is held together by hydrogen bonds between these purines and pyrimidines (adenine to thymine and guanine to cytosine). DNA, Double-Stranded,Deoxyribonucleic Acid,ds-DNA,DNA, Double Stranded,Double-Stranded DNA,ds DNA
D004268 DNA-Binding Proteins Proteins which bind to DNA. The family includes proteins which bind to both double- and single-stranded DNA and also includes specific DNA binding proteins in serum which can be used as markers for malignant diseases. DNA Helix Destabilizing Proteins,DNA-Binding Protein,Single-Stranded DNA Binding Proteins,DNA Binding Protein,DNA Single-Stranded Binding Protein,SS DNA BP,Single-Stranded DNA-Binding Protein,Binding Protein, DNA,DNA Binding Proteins,DNA Single Stranded Binding Protein,DNA-Binding Protein, Single-Stranded,Protein, DNA-Binding,Single Stranded DNA Binding Protein,Single Stranded DNA Binding Proteins
D001665 Binding Sites The parts of a macromolecule that directly participate in its specific combination with another molecule. Combining Site,Binding Site,Combining Sites,Site, Binding,Site, Combining,Sites, Binding,Sites, Combining
D017434 Protein Structure, Tertiary The level of protein structure in which combinations of secondary protein structures (ALPHA HELICES; BETA SHEETS; loop regions, and AMINO ACID MOTIFS) pack together to form folded shapes. Disulfide bridges between cysteines in two different parts of the polypeptide chain along with other interactions between the chains play a role in the formation and stabilization of tertiary structure. Tertiary Protein Structure,Protein Structures, Tertiary,Tertiary Protein Structures
D020539 Sequence Analysis, Protein A process that includes the determination of AMINO ACID SEQUENCE of a protein (or peptide, oligopeptide or peptide fragment) and the information analysis of the sequence. Amino Acid Sequence Analysis,Peptide Sequence Analysis,Protein Sequence Analysis,Sequence Determination, Protein,Amino Acid Sequence Analyses,Amino Acid Sequence Determination,Amino Acid Sequence Determinations,Amino Acid Sequencing,Peptide Sequence Determination,Protein Sequencing,Sequence Analyses, Amino Acid,Sequence Analysis, Amino Acid,Sequence Analysis, Peptide,Sequence Determination, Amino Acid,Sequence Determinations, Amino Acid,Acid Sequencing, Amino,Analyses, Peptide Sequence,Analyses, Protein Sequence,Analysis, Peptide Sequence,Analysis, Protein Sequence,Peptide Sequence Analyses,Peptide Sequence Determinations,Protein Sequence Analyses,Protein Sequence Determination,Protein Sequence Determinations,Sequence Analyses, Peptide,Sequence Analyses, Protein,Sequence Determination, Peptide,Sequence Determinations, Peptide,Sequence Determinations, Protein,Sequencing, Amino Acid,Sequencing, Protein

Related Publications

Yu-Feng Huang, and Chun-Chin Huang, and Yu-Cheng Liu, and Yen-Jen Oyang, and Chien-Kang Huang
July 2007, Bioinformatics (Oxford, England),
Yu-Feng Huang, and Chun-Chin Huang, and Yu-Cheng Liu, and Yen-Jen Oyang, and Chien-Kang Huang
December 2006, Journal of bioinformatics and computational biology,
Yu-Feng Huang, and Chun-Chin Huang, and Yu-Cheng Liu, and Yen-Jen Oyang, and Chien-Kang Huang
January 2005, Conference proceedings : ... Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual Conference,
Yu-Feng Huang, and Chun-Chin Huang, and Yu-Cheng Liu, and Yen-Jen Oyang, and Chien-Kang Huang
January 2012, IEEE/ACM transactions on computational biology and bioinformatics,
Yu-Feng Huang, and Chun-Chin Huang, and Yu-Cheng Liu, and Yen-Jen Oyang, and Chien-Kang Huang
February 2013, Biochimie,
Yu-Feng Huang, and Chun-Chin Huang, and Yu-Cheng Liu, and Yen-Jen Oyang, and Chien-Kang Huang
January 2023, IEEE/ACM transactions on computational biology and bioinformatics,
Yu-Feng Huang, and Chun-Chin Huang, and Yu-Cheng Liu, and Yen-Jen Oyang, and Chien-Kang Huang
March 2007, Bioinformatics (Oxford, England),
Yu-Feng Huang, and Chun-Chin Huang, and Yu-Cheng Liu, and Yen-Jen Oyang, and Chien-Kang Huang
January 2014, PloS one,
Yu-Feng Huang, and Chun-Chin Huang, and Yu-Cheng Liu, and Yen-Jen Oyang, and Chien-Kang Huang
August 2022, eLife,
Yu-Feng Huang, and Chun-Chin Huang, and Yu-Cheng Liu, and Yen-Jen Oyang, and Chien-Kang Huang
November 2021, Briefings in bioinformatics,
Copied contents to your clipboard!