Physical-chemical determinants of coil conformations in globular proteins. 2010

Lauren L Perskie, and George D Rose
T.C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, Maryland 21218, USA.

We present a method with the potential to generate a library of coil segments from first principles. Proteins are built from alpha-helices and/or beta-strands interconnected by these coil segments. Here, we investigate the conformational determinants of short coil segments, with particular emphasis on chain turns. Toward this goal, we extracted a comprehensive set of two-, three-, and four-residue turns from X-ray-elucidated proteins and classified them by conformation. A remarkably small number of unique conformers account for most of this experimentally determined set, whereas remaining members span a large number of rare conformers, many occurring only once in the entire protein database. Factors determining conformation were identified via Metropolis Monte Carlo simulations devised to test the effectiveness of various energy terms. Simulated structures were validated by comparison to experimental counterparts. After filtering rare conformers, we found that 98% of the remaining experimentally determined turn population could be reproduced by applying a hydrogen bond energy term to an exhaustively generated ensemble of clash-free conformers in which no backbone polar group lacks a hydrogen-bond partner. Further, at least 90% of longer coil segments, ranging from 5- to 20 residues, were found to be structural composites of these shorter primitives. These results are pertinent to protein structure prediction, where approaches can be divided into either empirical or ab initio methods. Empirical methods use database-derived information; ab initio methods rely on physical-chemical principles exclusively. Replacing the database-derived coil library with one generated from first principles would transform any empirically based method into its corresponding ab initio homologue.

UI MeSH Term Description Entries
D009010 Monte Carlo Method In statistics, a technique for numerically approximating the solution of a mathematical problem by studying the distribution of some random variable, often generated by a computer. The name alludes to the randomness characteristic of the games of chance played at the gambling casinos in Monte Carlo. (From Random House Unabridged Dictionary, 2d ed, 1993) Method, Monte Carlo
D011506 Proteins Linear POLYPEPTIDES that are synthesized on RIBOSOMES and may be further modified, crosslinked, cleaved, or assembled into complex proteins with several subunits. The specific sequence of AMINO ACIDS determines the shape the polypeptide will take, during PROTEIN FOLDING, and the function of the protein. Gene Products, Protein,Gene Proteins,Protein,Protein Gene Products,Proteins, Gene
D003198 Computer Simulation Computer-based representation of physical systems and phenomena such as chemical processes. Computational Modeling,Computational Modelling,Computer Models,In silico Modeling,In silico Models,In silico Simulation,Models, Computer,Computerized Models,Computer Model,Computer Simulations,Computerized Model,In silico Model,Model, Computer,Model, Computerized,Model, In silico,Modeling, Computational,Modeling, In silico,Modelling, Computational,Simulation, Computer,Simulation, In silico,Simulations, Computer
D017434 Protein Structure, Tertiary The level of protein structure in which combinations of secondary protein structures (ALPHA HELICES; BETA SHEETS; loop regions, and AMINO ACID MOTIFS) pack together to form folded shapes. Disulfide bridges between cysteines in two different parts of the polypeptide chain along with other interactions between the chains play a role in the formation and stabilization of tertiary structure. Tertiary Protein Structure,Protein Structures, Tertiary,Tertiary Protein Structures
D017510 Protein Folding Processes involved in the formation of TERTIARY PROTEIN STRUCTURE. Protein Folding, Globular,Folding, Globular Protein,Folding, Protein,Foldings, Globular Protein,Foldings, Protein,Globular Protein Folding,Globular Protein Foldings,Protein Foldings,Protein Foldings, Globular
D020816 Amino Acid Motifs Three-dimensional protein structural elements that are composed of a combination of secondary structures. They include HELIX-LOOP-HELIX MOTIFS and ZINC FINGERS. Motifs are typically the most conserved regions of PROTEIN DOMAINS and are critical for domain function. However, the same motif may occur in proteins or enzymes with different functions. AA Motifs,Motifs, Amino Acid,Protein Motifs,Protein Structure, Supersecondary,Supersecondary Protein Structure,AA Motif,Amino Acid Motif,Motif, AA,Motif, Amino Acid,Motif, Protein,Motifs, AA,Motifs, Protein,Protein Motif,Protein Structures, Supersecondary,Supersecondary Protein Structures
D030562 Databases, Protein Databases containing information about PROTEINS such as AMINO ACID SEQUENCE; PROTEIN CONFORMATION; and other properties. Amino Acid Sequence Databases,Databases, Amino Acid Sequence,Protein Databases,Protein Sequence Databases,SWISS-PROT,Protein Structure Databases,SwissProt,Database, Protein,Database, Protein Sequence,Database, Protein Structure,Databases, Protein Sequence,Databases, Protein Structure,Protein Database,Protein Sequence Database,Protein Structure Database,SWISS PROT,Sequence Database, Protein,Sequence Databases, Protein,Structure Database, Protein,Structure Databases, Protein

Related Publications

Lauren L Perskie, and George D Rose
December 1967, The Journal of biological chemistry,
Lauren L Perskie, and George D Rose
January 1963, Biokhimiia (Moscow, Russia),
Lauren L Perskie, and George D Rose
January 1981, Molekuliarnaia biologiia,
Lauren L Perskie, and George D Rose
January 1989, Proteins,
Lauren L Perskie, and George D Rose
November 1997, Acta crystallographica. Section D, Biological crystallography,
Lauren L Perskie, and George D Rose
November 2019, Chemical research in toxicology,
Lauren L Perskie, and George D Rose
March 2022, ACS macro letters,
Copied contents to your clipboard!