Conserved amino acid networks involved in antibody variable domain interactions. 2009

Norman Wang, and William F Smith, and Brian R Miller, and Dikran Aivazian, and Alexey A Lugovskoy, and Mitchell E Reff, and Scott M Glaser, and Lisa J Croner, and Stephen J Demarest
Biogen Idec, San Diego, California 92122, USA.

Engineered antibodies are a large and growing class of protein therapeutics comprising both marketed products and many molecules in clinical trials in various disease indications. We investigated naturally conserved networks of amino acids that support antibody V(H) and V(L) function, with the goal of generating information to assist in the engineering of robust antibody or antibody-like therapeutics. We generated a large and diverse sequence alignment of V-class Ig-folds, of which V(H) and V(L) domains are family members. To identify conserved amino acid networks, covariations between residues at all possible position pairs were quantified as correlation coefficients (phi-values). We provide rosters of the key conserved amino acid pairs in antibody V(H) and V(L) domains, for reference and use by the antibody research community. The majority of the most strongly conserved amino acid pairs in V(H) and V(L) are at or adjacent to the V(H)-V(L) interface suggesting that the ability to heterodimerize is a constraining feature of antibody evolution. For the V(H) domain, but not the V(L) domain, residue pairs at the variable-constant domain interface (V(H)-C(H)1 interface) are also strongly conserved. The same network of conserved V(H) positions involved in interactions with both the V(L) and C(H)1 domains is found in camelid V(HH) domains, which have evolved to lack interactions with V(L) and C(H)1 domains in their mature structures; however, the amino acids at these positions are different, reflecting their different function. Overall, the data describe naturally occurring amino acid networks in antibody Fv regions that can be referenced when designing antibodies or antibody-like fragments with the goal of improving their biophysical properties.

UI MeSH Term Description Entries
D007135 Immunoglobulin Variable Region That region of the immunoglobulin molecule that varies in its amino acid sequence and composition, and comprises the binding site for a specific antigen. It is located at the N-terminus of the Fab fragment of the immunoglobulin. It includes hypervariable regions (COMPLEMENTARITY DETERMINING REGIONS) and framework regions. Variable Region, Ig,Variable Region, Immunoglobulin,Framework Region, Immunoglobulin,Fv Antibody Fragments,Fv Fragments,Ig Framework Region,Ig Variable Region,Immunoglobulin Framework Region,Immunoglobulin Fv Fragments,Immunoglobulin V,Antibody Fragment, Fv,Antibody Fragments, Fv,Fragment, Fv,Fragment, Fv Antibody,Fragment, Immunoglobulin Fv,Fragments, Fv,Fragments, Fv Antibody,Fragments, Immunoglobulin Fv,Framework Region, Ig,Framework Regions, Ig,Framework Regions, Immunoglobulin,Fv Antibody Fragment,Fv Fragment,Fv Fragment, Immunoglobulin,Fv Fragments, Immunoglobulin,Ig Framework Regions,Ig Variable Regions,Immunoglobulin Framework Regions,Immunoglobulin Fv Fragment,Immunoglobulin Variable Regions,Regions, Immunoglobulin Variable,Variable Regions, Ig,Variable Regions, Immunoglobulin
D007143 Immunoglobulin Heavy Chains The largest of polypeptide chains comprising immunoglobulins. They contain 450 to 600 amino acid residues per chain, and have molecular weights of 51-72 kDa. Immunoglobulins, Heavy-Chain,Heavy-Chain Immunoglobulins,Ig Heavy Chains,Immunoglobulin Heavy Chain,Immunoglobulin Heavy Chain Subgroup VH-I,Immunoglobulin Heavy Chain Subgroup VH-III,Heavy Chain Immunoglobulins,Heavy Chain, Immunoglobulin,Heavy Chains, Ig,Heavy Chains, Immunoglobulin,Immunoglobulin Heavy Chain Subgroup VH I,Immunoglobulin Heavy Chain Subgroup VH III,Immunoglobulins, Heavy Chain
D007147 Immunoglobulin Light Chains Polypeptide chains, consisting of 211 to 217 amino acid residues and having a molecular weight of approximately 22 kDa. There are two major types of light chains, kappa and lambda. Two Ig light chains and two Ig heavy chains (IMMUNOGLOBULIN HEAVY CHAINS) make one immunoglobulin molecule. Ig Light Chains,Immunoglobulins, Light-Chain,Immunoglobulin Light Chain,Immunoglobulin Light-Chain,Light-Chain Immunoglobulins,Chains, Ig Light,Chains, Immunoglobulin Light,Immunoglobulins, Light Chain,Light Chain Immunoglobulins,Light Chain, Immunoglobulin,Light Chains, Ig,Light Chains, Immunoglobulin,Light-Chain, Immunoglobulin
D008969 Molecular Sequence Data Descriptions of specific amino acid, carbohydrate, or nucleotide sequences which have appeared in the published literature and/or are deposited in and maintained by databanks such as GENBANK, European Molecular Biology Laboratory (EMBL), National Biomedical Research Foundation (NBRF), or other sequence repositories. Sequence Data, Molecular,Molecular Sequencing Data,Data, Molecular Sequence,Data, Molecular Sequencing,Sequencing Data, Molecular
D000595 Amino Acid Sequence The order of amino acids as they occur in a polypeptide chain. This is referred to as the primary structure of proteins. It is of fundamental importance in determining PROTEIN CONFORMATION. Protein Structure, Primary,Amino Acid Sequences,Sequence, Amino Acid,Sequences, Amino Acid,Primary Protein Structure,Primary Protein Structures,Protein Structures, Primary,Structure, Primary Protein,Structures, Primary Protein
D000906 Antibodies Immunoglobulin molecules having a specific amino acid sequence by virtue of which they interact only with the ANTIGEN (or a very similar shape) that induced their synthesis in cells of the lymphoid series (especially PLASMA CELLS).
D015202 Protein Engineering Procedures by which protein structure and function are changed or created in vitro by altering existing or synthesizing new structural genes that direct the synthesis of proteins with sought-after properties. Such procedures may include the design of MOLECULAR MODELS of proteins using COMPUTER GRAPHICS or other molecular modeling techniques; site-specific mutagenesis (MUTAGENESIS, SITE-SPECIFIC) of existing genes; and DIRECTED MOLECULAR EVOLUTION techniques to create new genes. Genetic Engineering of Proteins,Genetic Engineering, Protein,Proteins, Genetic Engineering,Engineering, Protein,Engineering, Protein Genetic,Protein Genetic Engineering
D016415 Sequence Alignment The arrangement of two or more amino acid or base sequences from an organism or organisms in such a way as to align areas of the sequences sharing common properties. The degree of relatedness or homology between the sequences is predicted computationally or statistically based on weights assigned to the elements aligned between the sequences. This in turn can serve as a potential indicator of the genetic relatedness between the organisms. Sequence Homology Determination,Determination, Sequence Homology,Alignment, Sequence,Alignments, Sequence,Determinations, Sequence Homology,Sequence Alignments,Sequence Homology Determinations
D017124 Conserved Sequence A sequence of amino acids in a polypeptide or of nucleotides in DNA or RNA that is similar across multiple species. A known set of conserved sequences is represented by a CONSENSUS SEQUENCE. AMINO ACID MOTIFS are often composed of conserved sequences. Conserved Sequences,Sequence, Conserved,Sequences, Conserved
D017510 Protein Folding Processes involved in the formation of TERTIARY PROTEIN STRUCTURE. Protein Folding, Globular,Folding, Globular Protein,Folding, Protein,Foldings, Globular Protein,Foldings, Protein,Globular Protein Folding,Globular Protein Foldings,Protein Foldings,Protein Foldings, Globular

Related Publications

Norman Wang, and William F Smith, and Brian R Miller, and Dikran Aivazian, and Alexey A Lugovskoy, and Mitchell E Reff, and Scott M Glaser, and Lisa J Croner, and Stephen J Demarest
January 1991, Molekuliarnaia biologiia,
Norman Wang, and William F Smith, and Brian R Miller, and Dikran Aivazian, and Alexey A Lugovskoy, and Mitchell E Reff, and Scott M Glaser, and Lisa J Croner, and Stephen J Demarest
February 2010, Journal of bacteriology,
Norman Wang, and William F Smith, and Brian R Miller, and Dikran Aivazian, and Alexey A Lugovskoy, and Mitchell E Reff, and Scott M Glaser, and Lisa J Croner, and Stephen J Demarest
June 2017, Scientific reports,
Norman Wang, and William F Smith, and Brian R Miller, and Dikran Aivazian, and Alexey A Lugovskoy, and Mitchell E Reff, and Scott M Glaser, and Lisa J Croner, and Stephen J Demarest
February 2005, Amino acids,
Norman Wang, and William F Smith, and Brian R Miller, and Dikran Aivazian, and Alexey A Lugovskoy, and Mitchell E Reff, and Scott M Glaser, and Lisa J Croner, and Stephen J Demarest
September 2005, Bioinformatics (Oxford, England),
Norman Wang, and William F Smith, and Brian R Miller, and Dikran Aivazian, and Alexey A Lugovskoy, and Mitchell E Reff, and Scott M Glaser, and Lisa J Croner, and Stephen J Demarest
February 2020, Biophysical journal,
Norman Wang, and William F Smith, and Brian R Miller, and Dikran Aivazian, and Alexey A Lugovskoy, and Mitchell E Reff, and Scott M Glaser, and Lisa J Croner, and Stephen J Demarest
February 2000, Hepatology (Baltimore, Md.),
Norman Wang, and William F Smith, and Brian R Miller, and Dikran Aivazian, and Alexey A Lugovskoy, and Mitchell E Reff, and Scott M Glaser, and Lisa J Croner, and Stephen J Demarest
January 2004, Journal of experimental botany,
Norman Wang, and William F Smith, and Brian R Miller, and Dikran Aivazian, and Alexey A Lugovskoy, and Mitchell E Reff, and Scott M Glaser, and Lisa J Croner, and Stephen J Demarest
May 1989, Planta,
Norman Wang, and William F Smith, and Brian R Miller, and Dikran Aivazian, and Alexey A Lugovskoy, and Mitchell E Reff, and Scott M Glaser, and Lisa J Croner, and Stephen J Demarest
July 2007, BMC bioinformatics,
Copied contents to your clipboard!