Effects of rare codon clusters on high-level expression of heterologous proteins in Escherichia coli. 1995

J F Kane
SmithKline Beecham Pharmaceuticals, King of Prussia, USA.

Within Escherichia coli and other species, a clear codon bias exists among the 61 amino acid codons found within the population of mRNA molecules, and the level of cognate tRNA appears directly proportional to the frequency of codon usage. Given this situation, one would predict translational problems with an abundant mRNA species containing an excess of rare low tRNA codons. Such a situation might arise after the initiation of transcription of a cloned heterologous gene in the E. coli host. Recent studies suggest clusters of AGG/AGA, CUA, AUA, CGA or CCC codons can reduce both the quantity and quality of the synthesized protein. In addition, it is likely that an excess of any of these codons, even without clusters, could create translational problems.

UI MeSH Term Description Entries
D008969 Molecular Sequence Data Descriptions of specific amino acid, carbohydrate, or nucleotide sequences which have appeared in the published literature and/or are deposited in and maintained by databanks such as GENBANK, European Molecular Biology Laboratory (EMBL), National Biomedical Research Foundation (NBRF), or other sequence repositories. Sequence Data, Molecular,Molecular Sequencing Data,Data, Molecular Sequence,Data, Molecular Sequencing,Sequencing Data, Molecular
D009154 Mutation Any detectable and heritable change in the genetic material that causes a change in the GENOTYPE and which is transmitted to daughter cells and to succeeding generations. Mutations
D011994 Recombinant Proteins Proteins prepared by recombinant DNA technology. Biosynthetic Protein,Biosynthetic Proteins,DNA Recombinant Proteins,Recombinant Protein,Proteins, Biosynthetic,Proteins, Recombinant DNA,DNA Proteins, Recombinant,Protein, Biosynthetic,Protein, Recombinant,Proteins, DNA Recombinant,Proteins, Recombinant,Recombinant DNA Proteins,Recombinant Proteins, DNA
D003062 Codon A set of three nucleotides in a protein coding sequence that specifies individual amino acids or a termination signal (CODON, TERMINATOR). Most codons are universal, but some organisms do not produce the transfer RNAs (RNA, TRANSFER) complementary to all codons. These codons are referred to as unassigned codons (CODONS, NONSENSE). Codon, Sense,Sense Codon,Codons,Codons, Sense,Sense Codons
D004926 Escherichia coli A species of gram-negative, facultatively anaerobic, rod-shaped bacteria (GRAM-NEGATIVE FACULTATIVELY ANAEROBIC RODS) commonly found in the lower part of the intestine of warm-blooded animals. It is usually nonpathogenic, but some strains are known to produce DIARRHEA and pyogenic infections. Pathogenic strains (virotypes) are classified by their specific pathogenic mechanisms such as toxins (ENTEROTOXIGENIC ESCHERICHIA COLI), etc. Alkalescens-Dispar Group,Bacillus coli,Bacterium coli,Bacterium coli commune,Diffusely Adherent Escherichia coli,E coli,EAggEC,Enteroaggregative Escherichia coli,Enterococcus coli,Diffusely Adherent E. coli,Enteroaggregative E. coli,Enteroinvasive E. coli,Enteroinvasive Escherichia coli
D005810 Multigene Family A set of genes descended by duplication and variation from some ancestral gene. Such genes may be clustered together on the same chromosome or dispersed on different chromosomes. Examples of multigene families include those that encode the hemoglobins, immunoglobulins, histocompatibility antigens, actins, tubulins, keratins, collagens, heat shock proteins, salivary glue proteins, chorion proteins, cuticle proteins, yolk proteins, and phaseolins, as well as histones, ribosomal RNA, and transfer RNA genes. The latter three are examples of reiterated genes, where hundreds of identical genes are present in a tandem array. (King & Stanfield, A Dictionary of Genetics, 4th ed) Gene Clusters,Genes, Reiterated,Cluster, Gene,Clusters, Gene,Families, Multigene,Family, Multigene,Gene Cluster,Gene, Reiterated,Multigene Families,Reiterated Gene,Reiterated Genes
D001120 Arginine An essential amino acid that is physiologically active in the L-form. Arginine Hydrochloride,Arginine, L-Isomer,DL-Arginine Acetate, Monohydrate,L-Arginine,Arginine, L Isomer,DL Arginine Acetate, Monohydrate,Hydrochloride, Arginine,L Arginine,L-Isomer Arginine,Monohydrate DL-Arginine Acetate
D001483 Base Sequence The sequence of PURINES and PYRIMIDINES in nucleic acids and polynucleotides. It is also called nucleotide sequence. DNA Sequence,Nucleotide Sequence,RNA Sequence,DNA Sequences,Base Sequences,Nucleotide Sequences,RNA Sequences,Sequence, Base,Sequence, DNA,Sequence, Nucleotide,Sequence, RNA,Sequences, Base,Sequences, DNA,Sequences, Nucleotide,Sequences, RNA
D012270 Ribosomes Multicomponent ribonucleoprotein structures found in the CYTOPLASM of all cells, and in MITOCHONDRIA, and PLASTIDS. They function in PROTEIN BIOSYNTHESIS via GENETIC TRANSLATION. Ribosome
D014176 Protein Biosynthesis The biosynthesis of PEPTIDES and PROTEINS on RIBOSOMES, directed by MESSENGER RNA, via TRANSFER RNA that is charged with standard proteinogenic AMINO ACIDS. Genetic Translation,Peptide Biosynthesis, Ribosomal,Protein Translation,Translation, Genetic,Protein Biosynthesis, Ribosomal,Protein Synthesis, Ribosomal,Ribosomal Peptide Biosynthesis,mRNA Translation,Biosynthesis, Protein,Biosynthesis, Ribosomal Peptide,Biosynthesis, Ribosomal Protein,Genetic Translations,Ribosomal Protein Biosynthesis,Ribosomal Protein Synthesis,Synthesis, Ribosomal Protein,Translation, Protein,Translation, mRNA,mRNA Translations
Copied contents to your clipboard!