Cloning and DNA sequence of the gene coding for Clostridium thermocellum cellulase Ss (CelS), a major cellulosome component. 1993

W K Wang, and K Kruus, and J H Wu
Department of Chemical Engineering, University of Rochester, New York 14627-0166.

Clostridium thermocellum ATCC 27405 produces an extracellular cellulase system capable of hydrolyzing crystalline cellulose. The enzyme system involves a multicomponent protein aggregate (the cellulosome) with a total molecular weight in the millions, impeding mechanistic studies. However, two major components of the aggregate, SS (M(r) = 82,000) and SL (M(r) = 250,000), which act synergistically to hydrolyze crystalline cellulose, have been identified (J. H. D. Wu, W. H. Orme-Johnson, and A. L. Demain, Biochemistry 27:1703-1709, 1988). To further study this synergism, we cloned and sequenced the gene (celS) coding for the SS (CelS) protein by using a degenerate, inosine-containing oligonucleotide probe whose sequence was derived from the N-terminal amino acid sequence of the CelS protein. The open reading frame of celS consisted of 2,241 bp encoding 741 amino acid residues. It encoded the N-terminal amino acid sequence and two internal peptide sequences determined for the native CelS protein. A putative ribosome binding site was identified at the 5' end of the gene. A putative signal peptide of 27 amino acid residues was adjacent to the N terminus of the CelS protein. The predicted molecular weight of the secreted protein was 80,670. The celS gene contained a conserved reiterated sequence encoding 24 amino acid residues found in proteins encoded by many other clostridial cel or xyn genes. A palindromic structure was found downstream from the open reading frame. The celS gene is unique among the known cel genes of C. thermocellum. However, it is highly homologous to the partial open reading frame found in C. cellulolyticum and in Caldocellum saccharolyticum, indicating that these genes belong to a new family of cel genes.

UI MeSH Term Description Entries
D008969 Molecular Sequence Data Descriptions of specific amino acid, carbohydrate, or nucleotide sequences which have appeared in the published literature and/or are deposited in and maintained by databanks such as GENBANK, European Molecular Biology Laboratory (EMBL), National Biomedical Research Foundation (NBRF), or other sequence repositories. Sequence Data, Molecular,Molecular Sequencing Data,Data, Molecular Sequence,Data, Molecular Sequencing,Sequencing Data, Molecular
D012091 Repetitive Sequences, Nucleic Acid Sequences of DNA or RNA that occur in multiple copies. There are several types: INTERSPERSED REPETITIVE SEQUENCES are copies of transposable elements (DNA TRANSPOSABLE ELEMENTS or RETROELEMENTS) dispersed throughout the genome. TERMINAL REPEAT SEQUENCES flank both ends of another sequence, for example, the long terminal repeats (LTRs) on RETROVIRUSES. Variations may be direct repeats, those occurring in the same direction, or inverted repeats, those opposite to each other in direction. TANDEM REPEAT SEQUENCES are copies which lie adjacent to each other, direct or inverted (INVERTED REPEAT SEQUENCES). DNA Repetitious Region,Direct Repeat,Genes, Selfish,Nucleic Acid Repetitive Sequences,Repetitive Region,Selfish DNA,Selfish Genes,DNA, Selfish,Repetitious Region, DNA,Repetitive Sequence,DNA Repetitious Regions,DNAs, Selfish,Direct Repeats,Gene, Selfish,Repeat, Direct,Repeats, Direct,Repetitious Regions, DNA,Repetitive Regions,Repetitive Sequences,Selfish DNAs,Selfish Gene
D002480 Cellulase An endocellulase with specificity for the hydrolysis of 1,4-beta-glucosidic linkages in CELLULOSE, lichenin, and cereal beta-glucans. Endo-1,4-beta-Glucanase,Cellulysin,Endoglucanase,Endoglucanase A,Endoglucanase C,Endoglucanase E,Endoglucanase IV,Endoglucanase Y,beta-1,4-Glucan-4-Glucanohydrolase,Endo 1,4 beta Glucanase,beta 1,4 Glucan 4 Glucanohydrolase
D003001 Cloning, Molecular The insertion of recombinant DNA molecules from prokaryotic and/or eukaryotic sources into a replicating vehicle, such as a plasmid or virus vector, and the introduction of the resultant hybrid molecules into recipient cells without altering the viability of those cells. Molecular Cloning
D003013 Clostridium A genus of motile or nonmotile gram-positive bacteria of the family Clostridiaceae. Many species have been identified with some being pathogenic. They occur in water, soil, and in the intestinal tract of humans and lower animals.
D003062 Codon A set of three nucleotides in a protein coding sequence that specifies individual amino acids or a termination signal (CODON, TERMINATOR). Most codons are universal, but some organisms do not produce the transfer RNAs (RNA, TRANSFER) complementary to all codons. These codons are referred to as unassigned codons (CODONS, NONSENSE). Codon, Sense,Sense Codon,Codons,Codons, Sense,Sense Codons
D004269 DNA, Bacterial Deoxyribonucleic acid that makes up the genetic material of bacteria. Bacterial DNA
D005798 Genes, Bacterial The functional hereditary units of BACTERIA. Bacterial Gene,Bacterial Genes,Gene, Bacterial
D000595 Amino Acid Sequence The order of amino acids as they occur in a polypeptide chain. This is referred to as the primary structure of proteins. It is of fundamental importance in determining PROTEIN CONFORMATION. Protein Structure, Primary,Amino Acid Sequences,Sequence, Amino Acid,Sequences, Amino Acid,Primary Protein Structure,Primary Protein Structures,Protein Structures, Primary,Structure, Primary Protein,Structures, Primary Protein
D001482 Base Composition The relative amounts of the PURINES and PYRIMIDINES in a nucleic acid. Base Ratio,G+C Composition,Guanine + Cytosine Composition,G+C Content,GC Composition,GC Content,Guanine + Cytosine Content,Base Compositions,Base Ratios,Composition, Base,Composition, G+C,Composition, GC,Compositions, Base,Compositions, G+C,Compositions, GC,Content, G+C,Content, GC,Contents, G+C,Contents, GC,G+C Compositions,G+C Contents,GC Compositions,GC Contents,Ratio, Base,Ratios, Base

Related Publications

W K Wang, and K Kruus, and J H Wu
November 1993, Applied biochemistry and biotechnology,
W K Wang, and K Kruus, and J H Wu
February 1986, Nucleic acids research,
W K Wang, and K Kruus, and J H Wu
January 1993, Applied biochemistry and biotechnology,
W K Wang, and K Kruus, and J H Wu
July 1997, Journal of bacteriology,
W K Wang, and K Kruus, and J H Wu
December 2001, Applied microbiology and biotechnology,
W K Wang, and K Kruus, and J H Wu
November 1991, Applied biochemistry and biotechnology,
W K Wang, and K Kruus, and J H Wu
April 1985, Journal of bacteriology,
Copied contents to your clipboard!