Sequence divergence in a family of variant surface glycoprotein genes from trypanosomes: coding region hypervariability and downstream recombinogenic repeats. 1996

M C Field, and J C Boothroyd
Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford University, CA 94305, USA.

The surface of the parasitic protozoan Trypanosoma brucei spp. is covered with a dense coat consisting of a single type of glycoprotein molecule, the variant surface glycoprotein (VSG). There may be as many as 1,000 genes for VSG within the genome of T. brucei, and the switch of expression from one to another is the phenomenon of antigenic variation. As an approach to understanding the evolution of VSG genes we have determined the genomic DNA sequences of the eight genes encoding the variant surface glycoprotein 117 (VSG) family. From these data we have observed a number of features concerning the relationships between these genes: (1) there is a region of high variability confined to the N-terminus of the coding sequence, and comparison of the sequences with the available X-ray diffraction crystal structures suggests that two of the most variable stretches within the N-terminal domain are present on surface-exposed loops, indicating a role for epitope selection in evolution of these genes; (2) the 29 nucleotides surrounding the splice acceptor site are absolutely conserved in all eight 117 VSG genes; (3) numerous insertion/deletion mutations are located within or immediately downstream of the C-terminal protein-coding sequences: (4) within 500 bp downstream of the insertion/deletion mutations are one or two copies of a repeat motif highly homologous to the recombinogenic 76-bp repeat sequences present upstream of many VSG basic copy genes and the expression-linked copy.

UI MeSH Term Description Entries
D008969 Molecular Sequence Data Descriptions of specific amino acid, carbohydrate, or nucleotide sequences which have appeared in the published literature and/or are deposited in and maintained by databanks such as GENBANK, European Molecular Biology Laboratory (EMBL), National Biomedical Research Foundation (NBRF), or other sequence repositories. Sequence Data, Molecular,Molecular Sequencing Data,Data, Molecular Sequence,Data, Molecular Sequencing,Sequencing Data, Molecular
D011995 Recombination, Genetic Production of new arrangements of DNA by various mechanisms such as assortment and segregation, CROSSING OVER; GENE CONVERSION; GENETIC TRANSFORMATION; GENETIC CONJUGATION; GENETIC TRANSDUCTION; or mixed infection of viruses. Genetic Recombination,Recombination,Genetic Recombinations,Recombinations,Recombinations, Genetic
D012091 Repetitive Sequences, Nucleic Acid Sequences of DNA or RNA that occur in multiple copies. There are several types: INTERSPERSED REPETITIVE SEQUENCES are copies of transposable elements (DNA TRANSPOSABLE ELEMENTS or RETROELEMENTS) dispersed throughout the genome. TERMINAL REPEAT SEQUENCES flank both ends of another sequence, for example, the long terminal repeats (LTRs) on RETROVIRUSES. Variations may be direct repeats, those occurring in the same direction, or inverted repeats, those opposite to each other in direction. TANDEM REPEAT SEQUENCES are copies which lie adjacent to each other, direct or inverted (INVERTED REPEAT SEQUENCES). DNA Repetitious Region,Direct Repeat,Genes, Selfish,Nucleic Acid Repetitive Sequences,Repetitive Region,Selfish DNA,Selfish Genes,DNA, Selfish,Repetitious Region, DNA,Repetitive Sequence,DNA Repetitious Regions,DNAs, Selfish,Direct Repeats,Gene, Selfish,Repeat, Direct,Repeats, Direct,Repetitious Regions, DNA,Repetitive Regions,Repetitive Sequences,Selfish DNAs,Selfish Gene
D005810 Multigene Family A set of genes descended by duplication and variation from some ancestral gene. Such genes may be clustered together on the same chromosome or dispersed on different chromosomes. Examples of multigene families include those that encode the hemoglobins, immunoglobulins, histocompatibility antigens, actins, tubulins, keratins, collagens, heat shock proteins, salivary glue proteins, chorion proteins, cuticle proteins, yolk proteins, and phaseolins, as well as histones, ribosomal RNA, and transfer RNA genes. The latter three are examples of reiterated genes, where hundreds of identical genes are present in a tandem array. (King & Stanfield, A Dictionary of Genetics, 4th ed) Gene Clusters,Genes, Reiterated,Cluster, Gene,Clusters, Gene,Families, Multigene,Family, Multigene,Gene Cluster,Gene, Reiterated,Multigene Families,Reiterated Gene,Reiterated Genes
D000595 Amino Acid Sequence The order of amino acids as they occur in a polypeptide chain. This is referred to as the primary structure of proteins. It is of fundamental importance in determining PROTEIN CONFORMATION. Protein Structure, Primary,Amino Acid Sequences,Sequence, Amino Acid,Sequences, Amino Acid,Primary Protein Structure,Primary Protein Structures,Protein Structures, Primary,Structure, Primary Protein,Structures, Primary Protein
D000818 Animals Unicellular or multicellular, heterotrophic organisms, that have sensation and the power of voluntary movement. Under the older five kingdom paradigm, Animalia was one of the kingdoms. Under the modern three domain model, Animalia represents one of the many groups in the domain EUKARYOTA. Animal,Metazoa,Animalia
D000940 Antigenic Variation Change in the surface ANTIGEN of a microorganism. There are two different types. One is a phenomenon, especially associated with INFLUENZA VIRUSES, where they undergo spontaneous variation both as slow antigenic drift and sudden emergence of new strains (antigenic shift). The second type is when certain PARASITES, especially trypanosomes, PLASMODIUM, and BORRELIA, survive the immune response of the host by changing the surface coat (antigen switching). (From Herbert et al., The Dictionary of Immunology, 4th ed) Antigen Switching,Antigenic Diversity,Variation, Antigenic,Antigen Variation,Antigenic Switching,Antigenic Variability,Switching, Antigenic,Diversity, Antigenic,Switching, Antigen,Variability, Antigenic,Variation, Antigen
D001483 Base Sequence The sequence of PURINES and PYRIMIDINES in nucleic acids and polynucleotides. It is also called nucleotide sequence. DNA Sequence,Nucleotide Sequence,RNA Sequence,DNA Sequences,Base Sequences,Nucleotide Sequences,RNA Sequences,Sequence, Base,Sequence, DNA,Sequence, Nucleotide,Sequence, RNA,Sequences, Base,Sequences, DNA,Sequences, Nucleotide,Sequences, RNA
D012326 RNA Splicing The ultimate exclusion of nonsense sequences or intervening sequences (introns) before the final RNA transcript is sent to the cytoplasm. RNA, Messenger, Splicing,Splicing, RNA,RNA Splicings,Splicings, RNA
D014346 Trypanosoma brucei brucei A hemoflagellate subspecies of parasitic protozoa that causes nagana in domestic and game animals in Africa. It apparently does not infect humans. It is transmitted by bites of tsetse flies (Glossina). Trypanosoma brucei,Trypanosoma brucei bruceus,Trypanosoma bruceus,brucei brucei, Trypanosoma,brucei, Trypanosoma brucei,bruceus, Trypanosoma,bruceus, Trypanosoma brucei

Related Publications

M C Field, and J C Boothroyd
June 1992, Journal of molecular biology,
M C Field, and J C Boothroyd
January 2022, Trends in parasitology,
M C Field, and J C Boothroyd
January 1984, Oxford surveys on eukaryotic genes,
M C Field, and J C Boothroyd
November 2005, Biochemical Society transactions,
Copied contents to your clipboard!