TSS seq based core promoter architecture in blood feeding Tsetse fly (Glossina morsitans morsitans) vector of Trypanosomiasis. 2015

Sarah Mwangi, and Geoffrey Attardo, and Yutaka Suzuki, and Serap Aksoy, and Alan Christoffels
South African MRC Bioinformatics Unit, South African National Bioinformatics Institute, University of the Western Cape, Bellville, South Africa. sarah@sanbi.ac.za.

BACKGROUND Transcription initiation regulation is mediated by sequence-specific interactions between DNA-binding proteins (transcription factors) and cis-elements, where BRE, TATA, INR, DPE and MTE motifs constitute canonical core motifs for basal transcription initiation of genes. Accurate identification of transcription start site (TSS) and their corresponding promoter regions is critical for delineation of these motifs. To this end, the genome scale analysis of core promoter architecture in insects has been confined to Drosophila. The recently sequenced Tsetse fly genome provides a unique opportunity to analyze transcription initiation regulation machinery in blood-feeding insects. RESULTS A computational method for identification of TSS in newly sequenced Tsetse fly genome was evaluated, using TSS seq tags sampled from two developmental stages namely; larvae and pupae. There were 3134 tag clusters among which 45.4% (1424) of the tag clusters mapped to first coding exons or their proximal predicted 5'UTR regions and 1.0% (31) tag clusters mapping to transposons, within a threshold of 100 tags per cluster. These 1393 non transposon-derived core promoters had propensity for AT nucleotides. The -1/+1 and 1/+1 positions in D. melanogaster, and G. m. morsitans had propensity for CA and AA dinucleotides respectively. The 1393 tag clusters comprised narrow promoters (5%), broad with peak promoters (23%) and broad without peak promoters (72%). Two-way motif co-occurrence analysis showed that the MTE-DPE pair is over-represented in broad core promoters. The frequently occurring triplet motifs in all promoter classes are the INR-MTE-DPE, TATA-MTE-DPE and TATA-INR-DPE. Promoters without the TATA motif had higher frequency of the MTE and INR motifs than those observed in Drosophila, where the DPE motif occur more frequently in promoters without TATA motif. Gene ontology terms associated with developmental processes were overrepresented in the narrow and broad with peak promoters. CONCLUSIONS The study has identified different motif combinations associated with broad promoters in a blood-feeding insect. In the case of TATA-less core promoters, G.m. morsitans uses the MTE to compensate for the lack of a TATA motif. The increasing availability of TSS seq data allows for revision of existing gene annotation datasets with the potential of identifying new transcriptional units.

UI MeSH Term Description Entries
D007303 Insect Vectors Insects that transmit infective organisms from one host to another or from an inanimate reservoir to an animate host. Insect Vector,Vector, Insect,Vectors, Insect
D011401 Promoter Regions, Genetic DNA sequences which are recognized (directly or indirectly) and bound by a DNA-dependent RNA polymerase during the initiation of transcription. Highly conserved sequences within the promoter include the Pribnow box in bacteria and the TATA BOX in eukaryotes. rRNA Promoter,Early Promoters, Genetic,Late Promoters, Genetic,Middle Promoters, Genetic,Promoter Regions,Promoter, Genetic,Promotor Regions,Promotor, Genetic,Pseudopromoter, Genetic,Early Promoter, Genetic,Genetic Late Promoter,Genetic Middle Promoters,Genetic Promoter,Genetic Promoter Region,Genetic Promoter Regions,Genetic Promoters,Genetic Promotor,Genetic Promotors,Genetic Pseudopromoter,Genetic Pseudopromoters,Late Promoter, Genetic,Middle Promoter, Genetic,Promoter Region,Promoter Region, Genetic,Promoter, Genetic Early,Promoter, rRNA,Promoters, Genetic,Promoters, Genetic Middle,Promoters, rRNA,Promotor Region,Promotors, Genetic,Pseudopromoters, Genetic,Region, Genetic Promoter,Region, Promoter,Region, Promotor,Regions, Genetic Promoter,Regions, Promoter,Regions, Promotor,rRNA Promoters
D000818 Animals Unicellular or multicellular, heterotrophic organisms, that have sensation and the power of voluntary movement. Under the older five kingdom paradigm, Animalia was one of the kingdoms. Under the modern three domain model, Animalia represents one of the many groups in the domain EUKARYOTA. Animal,Metazoa,Animalia
D001482 Base Composition The relative amounts of the PURINES and PYRIMIDINES in a nucleic acid. Base Ratio,G+C Composition,Guanine + Cytosine Composition,G+C Content,GC Composition,GC Content,Guanine + Cytosine Content,Base Compositions,Base Ratios,Composition, Base,Composition, G+C,Composition, GC,Compositions, Base,Compositions, G+C,Compositions, GC,Content, G+C,Content, GC,Contents, G+C,Contents, GC,G+C Compositions,G+C Contents,GC Compositions,GC Contents,Ratio, Base,Ratios, Base
D014345 Trypanosoma A genus of flagellate protozoans found in the BLOOD and LYMPH of vertebrates and invertebrates, both hosts being required to complete the life cycle. Nannomonas,Trypanosomes,Nannomona,Trypanosome
D014352 Trypanosomiasis Infection with protozoa of the genus TRYPANOSOMA. Trypanosomiases
D014370 Tsetse Flies Bloodsucking flies of the genus Glossina, found primarily in equatorial Africa. Several species are intermediate hosts of trypanosomes. Glossina,Flies, Tsetse,Fly, Tsetse,Glossinas,Tsetse Fly
D016000 Cluster Analysis A set of statistical methods used to group variables or observations into strongly inter-related subgroups. In epidemiology, it may be used to analyze a closely grouped series of events or cases of disease or other health-related phenomenon with well-defined distribution patterns in relation to time or place or both. Clustering,Analyses, Cluster,Analysis, Cluster,Cluster Analyses,Clusterings
D049750 Genome, Insect The genetic complement of an insect (INSECTS) as represented in its DNA. Insect Genome,Genomes, Insect,Insect Genomes
D058977 Molecular Sequence Annotation The addition of descriptive information about the function or structure of a molecular sequence to its MOLECULAR SEQUENCE DATA record. Gene Annotation,Protein Annotation,Annotation, Gene,Annotation, Molecular Sequence,Annotation, Protein,Annotations, Gene,Annotations, Molecular Sequence,Annotations, Protein,Gene Annotations,Molecular Sequence Annotations,Protein Annotations,Sequence Annotation, Molecular,Sequence Annotations, Molecular

Related Publications

Sarah Mwangi, and Geoffrey Attardo, and Yutaka Suzuki, and Serap Aksoy, and Alan Christoffels
April 2014, Science (New York, N.Y.),
Sarah Mwangi, and Geoffrey Attardo, and Yutaka Suzuki, and Serap Aksoy, and Alan Christoffels
January 1991, Comparative biochemistry and physiology. B, Comparative biochemistry,
Sarah Mwangi, and Geoffrey Attardo, and Yutaka Suzuki, and Serap Aksoy, and Alan Christoffels
April 1975, The Journal of the Acoustical Society of America,
Sarah Mwangi, and Geoffrey Attardo, and Yutaka Suzuki, and Serap Aksoy, and Alan Christoffels
June 1975, Experientia,
Sarah Mwangi, and Geoffrey Attardo, and Yutaka Suzuki, and Serap Aksoy, and Alan Christoffels
January 1973, Chromosoma,
Sarah Mwangi, and Geoffrey Attardo, and Yutaka Suzuki, and Serap Aksoy, and Alan Christoffels
January 1978, Comparative biochemistry and physiology. B, Comparative biochemistry,
Sarah Mwangi, and Geoffrey Attardo, and Yutaka Suzuki, and Serap Aksoy, and Alan Christoffels
March 1975, Nature,
Sarah Mwangi, and Geoffrey Attardo, and Yutaka Suzuki, and Serap Aksoy, and Alan Christoffels
January 1985, Comparative biochemistry and physiology. C, Comparative pharmacology and toxicology,
Sarah Mwangi, and Geoffrey Attardo, and Yutaka Suzuki, and Serap Aksoy, and Alan Christoffels
July 2018, Parasites & vectors,
Sarah Mwangi, and Geoffrey Attardo, and Yutaka Suzuki, and Serap Aksoy, and Alan Christoffels
October 2005, Insect molecular biology,
Copied contents to your clipboard!