Transposome: a toolkit for annotation of transposable element families from unassembled sequence reads. 2015

S Evan Staton, and John M Burke
Department of Genetics and Department of Plant Biology, University of Georgia, Athens, GA 30602, USA.

BACKGROUND Transposable elements (TEs) can be found in virtually all eukaryotic genomes and have the potential to produce evolutionary novelty. Despite the broad taxonomic distribution of TEs, the evolutionary history of these sequences is largely unknown for many taxa due to a lack of genomic resources and identification methods. Given that most TE annotation methods are designed to work on genome assemblies, we sought to develop a method to provide a fine-grained classification of TEs from DNA sequence reads. Here, we present a toolkit for the efficient annotation of TE families from low-coverage whole-genome shotgun (WGS) data, enabling the rapid identification of TEs in a large number of taxa. We compared our software, Transposome, with other approaches for annotating repeats from WGS data, and we show that it offers significant improvements in run time and produces more precise estimates of genomic repeat abundance. Transposome may also be used as a general toolkit for working with Next Generation Sequencing (NGS) data, and for constructing custom genome analysis pipelines. METHODS The source code for Transposome is freely available (http://sestaton.github.io/Transposome), implemented in Perl and is supported on Linux.

UI MeSH Term Description Entries
D003313 Zea mays A plant species of the family POACEAE. It is a tall grass grown for its EDIBLE GRAIN, corn, used as food and animal FODDER. Corn,Indian Corn,Maize,Teosinte,Zea,Corn, Indian
D004251 DNA Transposable Elements Discrete segments of DNA which can excise and reintegrate to another site in the genome. Most are inactive, i.e., have not been found to exist outside the integrated state. DNA transposable elements include bacterial IS (insertion sequence) elements, Tn elements, the maize controlling elements Ac and Ds, Drosophila P, gypsy, and pogo elements, the human Tigger elements and the Tc and mariner elements which are found throughout the animal kingdom. DNA Insertion Elements,DNA Transposons,IS Elements,Insertion Sequence Elements,Tn Elements,Transposable Elements,Elements, Insertion Sequence,Sequence Elements, Insertion,DNA Insertion Element,DNA Transposable Element,DNA Transposon,Element, DNA Insertion,Element, DNA Transposable,Element, IS,Element, Insertion Sequence,Element, Tn,Element, Transposable,Elements, DNA Insertion,Elements, DNA Transposable,Elements, IS,Elements, Tn,Elements, Transposable,IS Element,Insertion Element, DNA,Insertion Elements, DNA,Insertion Sequence Element,Sequence Element, Insertion,Tn Element,Transposable Element,Transposable Element, DNA,Transposable Elements, DNA,Transposon, DNA,Transposons, DNA
D012984 Software Sequential operating programs and data which instruct the functioning of a digital computer. Computer Programs,Computer Software,Open Source Software,Software Engineering,Software Tools,Computer Applications Software,Computer Programs and Programming,Computer Software Applications,Application, Computer Software,Applications Software, Computer,Applications Softwares, Computer,Applications, Computer Software,Computer Applications Softwares,Computer Program,Computer Software Application,Engineering, Software,Open Source Softwares,Program, Computer,Programs, Computer,Software Application, Computer,Software Applications, Computer,Software Tool,Software, Computer,Software, Computer Applications,Software, Open Source,Softwares, Computer Applications,Softwares, Open Source,Source Software, Open,Source Softwares, Open,Tool, Software,Tools, Software
D017422 Sequence Analysis, DNA A multistage process that includes cloning, physical mapping, subcloning, determination of the DNA SEQUENCE, and information analysis. DNA Sequence Analysis,Sequence Determination, DNA,Analysis, DNA Sequence,DNA Sequence Determination,DNA Sequence Determinations,DNA Sequencing,Determination, DNA Sequence,Determinations, DNA Sequence,Sequence Determinations, DNA,Analyses, DNA Sequence,DNA Sequence Analyses,Sequence Analyses, DNA,Sequencing, DNA
D058977 Molecular Sequence Annotation The addition of descriptive information about the function or structure of a molecular sequence to its MOLECULAR SEQUENCE DATA record. Gene Annotation,Protein Annotation,Annotation, Gene,Annotation, Molecular Sequence,Annotation, Protein,Annotations, Gene,Annotations, Molecular Sequence,Annotations, Protein,Gene Annotations,Molecular Sequence Annotations,Protein Annotations,Sequence Annotation, Molecular,Sequence Annotations, Molecular
D059014 High-Throughput Nucleotide Sequencing Techniques of nucleotide sequence analysis that increase the range, complexity, sensitivity, and accuracy of results by greatly increasing the scale of operations and thus the number of nucleotides, and the number of copies of each nucleotide sequenced. The sequencing may be done by analysis of the synthesis or ligation products, hybridization to preexisting sequences, etc. High-Throughput Sequencing,Illumina Sequencing,Ion Proton Sequencing,Ion Torrent Sequencing,Next-Generation Sequencing,Deep Sequencing,High-Throughput DNA Sequencing,High-Throughput RNA Sequencing,Massively-Parallel Sequencing,Pyrosequencing,DNA Sequencing, High-Throughput,High Throughput DNA Sequencing,High Throughput Nucleotide Sequencing,High Throughput RNA Sequencing,High Throughput Sequencing,Massively Parallel Sequencing,Next Generation Sequencing,Nucleotide Sequencing, High-Throughput,RNA Sequencing, High-Throughput,Sequencing, Deep,Sequencing, High-Throughput,Sequencing, High-Throughput DNA,Sequencing, High-Throughput Nucleotide,Sequencing, High-Throughput RNA,Sequencing, Illumina,Sequencing, Ion Proton,Sequencing, Ion Torrent,Sequencing, Massively-Parallel,Sequencing, Next-Generation
D018745 Genome, Plant The genetic complement of a plant (PLANTS) as represented in its DNA. Plant Genome,Genomes, Plant,Plant Genomes
D023281 Genomics The systematic study of the complete DNA sequences (GENOME) of organisms. Included is construction of complete genetic, physical, and transcript maps, and the analysis of this structural genomic information on a global scale such as in GENOME WIDE ASSOCIATION STUDIES. Functional Genomics,Structural Genomics,Comparative Genomics,Genomics, Comparative,Genomics, Functional,Genomics, Structural

Related Publications

S Evan Staton, and John M Burke
February 1986, Genetics,
S Evan Staton, and John M Burke
November 2020, Nature protocols,
S Evan Staton, and John M Burke
January 2015, Mobile DNA,
S Evan Staton, and John M Burke
January 2004, Bioinformatics (Oxford, England),
S Evan Staton, and John M Burke
January 2021, Mobile DNA,
S Evan Staton, and John M Burke
July 2013, Trends in plant science,
Copied contents to your clipboard!