Arioc: High-concurrency short-read alignment on multiple GPUs. 2020

Richard Wilton, and Alexander S Szalay
Department of Physics and Astronomy, Johns Hopkins University, Baltimore, Maryland, United States of America.

In large DNA sequence repositories, archival data storage is often coupled with computers that provide 40 or more CPU threads and multiple GPU (general-purpose graphics processing unit) devices. This presents an opportunity for DNA sequence alignment software to exploit high-concurrency hardware to generate short-read alignments at high speed. Arioc, a GPU-accelerated short-read aligner, can compute WGS (whole-genome sequencing) alignments ten times faster than comparable CPU-only alignment software. When two or more GPUs are available, Arioc's speed increases proportionately because the software executes concurrently on each available GPU device. We have adapted Arioc to recent multi-GPU hardware architectures that support high-bandwidth peer-to-peer memory accesses among multiple GPUs. By modifying Arioc's implementation to exploit this GPU memory architecture we obtained a further 1.8x-2.9x increase in overall alignment speeds. With this additional acceleration, Arioc computes two million short-read alignments per second in a four-GPU system; it can align the reads from a human WGS sequencer run-over 500 million 150nt paired-end reads-in less than 15 minutes. As WGS data accumulates exponentially and high-concurrency computational resources become widespread, Arioc addresses a growing need for timely computation in the short-read data analysis toolchain.

UI MeSH Term Description Entries
D003196 Computer Graphics The process of pictorial communication, between human and computers, in which the computer input and output have the form of charts, drawings, or other appropriate pictorial representation. Computer Graphic,Graphic, Computer,Graphics, Computer
D003201 Computers Programmable electronic devices designed to accept data, perform prescribed mathematical and logical operations at high speed, and display the results of these operations. Calculators, Programmable,Computer Hardware,Computers, Digital,Hardware, Computer,Calculator, Programmable,Computer,Computer, Digital,Digital Computer,Digital Computers,Programmable Calculator,Programmable Calculators
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D000073336 Whole Genome Sequencing Techniques to determine the entire sequence of the GENOME of an organism or individual. Complete Genome Sequencing,Genome Sequencing, Complete,Genome Sequencing, Whole,Sequencing, Complete Genome,Sequencing, Whole Genome
D000465 Algorithms A procedure consisting of a sequence of algebraic formulas and/or logical steps to calculate or determine a given task. Algorithm
D001483 Base Sequence The sequence of PURINES and PYRIMIDINES in nucleic acids and polynucleotides. It is also called nucleotide sequence. DNA Sequence,Nucleotide Sequence,RNA Sequence,DNA Sequences,Base Sequences,Nucleotide Sequences,RNA Sequences,Sequence, Base,Sequence, DNA,Sequence, Nucleotide,Sequence, RNA,Sequences, Base,Sequences, DNA,Sequences, Nucleotide,Sequences, RNA
D012984 Software Sequential operating programs and data which instruct the functioning of a digital computer. Computer Programs,Computer Software,Open Source Software,Software Engineering,Software Tools,Computer Applications Software,Computer Programs and Programming,Computer Software Applications,Application, Computer Software,Applications Software, Computer,Applications Softwares, Computer,Applications, Computer Software,Computer Applications Softwares,Computer Program,Computer Software Application,Engineering, Software,Open Source Softwares,Program, Computer,Programs, Computer,Software Application, Computer,Software Applications, Computer,Software Tool,Software, Computer,Software, Computer Applications,Software, Open Source,Softwares, Computer Applications,Softwares, Open Source,Source Software, Open,Source Softwares, Open,Tool, Software,Tools, Software
D016247 Information Storage and Retrieval Organized activities related to the storage, location, search, and retrieval of information. Information Retrieval,Data Files,Data Linkage,Data Retrieval,Data Storage,Data Storage and Retrieval,Information Extraction,Information Storage,Machine-Readable Data Files,Data File,Data File, Machine-Readable,Data Files, Machine-Readable,Extraction, Information,Files, Machine-Readable Data,Information Extractions,Machine Readable Data Files,Machine-Readable Data File,Retrieval, Data,Storage, Data
D016415 Sequence Alignment The arrangement of two or more amino acid or base sequences from an organism or organisms in such a way as to align areas of the sequences sharing common properties. The degree of relatedness or homology between the sequences is predicted computationally or statistically based on weights assigned to the elements aligned between the sequences. This in turn can serve as a potential indicator of the genetic relatedness between the organisms. Sequence Homology Determination,Determination, Sequence Homology,Alignment, Sequence,Alignments, Sequence,Determinations, Sequence Homology,Sequence Alignments,Sequence Homology Determinations
D017422 Sequence Analysis, DNA A multistage process that includes cloning, physical mapping, subcloning, determination of the DNA SEQUENCE, and information analysis. DNA Sequence Analysis,Sequence Determination, DNA,Analysis, DNA Sequence,DNA Sequence Determination,DNA Sequence Determinations,DNA Sequencing,Determination, DNA Sequence,Determinations, DNA Sequence,Sequence Determinations, DNA,Analyses, DNA Sequence,DNA Sequence Analyses,Sequence Analyses, DNA,Sequencing, DNA

Related Publications

Richard Wilton, and Alexander S Szalay
January 2023, Journal of biotechnology and biomedicine,
Richard Wilton, and Alexander S Szalay
December 2023, Bioinformatics (Oxford, England),
Richard Wilton, and Alexander S Szalay
January 2012, IEEE/ACM transactions on computational biology and bioinformatics,
Richard Wilton, and Alexander S Szalay
January 2016, Methods in molecular biology (Clifton, N.J.),
Richard Wilton, and Alexander S Szalay
May 2011, BMC bioinformatics,
Richard Wilton, and Alexander S Szalay
January 2013, PloS one,
Richard Wilton, and Alexander S Szalay
June 2023, ICS ... : proceedings of the ... ACM International Conference on Supercomputing. International Conference on Supercomputing,
Richard Wilton, and Alexander S Szalay
January 2012, PloS one,
Richard Wilton, and Alexander S Szalay
January 2020, Frontiers in molecular biosciences,
Richard Wilton, and Alexander S Szalay
January 2017, BMC bioinformatics,
Copied contents to your clipboard!