skip to main content
10.1109/ISCA.2018.00017acmconferencesArticle/Chapter ViewAbstractPublication PagesiscaConference Proceedingsconference-collections
research-article

Genax: a genome sequencing accelerator

Published: 02 June 2018 Publication History

Abstract

Genomics can transform health-care through precision medicine. Plummeting sequencing costs would soon make genome testing affordable to the masses. Compute efficiency, however, has to improve by orders of magnitude to sequence and analyze the raw genome data. Sequencing software used today can take several hundreds to thousands of CPU hours to align reads to a reference sequence.
This paper presents GenAx, an accelerator for read alignment, a time-consuming step in genome sequencing. It consists of a seeding and seed-extension accelerator. The latter is based on an innovative automata design that was designed from the ground-up to enable hardware acceleration. Unlike conventional Levenshtein automata, it is string independent and scales quadratically with edit distance, instead of string length. It supports critical features commonly used in sequencing such as affine gap scoring and traceback.
GenAx provides a throughput of 4,058K reads/s for Illumina 101 bp reads. GenAx achieves 31.7x speedup over the standard BWA-MEM sequence aligner running on a 56--thread dualsocket 14-core Xeon E5 server processor, while reducing power consumption by 12 x and area by 5.6 x.

References

[1]
E. S. Lander, L. M. Linton, B. Birren, C. Nusbaum, M. C. Zody, J. Baldwin, K. Devon, K. Dewar, M. Doyle, W. FitzHugh et al., "Initial sequencing and analysis of the human genome," Nature, vol. 409, no. 6822, pp. 860--921, 2001.
[2]
"Illumina," https://www.forbes.com/sites/matthewherper/2017/01/09/illumina-promises-to-sequence- human-genome-for-100-but-not-quite-yet.
[3]
M. A. Hamburg and F. S. Collins, "The path to personalized medicine," N Engl J Med, vol. 2010, no. 363, pp. 301--304, 2010.
[4]
E. D. Pleasance, R. K. Cheetham, P. J. Stephens, D. J. McBride, S. J. Humphray, C. D. Greenman, I. Varela, M.-L. Lin, G. R. Ordóñez, G. R. Bignell et al., "A comprehensive catalogue of somatic mutations from a human cancer genome," Nature, vol. 463, no. 7278, pp. 191--196, 2010.
[5]
A. Lacour, A. Espinosa, E. Louwersheimer, S. Heilmann, I. Hernández, S. Wolfsgruber, V. Fernández, H. Wagner, M. Rosende-Roca, A. Mauleón et al., "Genome-wide significant risk factors for alzheimers disease: role in progression to dementia due to alzheimer's disease among subjects with mild cognitive impairment," Molecular psychiatry, vol. 22, no. 1, pp. 153--160, 2017.
[6]
Y. Cho, C.-H. Lee, E.-G. Jeong, M.-H. Kim, J. H. Hong, Y. Ko, B. Lee, G. Yun, B. J. Kim, J. Jung et al., "Prevalence of rare genetic variations and their implications in ngs-data interpretation," Scientific Reports, vol. 7, no. 1, p. 9810, 2017.
[7]
E. Hanna, C. Rémuzat, P. Auquier, and M. Toumi, "Gene therapies development: slow progress and promising prospect," Journal of Market Access & Health Policy, vol. 5, no. 1, p. 1265293, 2017.
[8]
Z. D. Stephens, S. Y. Lee, F. Faghri, R. H. Campbell, C. Zhai, M. J. Efron, R. Iyer, M. C. Schatz, S. Sinha, and G. E. Robinson, "Big data: astronomical or genomical?" PLoS biology, vol. 13, no. 7, p. e1002195, 2015.
[9]
"Facebook data," https://code.facebook.com/posts/229861827208629/scaling-the-facebook-data-warehouse-to-300-pb.
[10]
S. Kumar, K. K. Krishnani, B. Bhushan, and M. P. Brahmane, "Metagenomics: retrospect and prospects in high throughput age," Biotechnology research international, vol. 2015, 2015.
[11]
"Oxford nanopore minion." https://nanoporetech.com/products/minion.
[12]
H. Li, "Aligning sequence reads, clone sequences and assembly contigs with bwa-mem," arXiv preprint arXiv:1303.3997, 2013.
[13]
T. F. Smith and M. S. Waterman, "Identification of common molecular subsequences," Journal of molecular biology, vol. 147, no. 1, pp. 195--197, 1981.
[14]
M. Farrar, "Striped smith-waterman speeds database searches six times over other simd implementations," Bioinformatics, vol. 23, no. 2, pp. 156--161, 2006.
[15]
G. Myers, "A fast bit-vector algorithm for approximate string matching based on dynamic programming," Journal of the ACM (JACM), vol. 46, no. 3, pp. 395--415, 1999.
[16]
Y.-T. Chen, J. Cong, J. Lei, and P. Wei, "A novel high-throughput acceleration engine for read alignment," in Field-Programmable Custom Computing Machines (FCCM), 2015 IEEE 23rd Annual International Symposium on. IEEE, 2015, pp. 199--202.
[17]
R. McMillen and M. Ruehle, "Bioinformatics systems, apparatuses, and methods executed on an integrated circuit processing platform," https://www.google.com/patents/US9014989, Apr. 21 2015, uS Patent 9,014,989.
[18]
I. Roy and S. Aluru, "Discovering motifs in biological sequences using the micron automata processor," IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB), vol. 13, no. 1, pp. 99--111, 2016.
[19]
I. Tommy Tracy, M. Stan, N. Brunelle, J. Wadden, K. Wang, K. Skadron, and G. Robins, "Nondeterministic finite automata in hardware-the case of the levenshtein automaton."
[20]
A. Subramaniyan, J. Wang, E. R. M. Balasubramanian, D. Blaauw, D. Sylvester, and R. Das, "Cache automaton," in Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture, ser. MICRO-50 '17. New York, NY, USA: ACM, 2017, pp. 259--272.
[21]
O. Gotoh, "Optimal sequence alignment allowing for long gaps," Bulletin of mathematical biology, vol. 52, no. 3, pp. 359--373, 1990.
[22]
P. Chen, C. Wang, X. Li, and X. Zhou, "Accelerating the next generation long read mapping with the fpga-based system," IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 11, no. 5, pp. 840--852, 2014.
[23]
J. J. Tithi, N. C. Crago, and J. S. Emer, "Exploiting spatial architectures for edit distance algorithms," in 2014 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), March 2014, pp. 23--34.
[24]
M. Šošić and M. Šikić, "Edlib: a c/c++ library for fast, exact sequence alignment using edit distance," Bioinformatics, vol. 33, no. 9, pp. 1394--1395, 2017.
[25]
M. Zaharia, W. J. Bolosky, K. Curtis, A. Fox, D. Patterson, S. Shenker, I. Stoica, R. M. Karp, and T. Sittler, "Faster and more accurate sequence alignment with snap," arXiv preprint arXiv:1111.5572, 2011.
[26]
Z. Zhang, S. Schwartz, L. Wagner, and W. Miller, "A greedy algorithm for aligning dna sequences," Journal of Computational biology, vol. 7, no. 1--2, pp. 203--214, 2000.
[27]
B. Harris, A. C. Jacob, J. M. Lancaster, J. Buhler, and R. D. Chamberlain, "A banded smith-waterman fpga accelerator for mercury blastp," in 2007 International Conference on Field Programmable Logic and Applications, Aug 2007, pp. 765--769.
[28]
P. Dlugosch, D. Brown, P. Glendenning, M. Leventhal, and H. Noyes, "An efficient and scalable semiconductor architecture for parallel automata processing," IEEE Transactions on Parallel and Distributed Systems, vol. 25, no. 12, pp. 3088--3098, 2014.
[29]
V. Gogte, A. Kolli, M. J. Cafarella, L. D'Antoni, and T. F. Wenisch, "Hare: Hardware accelerator for regular expressions," in Microarchitecture (MICRO), 2016 49th Annual IEEE/ACM International Symposium on. IEEE, 2016, pp. 1--12.
[30]
Y. Fang, C. Zou, A. J. Elmore, and A. A. Chien, "Udp: A programmable accelerator for extract-transform-load workloads and more," in Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture, ser. MICRO-50 '17. New York, NY, USA: ACM, 2017, pp. 55--68. {Online}. Available
[31]
P. Mitankin, "Universal levenshtein automata. building and properties," Sofia University St. Kliment Ohridski, 2005.
[32]
H. Li, "Exploring single-sample snp and indel calling with whole-genome de novo assembly," Bioinformatics, vol. 28, no. 14, pp. 1838--1844, 2012.
[33]
P. Ferragina and G. Manzini, "Opportunistic data structures with applications," in Foundations of Computer Science, 2000. Proceedings. 41st Annual Symposium on. IEEE, 2000, pp. 390--398.
[34]
R. S. Harris, Improved pairwise alignment of genomic DNA. The Pennsylvania State University, 2007.
[35]
"Uscs genome browser." https://genome.ucsc.edu/.
[36]
M. A. Eberle, E. Fritzilas, P. Krusche, M. Kallberg, B. L. Moore, M. A. Bekritsky, Z. Iqbal, H.-Y. Chuang, S. J. Humphray, A. L. Halpern et al., "A reference data set of 5.4 million phased human variants validated by genetic inheritance from sequencing a three-generation 17-member pedigree," Genome research, vol. 27, no. 1, pp. 157--164, 2017.
[37]
Y. Liu and B. Schmidt, "Cushaw2-gpu: empowering faster gapped short-read alignment using gpu computing," IEEE Design & Test, vol. 31, no. 1, pp. 31--39, 2014.
[38]
A. Döring, D. Weese, T. Rausch, and K. Reinert, "Seqan an efficient, generic c++ library for sequence analysis," BMC bioinformatics, vol. 9, no. 1, p. 11, 2008.
[39]
M. Korpar and M. Sikic, "Sw#-gpu-enabled exact alignments on genome scale," Bioinformatics, vol. 29, no. 19, pp. 2494--2495, 2013.
[40]
Y. Kim, W. Yang, and O. Mutlu, "Ramulator: A fast and extensible dram simulator," IEEE Computer Architecture Letters, vol. 15, no. 1, pp. 45--49, 2016.
[41]
D. S. Hirschberg, "A linear space algorithm for computing maximal common subsequences," Commun. ACM, vol. 18, no. 6, pp. 341--343, Jun. 1975.
[42]
E. Kopylova, L. Noé, and H. Touzet, "Sortmerna: fast and accurate filtering of ribosomal rnas in metatranscriptomic data," Bioinformatics, vol. 28, no. 24, pp. 3211--3217, 2012.
[43]
E. B. Fernandez, W. A. Najjar, S. Lonardi, and J. Villarreal, "Multithreaded fpga acceleration of dna sequence mapping," in 2012 IEEE Conference on High Performance Extreme Computing, Sept 2012, pp. 1--6.
[44]
H. M. Waidyasooriya, M. Hariyama, and M. Kameyama, "Implementation of a custom hardware-accelerator for short-read mapping using burrows-wheeler alignment," in 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), July 2013, pp. 651--654.
[45]
H. M. Waidyasooriya and M. Hariyama, "Hardware-acceleration of short-read alignment based on the burrows-wheeler transform," IEEE Transactions on Parallel and Distributed Systems, vol. 27, no. 5, pp. 1358--1372, May 2016.
[46]
R. Luethy and C. Hoover, "Hardware and software systems for accelerating common bioinformatics sequence analysis algorithms," Drug Discovery Today: BIOSILICO, vol. 2, no. 1, pp. 12--17, 2004.
[47]
S. M. Kiełbasa, R. Wan, K. Sato, P. Horton, and M. C. Frith, "Adaptive seeds tame genomic sequence comparison," Genome research, vol. 21, no. 3, pp. 487--493, 2011.
[48]
M.-C. F. Chang, Y.-T. Chen, J. Cong, P.-T. Huang, C.-L. Kuo, and C. H. Yu, "The smem seeding acceleration for dna sequence alignment," in Field-Programmable Custom Computing Machines (FCCM), 2016 IEEE 24th Annual International Symposium on. IEEE, 2016, pp. 32--39.
[49]
Y.-C. Wu, C.-H. Chang, J.-H. Hung, and C.-H. Yang, "A 135-mw fully integrated data processor for next-generation sequencing," IEEE Transactions on Biomedical Circuits and Systems, 2017.
[50]
F. Hach, F. Hormozdiari, C. Alkan, F. Hormozdiari, I. Birol, E. E. Eichler, and S. C. Sahinalp, "mrsfast: a cache-oblivious algorithm for short-read mapping," Nature methods, vol. 7, no. 8, p. 576, 2010.
[51]
J. M. Abuín, J. C. Pichel, T. F. Pena, and J. Amigo, "Sparkbwa: speeding up the alignment of high-throughput dna sequencing data," PloS one, vol. 11, no. 5, p. e0155461, 2016.
[52]
A. Roy, Y. Diao, U. Evani, A. Abhyankar, C. Howarth, R. Le Priol, and T. Bloom, "Massively parallel processing of whole genome sequence data: An in-depth performance study," in Proceedings of the 2017 ACM International Conference on Management of Data. ACM, 2017, pp. 187--202.
[53]
S. Byma, S. Whitlock, L. Flueratoru, E. Tseng, C. Kozyrakis, E. Bugnion, and J. Larus, "Persona: A high-performance bioinformatics framework," in 2017 USENIX Annual Technical Conference (USENIX ATC 17). Santa Clara, CA: USENIX Association, 2017, pp. 153--165. {Online}. Available: https://www.usenix.org/conference/atc17/technical-sessions/presentation/byma

Cited By

View all
  • (2024)Harp: Leveraging Quasi-Sequential Characteristics to Accelerate Sequence-to-Graph Mapping of Long ReadsProceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 310.1145/3620666.3651331(512-527)Online publication date: 27-Apr-2024
  • (2024)GenArchBenchFuture Generation Computer Systems10.1016/j.future.2024.03.050157:C(313-329)Online publication date: 18-Jul-2024
  • (2023)Abakus: Accelerating k-mer Counting with Storage TechnologyACM Transactions on Architecture and Code Optimization10.1145/363295221:1(1-26)Online publication date: 21-Nov-2023
  • Show More Cited By

Index Terms

  1. Genax: a genome sequencing accelerator
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        ISCA '18: Proceedings of the 45th Annual International Symposium on Computer Architecture
        June 2018
        884 pages
        ISBN:9781538659847

        Publisher

        IEEE Press

        Publication History

        Published: 02 June 2018

        Check for updates

        Author Tags

        1. accelerator
        2. automaton
        3. sequence alignment

        Qualifiers

        • Research-article

        Conference

        ISCA '18

        Acceptance Rates

        Overall Acceptance Rate 543 of 3,203 submissions, 17%

        Upcoming Conference

        ISCA '25

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)14
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 07 Nov 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2024)Harp: Leveraging Quasi-Sequential Characteristics to Accelerate Sequence-to-Graph Mapping of Long ReadsProceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 310.1145/3620666.3651331(512-527)Online publication date: 27-Apr-2024
        • (2024)GenArchBenchFuture Generation Computer Systems10.1016/j.future.2024.03.050157:C(313-329)Online publication date: 18-Jul-2024
        • (2023)Abakus: Accelerating k-mer Counting with Storage TechnologyACM Transactions on Architecture and Code Optimization10.1145/363295221:1(1-26)Online publication date: 21-Nov-2023
        • (2023)CASA: An Energy-Efficient and High-Speed CAM-based SMEM Seeding Accelerator for Genome AlignmentProceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture10.1145/3613424.3614313(1423-1436)Online publication date: 28-Oct-2023
        • (2023)GMX: Instruction Set Extensions for Fast, Scalable, and Efficient Genome Sequence AlignmentProceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture10.1145/3613424.3614306(1466-1480)Online publication date: 28-Oct-2023
        • (2023)WFAsic: A High-Performance ASIC Accelerator for DNA Sequence Alignment on a RISC-V SoCProceedings of the 52nd International Conference on Parallel Processing10.1145/3605573.3605651(392-401)Online publication date: 7-Aug-2023
        • (2023)An FPGA Accelerator for Genome Variant CallingACM Transactions on Reconfigurable Technology and Systems10.1145/359529716:4(1-29)Online publication date: 1-Sep-2023
        • (2023)GenDP: A Framework of Dynamic Programming Acceleration for Genome Sequencing AnalysisProceedings of the 50th Annual International Symposium on Computer Architecture10.1145/3579371.3589060(1-15)Online publication date: 17-Jun-2023
        • (2023)hAP: A Spatial-von Neumann Heterogeneous Automata Processor with Optimized Resource and IO Overhead on FPGAProceedings of the 2023 ACM/SIGDA International Symposium on Field Programmable Gate Arrays10.1145/3543622.3573190(185-196)Online publication date: 12-Feb-2023
        • (2022)BWA-MEM-SCALE: Accelerating Genome Sequence Mapping on Commodity ServersProceedings of the 51st International Conference on Parallel Processing10.1145/3545008.3545033(1-12)Online publication date: 29-Aug-2022
        • Show More Cited By

        View Options

        Get Access

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media