• DocumentCode
    2481315
  • Title

    Accelerating error correction in high-throughput short-read DNA sequencing data with CUDA

  • Author

    Shi, Haixiang ; Schmidt, Bertil ; Liu, Weiguo ; Müller-Wittig, Wolfgang

  • Author_Institution
    Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
  • fYear
    2009
  • fDate
    23-29 May 2009
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Emerging DNA sequencing technologies open up exciting new opportunities for genome sequencing by generating read data with a massive throughput. However, produced reads are significantly shorter and more error-prone compared to the traditional Sanger shotgun sequencing method. This poses challenges for de-novo DNA fragment assembly algorithms in terms of both accuracy (to deal with short, error-prone reads) and scalability (to deal with very large input data sets). In this paper we present a scalable parallel algorithm for correcting sequencing errors in high-throughput short-read data. It is based on spectral alignment and uses the CUDA programming model. Our computational experiments on a GTX 280 GPU show runtime savings between 10 and 19 times (for different error-rates using simulated datasets as well as real Solexa/Illumina datasets).
  • Keywords
    DNA; biology computing; genomics; molecular biophysics; parallel algorithms; CUDA programming model; GTX 280 GPU; error correction; genome sequencing; high-throughput short-read DNA sequencing data; scalable parallel algorithm; Acceleration; Assembly; Bioinformatics; DNA; Error correction; Genomics; Parallel algorithms; Runtime; Scalability; Throughput;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing, 2009. IPDPS 2009. IEEE International Symposium on
  • Conference_Location
    Rome
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-4244-3751-1
  • Electronic_ISBN
    1530-2075
  • Type

    conf

  • DOI
    10.1109/IPDPS.2009.5160924
  • Filename
    5160924