DocumentCode :
640164
Title :
PREMIER — PRobabilistic error-correction using Markov inference in errored reads
Author :
Xin Yin ; Zhao Song ; Dorman, Karin ; Ramamoorthy, Aditya
Author_Institution :
Dept. of Stat., Iowa State Univ., Ames, IA, USA
fYear :
2013
fDate :
7-12 July 2013
Firstpage :
1626
Lastpage :
1630
Abstract :
In this work we present a flexible, probabilistic and reference-free method of error correction for high throughput DNA sequencing data. The key is to exploit the high coverage of sequencing data and model short sequence outputs as independent realizations of a Hidden Markov Model (HMM). We pose the problem of error correction of reads as one of maximum likelihood sequence detection over this HMM. While time and memory considerations rule out an implementation of the optimal Baum-Welch algorithm (for parameter estimation) and the optimal Viterbi algorithm (for error correction), we propose low-complexity approximate versions of both. Specifically, we propose an approximate Viterbi and a sequential decoding based algorithm for the error correction. Our results show that when compared with Reptile, a state-of-the-art error correction method, our methods consistently achieve superior performances on both simulated and real data sets.
Keywords :
DNA; Viterbi decoding; error correction codes; hidden Markov models; maximum likelihood detection; probability; sequential decoding; HMM; Markov inference; PREMIER; approximate Viterbi decoding based algorithm; errored reads; flexible probabilistic method; hidden Markov model; high throughput DNA sequencing data; maximum likelihood sequence detection; optimal Baum-Welch algorithm; probabilistic error-correction codes; reference-free method; sequential decoding based algorithm; DNA; Error correction; Genomics; Hidden Markov models; Maximum likelihood decoding; Sequential analysis; Viterbi algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Theory Proceedings (ISIT), 2013 IEEE International Symposium on
Conference_Location :
Istanbul
ISSN :
2157-8095
Type :
conf
DOI :
10.1109/ISIT.2013.6620502
Filename :
6620502
Link To Document :
بازگشت