Title :
PREMIER Turbo: Probabilistic error-correction using Markov inference in errored reads using the turbo principle
Author :
Xin Yin ; Zhao Song ; Dorman, Karin ; Ramamoorthy, Aditya
Author_Institution :
Dept. of Stat., Iowa State Univ., Ames, IA, USA
Abstract :
We present a probabilistic algorithm for error correction for high throughput DNA sequencing data. Our approach leverages our prior algorithm PREMIER where sequencer outputs are modeled as independent realizations of a Hidden Markov Model (HMM) and the problem of error correction is posed as one of maximum likelihood sequence detection over this HMM. In this work we propose an algorithm called PREMIER Turbo which can be viewed as an iterative application of the PREMIER approach. Specifically, we apply error correction in both the forward and the backward directions in a given read. We also present a heuristic inspired by turbo-equalization that incorporates the prior belief on a nucleotide position returned by the Baum-Welch algorithm into the error correction steps. Our approach significantly improves the correction of nucleotides in the beginning of the read. Our test results on the real C. elegans and E. coli datasets show that PREMIER Turbo achieves a significantly better error correction performance than the other competing methods.
Keywords :
DNA; bioinformatics; error correction; hidden Markov models; inference mechanisms; maximum likelihood detection; probability; Baum-WeIch algorithm; E coli datasets; HMM; Markov inference; PREMIER Turbo:; PREMIER algorithm; Turbo principle; backward direction error correction; elegan datasets; errored reads; forward direction error correction; hidden Markov model; high throughput DNA sequencing data; maximum likelihood sequence detection; nucleotide position; probabilistic error-correction algorithm; turbo-equalization; Benchmark testing; Bioinformatics; Computational modeling; Error correction; Genomics; Hidden Markov models; Viterbi algorithm; DNA sequencing; error correction; hidden Markov models;
Conference_Titel :
Global Conference on Signal and Information Processing (GlobalSIP), 2013 IEEE
Conference_Location :
Austin, TX
DOI :
10.1109/GlobalSIP.2013.6736816