DocumentCode
2198870
Title
A probabilistic approach for long read-length DNA sequence analysis
Author
Molina, Chrigtophe G. ; Mullikin, Jim
Author_Institution
Sanger Centre, Wellcome Trust Genome Campus, Cambridge, MA, USA
fYear
2002
fDate
2002
Firstpage
45
Lastpage
56
Abstract
This paper introduces a new algorithm for DNA sequence analysis, based on the use of a reference DNA sequence for the estimation of base positions, and a probabilistic modelling of trace peaks. The new algorithm has been applied to long read-length DNA sequences and its performance has been compared to the base-calling program Phred. The results reported in this paper, after cross-matching with a finished consensus, show a significant improvement by the new algorithm in the final sequence read-length and in the number of correct bases extracted from DNA traces.
Keywords
DNA; molecular biophysics; probability; DNA traces; algorithm; base-calling program Phred; correct bases extracted number; cross-matching; final sequence read-length; finished consensus; long read-length DNA sequence analysis; probabilistic approach; trace peaks; Algorithm design and analysis; Bioinformatics; DNA; Genomics; Humans; Image sequence analysis; Libraries; Phase estimation; Signal analysis; Signal processing algorithms;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks for Signal Processing, 2002. Proceedings of the 2002 12th IEEE Workshop on
Print_ISBN
0-7803-7616-1
Type
conf
DOI
10.1109/NNSP.2002.1030016
Filename
1030016
Link To Document