• DocumentCode
    2198870
  • Title

    A probabilistic approach for long read-length DNA sequence analysis

  • Author

    Molina, Chrigtophe G. ; Mullikin, Jim

  • Author_Institution
    Sanger Centre, Wellcome Trust Genome Campus, Cambridge, MA, USA
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    45
  • Lastpage
    56
  • Abstract
    This paper introduces a new algorithm for DNA sequence analysis, based on the use of a reference DNA sequence for the estimation of base positions, and a probabilistic modelling of trace peaks. The new algorithm has been applied to long read-length DNA sequences and its performance has been compared to the base-calling program Phred. The results reported in this paper, after cross-matching with a finished consensus, show a significant improvement by the new algorithm in the final sequence read-length and in the number of correct bases extracted from DNA traces.
  • Keywords
    DNA; molecular biophysics; probability; DNA traces; algorithm; base-calling program Phred; correct bases extracted number; cross-matching; final sequence read-length; finished consensus; long read-length DNA sequence analysis; probabilistic approach; trace peaks; Algorithm design and analysis; Bioinformatics; DNA; Genomics; Humans; Image sequence analysis; Libraries; Phase estimation; Signal analysis; Signal processing algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks for Signal Processing, 2002. Proceedings of the 2002 12th IEEE Workshop on
  • Print_ISBN
    0-7803-7616-1
  • Type

    conf

  • DOI
    10.1109/NNSP.2002.1030016
  • Filename
    1030016