• Title of article

    Basecalling using hidden Markov models

  • Author/Authors

    Boufounos، نويسنده , , Petros and El-Difrawy، نويسنده , , Sameh and Ehrlich، نويسنده , , Dan، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2004
  • Pages
    14
  • From page
    23
  • To page
    36
  • Abstract
    In this paper we propose hidden Markov models to model electropherograms from DNA sequencing equipment and perform basecalling. We model the state emission densities using artificial neural networks, and modify the Baum–Welch reestimation procedure to perform training. Moreover, we develop a method that exploits consensus sequences to label training data, thus minimizing the need for hand labeling. We propose the same method for locating an electropherogram in a longer DNA sequence. We also perform a careful study of the basecalling errors and propose alternative HMM topologies that might further improve performance. Our results demonstrate the potential of these models. Based on these results, we conclude by suggesting further research directions.
  • Keywords
    Hidden Markov Models , Basecalling , DNA sequencing , PHRED
  • Journal title
    Journal of the Franklin Institute
  • Serial Year
    2004
  • Journal title
    Journal of the Franklin Institute
  • Record number

    1542770