Title :
Accurate estimation of the signal baseline in DNA chromatograms
Author :
Andrade, Lucio ; Manolakos, Elias S.
Author_Institution :
Center for Res. & Graduate Studies, Northeastern Univ., Boston, MA, USA
Abstract :
Estimating accurately the varying baseline level in different parts of a DNA chromatogram is a challenging and important problem for accurate base-calling. We are formulating the problem in a statistical learning framework and propose an Expectation-Maximization algorithm for its solution. In addition we also present a faster, iterative histogram based method for estimating the background of the signal in small size windows. The two methods can be combined with regression techniques to correct the baseline in all regions of the chromatogram and are shown to work well even in areas of low SNR. By improving the separation of clusters, baseline correction actions reduce the classification errors when using the BEM base-caller developed in our group.
Keywords :
DNA; biological techniques; chromatography; optimisation; statistical analysis; DNA chromatogram; DNA fragments; accurate base-calling; accurate signal baseline estimation; capillary electrophoresis; classification errors reduction; clusters separation improvement; expectation-maximization algorithm; faster iterative histogram; gel electrophoresis; statistical learning framework; Bayesian methods; DNA computing; Data mining; Digital signal processing; Expectation-maximization algorithms; Histograms; Iterative methods; Signal resolution; Signal to noise ratio; Statistical learning;
Conference_Titel :
Neural Networks for Signal Processing, 2002. Proceedings of the 2002 12th IEEE Workshop on
Print_ISBN :
0-7803-7616-1
DOI :
10.1109/NNSP.2002.1030015