Title :
An improved maximum likelihood formulation for accurate genome assembly
Author :
Varma, Aditya ; Ranade, Abhiram ; Aluru, Srinivas
Author_Institution :
Dept. of Comput. Sci. & Eng., Indian Inst. of Technol. Bombay, Mumbai, India
Abstract :
We present improvements to the recently proposed maximum likelihood method for genome assembly. We formulate the problem as one of direct convex optimization, and achieve the following improvements: Our method does not require identical read lengths and can deal with reads of varying lengths. We eliminate the requirement to a priori know a stringent estimate of the length of the genome or the need to use further expectation minimization to predict the most likely length. Instead, we explicitly incorporate the uncertainty in the length estimate by a range parameter without affecting the convexity required for the optimization. Results indicate that our method can generate accurate estimates of repeat counts and produces fewer and much longer contigs. These results mark a further advancement of maximum likelihood genome assembly and the potential of this approach in building future genome assemblers.
Keywords :
biology computing; genomics; molecular biophysics; optimisation; direct convex optimization; genome assembly; Assembly; Bioinformatics; DNA; Equations; Genomics; Mathematical model; Maximum likelihood detection; genome assembly; maximum likelihood; next-gen sequencing;
Conference_Titel :
Computational Advances in Bio and Medical Sciences (ICCABS), 2011 IEEE 1st International Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
978-1-61284-851-8
DOI :
10.1109/ICCABS.2011.5729873