DocumentCode :
2736778
Title :
An improved maximum likelihood formulation for accurate genome assembly
Author :
Varma, Aditya ; Ranade, Abhiram ; Aluru, Srinivas
Author_Institution :
Dept. of Comput. Sci. & Eng., Indian Inst. of Technol. Bombay, Mumbai, India
fYear :
2011
fDate :
3-5 Feb. 2011
Firstpage :
165
Lastpage :
170
Abstract :
We present improvements to the recently proposed maximum likelihood method for genome assembly. We formulate the problem as one of direct convex optimization, and achieve the following improvements: Our method does not require identical read lengths and can deal with reads of varying lengths. We eliminate the requirement to a priori know a stringent estimate of the length of the genome or the need to use further expectation minimization to predict the most likely length. Instead, we explicitly incorporate the uncertainty in the length estimate by a range parameter without affecting the convexity required for the optimization. Results indicate that our method can generate accurate estimates of repeat counts and produces fewer and much longer contigs. These results mark a further advancement of maximum likelihood genome assembly and the potential of this approach in building future genome assemblers.
Keywords :
biology computing; genomics; molecular biophysics; optimisation; direct convex optimization; genome assembly; Assembly; Bioinformatics; DNA; Equations; Genomics; Mathematical model; Maximum likelihood detection; genome assembly; maximum likelihood; next-gen sequencing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Advances in Bio and Medical Sciences (ICCABS), 2011 IEEE 1st International Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
978-1-61284-851-8
Type :
conf
DOI :
10.1109/ICCABS.2011.5729873
Filename :
5729873
Link To Document :
بازگشت