DocumentCode :
2737801
Title :
Workshop: Efficient sequential and parallel algorithms for sequence assembly
Author :
Rajasekaran, Sanguthevar ; Dinh, Hieu ; Kundeti, Vamsi
Author_Institution :
Comput. Sci. & Eng. Dept., Univ. of Connecticut, Storrs, CT, USA
fYear :
2011
fDate :
3-5 Feb. 2011
Firstpage :
274
Lastpage :
274
Abstract :
Next Generation Sequence (NGS) assemblers are challenged with the problem of handling massive number of reads. Bi-directed de Bruijn graph is the most fundamental data structure on which numerous NGS assemblers have been built (e.g. Velvet, ABySS). Most of these assemblers only differ in the heuristics which they employ to operate on this de Bruijn graph. These heuristics are composed of several fundamental operations such as construction, compaction and pruning of the underlying bi-directed de Bruijn graph. Unfortunately the current algorithms to accomplish these fundamental operations on the de Bruijn graph are computationally inefficient and have become a bottleneck to scale the NGS assemblers. In this talk, some of the recent results which provide computationally efficient algorithms to these fundamental bi-directed de Bruijn graph operations are discussed. The algorithms are based on sorting and efficient in sequential, out of-core, and parallel settings.
Keywords :
biology computing; data structures; directed graphs; parallel algorithms; sorting; bidirected de Bruijn graph; data structure; next generation sequence assemblers; parallel algorithm; sequence assembly; sequential algorithm; sorting; Assembly; Bioinformatics; Communities; Compaction; Genomics; Parallel algorithms; Scalability;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Advances in Bio and Medical Sciences (ICCABS), 2011 IEEE 1st International Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
978-1-61284-851-8
Type :
conf
DOI :
10.1109/ICCABS.2011.5729928
Filename :
5729928
Link To Document :
بازگشت