DocumentCode :
2286893
Title :
An optimized algorithm of high spatial-temporal efficiency for Megablast
Author :
Tan, Guangming ; Xu, Lin ; Jiao, Yishan ; Feng, Shengzhong ; Bu, Dongbo ; Ninghui Sun
Author_Institution :
Inst. of Comput. Technol., Chinese Acad. of Sci., Beijing, China
Volume :
2
fYear :
2005
fDate :
20-22 July 2005
Firstpage :
703
Abstract :
BLAST (basic local alignment search tool), as a heuristic algorithm, is one of the most widely used sequence similarity search tools. MegaBlast, as an improved version of BLAST, speeds up the searches and improves the total throughput owing to greedy algorithm and batch processing. However, MegaBlast consumes a great deal of memory, which is proportional to the product of the size of the query file and database file. This paper proposes an optimized MegaBlast algorithm based on MegaBlast. The new algorithm exchanges the query and subject sequences, and builds a hash table based on new subject sequences. The optimized algorithm overlaps I/O with computation, further decreases the overall time and the cost of memory, which is only proportional to the size of the database file. The optimized algorithm is suitable to be parallelized on cluster systems. As our experiments shown, the parallel program, which is implemented with MPI, achieves high speedup.
Keywords :
application program interfaces; file organisation; greedy algorithms; message passing; parallel databases; parallel programming; query processing; search problems; spatiotemporal phenomena; MegaBlast algorithm; basic local alignment search tool; file organisation; hash table; heuristic algorithm; message passing interface; parallel databases; parallel program; query processing; Clustering algorithms; Computers; Cost function; Databases; Dynamic programming; Greedy algorithms; Heuristic algorithms; Parallel algorithms; Sun; Throughput;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Systems, 2005. Proceedings. 11th International Conference on
ISSN :
1521-9097
Print_ISBN :
0-7695-2281-5
Type :
conf
DOI :
10.1109/ICPADS.2005.92
Filename :
1524405
Link To Document :
بازگشت