• DocumentCode
    2286893
  • Title

    An optimized algorithm of high spatial-temporal efficiency for Megablast

  • Author

    Tan, Guangming ; Xu, Lin ; Jiao, Yishan ; Feng, Shengzhong ; Bu, Dongbo ; Ninghui Sun

  • Author_Institution
    Inst. of Comput. Technol., Chinese Acad. of Sci., Beijing, China
  • Volume
    2
  • fYear
    2005
  • fDate
    20-22 July 2005
  • Firstpage
    703
  • Abstract
    BLAST (basic local alignment search tool), as a heuristic algorithm, is one of the most widely used sequence similarity search tools. MegaBlast, as an improved version of BLAST, speeds up the searches and improves the total throughput owing to greedy algorithm and batch processing. However, MegaBlast consumes a great deal of memory, which is proportional to the product of the size of the query file and database file. This paper proposes an optimized MegaBlast algorithm based on MegaBlast. The new algorithm exchanges the query and subject sequences, and builds a hash table based on new subject sequences. The optimized algorithm overlaps I/O with computation, further decreases the overall time and the cost of memory, which is only proportional to the size of the database file. The optimized algorithm is suitable to be parallelized on cluster systems. As our experiments shown, the parallel program, which is implemented with MPI, achieves high speedup.
  • Keywords
    application program interfaces; file organisation; greedy algorithms; message passing; parallel databases; parallel programming; query processing; search problems; spatiotemporal phenomena; MegaBlast algorithm; basic local alignment search tool; file organisation; hash table; heuristic algorithm; message passing interface; parallel databases; parallel program; query processing; Clustering algorithms; Computers; Cost function; Databases; Dynamic programming; Greedy algorithms; Heuristic algorithms; Parallel algorithms; Sun; Throughput;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Systems, 2005. Proceedings. 11th International Conference on
  • ISSN
    1521-9097
  • Print_ISBN
    0-7695-2281-5
  • Type

    conf

  • DOI
    10.1109/ICPADS.2005.92
  • Filename
    1524405