• DocumentCode
    2719467
  • Title

    Parallel pattern identification in biological sequences on clusters

  • Author

    Huang, Chun-Hsi ; Biswas, Ratnabali

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Connecticut Univ., Storrs, CT, USA
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    127
  • Lastpage
    134
  • Abstract
    This paper presents a low communication overhead parallel algorithm for pattern matching in biological sequences. Given such a sequence of length n and a pattern of length m, we conclude an algorithm with five computation/communication phases, each requiring O(n) computation time and only O(p) message units. The low communication overhead of the algorithm is essential to achieving reasonable speedups on clusters, where the interprocessor communication latency is usually higher Previous parallel implementations use straightforward domain decomposition based on existing sequential algorithms and rely on parallel machines with low-latency interconnection network and fast hardware support for processor synchronization.
  • Keywords
    biology computing; communication complexity; parallel algorithms; sequences; string matching; workstation clusters; biological sequences; clusters; computation time; computation/communication phases; interprocessor communication latency; low communication overhead parallel algorithm; message units; parallel pattern identification; pattern matching; speedups; Biology computing; Clustering algorithms; DNA; Delay; Hardware; Multiprocessor interconnection networks; Parallel algorithms; Pattern matching; Proteins; Sequences;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster Computing, 2002. Proceedings. 2002 IEEE International Conference on
  • Print_ISBN
    0-7695-2066-9
  • Type

    conf

  • DOI
    10.1109/CLUSTR.2002.1137737
  • Filename
    1137737