• DocumentCode
    3723949
  • Title

    Efficient agglomerative hierarchical clustering for biological sequence analysis

  • Author

    Thuy-Diem Nguyen;Chee-Keong Kwoh

  • Author_Institution
    School of Computer Engineering, Nanyang Technological University, Singapore
  • fYear
    2015
  • Firstpage
    1
  • Lastpage
    3
  • Abstract
    Cluster analysis is an important data mining technique widely used for pattern recognition and information retrieval. In the literature, over a hundred clustering algorithms have been developed to target input datasets with different characteristics. Among these algorithms, the hierarchical clustering method is particularly useful for analyzing genetic datasets in evolutionary biology studies because of the inherent hierarchical relationships amongst the genetic sequences extracted from related organisms. However, this algorithm is computational expensive in terms of both execution time and particularly memory usage. This paper summarizes our experience in using parallel computing technologies with new algorithms to perform hierarchical sequence clustering in a more effective way without compromising the accuracy of the results.
  • Keywords
    Genetics
  • Publisher
    ieee
  • Conference_Titel
    TENCON 2015 - 2015 IEEE Region 10 Conference
  • ISSN
    2159-3442
  • Print_ISBN
    978-1-4799-8639-2
  • Electronic_ISBN
    2159-3450
  • Type

    conf

  • DOI
    10.1109/TENCON.2015.7373194
  • Filename
    7373194