• DocumentCode
    1241423
  • Title

    A Decentralized Method for Scaling Up Genome Similarity Search Services

  • Author

    Zhou, Bing Bing ; Wang, Chen ; Zomaya, Albert Y.

  • Author_Institution
    CSIRO ICT Center, Epping, NSW
  • Volume
    20
  • Issue
    3
  • fYear
    2009
  • fDate
    3/1/2009 12:00:00 AM
  • Firstpage
    303
  • Lastpage
    315
  • Abstract
    As genome sequence databases grow in size, the accuracy and speed of sequence similarity detection become more important. There is an increasing number of methods being used for detecting sequence similarity. Meanwhile the demands for genome sequence search and alignment services are also increasing. It is a challenge to scale up the computer systems for hosting various methods and serving requests to these methods in a timely manner. Traditional clusters, which are used in most of scientific centers, can not cope with this challenge. This paper tackles this problem in a novel way, which treats the sequence search requests as content requests to both genome databases and similarity detection methods; therefore, scaling up the computer systems that serve these contents is a process of constructing content distribution network. The paper gives a decentralized method to dynamically construct content distribution networks for a variety of genome sequence similarity detection services. It also provides a scheduling algorithm for efficiently using content nodes. Our simulation study shows that scalability and high content node utilization can be achieved in such a system while the cost of achieving remains reasonable.
  • Keywords
    bioinformatics; genetics; randomised algorithms; scheduling; scientific information systems; search problems; content distribution network; decentralized method; genome sequence database; genome similarity search service; scheduling algorithm; Data models; Distributed applications; Distributed architectures; Distributed networks; Hash-table representations; Optimization; Performance Analysis and Design Aids; Simulation;
  • fLanguage
    English
  • Journal_Title
    Parallel and Distributed Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/TPDS.2008.95
  • Filename
    4538213