DocumentCode
1241423
Title
A Decentralized Method for Scaling Up Genome Similarity Search Services
Author
Zhou, Bing Bing ; Wang, Chen ; Zomaya, Albert Y.
Author_Institution
CSIRO ICT Center, Epping, NSW
Volume
20
Issue
3
fYear
2009
fDate
3/1/2009 12:00:00 AM
Firstpage
303
Lastpage
315
Abstract
As genome sequence databases grow in size, the accuracy and speed of sequence similarity detection become more important. There is an increasing number of methods being used for detecting sequence similarity. Meanwhile the demands for genome sequence search and alignment services are also increasing. It is a challenge to scale up the computer systems for hosting various methods and serving requests to these methods in a timely manner. Traditional clusters, which are used in most of scientific centers, can not cope with this challenge. This paper tackles this problem in a novel way, which treats the sequence search requests as content requests to both genome databases and similarity detection methods; therefore, scaling up the computer systems that serve these contents is a process of constructing content distribution network. The paper gives a decentralized method to dynamically construct content distribution networks for a variety of genome sequence similarity detection services. It also provides a scheduling algorithm for efficiently using content nodes. Our simulation study shows that scalability and high content node utilization can be achieved in such a system while the cost of achieving remains reasonable.
Keywords
bioinformatics; genetics; randomised algorithms; scheduling; scientific information systems; search problems; content distribution network; decentralized method; genome sequence database; genome similarity search service; scheduling algorithm; Data models; Distributed applications; Distributed architectures; Distributed networks; Hash-table representations; Optimization; Performance Analysis and Design Aids; Simulation;
fLanguage
English
Journal_Title
Parallel and Distributed Systems, IEEE Transactions on
Publisher
ieee
ISSN
1045-9219
Type
jour
DOI
10.1109/TPDS.2008.95
Filename
4538213
Link To Document