DocumentCode :
3503563
Title :
Grid Deployment of Legacy Bioinformatics Applications with Transparent Data Access
Author :
Blanchet, Christophe ; Mollon, Rémi ; Thain, Douglas ; Deléage, Gilbert
Author_Institution :
Inst. de Biol. et Chimie des Proteines, Univ. Lyon 1
fYear :
2006
fDate :
28-29 Sept. 2006
Firstpage :
120
Lastpage :
127
Abstract :
Although grid computing offers great potential for executing large-scale bioinformatics applications, practical deployment is constrained by legacy interfaces. Most widely deployed bioinformatics were designed long before grid computing arose, and thus are created, tested, and validated in the familiar environment of a workstation. Most perform simple local I/O and have no facility for interfacing with a distributed system. Because of these limitations, users of bioinformatics applications are generally constrained to creating large local clustered systems in order to perform data analysis. In order to deploy these applications in wide-area grid systems, users require a transparent mechanism of attaching legacy interfaces to grid I/O systems. We have explored this problem by deploying several bioinformatics databases and programs for protein sequence analysis on the European EGEE grid. Using tools for transparent adaptation, we have connected legacy applications to the logical namespace provided by a replica manager, and compared the performance of remote access versus file staging. For common bioinformatics applications, we find that remote access has performance equal or better than simple file staging, with the added advantage that users are freed from stating the data needs of applications in advance
Keywords :
biology computing; data analysis; file organisation; grid computing; proteins; European EGEE grid; bioinformatics databases; data access; data analysis; distributed system; file staging; grid I/O system; grid computing; grid deployment; legacy bioinformatics; legacy interface; protein sequence analysis; replica manager; wide-area grid system; Application software; Bioinformatics; Biological information theory; Biology computing; Computer science; Distributed databases; Genomics; Grid computing; Joining processes; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Grid Computing, 7th IEEE/ACM International Conference on
Conference_Location :
Barcelona
Print_ISBN :
1-4244-0343-X
Electronic_ISBN :
1-4244-0344-8
Type :
conf
DOI :
10.1109/ICGRID.2006.311006
Filename :
4100463
Link To Document :
بازگشت