Title :
Short fragment sequence alignment on the HP-SEE infrastructure
Author :
Kozlovszky, Miklos ; Windisch, Gergely ; Balaskó, Ákos
Author_Institution :
Lab. of Parallel & Distrib. Comput., MTA SZTAKI, Budapest, Hungary
Abstract :
The recently used deep sequencing techniques represent a new data processing challenge: mapping short fragment reads to open-access eukaryotic genomes at the scale of several hundred thousand. This problem is solvable by BLAST, BWA and similar sequence alignment tools. BLAST is one of the most frequently used tool in bioinformatics and BWA is a relative new fast light-weighted tool that aligns effectively short sequences. Local installations of these algorithms are typically not able to handle large problem size therefore the sequence alignment process runs slowly, while web based implementations cannot accept high number of queries. HP-SEE infrastructure allows accessing massively parallel supercomputing infrastructure. With gUSE/WS-PGRADE we have created successfully an online Bioinformatics eScience Gateway, which is capable to serve the short fragment sequence alignment demand of the regional bioinformatics communities within the SEE region. Using workflows we have ported algorithms (BLAST and BWA) to the massively parallel HP-SEE infrastructure. In this paper we describe the created Bioinformatics eScience Gateway, and show as case study how we have implemented the ported BLAST workflow using parameter study. With our online service, researchers can do high throughput sequence alignments against the eukaryotic genomes to search for regulatory mechanisms controlled by short fragments on HP-SEE´s supercomputing infrastructure.
Keywords :
Internet; bioinformatics; network servers; sequences; BLAST; BWA; HP-SEE infrastructure; Web based implementations; data processing challenge; deep sequencing techniques; eukaryotic genomes; gUSE/WS-PGRADE; online bioinformatics escience gateway; open-access eukaryotic genomes; parallel HP-SEE infrastructure; short fragment sequence alignment demand; Bioinformatics; Communities; Educational institutions; Europe; Graphical user interfaces; Logic gates; Portals; Application porting; HP-SEE; gUSE; sequence alignment workflow;
Conference_Titel :
MIPRO, 2012 Proceedings of the 35th International Convention
Conference_Location :
Opatija
Print_ISBN :
978-1-4673-2577-6