DocumentCode :
593743
Title :
Efficient processing of models for large-scale shotgun proteomics data
Author :
Grover, H. ; Gopalakrishnan, V.
Author_Institution :
Dept. of Biomed. Inf., Univ. of Pittsburgh, Pittsburgh, PA, USA
fYear :
2012
fDate :
14-17 Oct. 2012
Firstpage :
591
Lastpage :
596
Abstract :
Mass-spectrometry (MS) based proteomics has become a key enabling technology for the systems approach to biology, providing insights into the protein complement of an organism. Bioinformatics analyses play a critical role in interpretation of large, and often replicated, MS datasets generated across laboratories and institutions. A significant amount of computational effort in the workflow is spent on the identification of protein and peptide components of complex biological samples, and consists of a series of steps relying on large database searches and intricate scoring algorithms. In this work, we share our efforts and experience in efficient handling of these large MS datasets through database indexing and parallelization based on multiprocessor architectures. We also identify important challenges and opportunities that are relevant specifically to the task of peptide and protein identification, and more generally to similar multi-step problems that are inherently parallelizable.
Keywords :
bioinformatics; data handling; database indexing; multiprocessing systems; parallel architectures; parallel databases; proteins; proteomics; replicated databases; MS-based proteomics; bioinformatics analysis; biological samples; database indexing; database parallelization; intricate scoring algorithm; large MS dataset handling; large database search; large-scale shotgun proteomics data; mass spectrometry-based proteomics; model processing; multiprocessor architectures; multistep problems; peptide component identification; protein identification; replicated MS datasets; Computational modeling; Databases; Hidden Markov models; Peptides; Proteins; Synchronization; Bioinformatics; High-throughput Proteomics; Indexing; Multiprocessing; Parallelization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Collaborative Computing: Networking, Applications and Worksharing (CollaborateCom), 2012 8th International Conference on
Conference_Location :
Pittsburgh, PA
Print_ISBN :
978-1-4673-2740-4
Type :
conf
Filename :
6450956
Link To Document :
بازگشت