Title :
A study of genomic data provenance in NoSQL document-oriented database systems
Author :
Valeria Guimar?es;Fernanda Hondo;Rodrigo Almeida;Harley Vera;Maristela Holanda;Aleteia Araujo;Maria Emilia Walter;Sergio Lifschitz
Author_Institution :
Department of Computer Science, University of Bras?lia, Brazil
Abstract :
This work considers a scientific experiment as a computational workflow. Provenance models store details of each workflow execution, including produced data, computational tools parameters and their versions, among others. This way, scientists can review details of a particular workflow execution, compare information generated among different executions and plan new ones efficiently. In the bioinformatics domain, particularly in the presence of large volumes of data, persistency of those data generated during the workflow execution is still a research challenge. In this article, we consider a study on provenance data storage for bioinformatics in a document-oriented NoSQL database system. We present data modeling issues and discuss an actual implementation into MongoDB.
Keywords :
"Bioinformatics","Genomics","Biological system modeling","Databases","Yttrium","Programming","Kidney"
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2015 IEEE International Conference on
DOI :
10.1109/BIBM.2015.7359902