DocumentCode :
3717443
Title :
SQL-like big data environments: Case study in clinical trial analytics
Author :
Akshay Grover;Jay Gholap;Vandana P. Janeja;Yelena Yesha;Raghu Chintalapati;Harsh Marwaha;Kunal Modi
Author_Institution :
Computer Science and Electrical Engineering University of Maryland, Baltimore County
fYear :
2015
Firstpage :
2680
Lastpage :
2689
Abstract :
Big Data deals with enormous volumes of complex and exponentially growing data sets from multiple sources. With rapid growth in technology, we are now able to generate immense amount of data in almost any field imaginable including physical, biological and biomedical sciences. With the diversity and amount of data in health care industry there is an increasing need to evaluate the components in big data frameworks and gauge their adaptability to analytics techniques. However, a key step in adapting big data tools is the portability of relational databases to big data environment. Since SQL is considered to be the de-facto language for interactive queries, in this paper, we evaluate the performance of SQL-like big data solutions for the portability of existing relational databases. Our work focuses on benchmarking multiple SQL-like big data technologies over Hadoop based distributed file system (HDFS) for Study Data Tabulation Model (SDTM) used in clinical trial databases for improving the efficiency of research in clinical trials. We use publically available clinical trial data (from National Institute on Drug Abuse (NIDA)), which follows SDTM, as a test bed to measure key parameters like usability, adaptability, modularity, robustness and efficiency of these solutions. With the intention to demonstrate how current clinical trial functionality can be replicated on a big data backend with high SQL-like functionality, we evaluate several types of ad-hoc SQL queries.
Keywords :
"Big data","Clinical trials","Data mining","Facebook","Distributed databases","Data models","Engines"
Publisher :
ieee
Conference_Titel :
Big Data (Big Data), 2015 IEEE International Conference on
Type :
conf
DOI :
10.1109/BigData.2015.7364068
Filename :
7364068
Link To Document :
بازگشت