Title :
MDDM: A Method to Improve Multiple Dimension Data Management Performance in HBase
Author :
Zhuang Wei;Qu JunMei;Liu Liang;Zhu ChaoQiang;Yin WenJun
Author_Institution :
IBM China Res. Lab., Beijing, China
Abstract :
Big data is the term applied to a new generation of software, applications and storage system, designed to derive business values. The big data phenomenon requires a revolutionary approach to the technologies deployed to ensure that timely results are delivered to create value. However, the state-of-the-art techniques for multiple dimensions big data query are facing problems as the data expand and user access pattern changes. In this paper, we will propose an optimized storage model and index scheme to provide efficient query over big multiple dimension data and multiple query patterns. We implement our scheme on HBase by introducing four components in its master node. Taking pollutant concentration data in "Green Horizon" project as the test data, we conduct numerous experiments. Experiment results show that our proposed storage model and index can help provide obvious performance improvement on multiple different queries patterns over big multiple dimension data and also has good scalability as data expand.
Keywords :
"Time series analysis","Big data","Data models","Monitoring","Software","Diamonds","Buildings"
Conference_Titel :
High Performance Computing and Communications (HPCC), 2015 IEEE 7th International Symposium on Cyberspace Safety and Security (CSS), 2015 IEEE 12th International Conferen on Embedded Software and Systems (ICESS), 2015 IEEE 17th International Conference on
DOI :
10.1109/HPCC-CSS-ICESS.2015.102