Title :
Data Vitalization: A New Paradigm for Large-Scale Dataset Analysis
Author :
Xiong, Zhang ; Luo, Wuman ; Chen, Lei ; Ni, Lionel M.
Author_Institution :
Sch. of Comput. Sci. & Technol., Beihang Univ., Beijing, China
Abstract :
Nowadays, datasets grow enormously both in size and complexity. One of the key issues confronted by large-scale dataset analysis is how to adapt systems to new, unprecedented query loads. Existing systems nail down the data organization scheme once and for all at the beginning of the system design, thus inevitably will see the performance goes down when user requirements change. In this paper, we propose a new paradigm, Data Vitalization, for large-scale dataset analysis. Our goal is to enable high flexibility such that the system is adaptive to complex analytical applications. Specifically, data are organized into a group of vitalized cells, each of which is a collection of data coupled with computing power. As user requirements change over time, cells evolve spontaneously to meet the potential new query loads. Besides basic functionality of Data Vitalization, we also explore an envisioned architecture of Data Vitalization including possible approaches for query processing, data evolution, as well as its tight-coupled mechanism for data storage and computing.
Keywords :
data acquisition; data analysis; query processing; storage management; complex analytical application; data collection; data computing; data evolution; data organization; data storage; data vitalization; high flexibility; large-scale dataset analysis; query load; query processing; system design; tight-coupled mechanism; user requirement; data analysis; data vitalization; large-scale dataset; vitalized data cell;
Conference_Titel :
Parallel and Distributed Systems (ICPADS), 2010 IEEE 16th International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-9727-0
Electronic_ISBN :
1521-9097
DOI :
10.1109/ICPADS.2010.102