Title :
Scalable prediction of energy consumption using incremental time series clustering
Author :
Simmhan, Yogesh ; Noor, Muhammad Usman
Author_Institution :
Univ. of Southern California, Los Angeles, CA, USA
Abstract :
Time series datasets are a canonical form of high velocity Big Data, and often generated by pervasive sensors, such as found in smart infrastructure. Performing predictive analytics on time series data can be computationally complex, and requires approximation techniques. In this paper, we motivate this problem using a real application from the smart grid domain. We propose an incremental clustering technique, along with a novel affinity score for determining cluster similarity, which help reduce the prediction error for cumulative time series within a cluster. We evaluate this technique, along with optimizations, using real datasets from smart meters, totaling ~700,000 data points, and show the efficacy of our techniques in improving the prediction error of time series data within polynomial time.
Keywords :
energy consumption; optimisation; pattern clustering; power engineering computing; sensors; smart meters; smart power grids; time series; approximation techniques; cluster similarity; cumulative time series; high velocity big data; incremental clustering technique; incremental time series clustering; optimizations; pervasive sensors; scalable energy consumption prediction; smart grid domain; smart infrastructure; smart meters; time series datasets; Clustering algorithms; Data handling; Data storage systems; Energy consumption; Information management; Predictive models; Time series analysis; clustering; predictive analytics; smart power grids; time series; velocity;
Conference_Titel :
Big Data, 2013 IEEE International Conference on
Conference_Location :
Silicon Valley, CA
DOI :
10.1109/BigData.2013.6691774