Title :
Mining group correlations over data streams
Author :
Zhijie Wang ; Jiangbo Qian ; Maochun Zhou ; Yihong Dong ; Huahui Chen
Author_Institution :
Coll. of Inf. Sci. & Eng, Ningbo Univ., Ningbo, China
Abstract :
Mining correlation over steams attracts a lot of attentions recently. However, group correlation analysis over data streams is relatively few. Moreover, existing literatures are mainly focused on a single time window, with large space and time complexity. This paper proposes an online canonical correlation analysis algorithm called MGDS (Mining Group Data Streams). Based on base-window, the MGDS algorithm dynamically maintains a few statistics from raw data to calculate correlation. The mining range is not limited in a single window, but can be changed according to queries. Theoretical analysis and experimental results show that the algorithm is accurate and efficient with space and time overhead reduced greatly.
Keywords :
computational complexity; data mining; group correlation analysis; group correlation mining; mining group data streams; online canonical correlation analysis algorithm; space complexity; time complexity; Algorithm design and analysis; Approximation methods; Complexity theory; Correlation; Covariance matrix; Data mining; Queueing analysis; data mining; group correlation analysis; multidimensional data streams;
Conference_Titel :
Computer Science & Education (ICCSE), 2011 6th International Conference on
Conference_Location :
Singapore
Print_ISBN :
978-1-4244-9717-1
DOI :
10.1109/ICCSE.2011.6028794