DocumentCode
2518820
Title
A novel approach for mining multiple data streams based on lag correlation
Author
Zhang, Tiancheng ; Yue, Dejun ; Wang, Yanqiu ; Yu, Ge
Author_Institution
Coll. of Inf. Sci. & Eng., Northeastern Univ., Shenyang, China
fYear
2011
fDate
23-25 May 2011
Firstpage
2377
Lastpage
2382
Abstract
Correlation analysis is a key problem for data stream analysis. In this paper, we propose a correlation analysis method for multiple dimensional data streams, which is based on the Boolean lag representation and the PCA (Principal Component Analysis). Firstly, the raw stream sequence is transformed into the Boolean sequence. By the correlation analysis of Boolean sequences, we can easily find the sequence pairs with lag correlations by means of simple bit operations. Secondly, we compute the lag time and synchronize the multiple dimensional data stream. Thirdly, the PCA method is deployed to reduce the multiple data streams, and we can reconstruct the data streams by a few principal components. The experimental evaluations show that the method has high computation performance with high accuracy.
Keywords
Boolean functions; correlation methods; data mining; principal component analysis; Boolean lag representation; Boolean sequence; PCA; lag correlation; multiple data streams mining; principal component analysis; Algorithm design and analysis; Complexity theory; Correlation; Educational institutions; Principal component analysis; Synchronization; Transforms; Boolean; PCA; data stream; lag correlation;
fLanguage
English
Publisher
ieee
Conference_Titel
Control and Decision Conference (CCDC), 2011 Chinese
Conference_Location
Mianyang
Print_ISBN
978-1-4244-8737-0
Type
conf
DOI
10.1109/CCDC.2011.5968606
Filename
5968606
Link To Document