• DocumentCode
    2518820
  • Title

    A novel approach for mining multiple data streams based on lag correlation

  • Author

    Zhang, Tiancheng ; Yue, Dejun ; Wang, Yanqiu ; Yu, Ge

  • Author_Institution
    Coll. of Inf. Sci. & Eng., Northeastern Univ., Shenyang, China
  • fYear
    2011
  • fDate
    23-25 May 2011
  • Firstpage
    2377
  • Lastpage
    2382
  • Abstract
    Correlation analysis is a key problem for data stream analysis. In this paper, we propose a correlation analysis method for multiple dimensional data streams, which is based on the Boolean lag representation and the PCA (Principal Component Analysis). Firstly, the raw stream sequence is transformed into the Boolean sequence. By the correlation analysis of Boolean sequences, we can easily find the sequence pairs with lag correlations by means of simple bit operations. Secondly, we compute the lag time and synchronize the multiple dimensional data stream. Thirdly, the PCA method is deployed to reduce the multiple data streams, and we can reconstruct the data streams by a few principal components. The experimental evaluations show that the method has high computation performance with high accuracy.
  • Keywords
    Boolean functions; correlation methods; data mining; principal component analysis; Boolean lag representation; Boolean sequence; PCA; lag correlation; multiple data streams mining; principal component analysis; Algorithm design and analysis; Complexity theory; Correlation; Educational institutions; Principal component analysis; Synchronization; Transforms; Boolean; PCA; data stream; lag correlation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Control and Decision Conference (CCDC), 2011 Chinese
  • Conference_Location
    Mianyang
  • Print_ISBN
    978-1-4244-8737-0
  • Type

    conf

  • DOI
    10.1109/CCDC.2011.5968606
  • Filename
    5968606