• DocumentCode
    2579439
  • Title

    Imputation for the analysis of missing values and prediction of time series data

  • Author

    Sridevi, S. ; Rajaram, S. ; Parthiban, C. ; SibiArasan, S. ; Swadhikar, C.

  • Author_Institution
    Dept. of CSE, Thiagarajar Coll. of Eng., Madurai, India
  • fYear
    2011
  • fDate
    3-5 June 2011
  • Firstpage
    1158
  • Lastpage
    1163
  • Abstract
    Data preprocessing plays an important and critical role in the data mining process. Data preprocessing is required in order to improve the efficiency of an algorithm. This paper focuses on missing value estimation and prediction of time series data based on the historical values. A number of algorithms have been developed to solve this problem, but they have several limitations. Most existing algorithms like KNNimpute (K-Nearest Neighbours imputation), BPCA (Bayesian Principal Component Analysis) and SVDimpute (Singular Value Decomposition imputation) are not able to deal with the situation where a particular time point (column) of the data is missing entirely. This paper focuses on autoregressive-model-based missing value estimation method (ARLSimpute) which is effective for the situation where a particular time point contains many missing values or where the entire time point is missing. Data preprocessing output is given to the input of the prediction techniques namely linear prediction and quadratic prediction. These techniques are used to predict the future values based on the historical values. The performance of the algorithm is measured by performance metrics like precision and recall. Experimental results on real-life datasets demonstrate that the proposed algorithm is effective and efficient to reveal future time series data.
  • Keywords
    autoregressive moving average processes; data mining; estimation theory; ARLSimpute; autoregressive-model-based missing value estimation method; data mining process; data preprocessing; missing value analysis; time series data prediction; Algorithm design and analysis; Data mining; Databases; Estimation; Prediction algorithms; Predictive models; Time series analysis; Auto-Regressive (AR) model; Prediction; Temporal Databases; time series analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Recent Trends in Information Technology (ICRTIT), 2011 International Conference on
  • Conference_Location
    Chennai, Tamil Nadu
  • Print_ISBN
    978-1-4577-0588-5
  • Type

    conf

  • DOI
    10.1109/ICRTIT.2011.5972466
  • Filename
    5972466