DocumentCode
2579439
Title
Imputation for the analysis of missing values and prediction of time series data
Author
Sridevi, S. ; Rajaram, S. ; Parthiban, C. ; SibiArasan, S. ; Swadhikar, C.
Author_Institution
Dept. of CSE, Thiagarajar Coll. of Eng., Madurai, India
fYear
2011
fDate
3-5 June 2011
Firstpage
1158
Lastpage
1163
Abstract
Data preprocessing plays an important and critical role in the data mining process. Data preprocessing is required in order to improve the efficiency of an algorithm. This paper focuses on missing value estimation and prediction of time series data based on the historical values. A number of algorithms have been developed to solve this problem, but they have several limitations. Most existing algorithms like KNNimpute (K-Nearest Neighbours imputation), BPCA (Bayesian Principal Component Analysis) and SVDimpute (Singular Value Decomposition imputation) are not able to deal with the situation where a particular time point (column) of the data is missing entirely. This paper focuses on autoregressive-model-based missing value estimation method (ARLSimpute) which is effective for the situation where a particular time point contains many missing values or where the entire time point is missing. Data preprocessing output is given to the input of the prediction techniques namely linear prediction and quadratic prediction. These techniques are used to predict the future values based on the historical values. The performance of the algorithm is measured by performance metrics like precision and recall. Experimental results on real-life datasets demonstrate that the proposed algorithm is effective and efficient to reveal future time series data.
Keywords
autoregressive moving average processes; data mining; estimation theory; ARLSimpute; autoregressive-model-based missing value estimation method; data mining process; data preprocessing; missing value analysis; time series data prediction; Algorithm design and analysis; Data mining; Databases; Estimation; Prediction algorithms; Predictive models; Time series analysis; Auto-Regressive (AR) model; Prediction; Temporal Databases; time series analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Recent Trends in Information Technology (ICRTIT), 2011 International Conference on
Conference_Location
Chennai, Tamil Nadu
Print_ISBN
978-1-4577-0588-5
Type
conf
DOI
10.1109/ICRTIT.2011.5972466
Filename
5972466
Link To Document