Title :
A Comparison Study of Missing Value Processing Methods in Time Series Data Mining
Author :
Jiang, Yi ; Lan, Tuo ; Wu, LiHua
Author_Institution :
Dept. of Comput. Sci., Xiamen Univ., Xiamen, China
Abstract :
There are many methods for dealing with missing value on time series mining. The regression model is better than other methods when the variables of the data are correlative. This paper uses the method of mean interpolation and one variable linear regression, multivariate linear regression and iterative regression method of regression interpolation to deal with missing value of hydrological time series database and compares them under different Pearson correlation coefficient. The study shows that the one variable linear regression is simple and intuitive, and has a higher accuracy when data gaps exist variables that have a great relevance with missing variable values, and it shows that multivariate linear regression and multiple regression iteration have better results when the data doesn ´t.
Keywords :
data mining; iterative methods; regression analysis; time series; Pearson correlation coefficient; data mining; iterative regression; missing value processing; multivariate linear regression; regression model; time series; variable linear regression; Clustering algorithms; Computer science; Data mining; Databases; Decision trees; Finance; Interpolation; Iterative methods; Linear regression; Multivariate regression;
Conference_Titel :
Computational Intelligence and Software Engineering, 2009. CiSE 2009. International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-4507-3
Electronic_ISBN :
978-1-4244-4507-3
DOI :
10.1109/CISE.2009.5365076