DocumentCode :
2845899
Title :
A Comparison Study of Missing Value Processing Methods in Time Series Data Mining
Author :
Jiang, Yi ; Lan, Tuo ; Wu, LiHua
Author_Institution :
Dept. of Comput. Sci., Xiamen Univ., Xiamen, China
fYear :
2009
fDate :
11-13 Dec. 2009
Firstpage :
1
Lastpage :
4
Abstract :
There are many methods for dealing with missing value on time series mining. The regression model is better than other methods when the variables of the data are correlative. This paper uses the method of mean interpolation and one variable linear regression, multivariate linear regression and iterative regression method of regression interpolation to deal with missing value of hydrological time series database and compares them under different Pearson correlation coefficient. The study shows that the one variable linear regression is simple and intuitive, and has a higher accuracy when data gaps exist variables that have a great relevance with missing variable values, and it shows that multivariate linear regression and multiple regression iteration have better results when the data doesn ´t.
Keywords :
data mining; iterative methods; regression analysis; time series; Pearson correlation coefficient; data mining; iterative regression; missing value processing; multivariate linear regression; regression model; time series; variable linear regression; Clustering algorithms; Computer science; Data mining; Databases; Decision trees; Finance; Interpolation; Iterative methods; Linear regression; Multivariate regression;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Software Engineering, 2009. CiSE 2009. International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-4507-3
Electronic_ISBN :
978-1-4244-4507-3
Type :
conf
DOI :
10.1109/CISE.2009.5365076
Filename :
5365076
Link To Document :
بازگشت