DocumentCode
397647
Title
A generic neural network approach for filling missing data in data mining
Author
Wei, Wei ; Tang, Ying
Author_Institution
Dept. of Comput. Sci., New Jersey Inst. of Technol., Newark, NJ, USA
Volume
1
fYear
2003
fDate
5-8 Oct. 2003
Firstpage
862
Abstract
The recent advances in data mining have produced algorithms for extracting hidden and potentially useful knowledge in large data sets, which are assumed to be complete and reliable. However, data suitable for mining comes from various sources, has different formats, and can have missing or incorrect values. Incomplete data sets significantly distort mining results. Therefore, data preparation to taking care of missing or out-of-range values is very critical to knowledge discovery. This paper proposes a generic framework for missing data imputation using neural networks, where two-stage filling algorithms are implemented. An empirical evaluation of this method through a large credit card data set is performed.
Keywords
credit transactions; data mining; data preparation; neural nets; very large databases; credit card data set; data mining; data preparation; filling algorithms; knowledge discovery; knowledge extraction; large data sets; missing data; neural network; Computer science; Data analysis; Data engineering; Data mining; Delta modulation; Filling; Intelligent networks; Knowledge engineering; Neural networks; Reliability engineering;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man and Cybernetics, 2003. IEEE International Conference on
ISSN
1062-922X
Print_ISBN
0-7803-7952-7
Type
conf
DOI
10.1109/ICSMC.2003.1243923
Filename
1243923
Link To Document