DocumentCode :
408318
Title :
A new reparation method for incomplete data in the context of supervised learning
Author :
Magnani, Matteo ; Montesi, Danilo
Author_Institution :
Dept. of Comput. Sci., Bologna Univ., Italy
Volume :
1
fYear :
2004
fDate :
5-7 April 2004
Firstpage :
471
Abstract :
Real-world data is often incomplete. There exist many statistical methods to deal with missing items. However, they assume data distributions which are difficult to justify in the context of supervised learning. In this paper we propose a new method of repairing incomplete data. This technique is a variation of a general strategy, here called local imputation. It repairs incomplete records, only when this is reasonable. It is able to identify wrong tuples. It is more general than other similar methods, because of a parametric similarity function. Finally, it also works with noisy data sets.
Keywords :
data handling; data integrity; data mining; learning (artificial intelligence); statistical analysis; data distributions; incomplete data repairing; incomplete records repairing; local imputation; noisy data sets; real-world data; reparation method; similarity function; statistical methods; supervised learning; wrong tuple identification; Computer science; Data analysis; Databases; Filling; Informatics; Lips; Mathematics; Statistical analysis; Supervised learning; Technological innovation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Technology: Coding and Computing, 2004. Proceedings. ITCC 2004. International Conference on
Print_ISBN :
0-7695-2108-8
Type :
conf
DOI :
10.1109/ITCC.2004.1286501
Filename :
1286501
Link To Document :
بازگشت