DocumentCode :
3640095
Title :
A Comparison of Techniques for Handling Incomplete Input Data with a Focus on Attribute Relevance Influence
Author :
M. Millan Giraldo;J.S. Sanchez;V.J. Traver
Author_Institution :
Dept. Llenguatges i Sistemes Inf., Univ. Jaume I, Castelló
fYear :
2010
Firstpage :
819
Lastpage :
822
Abstract :
This work presents a new approach based on support vector regression to deal with incomplete input (unseen) data and compares it to other existing techniques. The empirical analysis has been done over 18 real data sets and using five different classifiers, with the aim of foreseeing which technique can be deemed as more suitable for each classifier. Also, this study tries to devise how the relevance of the missing attribute affects the performance of each pair (handling algorithm, classifier). Experimental results demonstrate that no technique is absolutely better than the others for all classifiers. However, combining the proposed strategy with the nearest neighbor classifier appears as the best choice to face the problem of missing attribute values in the input data.
Keywords :
"Training","Support vector machines","Data models","Noise","Nearest neighbor searches","Libraries","Kernel"
Publisher :
ieee
Conference_Titel :
Machine Learning and Applications (ICMLA), 2010 Ninth International Conference on
Print_ISBN :
978-1-4244-9211-4
Type :
conf
DOI :
10.1109/ICMLA.2010.126
Filename :
5708949
Link To Document :
بازگشت