Title :
Cleaning Training-Datasets with Noise-Aware Algorithms
Author :
Escalante, H. Jair
Author_Institution :
Dept. of Comput. Sci., Instituto Nacional de Astrofisica Optica y Electronica, Puebla
Abstract :
We introduce a novel learning algorithm for noise elimination. Our algorithm is based on the re-measurement idea for the correction of erroneous observations and is able to discriminate between noisy and noiseless observations by using kernel methods. We apply our noise-aware algorithms to several domains including: astronomy, face recognition and ten machine learning benchmark datasets. Experimental results adding noise and useful anomalies to the data show that our algorithm improves data quality, without having to eliminate any observation from the original dataset
Keywords :
data integrity; learning (artificial intelligence); astronomy; data quality; face recognition; kernel methods; learning algorithm; machine learning benchmark datasets; noise elimination; noise-aware algorithms; training dataset cleaning; Cleaning; Computer science; Error correction; Face recognition; Humans; Investments; Kernel; Machine learning; Machine learning algorithms; Optical noise;
Conference_Titel :
Computer Science, 2006. ENC '06. Seventh Mexican International Conference on
Conference_Location :
San Luis Potosi
Print_ISBN :
0-7695-2666-7