DocumentCode :
2331802
Title :
Outlier Detection in Benchmark Classification Tasks
Author :
Li, Hongyu ; Niranjan, Mahesan
Author_Institution :
Dept. of Comput. Sci., Sheffield Univ.
Volume :
5
fYear :
2006
fDate :
14-19 May 2006
Abstract :
We present a new outlier detection method which is appropriate for classification problems. It combines estimating the overall probability density and sequential ranking of the data according to observed changes in performance on validation sets. The method has been implemented on ten widely used benchmark datasets and a spam email filtering application. Evaluated by six popular machine learning methods, classification performances are shown to improve after removing outliers in comparison to removing the same number of examples at random from the datasets
Keywords :
information filtering; learning (artificial intelligence); probability; benchmark classification tasks; machine learning methods; outlier detection; overall probability density; sequential ranking; spam email filtering application; Additive noise; Biological system modeling; Computer science; Electronic mail; Instruments; Labeling; Noise robustness; Performance evaluation; Predictive models; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
ISSN :
1520-6149
Print_ISBN :
1-4244-0469-X
Type :
conf
DOI :
10.1109/ICASSP.2006.1661336
Filename :
1661336
Link To Document :
بازگشت