DocumentCode :
2724178
Title :
Query-sensitive Feature Selection for Lazy Learners
Author :
Tong, Xin ; Gu, Mingyang
Author_Institution :
Dept. of Comput. & Inf. Sci., Norwegian Univ. of Sci. & Technol., Trondheim
fYear :
2007
fDate :
March 1 2007-April 5 2007
Firstpage :
59
Lastpage :
65
Abstract :
Feature selection contributes to increasing many learners´ accuracy by identifying and removing irrelevant features in multidimensional datasets. Conventional feature selection methods determine the optimal feature subset independently from and prior to the introduction of a new query. In general, some features will be relevant only in certain tasks. We argue that a query, as an indicator of the attention focus and current task, is a major part of the context and should be involved in the determination of the final feature subset. In this paper we attempt to propose a query-sensitive feature selection model, present two algorithms for applying such a feature selection method, and test their effectiveness by comparing their performances to those of the conventional selection algorithms. Our experiments are executed under a nearest neighbor classification environment and the results show a consistent improvement in the classification performance when a query-sensitive feature subset is selected and used for measuring similarities between the query and other instances. The results suggest that the performance of a lazy learner has the potential to increase through query-sensitive feature selection
Keywords :
data mining; learning (artificial intelligence); pattern classification; query processing; lazy learners; multidimensional datasets; nearest neighbor classification environment; optimal feature subset; query-sensitive feature selection; query-sensitive feature subset; Computational intelligence; Concrete; Data mining; Degradation; Filters; Nearest neighbor searches; Performance evaluation; System performance; Testing; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Data Mining, 2007. CIDM 2007. IEEE Symposium on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0705-2
Type :
conf
DOI :
10.1109/CIDM.2007.368853
Filename :
4221277
Link To Document :
بازگشت