DocumentCode
2774054
Title
Transferred Feature Selection
Author
Bi, Wei ; Shi, Yuan ; Lan, Zhenzhong
Author_Institution
Dept. of Comput. Sci., Sun Yat-sen Univ., Guangzhou, China
fYear
2009
fDate
6-6 Dec. 2009
Firstpage
416
Lastpage
421
Abstract
Traditional feature selection algorithms require a large number of labeled training instances to find out the most informative subset of features. However, in many real-world applications, the labeled data are often difficult, expensive or time-consuming to obtain. Recently, several semi-supervised feature selection algorithms were proposed, which aim at doing feature selection with the help of some unlabeled data. But such methods assume the labeled and unlabeled data are under the same data distribution. In this paper, we propose a new framework named transferred feature selection (TFS), which uses out-of-domain labeled data to alleviate the lack of same-distribution labeled training data. The out-of-domain data are labeled but have different distributions with the same-distribution data, so most supervised or semi-supervised feature selection algorithms fail to work well with them. The key idea of TFS is to transfer knowledge from the out-of-domain instances to select a feature subset that can yield high prediction accuracy. The framework is then implemented by k-NN method. Analysis and experiments show that TFS can effectively exploit the out-of-domain instances to improve the performance of feature selection.
Keywords
knowledge based systems; pattern recognition; k-NN method; knowledge transfer; labeled training; semi-supervised feature selection; transferred feature selection; Application software; Bismuth; Computer science; Conferences; Data mining; Labeling; Machine learning; Sun; Testing; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining Workshops, 2009. ICDMW '09. IEEE International Conference on
Conference_Location
Miami, FL
Print_ISBN
978-1-4244-5384-9
Electronic_ISBN
978-0-7695-3902-7
Type
conf
DOI
10.1109/ICDMW.2009.102
Filename
5360441
Link To Document