DocumentCode
3307226
Title
Bag-level active multi-instance learning
Author
Jian Fu ; Jian Yin
Author_Institution
Sch. of Inf. Sci. & Technol., SUN YAT-SEN Univ., Guangzhou, China
Volume
2
fYear
2011
fDate
26-28 July 2011
Firstpage
1307
Lastpage
1311
Abstract
Multi-Instance Learning (MIL) is a special scheme in machine learning. In recent research it is successfully applied in text classification problem. However, MIL is naturally semi-supervised since the instances labels are unknown for positive bags, which would cut down the accuracy of predictors, or require more computational cost to reduce uncertainty, or to guess such labels at a high probability. In this paper, we attempt to tackle MIL problem by introducing active learning, which is another learning scheme attracted much research interests. Active learning relies on an oracle that can give ground truth labels as required. The proposed method is based on query for bags and it adopts a Fisher Information Matrix (FIM) based method to construct the criteria of query for oracle. We launch experiment on a famous text classification data set - 20 group news. Compared to the randomly selected query strategy as a baseline method and recent methods, the proposed method is of higher accuracy and outperforms others.
Keywords
learning (artificial intelligence); matrix algebra; query processing; text analysis; active learning; bag-level active multi-instance learning; fisher information matrix based method; ground truth labels; machine learning; oracle; positive bags; query strategy; text classification data set; Bismuth; Machine learning; Measurement; Silicon; Text categorization; Training; Uncertainty; active learning; fisher information matrix; multi-instance learning; text classification;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery (FSKD), 2011 Eighth International Conference on
Conference_Location
Shanghai
Print_ISBN
978-1-61284-180-9
Type
conf
DOI
10.1109/FSKD.2011.6019682
Filename
6019682
Link To Document