DocumentCode
3547440
Title
Autonomous learning of visual concept models
Author
Song, Xiaodan ; Lin, Ching-Yung ; Sun, Ming-Ting
Author_Institution
Dept. of Electr. Eng., Univ. of Washington, Seattle, WA, USA
fYear
2005
fDate
23-26 May 2005
Firstpage
4598
Abstract
As the amount of video data increases, organizing and retrieving video data based on their semantics is becoming increasingly important. Traditionally, supervised learning is used to build models for detecting semantic concepts. However, in order to obtain a substantial amount of training data, extensive labeling work is needed with the supervised learning schemes. In this paper, we propose a novel autonomous learning framework in which imperfect labelling automatically extracted from cross-modality information is used for training. This completely avoids the manual labeling process. In our proposed framework, imperfect labels without user involvement are first obtained from cross-modality information. Then, based on our proposed new schemes, "generalized multiple-instance learning" and "uncertain labeling density", the system conjectures relevance scores of visual concepts. From these scores, support vector regression is used to build generic visual models. In preliminary experiments, we use the proposed system to learn 20 visual concepts in 6 hours of video. Compare with two concept models that were trained by two supervised algorithms, this novel autonomous learning framework achieves better system average precisions. Other concept models also show promising results.
Keywords
content-based retrieval; image retrieval; learning (artificial intelligence); multimedia databases; regression analysis; support vector machines; video databases; autonomous learning; cross-modality information; generalized multiple-instance learning; generic visual models; imperfect labelling; semantics; support vector regression; training; uncertain labeling density; video data retrieval; visual concept models; Content based retrieval; Feedback; Image retrieval; Information retrieval; Labeling; Learning systems; Sun; Supervised learning; Training data; Video sequences;
fLanguage
English
Publisher
ieee
Conference_Titel
Circuits and Systems, 2005. ISCAS 2005. IEEE International Symposium on
Print_ISBN
0-7803-8834-8
Type
conf
DOI
10.1109/ISCAS.2005.1465656
Filename
1465656
Link To Document