DocumentCode :
3547440
Title :
Autonomous learning of visual concept models
Author :
Song, Xiaodan ; Lin, Ching-Yung ; Sun, Ming-Ting
Author_Institution :
Dept. of Electr. Eng., Univ. of Washington, Seattle, WA, USA
fYear :
2005
fDate :
23-26 May 2005
Firstpage :
4598
Abstract :
As the amount of video data increases, organizing and retrieving video data based on their semantics is becoming increasingly important. Traditionally, supervised learning is used to build models for detecting semantic concepts. However, in order to obtain a substantial amount of training data, extensive labeling work is needed with the supervised learning schemes. In this paper, we propose a novel autonomous learning framework in which imperfect labelling automatically extracted from cross-modality information is used for training. This completely avoids the manual labeling process. In our proposed framework, imperfect labels without user involvement are first obtained from cross-modality information. Then, based on our proposed new schemes, "generalized multiple-instance learning" and "uncertain labeling density", the system conjectures relevance scores of visual concepts. From these scores, support vector regression is used to build generic visual models. In preliminary experiments, we use the proposed system to learn 20 visual concepts in 6 hours of video. Compare with two concept models that were trained by two supervised algorithms, this novel autonomous learning framework achieves better system average precisions. Other concept models also show promising results.
Keywords :
content-based retrieval; image retrieval; learning (artificial intelligence); multimedia databases; regression analysis; support vector machines; video databases; autonomous learning; cross-modality information; generalized multiple-instance learning; generic visual models; imperfect labelling; semantics; support vector regression; training; uncertain labeling density; video data retrieval; visual concept models; Content based retrieval; Feedback; Image retrieval; Information retrieval; Labeling; Learning systems; Sun; Supervised learning; Training data; Video sequences;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Circuits and Systems, 2005. ISCAS 2005. IEEE International Symposium on
Print_ISBN :
0-7803-8834-8
Type :
conf
DOI :
10.1109/ISCAS.2005.1465656
Filename :
1465656
Link To Document :
بازگشت