Autonomous learning of visual concept models

Author

Song, Xiaodan ; Lin, Ching-Yung ; Sun, Ming-Ting

Author_Institution

Dept. of Electr. Eng., Univ. of Washington, Seattle, WA, USA

fYear

2005

fDate

23-26 May 2005

Firstpage

4598

Abstract

As the amount of video data increases, organizing and retrieving video data based on their semantics is becoming increasingly important. Traditionally, supervised learning is used to build models for detecting semantic concepts. However, in order to obtain a substantial amount of training data, extensive labeling work is needed with the supervised learning schemes. In this paper, we propose a novel autonomous learning framework in which imperfect labelling automatically extracted from cross-modality information is used for training. This completely avoids the manual labeling process. In our proposed framework, imperfect labels without user involvement are first obtained from cross-modality information. Then, based on our proposed new schemes, "generalized multiple-instance learning" and "uncertain labeling density", the system conjectures relevance scores of visual concepts. From these scores, support vector regression is used to build generic visual models. In preliminary experiments, we use the proposed system to learn 20 visual concepts in 6 hours of video. Compare with two concept models that were trained by two supervised algorithms, this novel autonomous learning framework achieves better system average precisions. Other concept models also show promising results.

Keywords

content-based retrieval; image retrieval; learning (artificial intelligence); multimedia databases; regression analysis; support vector machines; video databases; autonomous learning; cross-modality information; generalized multiple-instance learning; generic visual models; imperfect labelling; semantics; support vector regression; training; uncertain labeling density; video data retrieval; visual concept models; Content based retrieval; Feedback; Image retrieval; Information retrieval; Labeling; Learning systems; Sun; Supervised learning; Training data; Video sequences;

fLanguage

English

Publisher

ieee

Conference_Titel

Circuits and Systems, 2005. ISCAS 2005. IEEE International Symposium on

Print_ISBN

0-7803-8834-8

Type

conf

DOI

10.1109/ISCAS.2005.1465656

Filename

1465656