مرکز منطقه ای اطلاع رساني علوم و فناوري - Autonomous learning of visual concept models

DocumentCode :

3547440

Title :

Autonomous learning of visual concept models

Author :

Song, Xiaodan ; Lin, Ching-Yung ; Sun, Ming-Ting

Author_Institution :

Dept. of Electr. Eng., Univ. of Washington, Seattle, WA, USA

fYear :

2005

fDate :

23-26 May 2005

Firstpage :

4598

Abstract :

As the amount of video data increases, organizing and retrieving video data based on their semantics is becoming increasingly important. Traditionally, supervised learning is used to build models for detecting semantic concepts. However, in order to obtain a substantial amount of training data, extensive labeling work is needed with the supervised learning schemes. In this paper, we propose a novel autonomous learning framework in which imperfect labelling automatically extracted from cross-modality information is used for training. This completely avoids the manual labeling process. In our proposed framework, imperfect labels without user involvement are first obtained from cross-modality information. Then, based on our proposed new schemes, "generalized multiple-instance learning" and "uncertain labeling density", the system conjectures relevance scores of visual concepts. From these scores, support vector regression is used to build generic visual models. In preliminary experiments, we use the proposed system to learn 20 visual concepts in 6 hours of video. Compare with two concept models that were trained by two supervised algorithms, this novel autonomous learning framework achieves better system average precisions. Other concept models also show promising results.

Keywords :

content-based retrieval; image retrieval; learning (artificial intelligence); multimedia databases; regression analysis; support vector machines; video databases; autonomous learning; cross-modality information; generalized multiple-instance learning; generic visual models; imperfect labelling; semantics; support vector regression; training; uncertain labeling density; video data retrieval; visual concept models; Content based retrieval; Feedback; Image retrieval; Information retrieval; Labeling; Learning systems; Sun; Supervised learning; Training data; Video sequences;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Circuits and Systems, 2005. ISCAS 2005. IEEE International Symposium on

Print_ISBN :

0-7803-8834-8

Type :

conf

DOI :

10.1109/ISCAS.2005.1465656

Filename :

1465656

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3547440