مرکز منطقه ای اطلاع رساني علوم و فناوري - Reusing Speech Techniques for Video Semantic Indexing [Applications Corner]

DocumentCode :

56812

Title :

Reusing Speech Techniques for Video Semantic Indexing [Applications Corner]

Author :

Shinoda, Kazuma ; Inoue, Naoko

Author_Institution :

Dept. of Comput. Sci., Tokyo Inst. of Technol., Tokyo, Japan

Volume :

Issue :

fYear :

2013

fDate :

Mar-13

Firstpage :

118

Lastpage :

122

Abstract :

Many techniques developed in speech research have been successfully employed in other fields, such as automatic video semantic indexing. In this application, a user submits a textual input query for an desired object or a scene to a search system, which returns video shots that include the object or scene. Recently, a new method using Gaussian-mixture model (GMM) supervectors and support vector machines (SVMs) was proven to be very effective. In this method, speech technology such as speaker verification and adaptation techniques play very important roles.

Keywords :

Gaussian processes; indexing; speaker recognition; support vector machines; GMM supervector; Gaussian-mixture model supervector; SVM; automatic video semantic indexing; speaker adaptation technique; speaker verification technique; speech technique; support vector machine; textual input query; video shot return; Feature extraction; Gaussian processes; Indexing; Sematics; Speech processing; Support vector machines; Video signal processing;

fLanguage :

English

Journal_Title :

Signal Processing Magazine, IEEE

Publisher :

ieee

ISSN :

1053-5888

Type :

jour

DOI :

10.1109/MSP.2012.2230520

Filename :

6461614

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=56812