مرکز منطقه ای اطلاع رساني علوم و فناوري - High-Level Feature Extraction Using SIFT GMMs and Audio Models

DocumentCode :

2502299

Title :

High-Level Feature Extraction Using SIFT GMMs and Audio Models

Author :

Inoue, Nakamasa ; Saito, Tatsuhiko ; Shinoda, Koichi ; Furui, Sadaoki

Author_Institution :

Dept. of Comput. Sci., Tokyo Inst. of Technol., Tokyo, Japan

fYear :

2010

fDate :

23-26 Aug. 2010

Firstpage :

3220

Lastpage :

3223

Abstract :

We propose a statistical framework for high-level feature extraction that uses SIFT Gaussian mixture models (GMMs) and audio models. SIFT features were extracted from all the image frames and modeled by a GMM. In addition, we used mel-frequency cepstral coefficients and ergodic hidden Markov models to detect high-level features in audio streams. The best result obtained by using SIFT GMMs in terms of mean average precision on the TRECVID 2009 corpus was 0.150 and was improved to 0.164 by using audio information.

Keywords :

Gaussian processes; audio signal processing; audio streaming; cepstral analysis; feature extraction; hidden Markov models; image processing; SIFT Gaussian mixture model; SIFT feature extraction; audio model; audio stream; ergodic hidden Markov model; high-level feature detection; high-level feature extraction; image frame; mean average precision; mel-frequency cepstral coefficient; statistical framework; Computational modeling; Data mining; Detectors; Feature extraction; Hidden Markov models; Streaming media; Visualization; HMM; MFCC; SIFT GMM;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Pattern Recognition (ICPR), 2010 20th International Conference on

Conference_Location :

Istanbul

ISSN :

1051-4651

Print_ISBN :

978-1-4244-7542-1

Type :

conf

DOI :

10.1109/ICPR.2010.787

Filename :

5597163

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2502299