A multimedia content identification system using audio and visual integrated features

Author

Chih-Chang Chen ; Liu, Chia-Hsiung ; Tsao, Yan-Cheng ; Tien, Pai-Yu ; Yuan-Bin Chen ; Chen, Oscal T C

Author_Institution

Dept of Electr. Eng., Nat. Chung Cheng Univ., Chiayi, Taiwan

fYear

2005

fDate

7-10 Aug. 2005

Firstpage

1855

Abstract

With an overflow of multimedia information around us and an urgent need to identify data accurately, an audio and visual identification system with a high accuracy rate is developed to meet the demand. Classification and feature extraction are performed separately on audio and visual signals. Pending on the temporal correlation of the feature vectors of objects and speakers, indexes of all objects included in an audio/visual sequence are listed in a time sequence. In integrating the audio/visual features, every object or character of the key frames has a set of feature vectors; the user can select and search specific characters that have the audio and visual features from the entire index set. Due to integrating the audio/visual identification results in the time order, the proposed identification system can increase the accuracy about 4% and 6% in our experiments, comparing with the results using the audio features and visual features separately, respectively.

Keywords

audio signal processing; classification; feature extraction; image sequences; multimedia systems; audio identification; audio sequence; feature extraction; multimedia content identification; time sequence; visual identification; visual sequence; Facial features; Feature extraction; Histograms; Image retrieval; Image segmentation; Multimedia systems; Multiple signal classification; Shape; Signal processing; Speech;

fLanguage

English

Publisher

ieee

Conference_Titel

Circuits and Systems, 2005. 48th Midwest Symposium on

Print_ISBN

0-7803-9197-7

Type

conf

DOI

10.1109/MWSCAS.2005.1594485

Filename

1594485