DocumentCode
3257490
Title
A multimedia content identification system using audio and visual integrated features
Author
Chih-Chang Chen ; Liu, Chia-Hsiung ; Tsao, Yan-Cheng ; Tien, Pai-Yu ; Yuan-Bin Chen ; Chen, Oscal T C
Author_Institution
Dept of Electr. Eng., Nat. Chung Cheng Univ., Chiayi, Taiwan
fYear
2005
fDate
7-10 Aug. 2005
Firstpage
1855
Abstract
With an overflow of multimedia information around us and an urgent need to identify data accurately, an audio and visual identification system with a high accuracy rate is developed to meet the demand. Classification and feature extraction are performed separately on audio and visual signals. Pending on the temporal correlation of the feature vectors of objects and speakers, indexes of all objects included in an audio/visual sequence are listed in a time sequence. In integrating the audio/visual features, every object or character of the key frames has a set of feature vectors; the user can select and search specific characters that have the audio and visual features from the entire index set. Due to integrating the audio/visual identification results in the time order, the proposed identification system can increase the accuracy about 4% and 6% in our experiments, comparing with the results using the audio features and visual features separately, respectively.
Keywords
audio signal processing; classification; feature extraction; image sequences; multimedia systems; audio identification; audio sequence; feature extraction; multimedia content identification; time sequence; visual identification; visual sequence; Facial features; Feature extraction; Histograms; Image retrieval; Image segmentation; Multimedia systems; Multiple signal classification; Shape; Signal processing; Speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Circuits and Systems, 2005. 48th Midwest Symposium on
Print_ISBN
0-7803-9197-7
Type
conf
DOI
10.1109/MWSCAS.2005.1594485
Filename
1594485
Link To Document