DocumentCode :
1672994
Title :
Automatic multi-modal dialogue scene indexing
Author :
Alatan, A. Aydin
Author_Institution :
Dept. of Electr. & Electron. Eng., Middle East Tech. Univ., Ankara, Turkey
Volume :
3
fYear :
2001
fDate :
6/23/1905 12:00:00 AM
Firstpage :
374
Abstract :
An automatic algorithm for indexing dialogue scenes in multimedia content is proposed. The content is segmented into dialogue scenes using the state transitions of a hidden Markov model (HMM). Each shot is classified using both audio and visual information to determine the state/scene transitions for this model. Face detection and silence/speech/music classification are the basic tools which are utilized to index the scenes. While face information is extracted after applying some heuristics to skin-colored regions, audio analysis is achieved by examining signal energy, periodicity and zero crossing rate (ZCR) of the audio waveform. The simulation results show the possibility of automatically indexing the dialogues using the proposed algorithm
Keywords :
audio signal processing; content-based retrieval; face recognition; hidden Markov models; indexing; multimedia databases; speech processing; HMM; audio analysis; audio waveform; automatic algorithm; dialogue scene indexing; face detection; hidden Markov model; multi-modal indexing; multimedia content; multimedia management; music classification; periodicity; signal energy; silence classification; skin-colored regions; speech classification; state transitions; state/scene transitions; zero crossing rate; Data mining; Face detection; Hidden Markov models; Image analysis; Indexing; Information analysis; Layout; Motion pictures; Natural languages; Signal analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Image Processing, 2001. Proceedings. 2001 International Conference on
Conference_Location :
Thessaloniki
Print_ISBN :
0-7803-6725-1
Type :
conf
DOI :
10.1109/ICIP.2001.958129
Filename :
958129
Link To Document :
بازگشت