DocumentCode :
2139425
Title :
Towards the detection and the characterization of conversational speech zones in audiovisual documents
Author :
Bigot, Benjamin ; Ferrane, Isabelle ; Ibrahim, Zein Al Abidin
Author_Institution :
IRIT - Paul Sabatier Univ., Toulouse
fYear :
2008
fDate :
18-20 June 2008
Firstpage :
162
Lastpage :
169
Abstract :
Giving access to the semantically rich content of large amounts of digital audiovisual data using an automatic and generic method is still an important challenge. The aim of our work is to address this issue while focusing on temporal aspects. Our approach is based on a method previously developed for analyzing temporal relations from a data mining point of view. This method is used to detect zones of a document in which two characteristics are active. These characteristics can result from low-level segmentations of the audio or video components, or from more semantic processings. Once ldquoactivity zonesrdquo have been detected, we propose to compute a set of additional descriptors in order to better characterize them. The method is applied in the scope of the EPAC project that focuses on the detection and the characterization of conversational speech.
Keywords :
audio-visual systems; data mining; document handling; speech recognition; audio component segmentation; audiovisual documents; data mining; digital audiovisual data; semantic processings; speech detection; video component segmentation; Aggregates; Content based retrieval; Data mining; Face detection; Image color analysis; Image segmentation; Indexing; Information retrieval; Speech analysis; Streaming media;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Content-Based Multimedia Indexing, 2008. CBMI 2008. International Workshop on
Conference_Location :
London
Print_ISBN :
978-1-4244-2043-8
Electronic_ISBN :
978-1-4244-2044-5
Type :
conf
DOI :
10.1109/CBMI.2008.4564942
Filename :
4564942
Link To Document :
بازگشت