DocumentCode :
3269006
Title :
Exploiting speaker segmentations for automatic role detection. An application to broadcast news documents
Author :
Bigot, Benjamin ; Ferrane, Isabelle ; Pinquier, Julien
Author_Institution :
IRIT, Univ. de Toulouse, Toulouse, France
fYear :
2010
fDate :
23-25 June 2010
Firstpage :
1
Lastpage :
6
Abstract :
In the field of automatic audiovisual content-based indexing and structuring, finding events like interviews, debates, reports, or live commentaries requires to bridge the gap between low-level feature extraction and such high-level event detection. In our work, we consider that detecting speaker role to enrich interaction sequences between speakers is a first step to reach this goal. The generic method we propose follows a data mining approach. We assume that speaker roles can emerge from parameters extracted from speaker segmentations without taking any prior information into account. Each speaker is then represented by a feature vector carrying temporal, signal and prosodic information. In this paper, we study how methods for dimensionality reduction and classification can help to recognize speaker roles. This method is applied to the corpus of the ESTER2 evaluation campaign and our best result reaches about 72% of well recognized roles that corresponds to nearly 79% of speech time.
Keywords :
data mining; document handling; feature extraction; speaker recognition; audiovisual content-based indexing; automatic role detection; broadcast news documents; data mining approach; dimensionality classification method; dimensionality reduction method; feature vector; high-level event detection; low-level feature extraction; speaker segmentation; Bridges; Broadcasting; Content based retrieval; Data mining; Event detection; Feature extraction; Indexing; Information retrieval; Speech analysis; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Content-Based Multimedia Indexing (CBMI), 2010 International Workshop on
Conference_Location :
Grenoble
ISSN :
1949-3983
Print_ISBN :
978-1-4244-8028-9
Electronic_ISBN :
1949-3983
Type :
conf
DOI :
10.1109/CBMI.2010.5529900
Filename :
5529900
Link To Document :
بازگشت