DocumentCode :
2697126
Title :
Speaker Diarization: About whom the Speaker is Talking ?
Author :
Mauclair, J. ; Meignier, S. ; Estève, Y.
Author_Institution :
LIUM, Maine Univ., Le Mans
fYear :
2006
fDate :
28-30 June 2006
Firstpage :
1
Lastpage :
6
Abstract :
The automatic speaker diarization consists in splitting the signal into homogeneous segments and clustering them by speakers. However the speaker segments are specified with anonymous labels. This paper suggests a solution to identify those speakers by extracting their full names pronounced in French broadcast news. A semantic classification tree is automatically built on a training corpus and associate the full names detected in the transcription of a segment to this segment or to one of its neighbors. Then, a merging method permits to associate a full name to a speaker cluster instead of an anonymous label provided by the diarization. The experiments are carried out over French broadcast news records from the ESTER 2005 evaluation campaign. About 70% show duration is correctly processed for both development and evaluation corpora. On the evaluation corpus, 18.2% show duration is wrongly named and no decision is taken for 11.9% show duration
Keywords :
natural languages; signal classification; speaker recognition; trees (mathematics); ESTER 2005 evaluation campaign; French broadcast news; automatic speaker diarization; merging method; semantic classification tree; speaker identification; speaker segmentation; training corpus; Audio recording; Broadcasting; Classification tree analysis; Costs; Error analysis; Indexing; Loudspeakers; Merging; NIST; Speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Speaker and Language Recognition Workshop, 2006. IEEE Odyssey 2006: The
Conference_Location :
San Juan
Print_ISBN :
1-424400471-1
Electronic_ISBN :
1-4244-0472-X
Type :
conf
DOI :
10.1109/ODYSSEY.2006.248114
Filename :
4013531
Link To Document :
بازگشت