DocumentCode :
2701324
Title :
Finding Speaker Identities with a Conditional Maximum Entropy Model
Author :
Chengyuan Ma ; Nguyen, P. ; Mahajan, Monika
Author_Institution :
Sch. of Electr. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
In this paper, we address the task of identifying the speakers by name in audio content. Identification of speakers by name helps to improve the readability of the transcript and also provides additional meta-data which can help in finding the audio content of interest. We present a conditional maximum entropy (maxent) framework for this problem which yields superior performance and lends itself well to incorporating different types of information. We take advantage of this property of maxent to explore new features for this task. We show that supplementing standard lexical triggers with information such as speaker gender and position of speaker name mentions afford us large gains in performance. At 95% precision, we increase the recall to 67% from the trigger baseline of 38%.
Keywords :
maximum entropy methods; speaker recognition; audio content; conditional maximum entropy model; speaker gender; speaker identities; speaker position; speakers identification; Acoustic testing; Acoustical engineering; Automatic speech recognition; Broadcasting; Entropy; Loudspeakers; Performance gain; Speaker recognition; System testing; Training data; Maximum entropy methods; Speaker recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.366899
Filename :
4218087
Link To Document :
بازگشت