Title :
Finding Speaker Identities with a Conditional Maximum Entropy Model
Author :
Chengyuan Ma ; Nguyen, P. ; Mahajan, Monika
Author_Institution :
Sch. of Electr. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
Abstract :
In this paper, we address the task of identifying the speakers by name in audio content. Identification of speakers by name helps to improve the readability of the transcript and also provides additional meta-data which can help in finding the audio content of interest. We present a conditional maximum entropy (maxent) framework for this problem which yields superior performance and lends itself well to incorporating different types of information. We take advantage of this property of maxent to explore new features for this task. We show that supplementing standard lexical triggers with information such as speaker gender and position of speaker name mentions afford us large gains in performance. At 95% precision, we increase the recall to 67% from the trigger baseline of 38%.
Keywords :
maximum entropy methods; speaker recognition; audio content; conditional maximum entropy model; speaker gender; speaker identities; speaker position; speakers identification; Acoustic testing; Acoustical engineering; Automatic speech recognition; Broadcasting; Entropy; Loudspeakers; Performance gain; Speaker recognition; System testing; Training data; Maximum entropy methods; Speaker recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
DOI :
10.1109/ICASSP.2007.366899