مرکز منطقه ای اطلاع رساني علوم و فناوري - Finding Speaker Identities with a Conditional Maximum Entropy Model

DocumentCode :

2701324

Title :

Finding Speaker Identities with a Conditional Maximum Entropy Model

Author :

Chengyuan Ma ; Nguyen, P. ; Mahajan, Monika

Author_Institution :

Sch. of Electr. Eng., Georgia Inst. of Technol., Atlanta, GA, USA

Volume :

fYear :

2007

fDate :

15-20 April 2007

Abstract :

In this paper, we address the task of identifying the speakers by name in audio content. Identification of speakers by name helps to improve the readability of the transcript and also provides additional meta-data which can help in finding the audio content of interest. We present a conditional maximum entropy (maxent) framework for this problem which yields superior performance and lends itself well to incorporating different types of information. We take advantage of this property of maxent to explore new features for this task. We show that supplementing standard lexical triggers with information such as speaker gender and position of speaker name mentions afford us large gains in performance. At 95% precision, we increase the recall to 67% from the trigger baseline of 38%.

Keywords :

maximum entropy methods; speaker recognition; audio content; conditional maximum entropy model; speaker gender; speaker identities; speaker position; speakers identification; Acoustic testing; Acoustical engineering; Automatic speech recognition; Broadcasting; Entropy; Loudspeakers; Performance gain; Speaker recognition; System testing; Training data; Maximum entropy methods; Speaker recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on

Conference_Location :

Honolulu, HI

ISSN :

1520-6149

Print_ISBN :

1-4244-0727-3

Type :

conf

DOI :

10.1109/ICASSP.2007.366899

Filename :

4218087

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2701324