DocumentCode
1097151
Title
NIST Speaker Recognition Evaluations Utilizing the Mixer Corpora—2004, 2005, 2006
Author
Przybocki, Mark A. ; Martin, Alvin F. ; Le, Audrey N.
Author_Institution
Nat. Inst. of Stand. & Technol., Gaithersburg
Volume
15
Issue
7
fYear
2007
Firstpage
1951
Lastpage
1959
Abstract
NIST has coordinated annual evaluations of text-independent speaker recognition from 1996 to 2006. This paper discusses the last three of these, which utilized conversational speech data from the Mixer Corpora recently collected by the Linguistic Data Consortium. We review the evaluation procedures, the matrix of test conditions included, and the performance trends observed. While most of the data is collected over telephone channels, one multichannel test condition utilizes a subset of Mixer conversations recorded simultaneously over multiple microphone channels and a telephone line. The corpus also includes some non-English conversations involving bilingual speakers, allowing an examination of the effect of language on performance results. On the various test conditions involving English language conversational telephone data, considerable performance gains are observed over the past three years.
Keywords
linguistics; speaker recognition; English language conversational telephone data; Linguistic Data Consortium; Mixer Corpora; NIST speaker recognition evaluations; bilingual speakers; text-independent speaker recognition; Cost function; Microphones; NIST; Natural languages; Performance gain; Speaker recognition; Speech analysis; Strontium; System testing; Telephony; Cross-channel evaluation; Mixer Corpora; NIST evaluations; decision error tradeoff (DET) curves; speaker recognition evaluation;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2007.902489
Filename
4291612
Link To Document