DocumentCode
755682
Title
Large population speaker identification using clean and telephone speech
Author
Reynolds, Douglas A.
Author_Institution
Lincoln Lab., MIT, Cambridge, MA, USA
Volume
2
Issue
3
fYear
1995
fDate
3/1/1995 12:00:00 AM
Firstpage
46
Lastpage
48
Abstract
This paper presents text-independent speaker identification results for varying speaker population sizes up to 630 speakers for both clean, wideband speech, and telephone speech. A system based on Gaussian mixture speaker models is used for speaker identification, and experiments are conducted on the TIMIT and NTIMIT databases. The TIMIT results show large population performance under near-ideal conditions, and the NTIMIT results show the corresponding accuracy loss due to telephone transmission. These are believed to be the first speaker identification experiments on the complete 630 speaker TIMIT and NTIMIT databases and the largest text-independent speaker identification task reported to date. Identification accuracies of 99.5 and 60.7% were achieved on the TIMIT and NTIMIT databases, respectively.<>
Keywords
Gaussian processes; speaker recognition; telephony; Gaussian mixture speaker models; NTIMIT database; TIMIT database; clean speech; large population speaker identification; near-ideal conditions; telephone speech; telephone transmission; text-independent speaker identification; wideband speech; Additive noise; Databases; Degradation; Loudspeakers; Performance loss; Propagation losses; Speech analysis; Telephony; Wideband;
fLanguage
English
Journal_Title
Signal Processing Letters, IEEE
Publisher
ieee
ISSN
1070-9908
Type
jour
DOI
10.1109/97.372913
Filename
372913
Link To Document