DocumentCode :
2909663
Title :
The effects of telephone transmission degradations on speaker recognition performance
Author :
Reynolds, D.A. ; Zissman, M.A. ; Quatieri, T.F. ; O´Leary, G.C. ; Carlson, B.A.
Author_Institution :
Lincoln Lab., MIT, Lexington, MA, USA
Volume :
1
fYear :
1995
fDate :
9-12 May 1995
Firstpage :
329
Abstract :
The two largest factors affecting automatic speaker identification performance are the size of the population and the degradations introduced by noisy communication channels (e.g., telephone transmission). To examine experimentally these two factors, this paper presents text-independent speaker identification results for varying speaker population sizes up to 630 speakers for both clean, wideband speech and telephone speech. A system based on Gaussian mixture speaker models is used for speaker identification and experiments are conducted on the TIMIT and NTIMIT databases. This is believed to be the first speaker identification experiments on the complete 630 speaker TIMIT and NTIMIT databases and the largest text-independent speaker identification task reported to date. Identification accuracies of 99.5% and 60.7% are achieved on the TIMIT and NTIMIT databases, respectively. This paper also presents experiments which examine and attempt to quantify the performance loss associated with various telephone degradations by systematically degrading the TIMIT speech in a manner consistent with measured NTIMIT degradations and measuring the performance loss at each step. It is found that the standard degradations of filtering and additive noise do not account for all of the performance gap between the TIMIT and NTIMIT data. Measurements of nonlinear microphone distortions are also described which may explain the additional performance loss
Keywords :
Gaussian processes; microphones; speaker recognition; telecommunication channels; telephone interference; telephony; Gaussian mixture speaker models; NTIMIT database; TIMIT database; additive noise; automatic speaker identification performance; filtering; noisy communication channels; nonlinear microphone distortions; population size; speaker recognition performance; telephone speech; telephone transmission degradations; text-independent speaker identification; wideband speech; Communication channels; Databases; Degradation; Distortion measurement; Filtering; Loss measurement; Performance loss; Speech; Telephony; Wideband;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
ISSN :
1520-6149
Print_ISBN :
0-7803-2431-5
Type :
conf
DOI :
10.1109/ICASSP.1995.479540
Filename :
479540
Link To Document :
بازگشت