DocumentCode :
3495155
Title :
Modeling prosodic dynamics for speaker recognition
Author :
Adami, Andre G. ; Mihaescu, Radu ; Reynolds, Douglas A. ; Godfrey, John J.
Volume :
4
fYear :
2003
fDate :
6-10 April 2003
Abstract :
Most current state-of-the-art automatic speaker recognition systems extract speaker-dependent features by looking at short-term spectral information. This approach ignores long-term information that can convey supra-segmental information, such as prosodics and speaking style. We propose two approaches that use the fundamental frequency and energy trajectories to capture long-term information. The first approach uses bigram models to model the dynamics of the fundamental frequency and energy trajectories for each speaker. The second approach uses the fundamental frequency trajectories of a predefined set of words as the speaker templates and then, using dynamic time warping, computes the distance between the templates and the words from the test message. The results presented in this work are on Switchboard I using the NIST Extended Data evaluation design. We show that these approaches can achieve an equal error rate of 3.7%, which is a 77% relative improvement over a system based on short-term pitch and energy features alone.
Keywords :
error statistics; natural languages; speaker recognition; spectral analysis; speech processing; NIST Extended Data evaluation design; Switchboard I; automatic speaker recognition; bigram models; dynamic time warping; energy trajectories; equal error rate; fundamental frequency; prosodic dynamics; short-term spectral information; speaking style; Data mining; Energy capture; Feature extraction; Frequency; Laboratories; NIST; Speaker recognition; Speech; Statistical distributions; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7663-3
Type :
conf
DOI :
10.1109/ICASSP.2003.1202761
Filename :
1202761
Link To Document :
بازگشت