DocumentCode
284778
Title
Augmented phonetic map for voice verification
Author
Chang, Harry M.
Author_Institution
NYNEX Science & Technology Inc., White Plains, NY, USA
Volume
2
fYear
1992
fDate
23-26 Mar 1992
Firstpage
169
Abstract
A perceptually based model for speaker identity verification (SIV) using derivative of phase spectrum (DPS) as the primary identity-bearing feature to model individual speakers´ vocal tract dynamics is presented. The basic technique used to model a speaker is to create a two-dimensional trajectory of changing vocal tract based on formant movement and pitch information. The map is further augmented with both instantaneous and dynamic feature parameters of DPS as well as with conventional energy-based acoustic features. A series of verification experiments was conducted, using a three-layer artificial neural network as a classifier, with an isolated digit database recorded over 11 different telephone handsets. The preliminary testing results suggest that this system performs significantly better than a baseline system using a standard cepstrum front-end
Keywords
neural nets; speech recognition; derivative of phase spectrum; energy-based acoustic features; formant movement; isolated digit database; pitch information; speaker identity verification; telephone handsets; three-layer artificial neural network; vocal tract dynamics; voice verification; Cepstral analysis; Cepstrum; Filtering algorithms; Linear predictive coding; Loudspeakers; Robustness; Spatial databases; Speech analysis; Telephony; Time domain analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location
San Francisco, CA
ISSN
1520-6149
Print_ISBN
0-7803-0532-9
Type
conf
DOI
10.1109/ICASSP.1992.226093
Filename
226093
Link To Document