Title :
On using formants to improve SCHMM speaker adaptation
Author :
Yang, Tae-Young ; Shin, Won-Ho ; Kim, Weon-Goo ; Youn, Dae Hee ; Cha, Il-Whan
Author_Institution :
Dept. of Electron. Eng., Yonsei Univ., Seoul, South Korea
fDate :
3/1/1999 12:00:00 AM
Abstract :
A speaker adaptation algorithm using formant frequencies is proposed. The formants extracted from the cepstral means in the reference codebook are iteratively shifted toward the formants of a test speaker. The number of cepstral means selected at each iteration decreases as the iteration increases. The decision of the number of selected cepstral means and a formant based distance measure are formulated. The proposed algorithm was implemented in two schemes and evaluated by speaker-independent, male speaker dependent, and female speaker-dependent recognition experiments. A combined scheme with the Bayesian adaptation obtained 9.7% enhancement for the average recognition accuracy in speaker-independent experiments and 52.6% in speaker-dependent recognition experiments
Keywords :
cepstral analysis; hidden Markov models; iterative methods; speech recognition; Bayesian adaptation; SCHMM speaker adaptation; average recognition accuracy; cepstral means; distance measure; female speaker-dependent recognition; formants; iteration; male speaker dependent recognition; semicontinuous hidden Markov model based speech recognition; speaker-independent recognition; Acoustic testing; Bayesian methods; Cepstral analysis; Character recognition; Frequency; Hidden Markov models; Iterative algorithms; Loudspeakers; Shape; Speech recognition;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on