Title :
Vector-field-smoothed Bayesian learning for incremental speaker adaptation
Author :
Takahashi, Jun-ichi ; Sagayama, Shigeki
Author_Institution :
NTT Human Interface Labs., Kanagawa, Japan
Abstract :
The paper presents a fast and incremental speaker adaptation method called MAP/VFS, which combines maximum a posteriori (MAP) estimation, or in other words Bayesian learning, with vector field smoothing (VFS). The point is that MAP is an intra-class training scheme while VFS is an inter-class smoothing technique. This is a basic technique for on-line adaptation which will be important in constructing a practical speech recognition system. Speaker adaptation speed of the incremental MAP is experimentally shown to be significantly accelerated by the use of VFS in word-by-word adaptation. The recognition performance of MAP is consistently improved and stabilized by VFS. The word error reduction rate achieved in incrementally adapting a few words of sample data is about 22%
Keywords :
Bayes methods; maximum likelihood estimation; smoothing methods; speech recognition; MAP/VFS; incremental speaker adaptation; inter-class smoothing technique; intra-class training scheme; maximum a posteriori estimation; on-line adaptation; practical speech recognition system; recognition performance; vector-field-smoothed Bayesian learning; word error reduction rate; word-by-word adaptation; Acceleration; Bayesian methods; Data acquisition; Hidden Markov models; Humans; Laboratories; Smoothing methods; Speech recognition; Telephony; Training data;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
Print_ISBN :
0-7803-2431-5
DOI :
10.1109/ICASSP.1995.479789