Title :
Speaker adaptation using semi-continuous hidden Markov models
Author :
Rieck, S. ; Schukat-Talamazzini, E.G. ; Niemann, H.
Author_Institution :
Lehrstuhl fur Inf. 5, Friedrich-Alexander-Univ., Erlangen, Germany
fDate :
30 Aug-3 Sep 1992
Abstract :
Presents a new approach to speaker adaptation based on semi-continuous hidden Markov models (SCHMM). The authors introduce a modification of the semi-continuous codebook updating which allows rapid speaker adaptation. The approach is based on the idea that phonetic information already incorporated in a trained model should be used to update the codebook. Thus the different acoustic representation of a new speaker is learned while the connection between codebook entries and model states remains the same. Several experiments were carried out with a small speech sample. It is possible to demonstrate that the new codebook updating performs better than conventional SCHMM codebook updating and that using a speech sample comprising about 40 seconds of adaptation speech is enough to achieve 50 percent of the difference in performance between full speaker-dependent training and no adaptation at all
Keywords :
encoding; hidden Markov models; speech recognition; acoustic representation; encoding; phonetic information; semi-continuous codebook updating; semi-continuous hidden Markov models; speaker adaptation; speech recognition; Acoustic emission; Books; Hidden Markov models; Interpolation; Loudspeakers; Maximum likelihood estimation; Prototypes; Smoothing methods; Speech recognition; State estimation;
Conference_Titel :
Pattern Recognition, 1992. Vol.III. Conference C: Image, Speech and Signal Analysis, Proceedings., 11th IAPR International Conference on
Conference_Location :
The Hague
Print_ISBN :
0-8186-2920-7
DOI :
10.1109/ICPR.1992.202044