Online unsupervised learning of hidden Markov models for adaptive speech recognition

Author

Chien, J.-T.

Author_Institution

Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan

Volume

148

Issue

5

fYear

2001

fDate

10/1/2001 12:00:00 AM

Firstpage

315

Lastpage

324

Abstract

A novel framework of an online unsupervised learning algorithm is presented to flexibly adapt the existing speaker-independent hidden Markov models (HMMs) to nonstationary environments induced by varying speakers, transmission channels, ambient noises, etc. The quasi-Bayes (QB) estimate is applied to incrementally obtain word sequence and adaptation parameters for adjusting HMMs when a block of unlabelled data is enrolled. The underlying statistics of a nonstationary environment can be successively traced according to the newest enrolment data. To improve the QB estimate, the adaptive initial hyperparameters are employed in the beginning session of online learning. These hyperparameters are estimated from a cluster of training speakers closest to the test environment. Additionally, a selection process is developed to select reliable parameters from a list of candidates for unsupervised learning. A set of reliability assessment criteria is explored for selection. In a series of speaker adaptation experiments, the effectiveness of the proposed method is confirmed and it is found that using the adaptive initial hyperparameters in online learning and the multiple assessments in parameter selection can improve the recognition performance

Keywords

Bayes methods; adaptive signal processing; online operation; parameter estimation; speech recognition; unsupervised learning; adaptive initial hyperparameters; adaptive speech recognition; ambient noise; automatic speech recognition; enrolment data; hidden Markov models; nonstationary environments; online unsupervised learning; parameter selection; quasi-Bayes estimate; recognition performance; reliability assessment criteria; speaker adaptation experiments; speaker-independent HMM; statistics; test environment; transmission channels; unlabelled data; word sequence parameters;

fLanguage

English

Journal_Title

Vision, Image and Signal Processing, IEE Proceedings -

Publisher

iet

ISSN

1350-245X

Type

jour

DOI

10.1049/ip-vis:20010560

Filename

974391