Title :
On-line adaptation and Bayesian detection of environmental changes based on a macroscopic time evolution system
Author :
Watanabe, Shinji ; Nakamura, Atsushi
Author_Institution :
NTT Commun. Sci. Labs., NTT Corp.
Abstract :
Acoustic characteristics are often changed over time as a result of various factors including changes of speakers, speaking styles, and noise sources. Incremental adaptation techniques for speech recognition are aimed at adjusting acoustic models quickly and stably to such time-variant acoustic characteristics. Recently we proposed a novel incremental adaptation framework based on a macroscopic time evolution system, which models the time-variant characteristics by successively updating posterior distributions of acoustic model parameters. This paper proposes fast incremental adaptation based on a macroscopic time evolution system that realizes an utterance-by-utterance update by approximating the posterior distributions. This adaptation was used to perform on-line adaptation of Japanese broadcast news for very large vocabulary continuous speech recognition (700k vocabulary size) in real time. The word accuracy was improved from 73.9% to 85.1%. In addition, by incorporating a Bayesian model selection approach, we realized the simultaneous on-line adaptation and detection of environmental changes.
Keywords :
Bayes methods; natural language processing; speech recognition; Bayesian detection; environmental changes; incremental adaptation techniques; macroscopic time evolution system; noise sources; posterior distributions; speaking styles; speech recognition; time-variant characteristics; Bayesian methods; Speech recognition; acoustic model; macroscopic time evolution system; model selection; on-line adaptation;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4960598