Title :
Structured modeling based on generalized variable parameter HMMs and speaker adaptation
Author :
Yang Li ; Xunying Liu ; Lan Wang
Author_Institution :
Shenzhen Inst. of Adv. Technol., Shenzhen, China
Abstract :
It is a challenging task that to handle ambient variable acoustic factors in automatic speech recognition (ASR) system. The ambient variable noise and the distinct acoustic factors among speakers are two key issues for recognition task. To solve these problems, we present a new framework for robust speech recognition based on structured modeling, using generalized variable parameter HMMs (GVP-HMMs) and unsupervised speaker adaptation (SA) to compensate the mismatch from environment and speaker variability. GVP-HMMs can explicitly approximate the continuous trajectory of Gaussian component mean, variance and linear transformation parameter with a polynomial function against the varying noise level. In recognition stage, MLLR transform captures general relationship between the original model set and the current speaker, which could help in removing the effects of unwanted speaker factors. The effectiveness of the proposed approach is confirmed by evaluation experiment on a medium vocabulary Mandarin recognition task.
Keywords :
Gaussian processes; hidden Markov models; natural language processing; polynomials; speech recognition; ASR system; GVP-HMM; Gaussian component mean; MLLR transform; SA; acoustic factors; automatic speech recognition; generalized variable parameter HMM; linear transformation parameter; medium vocabulary Mandarin recognition task; polynomial function; robust speech recognition; structured modeling; unsupervised speaker adaptation; variance transformation parameter; Adaptation models; Hidden Markov models; Mathematical model; Signal to noise ratio; Trajectory; Transforms; generalized variable parameter; robust speech recognition; speaker adaptation; variable noise;
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
Conference_Location :
Kowloon
Print_ISBN :
978-1-4673-2506-6
Electronic_ISBN :
978-1-4673-2505-9
DOI :
10.1109/ISCSLP.2012.6423526