DocumentCode :
3125122
Title :
Structured modeling based on generalized variable parameter HMMs and speaker adaptation
Author :
Yang Li ; Xunying Liu ; Lan Wang
Author_Institution :
Shenzhen Inst. of Adv. Technol., Shenzhen, China
fYear :
2012
fDate :
5-8 Dec. 2012
Firstpage :
136
Lastpage :
140
Abstract :
It is a challenging task that to handle ambient variable acoustic factors in automatic speech recognition (ASR) system. The ambient variable noise and the distinct acoustic factors among speakers are two key issues for recognition task. To solve these problems, we present a new framework for robust speech recognition based on structured modeling, using generalized variable parameter HMMs (GVP-HMMs) and unsupervised speaker adaptation (SA) to compensate the mismatch from environment and speaker variability. GVP-HMMs can explicitly approximate the continuous trajectory of Gaussian component mean, variance and linear transformation parameter with a polynomial function against the varying noise level. In recognition stage, MLLR transform captures general relationship between the original model set and the current speaker, which could help in removing the effects of unwanted speaker factors. The effectiveness of the proposed approach is confirmed by evaluation experiment on a medium vocabulary Mandarin recognition task.
Keywords :
Gaussian processes; hidden Markov models; natural language processing; polynomials; speech recognition; ASR system; GVP-HMM; Gaussian component mean; MLLR transform; SA; acoustic factors; automatic speech recognition; generalized variable parameter HMM; linear transformation parameter; medium vocabulary Mandarin recognition task; polynomial function; robust speech recognition; structured modeling; unsupervised speaker adaptation; variance transformation parameter; Adaptation models; Hidden Markov models; Mathematical model; Signal to noise ratio; Trajectory; Transforms; generalized variable parameter; robust speech recognition; speaker adaptation; variable noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
Conference_Location :
Kowloon
Print_ISBN :
978-1-4673-2506-6
Electronic_ISBN :
978-1-4673-2505-9
Type :
conf
DOI :
10.1109/ISCSLP.2012.6423526
Filename :
6423526
Link To Document :
بازگشت