مرکز منطقه ای اطلاع رساني علوم و فناوري - Online Bayesian tree-structured transformation of HMMs with optimal model selection for speaker adaptation

DocumentCode :

1522411

Title :

Online Bayesian tree-structured transformation of HMMs with optimal model selection for speaker adaptation

Author :

Wang, Shaojun ; Zhao, Yunxin

Volume :

Issue :

fYear :

2001

fDate :

9/1/2001 12:00:00 AM

Firstpage :

663

Lastpage :

677

Abstract :

This paper presents a new recursive Bayesian learning approach for transformation parameter estimation in speaker adaptation. Our goal is to incrementally transform or adapt a set of hidden Markov model (HMM) parameters for a new speaker and gain large performance improvement from a small amount of adaptation data. By constructing a clustering tree of HMM Gaussian mixture components, the linear regression (LR) or affine transformation parameters for HMM Gaussian mixture components are dynamically searched. An online Bayesian learning technique is proposed for recursive maximum a posteriori (MAP) estimation of LR and affine transformation parameters. This technique has the advantages of being able to accommodate flexible forms of transformation functions as well as a priori probability density functions (PDFs). To balance between model complexity and goodness of fit to adaptation data, a dynamic programming algorithm is developed for selecting models using a Bayesian variant of the “minimum description length” (MDL) principle. Speaker adaptation experiments with a 26-letter English alphabet vocabulary were conducted, and the results confirmed effectiveness of the online learning framework

Keywords :

Gaussian processes; dynamic programming; hidden Markov models; learning (artificial intelligence); online operation; parameter estimation; probability; recursive estimation; speech recognition; statistical analysis; trees (mathematics); English alphabet vocabulary; HMM Gaussian mixture components; HMM parameters; MAP estimation; adaptation data; affine transformation parameters; automatic speech recognition; clustering tree; dynamic programming algorithm; hidden Markov model; linear regression parameters; minimum description length; model complexity; online Bayesian learning; online Bayesian tree-structured transformation; optimal model selection; probability density functions; recursive Bayesian learning; recursive maximum a posteriori estimation; speaker adaptation experiments; transformation functions; transformation parameter estimation; Bayesian methods; Dynamic programming; Heuristic algorithms; Hidden Markov models; Linear regression; Parameter estimation; Performance gain; Probability density function; Recursive estimation; Vocabulary;

fLanguage :

English

Journal_Title :

Speech and Audio Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1063-6676

Type :

jour

DOI :

10.1109/89.943344

Filename :

943344

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1522411