مرکز منطقه ای اطلاع رساني علوم و فناوري - Bayesian latent variable models for speech recognition

DocumentCode :

1690377

Title :

Bayesian latent variable models for speech recognition

Author :

Jen-Tzung Chien ; Peng Liu

Author_Institution :

Dept. of Electr. & Comput. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan

fYear :

2013

Firstpage :

7393

Lastpage :

7397

Abstract :

We present a Bayesian framework to learn prior and posterior distributions for latent variable models. Our goal is to deal with model regularization and achieve desirable prediction using heterogeneous speech data. A variational Bayesian expectation-maximization algorithm is developed to establish a latent variable model based on the exponential family distributions. This algorithm does not only estimate model parameters but also their hyperparameters which reflect the model uncertainties. The uncertainty is compensated to construct a variety of regularized models. We realize this full Bayesian framework for uncertainty decoding of speech signals. Compared to maximum likelihood method and Bayesian approach with heuristically-selected hyperparameters, the proposed method achieves higher speech recognition accuracy especially in case of sparse and noisy training data.

Keywords :

belief networks; decoding; expectation-maximisation algorithm; learning (artificial intelligence); speech recognition; Bayesian approach; Bayesian framework; Bayesian latent variable models; exponential family distributions; heterogeneous speech data; heuristically-selected hyperparameters; maximum likelihood method; model parameter estimation; model uncertainties; noisy training data; posterior distributions; prior distributions; sparse training data; speech recognition; speech signals; uncertainty decoding; variational Bayesian expectation-maximization algorithm; Bayes methods; Computational modeling; Hidden Markov models; Speech; Speech recognition; Training; Training data; Bayesian Learning; Exponential Family; Latent Variable Model; Speech Recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on

Conference_Location :

Vancouver, BC

ISSN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2013.6639099

Filename :

6639099

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1690377