مرکز منطقه ای اطلاع رساني علوم و فناوري - Application of variational Bayesian PCA for speech feature extraction

DocumentCode :

542307

Title :

Application of variational Bayesian PCA for speech feature extraction

Author :

Kwon, Oh-Wook ; Lee, Te-Won ; Chan, Kwokleung

Author_Institution :

Institute for Neural Computation, University of California, San Diego, 8500 Oilman Drive, La Jolla, 92059-0523, USA

Volume :

fYear :

2002

fDate :

13-17 May 2002

Abstract :

In a standard mel-frequency cepstral coefficient-based speech recognizer, it is common to use the same feature dimension and the number of Gaussian mixtures for all subunits. We proposed to use different transformations and different number of mixtures for each subunit. We obtained the transformations from mel-frequency band energies by using the variational Bayesian principal component analysis (PCA) method. In the method, hyperparameters of the Gaussian mixtures and the number of mixtures are automatically learned through maximization of a lower bound of the evidence instead of the likelihood in the conventional maximum likelihood paradigm. Analyzing the TIMIT speech data, we revealed intrinsic structures of vowels and consonants. We demonstrated the userfulness of the method for speech recognition by performing phoneme classification of /b/, /d/ and /g/ phonemes.

Keywords :

Bayesian methods; Feature extraction; Hidden Markov models; Principal component analysis; Speech; Speech recognition; Transforms;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on

Conference_Location :

Orlando, FL, USA

ISSN :

1520-6149

Print_ISBN :

0-7803-7402-9

Type :

conf

DOI :

10.1109/ICASSP.2002.5743866

Filename :

5743866

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=542307