مرکز منطقه ای اطلاع رساني علوم و فناوري - Factor analysis of acoustic features for streamed hidden Markov modeling

DocumentCode :

2768541

Title :

Factor analysis of acoustic features for streamed hidden Markov modeling

Author :

Ting, Chuan-Wei ; Chien, Jen-Tzung

Author_Institution :

Nat. Cheng Kung Univ., Tainan

fYear :

2007

fDate :

9-13 Dec. 2007

Firstpage :

Lastpage :

Abstract :

This paper presents a new streamed hidden Markov model (HMM) framework for speech recognition. The factor analysis (FA) is performed to discover the common factors of acoustic features. The streaming regularities are governed by the correlation between features, which is inherent in common factors. Those features corresponding to the same factor are generated by identical HMM state. Accordingly, we use multiple Markov chains to represent the variation trends in cepstral features. We develop a FA streamed HMM (FASHMM) and go beyond the conventional HMM assuming that all features at a speech frame conduct the same state emission. This streamed HMM is more delicate than the factorial HMM where the streaming was empirically determined. We also exploit a new decoding algorithm for FASHMM speech recognition. In this manner, we fulfill the flexible Markov chains for an input sequence of multivariate Gaussian mixture observations. In the experiments, the proposed method can reduce word error rate by 36% at most.

Keywords :

Gaussian processes; decoding; feature extraction; hidden Markov models; speech coding; speech recognition; Markov chains; acoustic features; decoding algorithm; factor analysis; multivariate Gaussian mixture; speech frame; speech recognition; streamed hidden Markov modeling; Cepstral analysis; Decoding; Hidden Markov models; Information analysis; Mel frequency cepstral coefficient; Performance analysis; Speech analysis; Speech recognition; Statistics; Topology; Markov chain; factor analysis; speech recognition; streamed HMM;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on

Conference_Location :

Kyoto

Print_ISBN :

978-1-4244-1746-9

Electronic_ISBN :

978-1-4244-1746-9

Type :

conf

DOI :

10.1109/ASRU.2007.4430079

Filename :

4430079

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2768541