مرکز منطقه ای اطلاع رساني علوم و فناوري - All-phoneme ergodic hidden Markov network for unsupervised speaker adaptation

DocumentCode :

290079

Title :

All-phoneme ergodic hidden Markov network for unsupervised speaker adaptation

Author :

Miyazawa, Yasunaga ; Takami, Jun-Ichi ; Sagayama, Shigeki ; Matsunaga, Shoichi

Author_Institution :

ATR Interpreting Telephony Res. Labs., Kyoto, Japan

Volume :

fYear :

1994

fDate :

19-22 Apr 1994

Abstract :

The paper proposes an unsupervised speaker adaptation method using “all-phoneme ergodic hidden Markov network” that combines allophonic (context-dependent phone) acoustic models with stochastic language constraints. Hidden Markov networks (HMnet) for allophone modeling and allophonic bigram probabilities derived from a large text database are combined to yield a single large ergodic HMM which represents arbitrary speech signals in a particular language so that the model parameters can be re-estimated using text-unknown speech samples with the Baum-Welch algorithm. Combined with the vector field smoothing (VFS) technique, unsupervised speaker adaptation can be effectively performed. This method experimentally gave fairly better performances compared with the authors´ previous unsupervised adaptation method using conventional phonetic HMMs and phoneme bigram probabilities

Keywords :

hidden Markov models; neural nets; smoothing methods; speech recognition; unsupervised learning; Baum-Welch algorithm; Japanese phrases; all-phoneme ergodic hidden Markov network; allophonic acoustic models; bigram probabilities; context-dependent phone acoustic models; language; speech signals; stochastic language constraints; text database; text-unknown speech samples; unsupervised speaker adaptation; vector field smoothing; Context modeling; Feedback; Hidden Markov models; Loudspeakers; Markov random fields; Natural languages; Speech analysis; Stochastic processes; Stochastic systems; Training data;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on

Conference_Location :

Adelaide, SA

ISSN :

1520-6149

Print_ISBN :

0-7803-1775-0

Type :

conf

DOI :

10.1109/ICASSP.1994.389308

Filename :

389308

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=290079