DocumentCode :
290080
Title :
Tree-structured speaker clustering for fast speaker adaptation
Author :
Kosaka, Tetsuo ; Sagayama, Shigeki
Author_Institution :
ATR Interpreting Telephony Res. Labs., Kyoto, Japan
Volume :
i
fYear :
1994
fDate :
19-22 Apr 1994
Abstract :
The paper proposes a tree-structured speaker clustering algorithm and discusses its application to fast speaker adaptation. By tracing the clustering tree from top to bottom, adaptation is performed step-by-step from global to local individuality of speech. This adaptation method employs successive branch selection in the speaker clustering tree rather than parameter training and hence achieves fast adaptation using only a small amount of training data. This speaker adaptation method was applied to a hidden Markov network (HMnet) and evaluated in Japanese phoneme and phrase recognition experiments, in which it significantly outperformed speaker-independent recognition methods. In the phrase recognition experiments, the method reduced the error rate by 26.6% using three phrase utterances (approximately 2.7 seconds)
Keywords :
hidden Markov models; speech recognition; tree data structures; Japanese; error rate; fast speaker adaptation; hidden Markov network; phoneme recognition; phrase recognition; speaker clustering tree; successive branch selection; three phrase utterances; training data; tree-structured speaker clustering algorithm; Clustering algorithms; Clustering methods; Error analysis; Hidden Markov models; Humans; Smoothing methods; Speech; Stochastic processes; Training data; Tree data structures;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location :
Adelaide, SA
ISSN :
1520-6149
Print_ISBN :
0-7803-1775-0
Type :
conf
DOI :
10.1109/ICASSP.1994.389309
Filename :
389309
Link To Document :
بازگشت