DocumentCode
290080
Title
Tree-structured speaker clustering for fast speaker adaptation
Author
Kosaka, Tetsuo ; Sagayama, Shigeki
Author_Institution
ATR Interpreting Telephony Res. Labs., Kyoto, Japan
Volume
i
fYear
1994
fDate
19-22 Apr 1994
Abstract
The paper proposes a tree-structured speaker clustering algorithm and discusses its application to fast speaker adaptation. By tracing the clustering tree from top to bottom, adaptation is performed step-by-step from global to local individuality of speech. This adaptation method employs successive branch selection in the speaker clustering tree rather than parameter training and hence achieves fast adaptation using only a small amount of training data. This speaker adaptation method was applied to a hidden Markov network (HMnet) and evaluated in Japanese phoneme and phrase recognition experiments, in which it significantly outperformed speaker-independent recognition methods. In the phrase recognition experiments, the method reduced the error rate by 26.6% using three phrase utterances (approximately 2.7 seconds)
Keywords
hidden Markov models; speech recognition; tree data structures; Japanese; error rate; fast speaker adaptation; hidden Markov network; phoneme recognition; phrase recognition; speaker clustering tree; successive branch selection; three phrase utterances; training data; tree-structured speaker clustering algorithm; Clustering algorithms; Clustering methods; Error analysis; Hidden Markov models; Humans; Smoothing methods; Speech; Stochastic processes; Training data; Tree data structures;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location
Adelaide, SA
ISSN
1520-6149
Print_ISBN
0-7803-1775-0
Type
conf
DOI
10.1109/ICASSP.1994.389309
Filename
389309
Link To Document