مرکز منطقه ای اطلاع رساني علوم و فناوري - Tree-structured speaker clustering for fast speaker adaptation

DocumentCode :

290080

Title :

Tree-structured speaker clustering for fast speaker adaptation

Author :

Kosaka, Tetsuo ; Sagayama, Shigeki

Author_Institution :

ATR Interpreting Telephony Res. Labs., Kyoto, Japan

Volume :

fYear :

1994

fDate :

19-22 Apr 1994

Abstract :

The paper proposes a tree-structured speaker clustering algorithm and discusses its application to fast speaker adaptation. By tracing the clustering tree from top to bottom, adaptation is performed step-by-step from global to local individuality of speech. This adaptation method employs successive branch selection in the speaker clustering tree rather than parameter training and hence achieves fast adaptation using only a small amount of training data. This speaker adaptation method was applied to a hidden Markov network (HMnet) and evaluated in Japanese phoneme and phrase recognition experiments, in which it significantly outperformed speaker-independent recognition methods. In the phrase recognition experiments, the method reduced the error rate by 26.6% using three phrase utterances (approximately 2.7 seconds)

Keywords :

hidden Markov models; speech recognition; tree data structures; Japanese; error rate; fast speaker adaptation; hidden Markov network; phoneme recognition; phrase recognition; speaker clustering tree; successive branch selection; three phrase utterances; training data; tree-structured speaker clustering algorithm; Clustering algorithms; Clustering methods; Error analysis; Hidden Markov models; Humans; Smoothing methods; Speech; Stochastic processes; Training data; Tree data structures;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on

Conference_Location :

Adelaide, SA

ISSN :

1520-6149

Print_ISBN :

0-7803-1775-0

Type :

conf

DOI :

10.1109/ICASSP.1994.389309

Filename :

389309

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=290080