مرکز منطقه ای اطلاع رساني علوم و فناوري - Rapid speaker adaptation using speaker-mixture allophone models applied to speaker-independent speech recognition

DocumentCode :

2022586

Title :

Rapid speaker adaptation using speaker-mixture allophone models applied to speaker-independent speech recognition

Author :

Kosaka, Tetsuo ; Takami, Junichi ; Sagayama, Shageki

Author_Institution :

ATR Interpreting Telephony Res. Lab., Soraku-gun, Kyoto, Japan

Volume :

fYear :

1993

fDate :

27-30 April 1993

Firstpage :

570

Abstract :

A speaker mixture principle that allows the creation of speaker-independent phone models is proposed. Speaker-tied training for rapid speaker adaptation using utterances shorter than one second is derived from this principle. The concept of speaker pruning is also introduced for reducing computational cost without degrading the speaker adaptation performance. The above principle is combined with context-dependent phone models, which have been automatically generated by the successive state splitting algorithm. In a Japanese phrase recognition experiment, speaker mixture allophone models achieved an error reduction of 29.0%, which is high in comparison with the conventional speaker-independent HMM (hidden Markov model)-LR method. Speaker adaptation by speaker-tied training attained an error reduction of 16.8% using a 0.6-s Japanese word utterance. Speaker pruning reduced the number of phone model mixtures by between 50% and 92% without lowering recognition performance.<>

Keywords :

adaptive systems; computational complexity; learning (artificial intelligence); speech recognition; Japanese phrase recognition; computational cost; context-dependent phone models; error reduction; speaker adaptation performance; speaker pruning; speaker-independent speech recognition; speaker-mixture allophone models; speaker-tied training; successive state splitting algorithm;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on

Conference_Location :

Minneapolis, MN, USA

ISSN :

1520-6149

Print_ISBN :

0-7803-7402-9

Type :

conf

DOI :

10.1109/ICASSP.1993.319371

Filename :

319371

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2022586