مرکز منطقه ای اطلاع رساني علوم و فناوري - Adaptive training using structured transforms

DocumentCode :

417161

Title :

Adaptive training using structured transforms

Author :

Yu, K. ; Gales, M.J.F.

Author_Institution :

Dept. of Eng., Cambridge Univ., UK

Volume :

fYear :

2004

fDate :

17-21 May 2004

Abstract :

Adaptive training is an important approach to training speech recognition systems on found, non-homogeneous data. The standard approach employs a single transform to represent unwanted acoustic variability. However, for found data there are commonly multiple acoustic factors affecting the speech signal. The paper investigates the use of multiple forms of transformations, structured transforms (ST), to represent the complex non-speech variabilities in an adaptive training framework. Two forms of transformation are considered, cluster mean interpolation and constrained MLLR; consequently, the canonical model here is a multi-cluster HMM model. Both ML and minimum phone error (MPE) reestimation formulae for the canonical model, are presented. This multi-cluster MPE training is also applicable to eigenvoice systems. Experiments to compare ST to standard adaptive training schemes were performed on a conversational telephone speech task. ST were found to reduce the word error rate significantly.

Keywords :

hidden Markov models; interpolation; learning (artificial intelligence); maximum likelihood estimation; natural languages; speech recognition; transforms; ML estimation; adaptive training; cluster mean interpolation; constrained MLLR; conversational telephone speech; eigenvoice systems; found data; minimum phone error estimation; nonhomogeneous data; speech recognition systems; structured transforms; unwanted acoustic variability; Acoustical engineering; Error analysis; Hidden Markov models; Interpolation; Loudspeakers; Maximum likelihood estimation; Maximum likelihood linear regression; Speech recognition; Telephony; Testing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on

ISSN :

1520-6149

Print_ISBN :

0-7803-8484-9

Type :

conf

DOI :

10.1109/ICASSP.2004.1325986

Filename :

1325986

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=417161