مرکز منطقه ای اطلاع رساني علوم و فناوري - Improving the performance of an LVCSR system through ensembles of acoustic models

DocumentCode :

394373

Title :

Improving the performance of an LVCSR system through ensembles of acoustic models

Author :

Zhang, Rong ; Rudnicky, Alexander I.

Author_Institution :

Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA

Volume :

fYear :

2003

fDate :

6-10 April 2003

Abstract :

This paper describes our work on applying ensembles of acoustic models to the problem of large vocabulary continuous speech recognition (LVCSR). We propose three algorithms for constructing ensembles. The first two have their roots in bagging algorithms; however, instead of randomly sampling examples our algorithms construct training sets based on the word error rate. The third one is a boosting style algorithm. Different from other boosting methods which demand large resources for computation and storage, our method present a more efficient solution suitable for acoustic model training. We also investigate a method that seeks optimal combination for models. We report experimental results on a large real world corpus collected from the Carnegie Mellon Communicator dialog system. Significant improvements on system performance are observed in that up to 15.56% relative reduction on word error rate is achieved.

Keywords :

acoustic signal processing; learning (artificial intelligence); optimisation; signal classification; speech recognition; Carnegie Mellon Communicator dialog system; LVCSR system; acoustic model training; acoustic models; bagging algorithms; boosting style algorithm; classifiers; ensembles. construction; large real world corpus; large vocabulary continuous speech recognition; optimal models; supervised learning; system performance; training sets; word error rate reduction; Bagging; Boosting; Computer science; Decoding; Error analysis; Probability distribution; Sampling methods; Speech recognition; System performance; Voting;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on

ISSN :

1520-6149

Print_ISBN :

0-7803-7663-3

Type :

conf

DOI :

10.1109/ICASSP.2003.1198921

Filename :

1198921

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=394373