مرکز منطقه ای اطلاع رساني علوم و فناوري - Automatic generation and selection of multiple pronunciations for dynamic vocabularies

DocumentCode :

1749716

Title :

Automatic generation and selection of multiple pronunciations for dynamic vocabularies

Author :

Deligne, Sabine ; Maison, Benoit ; Gopinath, Ramesh

Author_Institution :

IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA

Volume :

fYear :

2001

fDate :

2001

Firstpage :

565

Abstract :

We present a scheme for the acoustic modeling of speech recognition applications requiring dynamic vocabularies. It applies especially to the acoustic modeling of out-of-vocabulary words which need to be added to a recognition lexicon based on the observation of a few (say one or two) speech utterances of these words. Standard approaches to this problem derive a single pronunciation from each speech utterance by combining acoustic and phone transition scores. In our scheme, multiple pronunciations are generated from each speech utterance of a word to enroll by varying the relative weights assigned to the acoustic and phone transition models. In our experiments, the use of these multiple baseforms dramatically outperforms the standard approach with a relative decrease of the word error rate ranging from 20% to 40% on all our test sets

Keywords :

probability; speech processing; speech recognition; acoustic modeling; dynamic vocabularies; multiple pronunciations; out-of-vocabulary words; phone transition models; recognition lexicon speech utterances; speech recognition; Acoustic applications; Engines; Error analysis; Keyboards; Robustness; Speech recognition; Testing; Vocabulary;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on

Conference_Location :

Salt Lake City, UT

ISSN :

1520-6149

Print_ISBN :

0-7803-7041-4

Type :

conf

DOI :

10.1109/ICASSP.2001.940894

Filename :

940894

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1749716