مرکز منطقه ای اطلاع رساني علوم و فناوري - Large vocabulary continuous speech recognition of Wall Street Journal data

DocumentCode :

290364

Title :

Large vocabulary continuous speech recognition of Wall Street Journal data

Author :

Aubert, X. ; Dugast, C. ; Ney, H. ; Steinbiss, V.

Author_Institution :

Philips GmbH Res. Lab. Aachen, Germany

Volume :

fYear :

1994

fDate :

19-22 Apr 1994

Abstract :

We report on recent developments of the Philips large vocabulary speech recognition system and on our experiments with the Wall Street Journal (WSJ) corpus. A two-pass decoding has been devised that allows an easy integration of more complex language models. First, a word lattice is produced using a time synchronous beam search with a bigram language model. Next, a higher-order language model is applied to the lattice at the phrase level. The conditions insuring the validity of this approach are explained and practical results for trigram demonstrate its usefulness. The main system development stages on WSJ data are presented and our final recognizers are evaluated on Nov. ´92 and Nov. ´93 test-data for both 5 K and 20 K vocabularies

Keywords :

decoding; dictation; speech recognition; vocabulary; Philips dictation system; WSJ data; Wall Street Journal data; bigram language model; higher-order language model; language models; large vocabulary continuous speech recognition; time synchronous beam search; two-pass decoding; word lattice; Acoustic beams; Acoustic testing; Decoding; Hidden Markov models; Laboratories; Lattices; Speech recognition; System testing; Vectors; Vocabulary;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on

Conference_Location :

Adelaide, SA

ISSN :

1520-6149

Print_ISBN :

0-7803-1775-0

Type :

conf

DOI :

10.1109/ICASSP.1994.389702

Filename :

389702

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=290364