مرکز منطقه ای اطلاع رساني علوم و فناوري - Comparative experiments on large vocabulary speech recognition

DocumentCode :

290013

Title :

Comparative experiments on large vocabulary speech recognition

Author :

Kubala, Francis ; Anastasakos, Anastasios ; Makhoul, John ; Nguyen, Long ; Schwartz, Richard ; Zavaliagkos, George

Author_Institution :

IBM Syst. & Technol., Cambridge, MA, USA

Volume :

fYear :

1994

fDate :

19-22 Apr 1994

Abstract :

We describe recent changes to the BYBLOS system´s training and recognition algorithms and report on numerous experiments in large vocabulary speech recognition. In earlier work, we performed five key experiments that were designed to answer questions related to different training scenarios. We investigated (1) the effect of varying the number of training speakers if the total amount of training data remains constant, (2) data pooling versus model averaging for generating speaker-independent (SI) HMMs, (3) the benefit of doubling the acoustic training data, (4) SI versus SD performance when the SI training data is twelve times greater, (5) the effect of cross-domain training for both the acoustic and language models. Our recent work was focused on four specific problem areas sharing the common thread that the test condition exposes the recognizer to phenomena not observed in the training data. Here we investigated (1) words outside the vocabulary, (2) spoken language effects due to subject variability and spontaneous dictation, (3) non-native dialects of the language, and (4) new microphones not used in training

Keywords :

hidden Markov models; learning (artificial intelligence); natural languages; neural nets; speech recognition; speech recognition equipment; vocabulary; BYBLOS system; acoustic training data; comparative experiments; cross-domain training; data pooling; language models; large vocabulary speech recognition; microphones; model averaging; non-native dialects; recognition algorithms; sequential neural networks; speaker dependent performance; speaker-independent HMM; spoken language effects; spontaneous dictation; subject variability; training algorithms; Context modeling; Decoding; Hidden Markov models; Microphones; Natural languages; Performance evaluation; Speech recognition; Testing; Training data; Vocabulary;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on

Conference_Location :

Adelaide, SA

ISSN :

1520-6149

Print_ISBN :

0-7803-1775-0

Type :

conf

DOI :

10.1109/ICASSP.1994.389232

Filename :

389232

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=290013