DocumentCode :
394353
Title :
Multiple task-domain acoustic models
Author :
Ljolje, Andrej
Author_Institution :
AT&T Labs.-Res., USA
Volume :
1
fYear :
2003
fDate :
6-10 April 2003
Abstract :
Many speech recognition applications require the recognizer to perform at peak recognition accuracy across many different domains. Examples of different domains are general English, digits, names, alphabet, etc. Here we show a way to preserve the simplicity of a single acoustic model while providing domain specific recognition speed and accuracy. This is achieved by employing an extended phoneme set that keeps a subset of phonemes specifically for a particular domain, and a context dependency specification that allows cross-word, cross-domain phonetic context dependencies. Testing on a names recognition task going from a wrong domain (general English) model to a multiple domain model (general English, alphabet, names) the error rate is reduced by more than 50%. Domain-specific model trained only on the names data further reduces the error rate by more than 50%.
Keywords :
error statistics; speech processing; speech recognition; accuracy; alphabet; context dependency specification; cross-domain dependencies; cross-word dependencies; domain specific recognition speed; error rate reduction; extended phoneme set; general English model; multiple task-domain acoustic models; names; names recognition task; phoneme subset; phonetic context dependencies; speech recognition applications; Acoustic testing; Error analysis; Hidden Markov models; Loudspeakers; Management training; Microphones; Natural languages; Speech recognition; Telephony; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7663-3
Type :
conf
DOI :
10.1109/ICASSP.2003.1198897
Filename :
1198897
Link To Document :
بازگشت