DocumentCode :
2229397
Title :
A bottom-up approach for handling unseen triphones in large vocabulary continuous speech recognition
Author :
Aubert, Xavier ; Beyerlein, Peter ; Ullrich, Meinhard
Author_Institution :
Philips GmbH Forschungslab., Aachen, Germany
Volume :
1
fYear :
1996
fDate :
3-6 Oct 1996
Firstpage :
14
Abstract :
Presents an extension of bottom-up state-tying towards improved handling of unseen triphones. As opposed to the usual backing-off to diphones and monophones, the current method aims at finding a triphone model that has proven to exhibit some similarity with the unseen triphone. It is based on a probabilistic mapping of unseen contexts to clusters of triphone states observed in the training data. This algorithm has been applied to dictation tasks for three languages with vocabulary sizes ranging from 20k to 64k. The results compare favorably with those obtained using standard back-off rules. This technique also offers an alternative to top-down decision-tree procedures which are frequently used, especially for their generalization capabilities
Keywords :
dictation; hidden Markov models; probability; speech recognition; vocabulary; back-off rules; backing-off; bottom-up state-tying; dictation tasks; generalization capabilities; languages; large-vocabulary continuous speech recognition; probabilistic mapping; similarity; top-down decision-tree procedures; training data; triphone state clusters; unseen triphones; vocabulary size; Buildings; Clustering algorithms; Context modeling; Databases; Decision trees; Decoding; Laboratories; Speech recognition; Training data; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
Type :
conf
DOI :
10.1109/ICSLP.1996.606918
Filename :
606918
Link To Document :
بازگشت