DocumentCode :
1467731
Title :
Predicting unseen triphones with senones
Author :
Hwang, Mei-Yuh ; Huang, Xuedong ; Alleva, Fileno A.
Author_Institution :
Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume :
4
Issue :
6
fYear :
1996
fDate :
11/1/1996 12:00:00 AM
Firstpage :
412
Lastpage :
419
Abstract :
In large-vocabulary speech recognition, we often encounter triphones that are not covered in the training data. These unseen triphones are usually backed off to their corresponding diphones or context-independent phones, which contain less context yet have plenty of training examples. We propose to use decision-tree-based senones to generate needed senonic baseforms for these unseen triphones. A decision tree is built for each Markov state of each base phone; the leaves of the trees constitute the senone pool. To find the senone associated with a Markov state of any triphone, the corresponding tree is traversed until a leaf node is reached. The effectiveness of the proposed approach was demonstrated in the ARPA 5000-word speaker-independent Wall Street Journal dictation task. The word error rate was reduced by 11% when unseen triphones were modeled by the decision-tree-based senones instead of context-independent phones. When there were more than five unseen triphones in each test utterance, the error rate reduction was more than 20%
Keywords :
Markov processes; dictation; speech recognition; trees (mathematics); ARPA 5000-word dictation task; Markov state; context-independent phones; decision-tree-based senones; diphones; large-vocabulary speech recognition; leaf node; senone pool; senonic baseforms; speaker-independent Wall Street Journal dictation; test utterance; training data; triphone; unseen triphones prediction; word error rate; Associate members; Classification tree analysis; Context modeling; Decision trees; Error analysis; Hidden Markov models; Predictive models; Speech recognition; Testing; Training data;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.544526
Filename :
544526
Link To Document :
بازگشت