DocumentCode
3486971
Title
Filler model based confidence measures for spoken dialog systems
Author
Akyol, Aydin ; Erdogan, Hakan
Author_Institution
Sabanci Universitesi, Istanbul, Turkey
fYear
2004
fDate
28-30 April 2004
Firstpage
552
Lastpage
555
Abstract
Because of the inadequate performance of speech recognition systems, an accurate confidence scoring mechanism should be employed to understand user requests correctly. To determine a confidence score for a hypothesis, certain confidence features are combined. The performance of filler model based confidence features are investigated. Five types of filler model networks were defined: triphone-network, phone-network, phone-class network, 5-state catch-all model and 3-state catch-all model. First, all the models were evaluated in a Turkish speech recognition task in terms of their ability to tag correctly (recognition-error or correct) recognition hypotheses. The best performance was obtained from the triphone recognition network. Then, the performance of reliable combinations of these models was investigated and it was observed that certain combinations of filler models could significantly improve the accuracy of the confidence annotation.
Keywords
interactive systems; natural language interfaces; speech recognition; 3-state catch-all model; 5-state catch-all model; Turkish speech recognition task; confidence score; filler model based confidence measures; phone-class network; phone-network; recognition hypotheses; spoken dialog systems; triphone-network; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing and Communications Applications Conference, 2004. Proceedings of the IEEE 12th
Print_ISBN
0-7803-8318-4
Type
conf
DOI
10.1109/SIU.2004.1338588
Filename
1338588
Link To Document