DocumentCode :
3486971
Title :
Filler model based confidence measures for spoken dialog systems
Author :
Akyol, Aydin ; Erdogan, Hakan
Author_Institution :
Sabanci Universitesi, Istanbul, Turkey
fYear :
2004
fDate :
28-30 April 2004
Firstpage :
552
Lastpage :
555
Abstract :
Because of the inadequate performance of speech recognition systems, an accurate confidence scoring mechanism should be employed to understand user requests correctly. To determine a confidence score for a hypothesis, certain confidence features are combined. The performance of filler model based confidence features are investigated. Five types of filler model networks were defined: triphone-network, phone-network, phone-class network, 5-state catch-all model and 3-state catch-all model. First, all the models were evaluated in a Turkish speech recognition task in terms of their ability to tag correctly (recognition-error or correct) recognition hypotheses. The best performance was obtained from the triphone recognition network. Then, the performance of reliable combinations of these models was investigated and it was observed that certain combinations of filler models could significantly improve the accuracy of the confidence annotation.
Keywords :
interactive systems; natural language interfaces; speech recognition; 3-state catch-all model; 5-state catch-all model; Turkish speech recognition task; confidence score; filler model based confidence measures; phone-class network; phone-network; recognition hypotheses; spoken dialog systems; triphone-network; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing and Communications Applications Conference, 2004. Proceedings of the IEEE 12th
Print_ISBN :
0-7803-8318-4
Type :
conf
DOI :
10.1109/SIU.2004.1338588
Filename :
1338588
Link To Document :
بازگشت