Title :
Filler model based confidence measures for spoken dialog systems
Author :
Akyol, Aydin ; Erdogan, Hakan
Author_Institution :
Sabanci Universitesi, Istanbul, Turkey
Abstract :
Because of the inadequate performance of speech recognition systems, an accurate confidence scoring mechanism should be employed to understand user requests correctly. To determine a confidence score for a hypothesis, certain confidence features are combined. The performance of filler model based confidence features are investigated. Five types of filler model networks were defined: triphone-network, phone-network, phone-class network, 5-state catch-all model and 3-state catch-all model. First, all the models were evaluated in a Turkish speech recognition task in terms of their ability to tag correctly (recognition-error or correct) recognition hypotheses. The best performance was obtained from the triphone recognition network. Then, the performance of reliable combinations of these models was investigated and it was observed that certain combinations of filler models could significantly improve the accuracy of the confidence annotation.
Keywords :
interactive systems; natural language interfaces; speech recognition; 3-state catch-all model; 5-state catch-all model; Turkish speech recognition task; confidence score; filler model based confidence measures; phone-class network; phone-network; recognition hypotheses; spoken dialog systems; triphone-network; Speech recognition;
Conference_Titel :
Signal Processing and Communications Applications Conference, 2004. Proceedings of the IEEE 12th
Print_ISBN :
0-7803-8318-4
DOI :
10.1109/SIU.2004.1338588