DocumentCode :
2659599
Title :
Discriminative learning using linguistic features to rescore n-best speech hypotheses
Author :
Georgescul, Maria ; Rayner, Manny ; Bouillon, Pierrette ; Tsourakis, Nikos
Author_Institution :
ISSCO/TIM, ETI, Univ. of Geneva, Geneva
fYear :
2008
fDate :
15-19 Dec. 2008
Firstpage :
97
Lastpage :
100
Abstract :
We describe how we were able to improve the accuracy of a medium-vocabulary spoken dialog system by rescoring the list of n-best recognition hypotheses using a combination of acoustic, syntactic, semantic and discourse information. The non-acoustic features are extracted from different intermediate processing results produced by the natural language processing module, and automatically filtered. We apply discriminative support vector learning designed for re-ranking, using both word error rate and semantic error rate as ranking target value, and evaluating using five-fold cross-validation; to show robustness of our method, confidence intervals for word and semantic error rates are computed via bootstrap sampling. The reduction in semantic error rate, from 19% to 11%, is statistically significant at 0.01 level.
Keywords :
learning (artificial intelligence); natural language processing; speech recognition; support vector machines; discriminative support vector learning; linguistic features; medium-vocabulary spoken dialog system; n-best speech recognition hypotheses; natural language processing; Automatic speech recognition; Calendars; Data mining; Databases; Error analysis; Feature extraction; Natural languages; Power system modeling; Speech recognition; Vocabulary; natural language; speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language Technology Workshop, 2008. SLT 2008. IEEE
Conference_Location :
Goa
Print_ISBN :
978-1-4244-3471-8
Electronic_ISBN :
978-1-4244-3472-5
Type :
conf
DOI :
10.1109/SLT.2008.4777849
Filename :
4777849
Link To Document :
بازگشت