DocumentCode :
2330733
Title :
Hypotheses selection for re-ranking semantic annotations
Author :
Dinarelli, Marco ; Moschitti, Alessandro ; Riccardi, Giuseppe
Author_Institution :
LIMSI, Univ. of Trento, Trento, Italy
fYear :
2010
fDate :
12-15 Dec. 2010
Firstpage :
407
Lastpage :
411
Abstract :
Discriminative reranking has been successfully used for several tasks of Natural Language Processing (NLP). Recently it has been applied also to Spoken Language Understanding, imrpoving state-of-the-art for some applications. However, such proposed models can be further improved by considering: (i) a better selection of the initial n-best hypotheses to be re-ranked and (ii) the use of a strategy that decides when the reranking model should be used, i.e. in some cases only the basic approach should be applied. In this paper, we apply a semantic inconsistency metric to select the n-best hypotheses from a large set generated by an SLU basic system. Then we apply a state-of-the-art re-ranker based on the Partial Tree Kernel (PTK), which encodes SLU hypotheses in Support Vector Machines (SVM) with complex structured features. Finally, we apply a decision model based on confidence values to select between the first hypothesis provided by the basic SLU model and the first hypothesis provided by the re-ranker. We show the effectiveness of our approach presenting comparative results obtained by reranking hypotheses generated by two very different models: a simple Stochastic Language Model encoded in Finite State Machines (FSM) and a Conditional Random Field (CRF) model. We evaluate our approach on the French MEDIA corpus and on an Italian corpus acquired in the European Project LUNA. The results show a significant improvement with respect to the current state-of-the-art and previous re-ranking models.
Keywords :
encoding; finite state machines; natural language processing; stochastic processes; support vector machines; trees (mathematics); European Project LUNA; French MEDIA corpus; Italian corpus; SLU hypotheses; conditional random field model; discriminative re-ranking; encoding; finite state machines; n-best hypotheses selection; natural language processing; partial tree kernel; semantic inconsistency metric; spoken language understanding; stochastic language model; support vector machines; Discriminative Reranking; Kernel Methods; Spoken Language Understanding;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2010 IEEE
Conference_Location :
Berkeley, CA
Print_ISBN :
978-1-4244-7904-7
Electronic_ISBN :
978-1-4244-7902-3
Type :
conf
DOI :
10.1109/SLT.2010.5700887
Filename :
5700887
Link To Document :
بازگشت