مرکز منطقه ای اطلاع رساني علوم و فناوري - Hypotheses selection for re-ranking semantic annotations

DocumentCode :

2330733

Title :

Hypotheses selection for re-ranking semantic annotations

Author :

Dinarelli, Marco ; Moschitti, Alessandro ; Riccardi, Giuseppe

Author_Institution :

LIMSI, Univ. of Trento, Trento, Italy

fYear :

2010

fDate :

12-15 Dec. 2010

Firstpage :

407

Lastpage :

411

Abstract :

Discriminative reranking has been successfully used for several tasks of Natural Language Processing (NLP). Recently it has been applied also to Spoken Language Understanding, imrpoving state-of-the-art for some applications. However, such proposed models can be further improved by considering: (i) a better selection of the initial n-best hypotheses to be re-ranked and (ii) the use of a strategy that decides when the reranking model should be used, i.e. in some cases only the basic approach should be applied. In this paper, we apply a semantic inconsistency metric to select the n-best hypotheses from a large set generated by an SLU basic system. Then we apply a state-of-the-art re-ranker based on the Partial Tree Kernel (PTK), which encodes SLU hypotheses in Support Vector Machines (SVM) with complex structured features. Finally, we apply a decision model based on confidence values to select between the first hypothesis provided by the basic SLU model and the first hypothesis provided by the re-ranker. We show the effectiveness of our approach presenting comparative results obtained by reranking hypotheses generated by two very different models: a simple Stochastic Language Model encoded in Finite State Machines (FSM) and a Conditional Random Field (CRF) model. We evaluate our approach on the French MEDIA corpus and on an Italian corpus acquired in the European Project LUNA. The results show a significant improvement with respect to the current state-of-the-art and previous re-ranking models.

Keywords :

encoding; finite state machines; natural language processing; stochastic processes; support vector machines; trees (mathematics); European Project LUNA; French MEDIA corpus; Italian corpus; SLU hypotheses; conditional random field model; discriminative re-ranking; encoding; finite state machines; n-best hypotheses selection; natural language processing; partial tree kernel; semantic inconsistency metric; spoken language understanding; stochastic language model; support vector machines; Discriminative Reranking; Kernel Methods; Spoken Language Understanding;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Spoken Language Technology Workshop (SLT), 2010 IEEE

Conference_Location :

Berkeley, CA

Print_ISBN :

978-1-4244-7904-7

Electronic_ISBN :

978-1-4244-7902-3

Type :

conf

DOI :

10.1109/SLT.2010.5700887

Filename :

5700887

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2330733