DocumentCode
1220078
Title
Discriminative Training of the Hidden Vector State Model for Semantic Parsing
Author
Zhou, Deyu ; He, Yulan
Author_Institution
Inf. Res. Centre, Univ. of Reading, Reading
Volume
21
Issue
1
fYear
2009
Firstpage
66
Lastpage
77
Abstract
In this paper, we discuss how discriminative training can be applied to the hidden vector state (HVS) model in different task domains. The HVS model is a discrete hidden Markov model (HMM) in which each HMM state represents the state of a push-down automaton with a finite stack size. In previous applications, maximum-likelihood estimation (MLE) is used to derive the parameters of the HVS model. However, MLE makes a number of assumptions and unfortunately some of these assumptions do not hold. Discriminative training, without making such assumptions, can improve the performance of the HVS model by discriminating the correct hypothesis from the competing hypotheses. Experiments have been conducted in two domains: the travel domain for the semantic parsing task using the DARPA Communicator data and the Air Travel Information Services (ATIS) data and the bioinformatics domain for the information extraction task using the GENIA corpus. The results demonstrate modest improvements of the performance of the HVS model using discriminative training. In the travel domain, discriminative training of the HVS model gives a relative error reduction rate of 31 percent in F-measure when compared with MLE on the DARPA Communicator data and 9 percent on the ATIS data. In the bioinformatics domain, a relative error reduction rate of 4 percent in F-measure is achieved on the GENIA corpus.
Keywords
grammars; hidden Markov models; information retrieval; learning (artificial intelligence); natural language processing; ATIS; Air Travel Information Services; DARPA Communicator data; GENIA corpus; HMM; HVS; MLE; discrete hidden Markov model; discriminative training; hidden vector state model; information extraction task; maximum-likelihood estimation; push-down automaton; semantic parsing; Language parsing and understanding; Machine learning; Parameter learning;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2008.95
Filename
4522548
Link To Document