DocumentCode :
1796958
Title :
Learning latent variable grammars from complementary perspectives
Author :
Dongchen Li ; Xiantao Zhang ; Xihong Wu
Author_Institution :
Key Lab. of Machine Perception & Intell., Peking Univ., Beijing, China
fYear :
2014
fDate :
9-13 July 2014
Firstpage :
124
Lastpage :
128
Abstract :
The corpus for training a parser consists of sentences of heterogeneous grammar usages. Previous parser domain adaptation work has concentrated on adaptation to the shifts in vocabulary rather than grammar usage. In this paper, we focus on exploiting the diversity of training date separately and then accumulates their advantages. We propose an approach that grammar is biased toward relevant syntactic style, and the complementary grammar usage are combined for inference. Multiple grammars with partly complementary points of strength are induced individually. They capture complementary data representation, and we accumulates their advantages in a joint model to assemble the complementary depicting powers. Despite its compatibility with many other methods, out product model achieves 85.20% F1 score on Penn Chinese Treebank, higher than previous systems.
Keywords :
grammars; learning (artificial intelligence); natural language processing; F1 score; Penn Chinese Treebank; complementary data representation; complementary grammar usage; complementary perspectives; heterogeneous grammar usage sentences; inference; joint model; latent variable grammar learning; parser training corpus; partly-complementary points; syntactic style; training date; Analytical models; Computational linguistics; Grammar; Mathematical model; Merging; Pragmatics; Syntactics; PCFGLA; Parsing; sparsity;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal and Information Processing (ChinaSIP), 2014 IEEE China Summit & International Conference on
Conference_Location :
Xi´an
Print_ISBN :
978-1-4799-5401-8
Type :
conf
DOI :
10.1109/ChinaSIP.2014.6889215
Filename :
6889215
Link To Document :
بازگشت