DocumentCode
2348886
Title
A reranking method for syntactic parsing with heterogeneous treebanks
Author
Ding, Haibo ; Zhu, Muhua ; Zhu, Jingbo
Author_Institution
Natural Language Process. Lab., Northeastern Univ., Shenyang, China
fYear
2010
fDate
21-23 Aug. 2010
Firstpage
1
Lastpage
4
Abstract
In the field of natural language processing (NLP), there often exist multiple corpora with different annotation standards for the same task. In this paper, we take syntactic parsing as a case study and propose a reranking method which is able to make direct use of disparate treebanks simultaneously without using techniques such as treebank conversion. The method proceeds in three steps: 1) build parsers on individual treebanks; 2) use parsers independently to generate n-best lists for each sentence in test set; 3) rerank individual n-best lists which correspond to the same sentence by using consensus information exchanged among these n-best lists. Experimental results on two open Chinese treebanks show that our method significantly outperforms the baseline system by 0.84% and 0.53% respectively.
Keywords
natural language processing; tree data structures; Chinese treebanks; annotation standards; disparate treebanks; heterogeneous treebanks; natural language processing; reranking method; syntactic parsing; treebank conversion; Accuracy; Artificial neural networks; Equations; Standards; Syntactic parsing; heterogeneous treebanks; reranking;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Language Processing and Knowledge Engineering (NLP-KE), 2010 International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4244-6896-6
Type
conf
DOI
10.1109/NLPKE.2010.5587842
Filename
5587842
Link To Document