DocumentCode :
991213
Title :
Ambiguity resolution analysis in incremental parsing of natural language
Author :
Costa, Fabrizio ; Frasconi, Paolo ; Lombardo, Vincenzo ; Sturt, Patrick ; Soda, Giovanni
Author_Institution :
Dipt. di Sistemi e Informatica, Univ. di Firenze, Italy
Volume :
16
Issue :
4
fYear :
2005
fDate :
7/1/2005 12:00:00 AM
Firstpage :
959
Lastpage :
971
Abstract :
Incremental parsing gains its importance in natural language processing and psycholinguistics because of its cognitive plausibility. Modeling the associated cognitive data structures, and their dynamics, can lead to a better understanding of the human parser. In earlier work, we have introduced a recursive neural network (RNN) capable of performing syntactic ambiguity resolution in incremental parsing. In this paper, we report a systematic analysis of the behavior of the network that allows us to gain important insights about the kind of information that is exploited to resolve different forms of ambiguity. In attachment ambiguities, in which a new phrase can be attached at more than one point in the syntactic left context, we found that learning from examples allows us to predict the location of the attachment point with high accuracy, while the discrimination amongst alternative syntactic structures with the same attachment point is slightly better than making a decision purely based on frequencies. We also introduce several new ideas to enhance the architectural design, obtaining significant improvements of prediction accuracy, up to 25% error reduction on the same dataset used in previous work. Finally, we report large scale experiments on the entire Wall Street Journal section of the Penn Treebank. The best prediction accuracy of the model on this large dataset is 87.6%, a relative error reduction larger than 50% compared to previous results.
Keywords :
cognitive systems; data structures; grammars; incremental compilers; linguistics; natural languages; neural nets; very large databases; Penn Treebank; Wall Street Journal; ambiguity resolution analysis; associated cognitive data structures; cognitive plausibility; incremental parsing; large dataset; natural language processing; prediction accuracy; psycholinguistics; recursive neural network; Accuracy; Data structures; Frequency; Humans; Information analysis; Natural language processing; Natural languages; Neural networks; Psychology; Recurrent neural networks; First-pass attachment; incremental parsing; learning preferences; recursive neural networks (RNNs); structured data; Algorithms; Linguistics; Natural Language Processing; Neural Networks (Computer); Pattern Recognition, Automated; Vocabulary, Controlled;
fLanguage :
English
Journal_Title :
Neural Networks, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9227
Type :
jour
DOI :
10.1109/TNN.2005.849837
Filename :
1461437
Link To Document :
بازگشت