Title :
Building deep dependency structure from partial parses
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Tehran, Tehran, Iran
Abstract :
Increasing the domain of locality by using tree-adjoining-grammars (TAG) encourages some researchers to use it as a modeling formalism in their language application. But parsing with a rich grammar like TAG faces two main obstacles: low parsing speed and a lot of ambiguous syntactical parses. We uses an idea of the shallow parsing based on a statistical approach in TAG formalism, named supertagging, which enhanced the standard POS tags in order to employ the syntactical information about the sentence. In this paper, an error-driven method in order to approaching a full parse from the partial parses based on TAG formalism is presented. These partial parses are basically resulted from supertagger which is followed by a simple heuristic based light parser named light weight dependency analyzer (LDA). Like other error driven methods, the process of generation the deep parses can be divided into two different phases: error detection and error correction, which in each phase, different completion heuristics applied on the partial parses. The experiments on Penn Treebank show considerable improvements in the parsing time and disambiguation process.
Keywords :
grammars; natural language processing; text analysis; Penn Treebank; ambiguous syntactical parses; deep dependency structure; error correction; error detection; error-driven method; light weight dependency analyzer; partial parses; syntactical information; tree-adjoining-grammars; Application software; Buildings; Error correction; Information retrieval; Linear discriminant analysis; Natural languages; Phase detection; Speech; State-space methods; Technical Activities Guide -TAG;
Conference_Titel :
Computer Conference, 2009. CSICC 2009. 14th International CSI
Conference_Location :
Tehran
Print_ISBN :
978-1-4244-4261-4
Electronic_ISBN :
978-1-4244-4262-1
DOI :
10.1109/CSICC.2009.5349409