DocumentCode :
3335048
Title :
Dependency parsing for Indonesian
Author :
Kamayani, M. ; Purwarianti, Ayu
Author_Institution :
Sch. of Electr. Eng. & Inf., Bandung Inst. of Technol., Bandung, Indonesia
fYear :
2011
fDate :
17-19 July 2011
Firstpage :
1
Lastpage :
5
Abstract :
Dependency parsing has gained many focus lately for its many advantages over constituency-based. Dependency parsing uses dependency grammar. The observation which drives dependency grammar is a simple one: In a sentence, all but one word depends on other words. The one word that does not depend on any other is called the root of the sentence. A word depends on another either if it is a complement or a modifier of the latter. This paper tried to build dependency grammar for Indonesian. It presents a parsing algorithm of Covington, M. (1990) in Prolog using GULP, based on dependency grammar. It parses simple declarative sentences of Indonesian without handling subordinate and coordinate clause. GULP (Graph Unification Logic Programming) is an extension to Prolog that facilitates the implementation of unification-based grammar. Input of the parser is list of words (sentence) and the output is dependency tree. The system contains of lexicon, dependency rule and parsing algorithm. There are about 30 dependency rules that applied to this parser, some of them are adapted from Stanford Dependency Label because Indonesian has some similarities with English. Some Indonesian dependency rules have typical which is the root of the sentence is the main verb, and for verb phrase or noun phrase, the head locates in front of its dependents. The parser is tested with 20 sentences and resulted in valid dependency trees. It needs to be improved in parsing complex sentences with subordinate clauses and coordinate clauses. This limitation can be overcome by adding new rules to the dependency grammar and enriching feature structures.
Keywords :
grammars; logic programming; natural language processing; GULP; Indonesian dependency parsing; Prolog; declarative sentences; dependency grammar; graph unification logic programming; noun phrase; unification based grammar; verb phrase; Electrical engineering; Grammar; Inference algorithms; Informatics; Logic programming; Manuals; Speech; GULP; Indonesian parser; dependency parsing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electrical Engineering and Informatics (ICEEI), 2011 International Conference on
Conference_Location :
Bandung
ISSN :
2155-6822
Print_ISBN :
978-1-4577-0753-7
Type :
conf
DOI :
10.1109/ICEEI.2011.6021552
Filename :
6021552
Link To Document :
بازگشت