Title :
A Study of Parsing Process on Natural Language Processing in Bahasa Indonesia
Author :
Sibarani, Elisa Margareth ; Nadial, Mhd ; Panggabean, Evy ; Meryana, S.
Author_Institution :
Inf. Eng. Study Program, Del Inst. of Technol., Sitoluama, Indonesia
Abstract :
Research on Natural Language Processing (NLP) in Indonesian is still limited and the results of available research that can be used for further research are also limited. In a series of natural language processing, the initial step is parsing the sentence in a particular language based on the grammar in order to help understanding the meaning of a sentence. This research aims to produce a simulation of Indonesian parser by adapting the process which was conducted by using Collins Algorithm. The three main stages are: 1) preprocessing to generate corpus and events files, 2) lexical analysis to convert the corpus into tokens, and 3) syntax analysis to build parse tree that requires file events to calculate the probability of the grammar by count the occurrence frequency on file events to determine the best sentence trees. An evaluation was performed to the parser using 30 simple sentences and the outcomes were able to generate a corpus file, file events, parse-tree and probability calculations. Nevertheless some sentences could not be parsed completely true because of the limitations of the Tree bank file in Indonesian. Some future works are to develop complete and valid Tree bank and Lexicon files.
Keywords :
grammars; natural language processing; probability; trees (mathematics); Bahasa Indonesia; Collins algorithm; Indonesian parser; NLP; corpus file; file events; grammar; lexical analysis; natural language processing; parse tree; parse-tree; parsing process; probability calculation; sentence tree; syntax analysis; tree bank file; Adaptation models; Dictionaries; Grammar; Natural language processing; Probabilistic logic; Probability; Production; Lexical Analysis; Natural Language Processing (NLP); Parser; Preprocessing; Syntax Analysis;
Conference_Titel :
Computational Science and Engineering (CSE), 2013 IEEE 16th International Conference on
Conference_Location :
Sydney, NSW
DOI :
10.1109/CSE.2013.56