DocumentCode :
2909784
Title :
Natural Language Grammar Induction of Indonesian Language Corpora Using Genetic Algorithm
Author :
Hermawan, Arya Tandy ; Gunawan, G. ; Santoso, Joan
Author_Institution :
Dept. of Comput. Sci., Sekolah Tinggi Teknik Surabaya, Surabaya, Indonesia
fYear :
2011
fDate :
15-17 Nov. 2011
Firstpage :
15
Lastpage :
18
Abstract :
Grammar Induction is a machine learning process for learning grammar from corpora. This paper will discuss the process of grammar induction for Indonesian language corpora using genetic algorithm. The Grammar production rules will be modeled in the form of chromosomes. The fitness function is used to count how many sentences can be parsed. The data used are Indonesian fairy tales stories such as "Bawang Merah Bawang Putih" and "Malin Kundang". This paper describes the detailed explanations about the steps of each process carried out for natural language grammar problems.
Keywords :
genetic algorithms; grammars; learning (artificial intelligence); natural language processing; Bawang Merah Bawang Putih; Indonesian fairy tales stories; Indonesian language corpora; Malin Kundang; chromosomes; genetic algorithm; grammar learning; grammar production rules; machine learning process; natural language grammar induction; Biological cells; Genetic algorithms; Grammar; Natural languages; Production; Testing; Training; Genetic Algorithm; Grammar Induction; Indonesian Language; Natural Language Processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Asian Language Processing (IALP), 2011 International Conference on
Conference_Location :
Penang
Print_ISBN :
978-1-4577-1733-8
Type :
conf
DOI :
10.1109/IALP.2011.58
Filename :
6121459
Link To Document :
بازگشت