• DocumentCode
    2909784
  • Title

    Natural Language Grammar Induction of Indonesian Language Corpora Using Genetic Algorithm

  • Author

    Hermawan, Arya Tandy ; Gunawan, G. ; Santoso, Joan

  • Author_Institution
    Dept. of Comput. Sci., Sekolah Tinggi Teknik Surabaya, Surabaya, Indonesia
  • fYear
    2011
  • fDate
    15-17 Nov. 2011
  • Firstpage
    15
  • Lastpage
    18
  • Abstract
    Grammar Induction is a machine learning process for learning grammar from corpora. This paper will discuss the process of grammar induction for Indonesian language corpora using genetic algorithm. The Grammar production rules will be modeled in the form of chromosomes. The fitness function is used to count how many sentences can be parsed. The data used are Indonesian fairy tales stories such as "Bawang Merah Bawang Putih" and "Malin Kundang". This paper describes the detailed explanations about the steps of each process carried out for natural language grammar problems.
  • Keywords
    genetic algorithms; grammars; learning (artificial intelligence); natural language processing; Bawang Merah Bawang Putih; Indonesian fairy tales stories; Indonesian language corpora; Malin Kundang; chromosomes; genetic algorithm; grammar learning; grammar production rules; machine learning process; natural language grammar induction; Biological cells; Genetic algorithms; Grammar; Natural languages; Production; Testing; Training; Genetic Algorithm; Grammar Induction; Indonesian Language; Natural Language Processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Asian Language Processing (IALP), 2011 International Conference on
  • Conference_Location
    Penang
  • Print_ISBN
    978-1-4577-1733-8
  • Type

    conf

  • DOI
    10.1109/IALP.2011.58
  • Filename
    6121459