Title :
From Raw Text to Morphological Rules for Iban Morphological Analyser
Author :
Saee, Suhaila ; Lay-Ki Soon ; Tek-Yong Lim ; Ranaivo-Malancon, Bali ; Tang, Enya Kong
Author_Institution :
Fac. of Comput. & Inf., Multimedia Univ., Cyberjaya, Malaysia
Abstract :
To extend a complete workflow of automatic acquisition of morphological rules for morphological analyser, we propose a semi-automatic workflow for under-resourced language, which is Iban language. The workflow focuses in determining the rules to be used for building Iban morphological analyser without prior knowledge of language-specific morphological rules. This work introduces three main steps in acquiring the rules from the under-resourced language, which are morphological rules extraction, validation of the extracted rules and evaluation of the generated rules. From the proposed workflow, 25 rules were generated from 744 rules candidate. This work has achieved 76% of precision and 99% of recall. We believe the workflow will assist other researchers to build morphological analyser with the validated morphological rules for the under-resourced languages.
Keywords :
data acquisition; data analysis; natural language processing; Iban language; Iban morphological analyser; language-specific morphological rule; morphological rule acquisition; rule extraction; rule validation; under-resourced language; Availability; Buildings; Dictionaries; Educational institutions; Error analysis; Morphology; Pragmatics; morphological analyzer; morphological rules; rules extraction; under-resourced language;
Conference_Titel :
Asian Language Processing (IALP), 2012 International Conference on
Conference_Location :
Hanoi
Print_ISBN :
978-1-4673-6113-2
Electronic_ISBN :
978-0-7695-4886-9
DOI :
10.1109/IALP.2012.71