DocumentCode
600213
Title
From Raw Text to Morphological Rules for Iban Morphological Analyser
Author
Saee, Suhaila ; Lay-Ki Soon ; Tek-Yong Lim ; Ranaivo-Malancon, Bali ; Tang, Enya Kong
Author_Institution
Fac. of Comput. & Inf., Multimedia Univ., Cyberjaya, Malaysia
fYear
2012
fDate
13-15 Nov. 2012
Firstpage
21
Lastpage
24
Abstract
To extend a complete workflow of automatic acquisition of morphological rules for morphological analyser, we propose a semi-automatic workflow for under-resourced language, which is Iban language. The workflow focuses in determining the rules to be used for building Iban morphological analyser without prior knowledge of language-specific morphological rules. This work introduces three main steps in acquiring the rules from the under-resourced language, which are morphological rules extraction, validation of the extracted rules and evaluation of the generated rules. From the proposed workflow, 25 rules were generated from 744 rules candidate. This work has achieved 76% of precision and 99% of recall. We believe the workflow will assist other researchers to build morphological analyser with the validated morphological rules for the under-resourced languages.
Keywords
data acquisition; data analysis; natural language processing; Iban language; Iban morphological analyser; language-specific morphological rule; morphological rule acquisition; rule extraction; rule validation; under-resourced language; Availability; Buildings; Dictionaries; Educational institutions; Error analysis; Morphology; Pragmatics; morphological analyzer; morphological rules; rules extraction; under-resourced language;
fLanguage
English
Publisher
ieee
Conference_Titel
Asian Language Processing (IALP), 2012 International Conference on
Conference_Location
Hanoi
Print_ISBN
978-1-4673-6113-2
Electronic_ISBN
978-0-7695-4886-9
Type
conf
DOI
10.1109/IALP.2012.71
Filename
6473686
Link To Document