DocumentCode :
3693954
Title :
Suffix sequences based morphological segmentation for Afaan Oromo
Author :
Getachew Mamo Wegari;Massimo Melucci;Solomon Teferra
Author_Institution :
IT Doctoral Program, Addis Ababa University, Addis Ababa, Ethiopia
fYear :
2015
Firstpage :
1
Lastpage :
6
Abstract :
This paper reports on a morphological segmentation model for Afaan Oromo based on suffix sequences approach. Understanding and identifying the suffix sequences of a language allow us to detect morpheme boundaries of many words of Afaan Oromo. Morphological segmentation models can be used in many Natural Language Processing applications such as machine translation, speech recognition, information retrieval and part-of-speech tagging. A divisive hierarchical clustering and frequency distribution were used to build a tree of candidate stems from which segmented suffix sequences can be modeled. The proposed morphological segmentation model was evaluated with test word-lists. The accuracy obtained by our morphological segmentation model is encouraging.
Keywords :
"Computational modeling","Training","Natural language processing","Algorithm design and analysis","Clustering algorithms","Pragmatics"
Publisher :
ieee
Conference_Titel :
AFRICON, 2015
Electronic_ISBN :
2153-0033
Type :
conf
DOI :
10.1109/AFRCON.2015.7331956
Filename :
7331956
Link To Document :
بازگشت