Title of article :
Building Morphological Analyzer for Nepali
Author/Authors :
Bhat, Shahid Mushtaq Central Institute of Indian Languages - Linguistic Data Consortium for Indian Languages, India , Rai, Rupesh Central Institute of Indian Languages - Linguistic Data Consortium for Indian Languages, India
From page :
45
To page :
58
Abstract :
Morphological analyzer is a fundamental tool in Natural Language Processing (NLP) that generates the morphological analyses of a given word-form. It can be used in enhancing the accuracy of POS-Tagging, Chunking, Syntactic Parsing, Word Sense Disambiguation (WSD), Information Retrieval (IR) Machine Translation (MT) Systems. This paper describes an ongoing effort to develop Nepali morphological analyzer, using an open source platform-Apertium (LT-Toolbox). Since, it is the initial stage of this project; we have confined our work to inflectional morphology. So far, we have covered all the possible categories, as per LDC-IL^1 POS tag-set of Nepali. Currently, the coverage of Nepali Morph-Analyzer is 20,000 words, classified into 219 paradigms.
Keywords :
Morphological analyzer , Word and paradigm model , Apertium , LT , Tool Box , Paradigm , Concatenative Morphology , Machine Translation , Devnagri , Transliteration
Journal title :
Journal of Modern Languages
Journal title :
Journal of Modern Languages
Record number :
2672943
Link To Document :
بازگشت