Title :
LALITHA: A light weight Malayalam stemmer using suffix stripping method
Author :
Prajitha, U. ; Sreejith, C. ; Reghu Raj, P.C.
Author_Institution :
Dept. of Comput. Sci. & Eng., Gov. Eng. Coll., Palakkad, India
Abstract :
Stemming is the process of removing the affixes from inflections and to return the root form. Malayalam is highly agglutinative in nature and hundreds of inflections are possible for each word. An effective stemmer in Malayalam is not yet implemented. This paper presents a lightweight stemmer for Malayalam, which conflates terms by suffix removal. The proposed stemmer is both computationally inexpensive and domain independent and will serve as a vital part in many areas of Malayalam Language Computing.
Keywords :
natural language processing; LALITHA; Malayalam language computing; affix removal; light weight Malayalam stemmer; suffix removal; suffix stripping method; Complexity theory; Computational linguistics; Dictionaries; Force; Natural language processing; Strips; Malayalam Computing; Natural Language Processing; Stemmer; Stemming; Suffix stripping;
Conference_Titel :
Control Communication and Computing (ICCC), 2013 International Conference on
Conference_Location :
Thiruvananthapuram
Print_ISBN :
978-1-4799-0573-7
DOI :
10.1109/ICCC.2013.6731658