DocumentCode
679809
Title
STHREE: Stemmer for Malayalam using three pass algorithm
Author
Pragisha, K. ; Reghuraj, P.C.
Author_Institution
Dept. of Comput. Sci. & Eng., Gov. Eng. Coll. Sreekrishnapuram, Palakkad, India
fYear
2013
fDate
13-15 Dec. 2013
Firstpage
149
Lastpage
152
Abstract
This paper reports the design of a three pass stemmer STHREE for Malayalam. The language is rich in morphological variations but poor in linguistic computational resources. The system returns the meaningful root word of the input word in 97% of the cases when tested with 1040 words. This is a significant improvement over the reported accuracy of SILPA system, the only known stemmer for Malayalam, with the same test data sets.
Keywords
computational linguistics; linguistics; natural language processing; Malayalam; SILPA system; STHREE; data sets; input word; linguistic computational resources; morphological variations; root word; three pass stemmer; Accuracy; Algorithm design and analysis; Computational linguistics; Computer science; Educational institutions; Knowledge discovery; Natural language processing; Stemmer; linguistic computational resources; morphological variation; root word;
fLanguage
English
Publisher
ieee
Conference_Titel
Control Communication and Computing (ICCC), 2013 International Conference on
Conference_Location
Thiruvananthapuram
Print_ISBN
978-1-4799-0573-7
Type
conf
DOI
10.1109/ICCC.2013.6731640
Filename
6731640
Link To Document