مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

3402239

Title :

Rule based stemmer in Urdu

Author :

Gupta, V. ; Joshi, Niranjan ; Mathur, Iti

Author_Institution :

IES, IPS Acad., Indore, India

fYear :

2013

fDate :

20-22 Sept. 2013

Firstpage :

129

Lastpage :

132

Abstract :

Urdu is a combination of several languages like Arabic, Hindi, English, Turkish, Sanskrit etc. It has a complex and rich morphology. This is the reason why not much work has been done in Urdu language processing. Stemming is used to convert a word into its respective root form. In stemming, we separate the suffix and prefix from the word. It is useful in search engines, natural language processing and word processing, spell checkers, word parsing, word frequency and count studies. This paper presents a rule based stemmer for Urdu. The stemmer that we have discussed here is used in information retrieval. We have also evaluated our results by verifying it with a human expert.

Keywords :

information retrieval; knowledge based systems; natural language processing; Urdu language processing; count studies; information retrieval; natural language processing; prefix; rule based stemmer; search engines; spell checkers; stemming; suffix; word frequency; word parsing; word processing; Accuracy; Communications technology; Computers; Conferences; Educational institutions; Morphology; Complex and Rich morphology; Rule Based Stemmer; Stemming; Urdu;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer and Communication Technology (ICCCT), 2013 4th International Conference on

Conference_Location :

Allahabad

Print_ISBN :

978-1-4799-1569-9

Type :

conf

DOI :

10.1109/ICCCT.2013.6749615

Filename :

6749615

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3402239