Title of article :
Part of Speech Taggers for Morphologically Rich Indian Languages: A Survey
Author/Authors :
Dinesh Kumar، نويسنده , , Gurpreet Singh Josan، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2010
Abstract :
The problem of tagging in natural language processing is to find a way to tag every word in a text as a particular part of speech, e.g., proper pronoun. POS tagging is a very important preprocessing task for language processing activities. This paper reports about the Part of Speech (POS) taggers proposed for various Indian Languages like Hindi, Punjabi, Malayalam, Bengali and Telugu. Various part of speech tagging approaches like Hidden Markov Model (HMM), Support Vector Model (SVM), Rule based approaches, Maximum Entropy (ME) and Conditional Random Field (CRF) have been used for POS tagging. Accuracy is the prime factor in evaluating any POS tagger so the accuracy of every proposed tagger is also discussed in this paper
Keywords :
Tagging , HMM , Finite state automata , Support vector machines , stochastic , Stemming , Maximum entropy , Corpora , tags , morphology , Suffix , Tagset , Prefix
Journal title :
International Journal of Computer Applications
Journal title :
International Journal of Computer Applications