Title :
POS Tagging of Assamese Language and Performance Analysis of CRF++ and fnTBL Approaches
Author :
Barman, A.K. ; Sarmah, J. ; Sarma, S.K.
Author_Institution :
Dept. of Inf. Technologv, Gauhati Univ., Guwahati, India
Abstract :
Assamese is one of the regional languages of India spoken by the people of Assam and other north eastern states of India. Parts Of Speech (POS) tagging is one of the most important research issue as it is the basic need for any Natural Language Processing (NLP). An automated way to provide a Parts Of Speech label to a word on a context is known as Parts Of Speech Tagging. Assamese is one, among the less computationally aware languages of India. This paper presents our works on POS tagging for Assamese sentences, using Conditional Random Field (CRF) and Transformation Based Learning (TBL). We obtain 87.17 and 67.73 percent tagging accuracy for TBL and CRF respectively that are train through a manually tagged corpus.
Keywords :
learning (artificial intelligence); natural language processing; random processes; Assamese language; Assamese sentences; CRF++; NLP; POS label; POS tagging; computationally aware languages; conditional random field; fnTBL approaches; natural language processing; north eastern states; parts of speech tagging; performance analysis; regional languages; transformation based learning; Bismuth; Computational modeling; Computers; Assamese; CRF; POS tagging; TBL;
Conference_Titel :
Computer Modelling and Simulation (UKSim), 2013 UKSim 15th International Conference on
Conference_Location :
Cambridge
Print_ISBN :
978-1-4673-6421-8
DOI :
10.1109/UKSim.2013.91