DocumentCode
605238
Title
POS Tagging of Assamese Language and Performance Analysis of CRF++ and fnTBL Approaches
Author
Barman, A.K. ; Sarmah, J. ; Sarma, S.K.
Author_Institution
Dept. of Inf. Technologv, Gauhati Univ., Guwahati, India
fYear
2013
fDate
10-12 April 2013
Firstpage
476
Lastpage
479
Abstract
Assamese is one of the regional languages of India spoken by the people of Assam and other north eastern states of India. Parts Of Speech (POS) tagging is one of the most important research issue as it is the basic need for any Natural Language Processing (NLP). An automated way to provide a Parts Of Speech label to a word on a context is known as Parts Of Speech Tagging. Assamese is one, among the less computationally aware languages of India. This paper presents our works on POS tagging for Assamese sentences, using Conditional Random Field (CRF) and Transformation Based Learning (TBL). We obtain 87.17 and 67.73 percent tagging accuracy for TBL and CRF respectively that are train through a manually tagged corpus.
Keywords
learning (artificial intelligence); natural language processing; random processes; Assamese language; Assamese sentences; CRF++; NLP; POS label; POS tagging; computationally aware languages; conditional random field; fnTBL approaches; natural language processing; north eastern states; parts of speech tagging; performance analysis; regional languages; transformation based learning; Bismuth; Computational modeling; Computers; Assamese; CRF; POS tagging; TBL;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Modelling and Simulation (UKSim), 2013 UKSim 15th International Conference on
Conference_Location
Cambridge
Print_ISBN
978-1-4673-6421-8
Type
conf
DOI
10.1109/UKSim.2013.91
Filename
6527464
Link To Document