• DocumentCode
    605238
  • Title

    POS Tagging of Assamese Language and Performance Analysis of CRF++ and fnTBL Approaches

  • Author

    Barman, A.K. ; Sarmah, J. ; Sarma, S.K.

  • Author_Institution
    Dept. of Inf. Technologv, Gauhati Univ., Guwahati, India
  • fYear
    2013
  • fDate
    10-12 April 2013
  • Firstpage
    476
  • Lastpage
    479
  • Abstract
    Assamese is one of the regional languages of India spoken by the people of Assam and other north eastern states of India. Parts Of Speech (POS) tagging is one of the most important research issue as it is the basic need for any Natural Language Processing (NLP). An automated way to provide a Parts Of Speech label to a word on a context is known as Parts Of Speech Tagging. Assamese is one, among the less computationally aware languages of India. This paper presents our works on POS tagging for Assamese sentences, using Conditional Random Field (CRF) and Transformation Based Learning (TBL). We obtain 87.17 and 67.73 percent tagging accuracy for TBL and CRF respectively that are train through a manually tagged corpus.
  • Keywords
    learning (artificial intelligence); natural language processing; random processes; Assamese language; Assamese sentences; CRF++; NLP; POS label; POS tagging; computationally aware languages; conditional random field; fnTBL approaches; natural language processing; north eastern states; parts of speech tagging; performance analysis; regional languages; transformation based learning; Bismuth; Computational modeling; Computers; Assamese; CRF; POS tagging; TBL;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Modelling and Simulation (UKSim), 2013 UKSim 15th International Conference on
  • Conference_Location
    Cambridge
  • Print_ISBN
    978-1-4673-6421-8
  • Type

    conf

  • DOI
    10.1109/UKSim.2013.91
  • Filename
    6527464