• DocumentCode
    1923202
  • Title

    Suffix stripping based NER in Assamese for location names

  • Author

    Sharma, Padmaja ; Sharma, Utpal ; Kalita, Jugal

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Tezpur Univ., Tezpur, India
  • fYear
    2012
  • fDate
    2-3 March 2012
  • Firstpage
    91
  • Lastpage
    94
  • Abstract
    Named Entity Recognition (NER) is the process of identifying and classifying proper nouns in text documents into pre-defined classes such as person, location and organization. It plays an important role in Natural Language Processing applications. Although NER in Indian languages is a difficult and challenging task and suffers from scarcity of resources, such work has started to appear recently. In highly inflectional languages such as Assamese, NER requires identification of the root forms of words that occur in texts. Our work reports a suffix stripping approach to identify those roots of words which are location named entities.
  • Keywords
    natural language processing; object recognition; pattern classification; text analysis; Assamese; Indian language; NER; location name; location name entity; named entity recognition; natural language processing; noun classification; suffix stripping approach; text document; word root form identification; Accuracy; Algorithm design and analysis; Computer science; Morphology; Organizations; Production;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Signal Processing (CISP), 2012 2nd National Conference on
  • Conference_Location
    Guwahati, Assam
  • Print_ISBN
    978-1-4577-0719-3
  • Type

    conf

  • DOI
    10.1109/NCCISP.2012.6189684
  • Filename
    6189684