Suffix stripping based NER in Assamese for location names

Author

Sharma, Padmaja ; Sharma, Utpal ; Kalita, Jugal

Author_Institution

Dept. of Comput. Sci. & Eng., Tezpur Univ., Tezpur, India

fYear

2012

fDate

2-3 March 2012

Firstpage

91

Lastpage

94

Abstract

Named Entity Recognition (NER) is the process of identifying and classifying proper nouns in text documents into pre-defined classes such as person, location and organization. It plays an important role in Natural Language Processing applications. Although NER in Indian languages is a difficult and challenging task and suffers from scarcity of resources, such work has started to appear recently. In highly inflectional languages such as Assamese, NER requires identification of the root forms of words that occur in texts. Our work reports a suffix stripping approach to identify those roots of words which are location named entities.

Keywords

natural language processing; object recognition; pattern classification; text analysis; Assamese; Indian language; NER; location name; location name entity; named entity recognition; natural language processing; noun classification; suffix stripping approach; text document; word root form identification; Accuracy; Algorithm design and analysis; Computer science; Morphology; Organizations; Production;

fLanguage

English

Publisher

ieee

Conference_Titel

Computational Intelligence and Signal Processing (CISP), 2012 2nd National Conference on

Conference_Location

Guwahati, Assam

Print_ISBN

978-1-4577-0719-3

Type

conf

DOI

10.1109/NCCISP.2012.6189684

Filename

6189684