DocumentCode :
3767549
Title :
Mongolian Named Entity Recognition using suffixes segmentation
Author :
Weihua Wang; Feilong Bao; Guanglai Gao
Author_Institution :
College of Computer Science, Inner Mongolia University, Hohhot, China, 010021
fYear :
2015
Firstpage :
169
Lastpage :
172
Abstract :
Mongolian is an agglutinative language with the complex morphological structures. Building an accurate Named Entity Recognition (NER) system for Mongolian is a challenging and meaningful work. This paper analyzes the characteristic of Mongolian suffixes using Narrow Non-Break Space and investigates Mongolian NER system under three methods in the Condition Random Field framework. The experiment shows that segmenting each suffix into an individual token achieves the best performance than both without segmenting and using the suffixes as a feature. Our approach obtains an F-measure = 82.71. It is appropriate for the Mongolian large scale vocabulary NER. This research also makes sense to other agglutinative languages NER systems.
Keywords :
"Artificial neural networks","Iron","Morphology","Instruments","Organizations","Pragmatics"
Publisher :
ieee
Conference_Titel :
Asian Language Processing (IALP), 2015 International Conference on
Print_ISBN :
978-1-4673-9595-3
Type :
conf
DOI :
10.1109/IALP.2015.7451558
Filename :
7451558
Link To Document :
بازگشت