DocumentCode :
3730560
Title :
Lao Named Entity Recognition based on conditional random fields with simple heuristic information
Author :
Mengjie Yang; Lanjiang Zhou; Zhengtao Yu; Shengxiang Gao; Jianyi Guo
Author_Institution :
Sch. of Inf. Eng. &
fYear :
2015
Firstpage :
1426
Lastpage :
1431
Abstract :
According to characteristics of Lao named entities, the paper proposes an approach of Lao Named Entity Recognition (NER) based on Conditional Random Fields (CRFs) with knowledge information. Firstly, we segment the text into word sequence and design three labels BIO1 for personal name and location name entity recognition. Secondly, some named entity features of Lao Language are selected for Conditional Random Fields (CRFs) model, such as the clue word feature, the predicate feature etc.. Then, candidate named entities are recognized. Thirdly, we extract simple personal name and location name features of Lao Language to build heuristic information, and use the heuristic information to determine candidate named entities. Finally, named entities which have not been discovered by Conditional Random Fields (CRFs) model are further recognized by using the named entities word list, and these final named entities are obtained. The experimental results show that the method proposed is effective, and it can improve the effect of named entity recognition by using machine learning method with heuristic information.
Keywords :
"Speech recognition","Character recognition","Speech","Context","Yttrium","Feature extraction","Learning systems"
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2015 12th International Conference on
Type :
conf
DOI :
10.1109/FSKD.2015.7382153
Filename :
7382153
Link To Document :
بازگشت