Title :
Early results for Chinese named entity recognition using conditional random fields model, HMM and maximum entropy
Author :
Feng, Yuanyong ; Sun, Le ; Zhang, Julin
Author_Institution :
Open Syst. & Chinese Inf. Process. Center, Chinese Acad. of Sci., Beijing, China
fDate :
30 Oct.-1 Nov. 2005
Abstract :
Entity recognition (NER) is an important step for many natural language applications, such as information extraction, text summarization, and question answering. Chinese NER has some special characteristics that make this task difficult. In this paper, we present some NER experiments on the corpora used for Chinese 863 NER task in 2004 based on three models: maximum entropy, hidden Markov model (HMM) and the more recent conditional random fields (CRFs). The results show that CRFs model outperforms the other two models in the sense of best results and average performance, and model scalability among data sizes. In our experiments, CRFs model approach can achieve an overall Fl measure around 84.39/80.68 in simple/traditional Chinese NER respectively, with a gain of 2.01/10.50 over the best system in 863 competitions.
Keywords :
hidden Markov models; maximum entropy methods; natural languages; Chinese named entity recognition; HMM; conditional random field model; hidden Markov model; information extraction; maximum entropy; natural language; question answering; text summarization; Data mining; Entropy; Hidden Markov models; Information processing; Natural languages; Open systems; Scalability; Sun; Testing; Text recognition;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
Print_ISBN :
0-7803-9361-9
DOI :
10.1109/NLPKE.2005.1598798