DocumentCode :
2438623
Title :
The Application of CRFs in Part-of-Speech Tagging
Author :
Zhang, Xiaofei ; Huang, Heyan ; Liang, Zhang
Author_Institution :
Sch. of Comput. Sci. & Technol., Beijing Inst. of Technol., Beijing, China
Volume :
2
fYear :
2009
fDate :
26-27 Aug. 2009
Firstpage :
347
Lastpage :
350
Abstract :
Conditional random fields (CRFs) for sequence labeling offer advantages over both generative models like hidden Markov model (HMM) and classifiers applied at each sequence position. First, the CRFs don´t force to adhere to the independence assumption and thus can depend on arbitrary, non-independent features, without accounting for the distribution of those dependencies. Since CRFs models are able to flexibly utilize a wide variety of features, the training data sparse problem can be efficiently resolved. Moreover, the parameter estimation for CRFs is global, which effectively resolve the label bias problem. In this paper, the CRFs with Gaussian prior smoothing is used for part-of-speech (POS) tagging. Experiments show that the POS tagging error rate is reduced by 55.17% in close test and 43.64% in open test over HMM-based baseline, and synchronously an accuracy of 98.05% in close test and 95.79% in open test are also achieved. These positive results confirm CRFs superior performance.
Keywords :
Gaussian processes; hidden Markov models; natural language processing; random processes; CRF; Gaussian prior smoothing; HMM; conditional random field; data sparse problem; generative model; hidden Markov model; label bias problem; natural language processing; parameter estimation; part-of-speech tagging; sequence labeling; sequence position; Automatic speech recognition; Hidden Markov models; Intelligent systems; Man machine systems; Maximum likelihood decoding; Natural language processing; Parameter estimation; Smoothing methods; Tagging; Testing; CRF; HMM; Natural Language Processing; POS tagging;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Human-Machine Systems and Cybernetics, 2009. IHMSC '09. International Conference on
Conference_Location :
Hangzhou, Zhejiang
Print_ISBN :
978-0-7695-3752-8
Type :
conf
DOI :
10.1109/IHMSC.2009.210
Filename :
5335969
Link To Document :
بازگشت