DocumentCode
677176
Title
On the effect of the label bias problem in part-of-speech tagging
Author
Phuong Le-Hong ; Xuan-Hieu Phan ; The-Trung Tran
Author_Institution
Univ. of Sci., Hanoi, Vietnam
fYear
2013
fDate
10-13 Nov. 2013
Firstpage
103
Lastpage
108
Abstract
This paper investigates the effect of the label bias problem of maximum entropy Markov models for part-of-speech tagging, a typical sequence prediction task in natural language processing. This problem has been underexploited and underappreciated. The investigation reveals useful information about the entropy of local transition probability distributions of the tagging model which enables us to exploit and quantify the label bias effect of part-of-speech tagging. Experiments on a Vietnamese treebank and on a French treebank show a significant effect of the label bias problem in both of the languages.
Keywords
Markov processes; learning (artificial intelligence); maximum entropy methods; natural language processing; speech processing; statistical distributions; French treebank; Vietnamese treebank; label bias problem; local transition probability distribution entropy; maximum entropy Markov models; natural language processing; part-of-speech tagging; sequence prediction task; Accuracy; Context; Entropy; Hidden Markov models; Predictive models; Probability distribution; Tagging; CRF; French; MEMM; Vietnamese; label bias problem; machine learning; part-of-speech tagging; treebank;
fLanguage
English
Publisher
ieee
Conference_Titel
Computing and Communication Technologies, Research, Innovation, and Vision for the Future (RIVF), 2013 IEEE RIVF International Conference on
Conference_Location
Hanoi
Print_ISBN
978-1-4799-1349-7
Type
conf
DOI
10.1109/RIVF.2013.6719875
Filename
6719875
Link To Document