• DocumentCode
    677176
  • Title

    On the effect of the label bias problem in part-of-speech tagging

  • Author

    Phuong Le-Hong ; Xuan-Hieu Phan ; The-Trung Tran

  • Author_Institution
    Univ. of Sci., Hanoi, Vietnam
  • fYear
    2013
  • fDate
    10-13 Nov. 2013
  • Firstpage
    103
  • Lastpage
    108
  • Abstract
    This paper investigates the effect of the label bias problem of maximum entropy Markov models for part-of-speech tagging, a typical sequence prediction task in natural language processing. This problem has been underexploited and underappreciated. The investigation reveals useful information about the entropy of local transition probability distributions of the tagging model which enables us to exploit and quantify the label bias effect of part-of-speech tagging. Experiments on a Vietnamese treebank and on a French treebank show a significant effect of the label bias problem in both of the languages.
  • Keywords
    Markov processes; learning (artificial intelligence); maximum entropy methods; natural language processing; speech processing; statistical distributions; French treebank; Vietnamese treebank; label bias problem; local transition probability distribution entropy; maximum entropy Markov models; natural language processing; part-of-speech tagging; sequence prediction task; Accuracy; Context; Entropy; Hidden Markov models; Predictive models; Probability distribution; Tagging; CRF; French; MEMM; Vietnamese; label bias problem; machine learning; part-of-speech tagging; treebank;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computing and Communication Technologies, Research, Innovation, and Vision for the Future (RIVF), 2013 IEEE RIVF International Conference on
  • Conference_Location
    Hanoi
  • Print_ISBN
    978-1-4799-1349-7
  • Type

    conf

  • DOI
    10.1109/RIVF.2013.6719875
  • Filename
    6719875