• DocumentCode
    2352363
  • Title

    Transformation Rule Learning without Rule Templates: A Case Study in Part of Speech Tagging

  • Author

    Ngo Xuan Bach ; Le Anh Cuong ; Nguyen Viet Ha ; Nguyen Ngoc Binh

  • Author_Institution
    Coll. of Technol., Vietnam Nat. Univ., Hanoi
  • fYear
    2008
  • fDate
    23-25 July 2008
  • Firstpage
    9
  • Lastpage
    14
  • Abstract
    Part of speech (POS) tagging is an important problem and is one of the first steps included in many tasks in natural language processing. It affects directly on the accuracy of many other problems such as Syntax Parsing, WordSense Disambiguation, and Machine Translation. Stochastic models solve this problem relatively well, but they still make mistakes. Transformation-based learning (TBL) is a solution which can be used to improve stochastic taggers by learning a set of transformation rules. However, its rule learning algorithm has the disadvantages that rule templates must be prepared by hand and only rules are instances of rule templates can be generated. In this paper, we propose a model to learn transformation rules without rule templates. This model considers the rule learning problem as a feature selection problem. Experiments on PennTree Bank showed that the proposal model reduces errors of stochastic taggers with some tags.
  • Keywords
    feature extraction; learning (artificial intelligence); natural language processing; stochastic processes; feature selection problem; natural language processing; part-of-speech tagging; stochastic tagger model; transformation rule-based learning; Books; Context modeling; Educational institutions; Information technology; Natural language processing; Natural languages; Proposals; Speech processing; Stochastic processes; Tagging;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Language Processing and Web Information Technology, 2008. ALPIT '08. International Conference on
  • Conference_Location
    Dalian Liaoning
  • Print_ISBN
    978-0-7695-3273-8
  • Type

    conf

  • DOI
    10.1109/ALPIT.2008.73
  • Filename
    4584333