DocumentCode
2352363
Title
Transformation Rule Learning without Rule Templates: A Case Study in Part of Speech Tagging
Author
Ngo Xuan Bach ; Le Anh Cuong ; Nguyen Viet Ha ; Nguyen Ngoc Binh
Author_Institution
Coll. of Technol., Vietnam Nat. Univ., Hanoi
fYear
2008
fDate
23-25 July 2008
Firstpage
9
Lastpage
14
Abstract
Part of speech (POS) tagging is an important problem and is one of the first steps included in many tasks in natural language processing. It affects directly on the accuracy of many other problems such as Syntax Parsing, WordSense Disambiguation, and Machine Translation. Stochastic models solve this problem relatively well, but they still make mistakes. Transformation-based learning (TBL) is a solution which can be used to improve stochastic taggers by learning a set of transformation rules. However, its rule learning algorithm has the disadvantages that rule templates must be prepared by hand and only rules are instances of rule templates can be generated. In this paper, we propose a model to learn transformation rules without rule templates. This model considers the rule learning problem as a feature selection problem. Experiments on PennTree Bank showed that the proposal model reduces errors of stochastic taggers with some tags.
Keywords
feature extraction; learning (artificial intelligence); natural language processing; stochastic processes; feature selection problem; natural language processing; part-of-speech tagging; stochastic tagger model; transformation rule-based learning; Books; Context modeling; Educational institutions; Information technology; Natural language processing; Natural languages; Proposals; Speech processing; Stochastic processes; Tagging;
fLanguage
English
Publisher
ieee
Conference_Titel
Advanced Language Processing and Web Information Technology, 2008. ALPIT '08. International Conference on
Conference_Location
Dalian Liaoning
Print_ISBN
978-0-7695-3273-8
Type
conf
DOI
10.1109/ALPIT.2008.73
Filename
4584333
Link To Document