DocumentCode :
2352363
Title :
Transformation Rule Learning without Rule Templates: A Case Study in Part of Speech Tagging
Author :
Ngo Xuan Bach ; Le Anh Cuong ; Nguyen Viet Ha ; Nguyen Ngoc Binh
Author_Institution :
Coll. of Technol., Vietnam Nat. Univ., Hanoi
fYear :
2008
fDate :
23-25 July 2008
Firstpage :
9
Lastpage :
14
Abstract :
Part of speech (POS) tagging is an important problem and is one of the first steps included in many tasks in natural language processing. It affects directly on the accuracy of many other problems such as Syntax Parsing, WordSense Disambiguation, and Machine Translation. Stochastic models solve this problem relatively well, but they still make mistakes. Transformation-based learning (TBL) is a solution which can be used to improve stochastic taggers by learning a set of transformation rules. However, its rule learning algorithm has the disadvantages that rule templates must be prepared by hand and only rules are instances of rule templates can be generated. In this paper, we propose a model to learn transformation rules without rule templates. This model considers the rule learning problem as a feature selection problem. Experiments on PennTree Bank showed that the proposal model reduces errors of stochastic taggers with some tags.
Keywords :
feature extraction; learning (artificial intelligence); natural language processing; stochastic processes; feature selection problem; natural language processing; part-of-speech tagging; stochastic tagger model; transformation rule-based learning; Books; Context modeling; Educational institutions; Information technology; Natural language processing; Natural languages; Proposals; Speech processing; Stochastic processes; Tagging;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Language Processing and Web Information Technology, 2008. ALPIT '08. International Conference on
Conference_Location :
Dalian Liaoning
Print_ISBN :
978-0-7695-3273-8
Type :
conf
DOI :
10.1109/ALPIT.2008.73
Filename :
4584333
Link To Document :
بازگشت