DocumentCode :
3316763
Title :
Rules selection in word sense disambiguation using Adaboost
Author :
Qin, Ying ; Wang, Xiaojie
Author_Institution :
Sch. of Inf. Eng., Beijing Univ. of Posts & Telecommun., China
fYear :
2005
fDate :
30 Oct.-1 Nov. 2005
Firstpage :
26
Lastpage :
29
Abstract :
Boosting algorithm is confirmed as a promising and practical machine learning method which has successfully been applied to some classification problems. Word sense disambiguation system using Boosting acquired the state-of-the-art performance. This paper explores the primary but unavoidable problem of rules selection in Adaboost applied to word sense disambiguation system, presenting the relations among rules selection, iteration number and the performance of the system on sparse data. The results show the increment of the iteration number in Adaboost trained on a small set of examples without noise is neither helpful nor harmful. The algorithm is sensitive to weak rules selection in two aspects: on one hand, some rules make the training error converge more quickly and have higher generalization ability simultaneously, on the other hand, conflictions may occur among weak rules built on different features causing trouble to the whole system.
Keywords :
computational linguistics; learning (artificial intelligence); natural languages; word processing; Adaboost; computational linguistics; iteration number; machine learning; natural languages; rules selection; sparse data; word sense disambiguation system; Boosting; Classification algorithms; Face detection; Face recognition; Handwriting recognition; Learning systems; Machine learning algorithms; Natural languages; Supervised learning; Tagging;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
Print_ISBN :
0-7803-9361-9
Type :
conf
DOI :
10.1109/NLPKE.2005.1598701
Filename :
1598701
Link To Document :
بازگشت