Title :
Symmetric Trends: Optimal Local Context Window in Chinese Word Sense Disambiguation
Author :
Li, Gang ; Kou, Guangzeng ; Zhou, Ercui ; Zhang, Ling
Author_Institution :
Sch. of Inf. Manage., Wuhan Univ., Wuhan, China
Abstract :
Word Sense Disambiguation (WSD) is a task of classification to identify sense of ambiguous word. Most systems choose symmetric local context window on empirical grounds, that is, the distance from the ambiguous word to both sides of the window is same, such as [-1, +1]. Why? In this paper, we do an experiment on Senseval-3 Chinese data set. Use cross-validation method to identify the optimal local context window for word features and part of speech (POS) features separately. The results show that in the same window scope, the smaller the gap, the lower the error, and the better the WSD performance. In other words, the window that tends to symmetric will get better performance.
Keywords :
word processing; Senseval-3 Chinese data set; cross-validation method; local context window; part of speech; word features; word sense disambiguation; Bayesian methods; Conference management; Decision making; Feature extraction; Hybrid intelligent systems; Information management; Machine learning; Natural language processing; Speech; Testing; Chinese; Word Sense Disambiguation; local context window; symmetric;
Conference_Titel :
Hybrid Intelligent Systems, 2009. HIS '09. Ninth International Conference on
Conference_Location :
Shenyang
Print_ISBN :
978-0-7695-3745-0
DOI :
10.1109/HIS.2009.244