DocumentCode :
3673290
Title :
An improved approach to word sense disambiguation
Author :
Pradeep Sachdeva;Surbhi Verma;Sandeep Kumar Singh
Author_Institution :
Department of Computer Science and Information Technology, JIIT, Noida, India
fYear :
2014
Firstpage :
235
Lastpage :
240
Abstract :
Words in the English language often correspond to different meanings in different contexts. Such words are referred to as polysemous words i.e. words having more than one sense. This paper presents a knowledge based algorithm for disambiguating polysemous words using computational linguistics tool, WordNet. The task of word sense disambiguation requires finding out the similarity between the target word (word to be disambiguated) and the nearby words (words surrounding the target word in input text). Algorithms in the past have calculated similarity either by finding out the number of common words (intersection) between the glosses (definitions/meanings) of the target and nearby words, or by finding out the exact occurrence of the nearby word´s sense in the hierarchy (hypernyms) of the target word´s senses. This paper proposes an algorithm which modifies the above two parameters by computing intersection using not only the glosses but also by including the related words. Also the intersection is computed for the entire hierarchy of the target and nearby words. It also incorporates a third parameter `distance´ (between target and nearby words). The proposed approach incorporates more parameters for calculating similarity, which has not been attempted by any of the previous approaches. It scores the senses based on the overall impact of three parameters i.e. intersection, hierarchy and distance and then chooses the sense with the highest score. The algorithm has been evaluated on SemCor which is the largest available sense-tagged corpus. The proposed algorithm achieves a precision of 53.12% for Top1 results, 59.91% for Top2 results and that of 62.13% for Top3 results which is better than other knowledge based approaches.
Keywords :
Instruments
Publisher :
ieee
Conference_Titel :
Signal Processing and Information Technology (ISSPIT), 2014 IEEE International Symposium on
ISSN :
2162-7843
Type :
conf
DOI :
10.1109/ISSPIT.2014.7300594
Filename :
7300594
Link To Document :
بازگشت