DocumentCode
3673290
Title
An improved approach to word sense disambiguation
Author
Pradeep Sachdeva;Surbhi Verma;Sandeep Kumar Singh
Author_Institution
Department of Computer Science and Information Technology, JIIT, Noida, India
fYear
2014
Firstpage
235
Lastpage
240
Abstract
Words in the English language often correspond to different meanings in different contexts. Such words are referred to as polysemous words i.e. words having more than one sense. This paper presents a knowledge based algorithm for disambiguating polysemous words using computational linguistics tool, WordNet. The task of word sense disambiguation requires finding out the similarity between the target word (word to be disambiguated) and the nearby words (words surrounding the target word in input text). Algorithms in the past have calculated similarity either by finding out the number of common words (intersection) between the glosses (definitions/meanings) of the target and nearby words, or by finding out the exact occurrence of the nearby word´s sense in the hierarchy (hypernyms) of the target word´s senses. This paper proposes an algorithm which modifies the above two parameters by computing intersection using not only the glosses but also by including the related words. Also the intersection is computed for the entire hierarchy of the target and nearby words. It also incorporates a third parameter `distance´ (between target and nearby words). The proposed approach incorporates more parameters for calculating similarity, which has not been attempted by any of the previous approaches. It scores the senses based on the overall impact of three parameters i.e. intersection, hierarchy and distance and then chooses the sense with the highest score. The algorithm has been evaluated on SemCor which is the largest available sense-tagged corpus. The proposed algorithm achieves a precision of 53.12% for Top1 results, 59.91% for Top2 results and that of 62.13% for Top3 results which is better than other knowledge based approaches.
Keywords
Instruments
Publisher
ieee
Conference_Titel
Signal Processing and Information Technology (ISSPIT), 2014 IEEE International Symposium on
ISSN
2162-7843
Type
conf
DOI
10.1109/ISSPIT.2014.7300594
Filename
7300594
Link To Document