Title :
Correlation based Word Sense Disambiguation
Author :
Agarwal, Mohini ; Bajpai, Jyoti
Author_Institution :
Dept. of CEA, GLA Univ., Mathura, India
Abstract :
Today internet usage has seen tremendous growth. As English is the primary language, documents are mostly available in English language. In India, Hindi is the prevalent language and user wants to access data in Hindi. For the language processing we are required to get the exact sense of polysemous word interpreting the meaning in a particular context. To disambiguate the meaning of the polysemous word, the techniques used is Word Sense Disambiguation (WSD). It is a known problem in natural language processing referred as lexical semantic ambiguity. In this paper, correlation analysis of context in which the target word is used with the collocation vector of definition of target word derived from Hindi WordNet i.e. developed at IIT Bombay and the co-occurrence vector which is derived from Hindi Corpus is computed. The proposed approach uses collocation information, co-occurrence information of target word to assign weights to the different senses of ambiguous word. The evaluation is done on the 60 ambiguous words, precision obtained is 88.92%. The proposed experiment shows better efficiency.
Keywords :
natural language processing; English language; Hindi Corpus; Hindi WordNet; Hindi language; WSD; co-occurrence information; collocation information; collocation vector; correlation analysis; correlation based word sense disambiguation; lexical semantic ambiguity; natural language processing; polysemous word; Algorithm design and analysis; Context; Correlation; Dictionaries; Knowledge based systems; Semantics; Vectors; correlation analysis; hindi corpus; hindi wordnet; knowledge based approach; polysemous words;
Conference_Titel :
Contemporary Computing (IC3), 2014 Seventh International Conference on
Conference_Location :
Noida
Print_ISBN :
978-1-4799-5172-7
DOI :
10.1109/IC3.2014.6897204