Title :
Web-Based Variant of the Lesk Approach to Word Sense Disambiguation
Author :
Gaona, Miguel Ángel Ríos ; Gelbukh, Alexander ; Bandyopadhyay, Sivaji
Author_Institution :
Center for Comput. Res., Nat. Polytech. Inst., Mexico City, Mexico
Abstract :
Word Sense Disambiguation (WSD) is the task of selecting the meaning of a word based on the context in which the word occurs. The principal statistical WSD approaches are supervised and unsupervised learning. The Lesk method is an example of unsupervised disambiguation. We present a measure for sense assignment useful for the simple Lesk algorithm. We use word co-occurrences of the gloss and the context, which is statistical information retrieved from the Web. In the SemCor data our method always gives an answer. On the Senseval 2 data, our variant of the Lesk method outperformed some other Lesk-based methods.
Keywords :
Internet; information retrieval; natural language processing; unsupervised learning; word processing; Lesk algorithm; Lesk method; SemCor data; World Wide Web; principal statistical WSD; sense assignment; statistical information retrieval; unsupervised disambiguation; unsupervised learning; word co-occurrences; word sense disambiguation; Artificial intelligence; Cities and towns; Computer science; Costs; Dictionaries; Information retrieval; Knowledge acquisition; Mice; Testing; Unsupervised learning; Natural Language Processing; Unsupervised disambiguation; Word Sense Disambiguation;
Conference_Titel :
Artificial Intelligence, 2009. MICAI 2009. Eighth Mexican International Conference on
Conference_Location :
Guanajuato
Print_ISBN :
978-0-7695-3933-1
DOI :
10.1109/MICAI.2009.41