Title :
Using big data to support automatic Word Sense Disambiguation
Author :
Simonini, Giovanni ; Guerra, Federico
Author_Institution :
DIEF, Univ. of Modena & Reggio Emilia, Modena, Italy
Abstract :
Word Sense Disambiguation (WSD) usually relies on data structures built upon the words to be disambiguated. This is a time-consuming process that requires a huge computational effort. In this paper, we propose an approach to automatically build a generic sense inventory (called iSC) to be used as a reference for disambiguation. The sense inventory is built extracting insight from Big Data exploiting a community detection algorithm. Since generate taking into account large corpora of data, the iSC is independent of the domain of application and of predefined target words.
Keywords :
Big Data; natural language processing; Big Data; WSD; automatic word sense disambiguation; community detection algorithm; data corpora; data structures; generic sense inventory; iSC; Big data; Communities; Computational linguistics; Context; Indexes; Natural language processing; Social network services; Big Data Analysis; Community Detection; Word Sense Disambiguation;
Conference_Titel :
High Performance Computing & Simulation (HPCS), 2014 International Conference on
Conference_Location :
Bologna
Print_ISBN :
978-1-4799-5312-7
DOI :
10.1109/HPCSim.2014.6903701