DocumentCode
244537
Title
Using big data to support automatic Word Sense Disambiguation
Author
Simonini, Giovanni ; Guerra, Federico
Author_Institution
DIEF, Univ. of Modena & Reggio Emilia, Modena, Italy
fYear
2014
fDate
21-25 July 2014
Firstpage
311
Lastpage
314
Abstract
Word Sense Disambiguation (WSD) usually relies on data structures built upon the words to be disambiguated. This is a time-consuming process that requires a huge computational effort. In this paper, we propose an approach to automatically build a generic sense inventory (called iSC) to be used as a reference for disambiguation. The sense inventory is built extracting insight from Big Data exploiting a community detection algorithm. Since generate taking into account large corpora of data, the iSC is independent of the domain of application and of predefined target words.
Keywords
Big Data; natural language processing; Big Data; WSD; automatic word sense disambiguation; community detection algorithm; data corpora; data structures; generic sense inventory; iSC; Big data; Communities; Computational linguistics; Context; Indexes; Natural language processing; Social network services; Big Data Analysis; Community Detection; Word Sense Disambiguation;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing & Simulation (HPCS), 2014 International Conference on
Conference_Location
Bologna
Print_ISBN
978-1-4799-5312-7
Type
conf
DOI
10.1109/HPCSim.2014.6903701
Filename
6903701
Link To Document