• DocumentCode
    244537
  • Title

    Using big data to support automatic Word Sense Disambiguation

  • Author

    Simonini, Giovanni ; Guerra, Federico

  • Author_Institution
    DIEF, Univ. of Modena & Reggio Emilia, Modena, Italy
  • fYear
    2014
  • fDate
    21-25 July 2014
  • Firstpage
    311
  • Lastpage
    314
  • Abstract
    Word Sense Disambiguation (WSD) usually relies on data structures built upon the words to be disambiguated. This is a time-consuming process that requires a huge computational effort. In this paper, we propose an approach to automatically build a generic sense inventory (called iSC) to be used as a reference for disambiguation. The sense inventory is built extracting insight from Big Data exploiting a community detection algorithm. Since generate taking into account large corpora of data, the iSC is independent of the domain of application and of predefined target words.
  • Keywords
    Big Data; natural language processing; Big Data; WSD; automatic word sense disambiguation; community detection algorithm; data corpora; data structures; generic sense inventory; iSC; Big data; Communities; Computational linguistics; Context; Indexes; Natural language processing; Social network services; Big Data Analysis; Community Detection; Word Sense Disambiguation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing & Simulation (HPCS), 2014 International Conference on
  • Conference_Location
    Bologna
  • Print_ISBN
    978-1-4799-5312-7
  • Type

    conf

  • DOI
    10.1109/HPCSim.2014.6903701
  • Filename
    6903701