• DocumentCode
    707254
  • Title

    Unsupervised Hindi word sense disambiguation based on network agglomeration

  • Author

    Jain, Amita ; Lobiyal, D.K.

  • Author_Institution
    Ambedkar Inst. of Adv. Commun. Technol. & Res., Jawaharlal Nehru Univ., Delhi, India
  • fYear
    2015
  • fDate
    11-13 March 2015
  • Firstpage
    195
  • Lastpage
    200
  • Abstract
    Word sense disambiguation (WSD) is an essential task in computational linguistics for language understanding applications such as information retrieval, question answering, machine translation, text summarization etc. In this paper we propose an unsupervised WSD method for a Hindi sentence based on network agglomeration. First we create the sentence graph G for the given sentence. This sentence graph collectively represents all the interpretations of the sentence. Now from this sentence graph G we create the interpretation graph G´ ⊆ G for each of the interpretation of the sentence. To identify the desired interpretation we compute network agglomeration for all the interpretation graphs. Thus the relevant interpretation having highest value of network agglomeration is identified. The results on the standard sense tagged corpus show better performance for the proposed method than the previous approaches.
  • Keywords
    computational linguistics; graph theory; natural language processing; computational linguistics; interpretation graph; language understanding applications; network agglomeration; sentence graph; standard sense tagged corpus; unsupervised Hindi word sense disambiguation; unsupervised WSD method; Computers; Context; Information retrieval; Knowledge based systems; Knowledge discovery; Natural language processing; Standards; Hindi WordNet; Network Agglomeration; Word Sense Disambiguation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computing for Sustainable Global Development (INDIACom), 2015 2nd International Conference on
  • Conference_Location
    New Delhi
  • Print_ISBN
    978-9-3805-4415-1
  • Type

    conf

  • Filename
    7100244