• DocumentCode
    3225976
  • Title

    Word sense disambiguation using lexical and semantic information within local syntactic relations

  • Author

    Kim, Young-Kil ; Hong, Mun-Pyo ; Kim, Chang-Hyun ; Park, Sang-Kyn

  • Author_Institution
    Speech/Language Inf. Res. Dept., Electron. & Telecommun. Res. Inst., Daejeon, South Korea
  • Volume
    3
  • fYear
    2004
  • fDate
    2-6 Nov. 2004
  • Firstpage
    3111
  • Abstract
    Until recently, most of WSD models have used adjacent words surrounding a target word as context information for word sense disambiguation. The difficulty of parsing Korean sentences and properly analyzing their structures restricted us to use syntactic relations to select proper senses of Korean words. In this paper, we propose the method to disambiguate Korean noun senses in unrestricted lexis using a statistical WSD model based on lexical and semantic information within syntactic relations. We disambiguate noun senses step by step in the range of local syntactic relations such as VP, NP and compound nouns. We classified the meaning of Korean nouns with about 400 semantic codes allowing for 8 levels in the semantic hierarchy and experimented for 1,838 homographs using the semantic information in consideration with syntactic relations. In the experiment, average senses of homographs that have different translations are 2.84 and the precision of word sense disambiguation is 86.2%.
  • Keywords
    computational linguistics; grammars; natural languages; statistical analysis; text analysis; word processing; homographs; lexical information; semantic hierarchy; semantic information; syntactic relations; word sense disambiguation; Context modeling; Data mining; Dictionaries; Graphics; Natural language processing; Natural languages; Speech; Thesauri; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Industrial Electronics Society, 2004. IECON 2004. 30th Annual Conference of IEEE
  • Print_ISBN
    0-7803-8730-9
  • Type

    conf

  • DOI
    10.1109/IECON.2004.1432309
  • Filename
    1432309