• Title of article

    Automatic Extraction of Collocations From Korean Text

  • Author/Authors

    Kim، Seonho نويسنده , , Yoon، Juntae نويسنده , , Song، Mansuk نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2001
  • Pages
    -272
  • From page
    273
  • To page
    0
  • Abstract
    In this paper, we propose a statistical method to automatically extract collocations from Korean POS-tagged corpus. Since a large portion of language is represented by collocation patterns, the collocational knowledge provides a valuable resource for NLP applications. One difficulty of collocation extraction is that Korean has a partially free word order, which also appears in collocations. In this work, we exploit four statistics, `frequencyʹ, `randomnessʹ, `convergenceʹ, and `correlationʹ in order to take into account the flexible word order of Korean collocations. We separate meaningful bigrams using an evaluation function based on the four statistics and extend the bigrams to n-gram collocations using a fuzzy relation. Experiments show that this method works well for Korean collocations.
  • Keywords
    continental deformation , orogeny , crustal deformation , radioactivity , topography , viscosity , isostasy
  • Journal title
    COMPUTER AND THE HUMANITIES
  • Serial Year
    2001
  • Journal title
    COMPUTER AND THE HUMANITIES
  • Record number

    32064