• DocumentCode
    2346938
  • Title

    Automatic acquisition of wordnet relations by the morpho-syntactic patterns extracted from the corpora in Polish

  • Author

    Kurc, Roman ; Piasecki, Maciej

  • Author_Institution
    Inst. of Appl. Inf., Wroclaw Univ. of Technol., Wroclaw
  • fYear
    2008
  • fDate
    20-22 Oct. 2008
  • Firstpage
    181
  • Lastpage
    188
  • Abstract
    In the paper we present an adaptation of the Espresso algorithm of the extraction of lexical semantic relation to specific requirements of Polish. The introduced changes are of more technical character like the adaptation to the existing Polish language tools, but also we investigate the structure of the patterns that takes into account specific features of Polish as an inflectional language. A new method of the reliability measure computation is proposed. The modified version of the algorithm called Estratto was compared with the more direct reimplementation of Espresso on several corpora of Polish. We tested the influence of different algorithm parameters and different corpora on the received results.
  • Keywords
    natural language processing; pattern recognition; Espresso algorithm; Estratto; Polish corpora; Polish language tools; automatic acquisition; lexical semantic relation; morpho-syntactic pattern extraction; wordnet relations; Computational linguistics; Computer architecture; Computer science; Control systems; Data mining; Informatics; Information technology; Paper technology; Software engineering; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Information Technology, 2008. IMCSIT 2008. International Multiconference on
  • Conference_Location
    Wisia
  • Print_ISBN
    978-83-60810-14-9
  • Type

    conf

  • DOI
    10.1109/IMCSIT.2008.4747237
  • Filename
    4747237