Title :
Automatic acquisition of wordnet relations by the morpho-syntactic patterns extracted from the corpora in Polish
Author :
Kurc, Roman ; Piasecki, Maciej
Author_Institution :
Inst. of Appl. Inf., Wroclaw Univ. of Technol., Wroclaw
Abstract :
In the paper we present an adaptation of the Espresso algorithm of the extraction of lexical semantic relation to specific requirements of Polish. The introduced changes are of more technical character like the adaptation to the existing Polish language tools, but also we investigate the structure of the patterns that takes into account specific features of Polish as an inflectional language. A new method of the reliability measure computation is proposed. The modified version of the algorithm called Estratto was compared with the more direct reimplementation of Espresso on several corpora of Polish. We tested the influence of different algorithm parameters and different corpora on the received results.
Keywords :
natural language processing; pattern recognition; Espresso algorithm; Estratto; Polish corpora; Polish language tools; automatic acquisition; lexical semantic relation; morpho-syntactic pattern extraction; wordnet relations; Computational linguistics; Computer architecture; Computer science; Control systems; Data mining; Informatics; Information technology; Paper technology; Software engineering; Testing;
Conference_Titel :
Computer Science and Information Technology, 2008. IMCSIT 2008. International Multiconference on
Conference_Location :
Wisia
Print_ISBN :
978-83-60810-14-9
DOI :
10.1109/IMCSIT.2008.4747237