DocumentCode :
2346938
Title :
Automatic acquisition of wordnet relations by the morpho-syntactic patterns extracted from the corpora in Polish
Author :
Kurc, Roman ; Piasecki, Maciej
Author_Institution :
Inst. of Appl. Inf., Wroclaw Univ. of Technol., Wroclaw
fYear :
2008
fDate :
20-22 Oct. 2008
Firstpage :
181
Lastpage :
188
Abstract :
In the paper we present an adaptation of the Espresso algorithm of the extraction of lexical semantic relation to specific requirements of Polish. The introduced changes are of more technical character like the adaptation to the existing Polish language tools, but also we investigate the structure of the patterns that takes into account specific features of Polish as an inflectional language. A new method of the reliability measure computation is proposed. The modified version of the algorithm called Estratto was compared with the more direct reimplementation of Espresso on several corpora of Polish. We tested the influence of different algorithm parameters and different corpora on the received results.
Keywords :
natural language processing; pattern recognition; Espresso algorithm; Estratto; Polish corpora; Polish language tools; automatic acquisition; lexical semantic relation; morpho-syntactic pattern extraction; wordnet relations; Computational linguistics; Computer architecture; Computer science; Control systems; Data mining; Informatics; Information technology; Paper technology; Software engineering; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Information Technology, 2008. IMCSIT 2008. International Multiconference on
Conference_Location :
Wisia
Print_ISBN :
978-83-60810-14-9
Type :
conf
DOI :
10.1109/IMCSIT.2008.4747237
Filename :
4747237
Link To Document :
بازگشت