DocumentCode
2346938
Title
Automatic acquisition of wordnet relations by the morpho-syntactic patterns extracted from the corpora in Polish
Author
Kurc, Roman ; Piasecki, Maciej
Author_Institution
Inst. of Appl. Inf., Wroclaw Univ. of Technol., Wroclaw
fYear
2008
fDate
20-22 Oct. 2008
Firstpage
181
Lastpage
188
Abstract
In the paper we present an adaptation of the Espresso algorithm of the extraction of lexical semantic relation to specific requirements of Polish. The introduced changes are of more technical character like the adaptation to the existing Polish language tools, but also we investigate the structure of the patterns that takes into account specific features of Polish as an inflectional language. A new method of the reliability measure computation is proposed. The modified version of the algorithm called Estratto was compared with the more direct reimplementation of Espresso on several corpora of Polish. We tested the influence of different algorithm parameters and different corpora on the received results.
Keywords
natural language processing; pattern recognition; Espresso algorithm; Estratto; Polish corpora; Polish language tools; automatic acquisition; lexical semantic relation; morpho-syntactic pattern extraction; wordnet relations; Computational linguistics; Computer architecture; Computer science; Control systems; Data mining; Informatics; Information technology; Paper technology; Software engineering; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Information Technology, 2008. IMCSIT 2008. International Multiconference on
Conference_Location
Wisia
Print_ISBN
978-83-60810-14-9
Type
conf
DOI
10.1109/IMCSIT.2008.4747237
Filename
4747237
Link To Document