DocumentCode :
2709107
Title :
Generalized Framework for Syntax-Based Relation Mining
Author :
Coppola, Bonaventura ; Moschitti, Alessandro ; Pighin, Daniele
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of Trento, Trento
fYear :
2008
fDate :
15-19 Dec. 2008
Firstpage :
153
Lastpage :
162
Abstract :
Supervised approaches to data mining are particularly appealing as they allow for the extraction of complex relations from data objects. In order to facilitate their application in different areas, ranging from protein to protein interaction in bioinformatics to text mining in computational linguistics research, a modular and general mining framework is needed. The major constraint to the generalization process concerns the feature design for the description of relational data. In this paper, we present a machine learning framework for the automatic mining of relations, where the target objects are structurally organized in a tree. Object types are generalized by means of the use of roles, whereas the relation properties are described by means of the underlying tree structure. The latter is encoded in the learning algorithm thanks to kernel methods for structured data, which represent structures in terms of their all possible subparts. This approach can be applied to any kind of data disregarding their very nature. Experiments with support vector machines on two text mining datasets for relation extraction, i.e. the PropBank and FrameNet corpora, show both that our approach is general, and that it reaches state-of-the-art accuracy.
Keywords :
data mining; learning (artificial intelligence); support vector machines; FrameNet; PropBank; automatic relation mining; data mining; generalization; kernel methods; machine learning framework; relational data; roles; structured data; supervised learning; support vector machines; syntax-based relation mining; text mining datasets; tree; Bioinformatics; Computational linguistics; Data mining; Kernel; Machine learning; Machine learning algorithms; Proteins; Support vector machines; Text mining; Tree data structures; frame recognition; kernel methods; relation mining; semantic role labeling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2008. ICDM '08. Eighth IEEE International Conference on
Conference_Location :
Pisa
ISSN :
1550-4786
Print_ISBN :
978-0-7695-3502-9
Type :
conf
DOI :
10.1109/ICDM.2008.153
Filename :
4781110
Link To Document :
بازگشت