Title :
Unsupervised Relation Extraction by Massive Clustering
Author :
Gonzalez, E. ; Turmo, Jordi
Author_Institution :
TALP Res. Center, Univ. Politec. de Catalunya, Barcelona, Spain
Abstract :
The goal of Information Extraction is to automatically generate structured pieces of information from the relevant information contained in text documents. Machine Learning techniques have been applied to reduce the cost of Information Extraction system adaptation. However, elements of human supervision strongly bias the learning process. Unsupervised learning approaches can avoid these biases. In this paper, we propose an unsupervised approach to learning for Relation Detection, based on the use of massive clustering ensembles. The results obtained on the ACE Relation Mention Detection task outperform in terms of F1 score by 5 points the state of the art of unsupervised techniques for this evaluation framework, in addition to being simpler and more flexible.
Keywords :
data mining; information retrieval; pattern clustering; text analysis; unsupervised learning; ACE relation mention detection; automatic content extraction; information extraction system adaptation; machine learning techniques; massive clustering; relation detection; text documents; unsupervised learning approach; unsupervised relation extraction; Adaptive systems; Automatic testing; Costs; Data mining; Humans; Learning systems; Machine learning; Proposals; Text mining; Unsupervised learning; Ensemble Clustering; Relation Detection; Unsupervised Methods;
Conference_Titel :
Data Mining, 2009. ICDM '09. Ninth IEEE International Conference on
Conference_Location :
Miami, FL
Print_ISBN :
978-1-4244-5242-2
Electronic_ISBN :
1550-4786
DOI :
10.1109/ICDM.2009.81