Title :
Relation Extraction from Chinese News Web Documents Based on Weakly Supervised Learning
Author :
Qiu, Jing ; Liao, Lejian ; Li, Peng
Author_Institution :
Beijing Lab. of Intell. Inf. Technol., Beijing Inst. of Technol., Beijing, China
Abstract :
Extracting instances of a given target relation from a given Web page corpus seems to be the basic work to exploit nearly endless source of knowledge which provided by the World Wide Web. Supervised learning requires a large amount of labeled data, but the data labeling process can be expensive and time consuming. In this paper we present a kernel-based weakly supervised machine learning algorithm for relation extraction. It takes a small set of target relations as input. The goal is to automatically extract arbitrary binary relations from Web documents in the domain of football games. Bootstrapping is used to improve the performance of the system. We also compare the performances on different input example sizes. Experimental results show the effectiveness and benefits of our approach.
Keywords :
Internet; computer games; document handling; information retrieval; learning (artificial intelligence); Chinese news Web documents; Web page corpus; World Wide Web; arbitrary binary relation extraction; bootstrapping; data labeling process; football games; kernel-based weakly supervised machine learning algorithm; Data mining; Intelligent networks; Kernel; Knowledge engineering; Machine learning; Supervised learning; Support vector machine classification; Support vector machines; Web pages; Web sites; Kernel method; Machine learning; Relation extraction; Weakly supervised;
Conference_Titel :
Intelligent Networking and Collaborative Systems, 2009. INCOS '09. International Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-1-4244-5165-4
Electronic_ISBN :
978-0-7695-3858-7
DOI :
10.1109/INCOS.2009.14