Title :
Relation extraction from wikipedia articles by entities clustering
Author :
Song Liu ; Fuji Ren
Author_Institution :
Sch. of Comput. Sci., Beijing Univ. of Posts & Telecommun., Beijing, China
fDate :
Oct. 30 2012-Nov. 1 2012
Abstract :
Wikipedia is an encyclopedia based on wiki technology. It is multilingual high quality knowledge base. In this work a episode based extraction method are proposed to extract relations from Wikipedia articles. The entities are clustered and labeled. The relation extraction is benefited by the information redundancy provided by the clusters. A strict Wikipedia entities clustering algorithm based on the category system and first sentence of the article is approached. This work required less manual assist. And the relations are abundant. The results are comparable with other works [1, 2].
Keywords :
Web sites; information retrieval; pattern clustering; Wikipedia articles relation extraction; Wikipedia entities clustering algorithm; category system; encyclopedia; entity labelling; episode based extraction method; information redundancy; multilingual high quality knowledge base; wiki technology; Clustering algorithms; Data mining; Electronic publishing; Encyclopedias; Internet; Kernel; entities clustering; episode; relation extraction;
Conference_Titel :
Cloud Computing and Intelligent Systems (CCIS), 2012 IEEE 2nd International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4673-1855-6
DOI :
10.1109/CCIS.2012.6664633