DocumentCode
3696918
Title
Entity Recognition and Relations Extraction Based on the Structure of Online Encyclopedia
Author
Qing Song;Yue Yang
Author_Institution
New Media Inst., Commun. Univ. of China, Beijing, China
fYear
2015
fDate
7/1/2015 12:00:00 AM
Firstpage
478
Lastpage
482
Abstract
In order to construct the knowledge base in the field of journalism, this paper improved the knowledge representation framework of Freebase to make it more suitable for the knowledge in journalism domain. On the basis, we chose to extract entities, entity attributes and relationships between entities from Baidu Encyclopedia websites in order to analyze its structure. According to the infobox, the Character Relationship, the Relevant Characters and the Category Labels templates on the Baidu Encyclopedia webpages, we harvested entity triples (Entity1, Relation, Entity2). Then, we supplemented the characters relationship types through the Entity Relation template in Hudong Encyclopedia webpages. Through the natural language processing technology, the entity similarity algorithm, the association rules reasoning algorithm and other methods, we cleaned and supplemented the knowledge. After that, we stored the knowledge to the graph database according to the knowledge representation model. Finally, we got a better knowledge base in the field of journalism.
Keywords
"Encyclopedias","Knowledge based systems","Uniform resource locators","Knowledge representation","Text recognition","Data mining"
Publisher
ieee
Conference_Titel
Applied Computing and Information Technology/2nd International Conference on Computational Science and Intelligence (ACIT-CSI), 2015 3rd International Conference on
Type
conf
DOI
10.1109/ACIT-CSI.2015.91
Filename
7336111
Link To Document