• DocumentCode
    3696918
  • Title

    Entity Recognition and Relations Extraction Based on the Structure of Online Encyclopedia

  • Author

    Qing Song;Yue Yang

  • Author_Institution
    New Media Inst., Commun. Univ. of China, Beijing, China
  • fYear
    2015
  • fDate
    7/1/2015 12:00:00 AM
  • Firstpage
    478
  • Lastpage
    482
  • Abstract
    In order to construct the knowledge base in the field of journalism, this paper improved the knowledge representation framework of Freebase to make it more suitable for the knowledge in journalism domain. On the basis, we chose to extract entities, entity attributes and relationships between entities from Baidu Encyclopedia websites in order to analyze its structure. According to the infobox, the Character Relationship, the Relevant Characters and the Category Labels templates on the Baidu Encyclopedia webpages, we harvested entity triples (Entity1, Relation, Entity2). Then, we supplemented the characters relationship types through the Entity Relation template in Hudong Encyclopedia webpages. Through the natural language processing technology, the entity similarity algorithm, the association rules reasoning algorithm and other methods, we cleaned and supplemented the knowledge. After that, we stored the knowledge to the graph database according to the knowledge representation model. Finally, we got a better knowledge base in the field of journalism.
  • Keywords
    "Encyclopedias","Knowledge based systems","Uniform resource locators","Knowledge representation","Text recognition","Data mining"
  • Publisher
    ieee
  • Conference_Titel
    Applied Computing and Information Technology/2nd International Conference on Computational Science and Intelligence (ACIT-CSI), 2015 3rd International Conference on
  • Type

    conf

  • DOI
    10.1109/ACIT-CSI.2015.91
  • Filename
    7336111