• DocumentCode
    3752790
  • Title

    Towards automatic document classification by exploiting only knowledge resources

  • Author

    Gleidson Antonio Cardoso da Silva;Carina F. Dorneles

  • Author_Institution
    Federal University of Santa Catarina, Florianopolis, Brasil
  • fYear
    2015
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Document classification is critical to optimize information retrieval tasks, especially over the web. In this environment, the open domain nature and growing volume of available data remain a challenge for the classification task. In this paper, we deal with these problems by only using knowledge resources. Our approach relies on concepts instances derived from the document and an open domain knowledge base for concept generalization. The set of broader concepts is ranked according to a disparity value, and then the best-placed concept is considered as the document class label. Experimental results on real-world datasets show that this approach can achieve document classification without the need to build an ontology or train and keep a classification model.
  • Keywords
    "Knowledge based systems","Ontologies","Training","Proposals","Semantics","Informatics","Information retrieval"
  • Publisher
    ieee
  • Conference_Titel
    Chilean Computer Science Society (SCCC), 2015 34th International Conference of the
  • Type

    conf

  • DOI
    10.1109/SCCC.2015.7416573
  • Filename
    7416573