• DocumentCode
    2834604
  • Title

    Learning Link-Based Classifiers from Ontology-Extended Textual Data

  • Author

    Caragea, Cornelia ; Caragea, Doina ; Honavar, Vasant

  • Author_Institution
    Comput. Sci. Dept., Iowa State Univ., Ames, IA, USA
  • fYear
    2009
  • fDate
    2-4 Nov. 2009
  • Firstpage
    354
  • Lastpage
    361
  • Abstract
    Real-world data mining applications call for effective strategies for learning predictive models from richly structured relational data. In this paper, we address the problem of learning classifiers from structured relational data that are annotated with relevant meta data. Specifically, we show how to learn classifiers at different levels of abstraction in a relational setting, where the structured relational data are organized in an abstraction hierarchy that describes the semantics of the content of the data. We show how to cope with some of the challenges presented by partial specification in the case of structured data, that unavoidably results from choosing a particular level of abstraction. Our solution to partial specification is based on a statistical method, called shrinkage. We present results of experiments in the case of learning link-based Naive Bayes classifiers on a text classification task that (i) demonstrate that the choice of the level of abstraction can impact the performance of the resulting link-based classifiers and (ii) examine the effect of partially specified data.
  • Keywords
    Bayes methods; data mining; meta data; ontologies (artificial intelligence); Naive Bayes classifiers; abstraction hierarchy; learning link-based classifiers; meta data; ontology-extended textual data; real-world data mining applications; shrinkage; statistical method; structured relational data; text classification task; Application software; Artificial intelligence; Computer science; Data mining; Machine learning; Ontologies; Predictive models; Statistical analysis; Text categorization; Yield estimation; link-based classifier; ontology-extended data sources;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Tools with Artificial Intelligence, 2009. ICTAI '09. 21st International Conference on
  • Conference_Location
    Newark, NJ
  • ISSN
    1082-3409
  • Print_ISBN
    978-1-4244-5619-2
  • Electronic_ISBN
    1082-3409
  • Type

    conf

  • DOI
    10.1109/ICTAI.2009.111
  • Filename
    5364346