• DocumentCode
    769277
  • Title

    Learning Contextual Dependency Network Models for Link-Based Classification

  • Author

    Tian, Yonghong ; Yang, Qiang ; Huang, Tiejun ; Ling, Charles X. ; Gao, Wen

  • Author_Institution
    Inst. of Comput. Technol., Chinese Acad. of Sci., Beijing
  • Volume
    18
  • Issue
    11
  • fYear
    2006
  • Firstpage
    1482
  • Lastpage
    1496
  • Abstract
    Links among objects contain rich semantics that can be very helpful in classifying the objects. However, many irrelevant links can be found in real-world link data such as Web pages. Often, these noisy and irrelevant links do not provide useful and predictive information for categorization. It is thus important to automatically identify which links are most relevant for categorization. In this paper, we present a contextual dependency network (CDN) model for classifying linked objects in the presence of noisy and irrelevant links. The CDN model makes use of a dependency function that characterizes the contextual dependencies among linked objects. In this way, CDNs can differentiate the impacts of the related objects on the classification and consequently reduce the effect of irrelevant links on the classification. We show how to learn the CDN model effectively and how to use the Gibbs inference framework over the learned model for collective classification of multiple linked objects. The experiments show that the CDN model demonstrates relatively high robustness on data sets containing irrelevant links
  • Keywords
    classification; inference mechanisms; learning (artificial intelligence); text analysis; Gibbs inference framework; Web pages; contextual dependency network models; link-based classification; text categorization; Accuracy; Context modeling; Inference algorithms; Machine learning; Markov random fields; Predictive models; Robustness; Web pages; Data dependencies; Gibbs inference.; contextual dependency networks; hypertext/hypermedia; link context; link-based classification; machine learning;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2006.178
  • Filename
    1704801