• DocumentCode
    2139585
  • Title

    Web Image Annotation Based on the Decision Rules Inferred by the Statistical Analysis of Web Pages

  • Author

    Park, Joohyoun ; Choe, Giseok ; Lee, Jongwon ; Nang, Jongho

  • Author_Institution
    Sogang Univ., Seoul
  • fYear
    2007
  • fDate
    16-19 Oct. 2007
  • Firstpage
    183
  • Lastpage
    188
  • Abstract
    This paper proposes a rule based web image annotation method which improves the precision and recall of annotation by the use of decision tree. This decision tree learns the relationship between images and their annotations based on the proposed 17 attributes that specify the structural relationship between them in HTML documents and the visual characteristics of the images. By converting and pruning this learned tree, a set of rules with high estimated accuracy which determines whether or not a word can be the keyword of an image can be generated. Upon experimental results, the proposed method made 57 rules and the precision and recall of annotation by these rules were about 88% and 95% for the various concepts, respectively. We argue the contribution of this work in two aspects. First, we suggest the clear criteria for precise annotation inferred by the statistical analysis of many web pages. Second, to cope with the deterioration of recall caused by the lack of measure for the visual characteristics, the visual similarity between an image and its concept combines to the attributes that used for tree learning.
  • Keywords
    Internet; decision trees; document image processing; hypermedia markup languages; information analysis; statistical analysis; HTML documents; Web image annotation; Web pages; decision rules; decision tree; statistical analysis; structural relationship; tree learning; visual characteristics; visual similarity; Computer science; Cultural differences; Decision trees; Educational institutions; HTML; Image retrieval; Information technology; Statistical analysis; Web pages; World Wide Web;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Information Technology, 2007. CIT 2007. 7th IEEE International Conference on
  • Conference_Location
    Aizu-Wakamatsu, Fukushima
  • Print_ISBN
    978-0-7695-2983-7
  • Type

    conf

  • DOI
    10.1109/CIT.2007.123
  • Filename
    4385078