• DocumentCode
    3752202
  • Title

    Features of web subject-related image and its retrieval significance

  • Author

    Daning Zhan;Yongli Zou

  • Author_Institution
    Sun Yat-Sen University, Guangzhou, China
  • fYear
    2015
  • Firstpage
    1151
  • Lastpage
    1154
  • Abstract
    Keyword information retrieval is the mainstream way of information retrieval at present. Users need to scan a large amount of text information in the search results to find the information they want, but the process causes inefficiency and poor user experience. In fact, images in the web pages can provide a direct-viewing and fast retrieval way. Through the research on the features of web subject-related image and achieve its automatic identification and extraction, we can display it in the thumbnail together with the page title and summary in the results pages. It can help users filter and browse information in a more convenient way. This paper establishes a web image attribute system from both HTML attributes and external attributes. Then through corresponding automatic extracting algorithms and data analysis it succeeds to gain 16 feature rules of web subject-related image, and finishes building a web subject-related image feature model. The model proves to obtain more than 99% of the extraction rate and filtering rate while applying to the sample data, demonstrating its value in web information retrieval.
  • Keywords
    "Feature extraction","Web pages","Data mining","Information filters"
  • Publisher
    ieee
  • Conference_Titel
    Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2015 Asia-Pacific
  • Type

    conf

  • DOI
    10.1109/APSIPA.2015.7415452
  • Filename
    7415452