• DocumentCode
    3165903
  • Title

    Automatic classification of Web information based on site structure

  • Author

    Kening, Gao ; Leiming, Yang ; Bin, Zhang ; Qiaozi, Chai ; Anxiang, Ma

  • Author_Institution
    Coll. of Inf. Sci. & Eng., Northeastern Univ., Shenyang
  • fYear
    2005
  • fDate
    23-25 Nov. 2005
  • Lastpage
    558
  • Abstract
    How to classify automatically Web information that grows explosive is becoming an imminent problem needed to be resolved. Based on site structure, we propose, in this paper, a new mechanism of automatic classification of Web information, which downloads Web pages within a Web site, records the hyperlinks among Web pages, catches the site structure, extracts the classifying system of the site itself, and then links categorizing information with the correspondent position in the site structure. Therefore automatic classification of Web information can be realized through matching the positions of categorizing information with the positions of Web pages. Experiments show that such classification based on site structure works more accurately and efficiently
  • Keywords
    Web sites; classification; Web information automatic classification; Web page; Web site structure; Classification tree analysis; Data mining; Educational institutions; Explosives; Information science; Information systems; Machine learning; Navigation; Statistics; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cyberworlds, 2005. International Conference on
  • Conference_Location
    Singapore
  • Print_ISBN
    0-7695-2378-1
  • Type

    conf

  • DOI
    10.1109/CW.2005.24
  • Filename
    1587594