• DocumentCode
    2412497
  • Title

    Automatic Identification of Home Pages on the Web

  • Author

    Kennedy, Alistair ; Shepherd, Michael

  • Author_Institution
    Dalhousie University, Halifax, Canada
  • fYear
    2005
  • fDate
    03-06 Jan. 2005
  • Abstract
    The research reported in this paper is the first phase of a larger project on the automatic classification of Web pages by their genres. The long term goal is the incorporation of web page genre into the search process to improve the quality of the search results. In this phase, a neural net classifier was trained to distinguish home pages from non-home pages and to classify those home pages as personal home page, corporate home page or organization home page. Results indicate that the classifier is able to distinguish home pages from non-home pages and within the home page genre it is able to distinguish personal from corporate home pages. Organization home pages, however, were more difficult to distinguish from personal and corporate home pages.
  • Keywords
    Computer science; Machine learning; Neural networks; Search engines; Society home pages; Web pages; Web sites;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    System Sciences, 2005. HICSS '05. Proceedings of the 38th Annual Hawaii International Conference on
  • ISSN
    1530-1605
  • Print_ISBN
    0-7695-2268-8
  • Type

    conf

  • DOI
    10.1109/HICSS.2005.114
  • Filename
    1385438