• DocumentCode
    456761
  • Title

    Mining Unstructured Web Pages to Enhance Web Information Retrieval

  • Author

    Chung-Hong Lee

  • Author_Institution
    Dept. of Electr. Eng., Nat. Kaohsiung Univ. of Appl. Sci.
  • Volume
    2
  • fYear
    2006
  • fDate
    Aug. 30 2006-Sept. 1 2006
  • Firstpage
    429
  • Lastpage
    432
  • Abstract
    One major approach for information finding in the WWW is to navigate through some Web directories and browse them for the goal pages. However, such directories are generally constructed manually and have disadvantages of narrow coverage and inconsistency. In this work, we propose a machine learning approach that automatically constructs a navigational structure for the WWW to help information finding. A self-organizing map is constructed to train the Web pages and obtain two feature maps, which reveal the relationships among Web pages and thematic keywords respectively. We then use these maps to develop a structure that may assist the users finding the information they need. We used a small set of Web pages in the experiments and obtained promising result
  • Keywords
    Internet; Web sites; data mining; information retrieval; learning (artificial intelligence); self-organising feature maps; Web directory; Web information retrieval; information navigation; machine learning approach; self-organizing feature map construction; unstructured Web page mining; Humans; Information management; Information retrieval; Machine learning; Navigation; Portals; Search engines; Text mining; Web pages; World Wide Web;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Innovative Computing, Information and Control, 2006. ICICIC '06. First International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    0-7695-2616-0
  • Type

    conf

  • DOI
    10.1109/ICICIC.2006.310
  • Filename
    1692017