Title :
Mining Unstructured Web Pages to Enhance Web Information Retrieval
Author_Institution :
Dept. of Electr. Eng., Nat. Kaohsiung Univ. of Appl. Sci.
fDate :
Aug. 30 2006-Sept. 1 2006
Abstract :
One major approach for information finding in the WWW is to navigate through some Web directories and browse them for the goal pages. However, such directories are generally constructed manually and have disadvantages of narrow coverage and inconsistency. In this work, we propose a machine learning approach that automatically constructs a navigational structure for the WWW to help information finding. A self-organizing map is constructed to train the Web pages and obtain two feature maps, which reveal the relationships among Web pages and thematic keywords respectively. We then use these maps to develop a structure that may assist the users finding the information they need. We used a small set of Web pages in the experiments and obtained promising result
Keywords :
Internet; Web sites; data mining; information retrieval; learning (artificial intelligence); self-organising feature maps; Web directory; Web information retrieval; information navigation; machine learning approach; self-organizing feature map construction; unstructured Web page mining; Humans; Information management; Information retrieval; Machine learning; Navigation; Portals; Search engines; Text mining; Web pages; World Wide Web;
Conference_Titel :
Innovative Computing, Information and Control, 2006. ICICIC '06. First International Conference on
Conference_Location :
Beijing
Print_ISBN :
0-7695-2616-0
DOI :
10.1109/ICICIC.2006.310