DocumentCode
456761
Title
Mining Unstructured Web Pages to Enhance Web Information Retrieval
Author
Chung-Hong Lee
Author_Institution
Dept. of Electr. Eng., Nat. Kaohsiung Univ. of Appl. Sci.
Volume
2
fYear
2006
fDate
Aug. 30 2006-Sept. 1 2006
Firstpage
429
Lastpage
432
Abstract
One major approach for information finding in the WWW is to navigate through some Web directories and browse them for the goal pages. However, such directories are generally constructed manually and have disadvantages of narrow coverage and inconsistency. In this work, we propose a machine learning approach that automatically constructs a navigational structure for the WWW to help information finding. A self-organizing map is constructed to train the Web pages and obtain two feature maps, which reveal the relationships among Web pages and thematic keywords respectively. We then use these maps to develop a structure that may assist the users finding the information they need. We used a small set of Web pages in the experiments and obtained promising result
Keywords
Internet; Web sites; data mining; information retrieval; learning (artificial intelligence); self-organising feature maps; Web directory; Web information retrieval; information navigation; machine learning approach; self-organizing feature map construction; unstructured Web page mining; Humans; Information management; Information retrieval; Machine learning; Navigation; Portals; Search engines; Text mining; Web pages; World Wide Web;
fLanguage
English
Publisher
ieee
Conference_Titel
Innovative Computing, Information and Control, 2006. ICICIC '06. First International Conference on
Conference_Location
Beijing
Print_ISBN
0-7695-2616-0
Type
conf
DOI
10.1109/ICICIC.2006.310
Filename
1692017
Link To Document