DocumentCode :
456761
Title :
Mining Unstructured Web Pages to Enhance Web Information Retrieval
Author :
Chung-Hong Lee
Author_Institution :
Dept. of Electr. Eng., Nat. Kaohsiung Univ. of Appl. Sci.
Volume :
2
fYear :
2006
fDate :
Aug. 30 2006-Sept. 1 2006
Firstpage :
429
Lastpage :
432
Abstract :
One major approach for information finding in the WWW is to navigate through some Web directories and browse them for the goal pages. However, such directories are generally constructed manually and have disadvantages of narrow coverage and inconsistency. In this work, we propose a machine learning approach that automatically constructs a navigational structure for the WWW to help information finding. A self-organizing map is constructed to train the Web pages and obtain two feature maps, which reveal the relationships among Web pages and thematic keywords respectively. We then use these maps to develop a structure that may assist the users finding the information they need. We used a small set of Web pages in the experiments and obtained promising result
Keywords :
Internet; Web sites; data mining; information retrieval; learning (artificial intelligence); self-organising feature maps; Web directory; Web information retrieval; information navigation; machine learning approach; self-organizing feature map construction; unstructured Web page mining; Humans; Information management; Information retrieval; Machine learning; Navigation; Portals; Search engines; Text mining; Web pages; World Wide Web;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Innovative Computing, Information and Control, 2006. ICICIC '06. First International Conference on
Conference_Location :
Beijing
Print_ISBN :
0-7695-2616-0
Type :
conf
DOI :
10.1109/ICICIC.2006.310
Filename :
1692017
Link To Document :
بازگشت