DocumentCode
1802496
Title
An adaptive neural network approach to hypertext clustering
Author
Vlajic, Natalija ; Card, Howard C.
Author_Institution
Dept. of Electr. & Comput. Eng., Manitoba Univ., Winnipeg, Man., Canada
Volume
6
fYear
1999
fDate
36342
Firstpage
3722
Abstract
The WWW is an online hypertextual collection, and a more sophisticated algorithm for Web page clustering may have to be based on combined term-similarity and hyperlink-similarity measures. It has been observed that nearly all currently employed techniques for document classification on the Web make use of textual information only. In addition, most of these techniques are incapable of discovering the real nature of the collection to which they are applied due to rather inefficient clustering algorithms employed. This paper describes a novel technique for hypertext clustering, called an adaptive hypertext clustering (AHC) algorithm. This algorithm has been derived from a modified neural network algorithm, and adjusted to the problem of combined term-similarity and hyperlink-similarity measures. The results presented in the paper show that AHC can be easily adapted to enable the most appropriate Web page classification within collections of various thematic and functional profiles, suggesting its main benefits over the traditional techniques
Keywords
hypermedia; indexing; information resources; pattern clustering; self-organising feature maps; WWW; Web page classification; Web page clustering; World Wide Web; adaptive neural network; document classification; functional profiles; hyperlink-similarity; hyperlink-similarity measures; hypertext clustering; online hypertextual collection; term-similarity; term-similarity measures; thematic profiles; Adaptive systems; Artificial neural networks; Books; Clustering algorithms; Internet; Libraries; Neural networks; Resonance; Technological innovation; Web pages;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 1999. IJCNN '99. International Joint Conference on
Conference_Location
Washington, DC
ISSN
1098-7576
Print_ISBN
0-7803-5529-6
Type
conf
DOI
10.1109/IJCNN.1999.830743
Filename
830743
Link To Document