Title :
An adaptive neural network approach to hypertext clustering
Author :
Vlajic, Natalija ; Card, Howard C.
Author_Institution :
Dept. of Electr. & Comput. Eng., Manitoba Univ., Winnipeg, Man., Canada
Abstract :
The WWW is an online hypertextual collection, and a more sophisticated algorithm for Web page clustering may have to be based on combined term-similarity and hyperlink-similarity measures. It has been observed that nearly all currently employed techniques for document classification on the Web make use of textual information only. In addition, most of these techniques are incapable of discovering the real nature of the collection to which they are applied due to rather inefficient clustering algorithms employed. This paper describes a novel technique for hypertext clustering, called an adaptive hypertext clustering (AHC) algorithm. This algorithm has been derived from a modified neural network algorithm, and adjusted to the problem of combined term-similarity and hyperlink-similarity measures. The results presented in the paper show that AHC can be easily adapted to enable the most appropriate Web page classification within collections of various thematic and functional profiles, suggesting its main benefits over the traditional techniques
Keywords :
hypermedia; indexing; information resources; pattern clustering; self-organising feature maps; WWW; Web page classification; Web page clustering; World Wide Web; adaptive neural network; document classification; functional profiles; hyperlink-similarity; hyperlink-similarity measures; hypertext clustering; online hypertextual collection; term-similarity; term-similarity measures; thematic profiles; Adaptive systems; Artificial neural networks; Books; Clustering algorithms; Internet; Libraries; Neural networks; Resonance; Technological innovation; Web pages;
Conference_Titel :
Neural Networks, 1999. IJCNN '99. International Joint Conference on
Conference_Location :
Washington, DC
Print_ISBN :
0-7803-5529-6
DOI :
10.1109/IJCNN.1999.830743