مرکز منطقه ای اطلاع رساني علوم و فناوري - An efficient web document clustering algorithm for building dynamic similarity profile in Similarity-aware web caching

DocumentCode :

2316857

Title :

An efficient web document clustering algorithm for building dynamic similarity profile in Similarity-aware web caching

Author :

Xiao, Ji-Tian

Author_Institution :

Sch. of Comput. & Security Sci., Edith Cowan Univ., Mount Lawley, WA, Australia

Volume :

fYear :

2012

fDate :

15-17 July 2012

Firstpage :

1268

Lastpage :

1273

Abstract :

Discovering and establishing similarities among web documents have been one of the key research streams in web usage mining community in the recent years. The knowledge obtained from the exercise can be used for many applications such as optimizing web cache organization and improving the quality of web document pre-fetching. This paper presents an efficient matrix-based method to cluster web documents based on a predetermined similarity threshold. Our preliminary experiments have demonstrated that the new algorithm outperforms existing algorithms. The clustered web documents are then applied to a Similarity-aware web content management system, facilitating offline building of the similarity-ware web caches and online updating similarity profiles of the system.

Keywords :

Internet; cache storage; data mining; document handling; matrix algebra; pattern clustering; Web cache organization optimization; Web document clustering algorithm; Web document pre-fetching quality improvement; Web usage mining community; dynamic similarity profile; matrix-based method; online updating similarity profiles; predetermined similarity threshold; similarity discovery; similarity establishment; similarity-aware Web caching; similarity-aware Web content management system; Abstracts; Algorithm design and analysis; Clustering algorithms; Similarity profile; Web caching; Web document clustering;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Machine Learning and Cybernetics (ICMLC), 2012 International Conference on

Conference_Location :

Xian

ISSN :

2160-133X

Print_ISBN :

978-1-4673-1484-8

Type :

conf

DOI :

10.1109/ICMLC.2012.6359547

Filename :

6359547

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2316857