DocumentCode :
3496818
Title :
MCBDist: A Novel Markov-Chain-Based Measure of Distance among Webpages
Author :
Xiong, Zhi ; Wu, Guangguan
Author_Institution :
Shantou Univ., Shantou
fYear :
2008
fDate :
6-8 April 2008
Firstpage :
1577
Lastpage :
1582
Abstract :
In the Web server cluster adopting content-aware request dispatching, the persistent connection feature of HTTP/1.1 may bring extra cost. If the webpages, that are likely to be visited together by users, are grouped into webpage cluster appropriately, and the webpage cluster is viewed as document distribution unit, such cost may decrease. How to measure the distance among webpages is a key problem of webpage clustering. In this paper, we propose a novel Markov-chain-based measure of distance among webpages, called MCBDist. It not only considers the temporal correlation of users´ visit, but also considers the path of users´ visit. In addition, for the dimension of the transition probability matrix is usually very large, transition probability matrix compression is used to deduce the computational complexity. Finally, we give an example to illustrate the effectiveness of MCBDist.
Keywords :
Internet; Markov processes; computational complexity; data compression; document handling; matrix algebra; probability; workstation clusters; Markov-chain-based measure; Web page clustering; Web page distance measure; Web server cluster; computational complexity; content-aware request dispatching; document distribution unit; temporal correlation; transition probability matrix compression; Computational complexity; Costs; Delay; Dispatching; File servers; File systems; Network servers; Routing; Splicing; Web server;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Networking, Sensing and Control, 2008. ICNSC 2008. IEEE International Conference on
Conference_Location :
Sanya
Print_ISBN :
978-1-4244-1685-1
Electronic_ISBN :
978-1-4244-1686-8
Type :
conf
DOI :
10.1109/ICNSC.2008.4525472
Filename :
4525472
Link To Document :
بازگشت