Title :
Mining Web logs to improve hit ratios of prefetching and caching
Author :
Huang, Yin-Fu ; Hsu, Jhao-Min
Author_Institution :
Inst. of Electron. & Inf. Eng., Nat. Yunlin Univ. of Sci. & Technol., Taiwan
Abstract :
In the Internet, proxy servers play the key roles between users and Web sites, which could reduce the response time of user requests and save network bandwidth. Basically, an efficient buffer manager should be built in a proxy server to cache frequently accessed documents in the buffer, thereby achieving better response time. In the paper, we developed an access sequence miner to mine popular surfing 2-sequences with their conditional probabilities from the proxy log, and stored them in the rule table. Then, according to buffer contents and the rule table, a prediction-based buffer manager also developed here makes appropriate actions such as document caching, document prefetching, and even cache/prefetch buffer size adjusting to achieve better buffer utilization. Through the simulation, we found that our approach has better performance than the others, in the quantitative measures such as hit ratios and byte hit ratios of accessed documents.
Keywords :
Web sites; cache storage; data mining; Internet; Web caching; Web log mining; Web prefetching; Web site; access sequence miner; buffer utilization; cache buffer; conditional probability; document caching; document prefetching; prediction-based buffer manager; prefetch buffer; proxy log; proxy server; rule table; Bandwidth; Content management; Data mining; Delay; Filters; IP networks; Network servers; Predictive models; Prefetching; Web server;
Conference_Titel :
Web Intelligence, 2005. Proceedings. The 2005 IEEE/WIC/ACM International Conference on
Print_ISBN :
0-7695-2415-X
DOI :
10.1109/WI.2005.100