DocumentCode
2185340
Title
Mining Web logs to improve hit ratios of prefetching and caching
Author
Huang, Yin-Fu ; Hsu, Jhao-Min
Author_Institution
Inst. of Electron. & Inf. Eng., Nat. Yunlin Univ. of Sci. & Technol., Taiwan
fYear
2005
fDate
19-22 Sept. 2005
Firstpage
577
Lastpage
580
Abstract
In the Internet, proxy servers play the key roles between users and Web sites, which could reduce the response time of user requests and save network bandwidth. Basically, an efficient buffer manager should be built in a proxy server to cache frequently accessed documents in the buffer, thereby achieving better response time. In the paper, we developed an access sequence miner to mine popular surfing 2-sequences with their conditional probabilities from the proxy log, and stored them in the rule table. Then, according to buffer contents and the rule table, a prediction-based buffer manager also developed here makes appropriate actions such as document caching, document prefetching, and even cache/prefetch buffer size adjusting to achieve better buffer utilization. Through the simulation, we found that our approach has better performance than the others, in the quantitative measures such as hit ratios and byte hit ratios of accessed documents.
Keywords
Web sites; cache storage; data mining; Internet; Web caching; Web log mining; Web prefetching; Web site; access sequence miner; buffer utilization; cache buffer; conditional probability; document caching; document prefetching; prediction-based buffer manager; prefetch buffer; proxy log; proxy server; rule table; Bandwidth; Content management; Data mining; Delay; Filters; IP networks; Network servers; Predictive models; Prefetching; Web server;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence, 2005. Proceedings. The 2005 IEEE/WIC/ACM International Conference on
Print_ISBN
0-7695-2415-X
Type
conf
DOI
10.1109/WI.2005.100
Filename
1517911
Link To Document