Title :
Towards semantics-based prefetching to reduce Web access latency
Author :
Xu, Cheng-Zhong ; Ibrahim, Tamer I.
Author_Institution :
Dept. of Electr. & Comput. Eng., Wayne State Univ., Detroit, MI, USA
Abstract :
Prefetching is an important technique for tolerating Web access latency. Existing prefetching algorithms are mostly based on URL graphs. While they have been demonstrated to be effective in prefetching of documents that are often accessed, few of them can prefetch documents whose URLs have never been accessed. We propose a semantics-based prefetching technique to overcome the limitation. It predicts future requests based on semantic preferences of previously retrieved documents. We apply this technique to news reading activities and prototyped a client-side prefetching system, NewsAgent. The system extracts document semantics by identifying keywords in their URL anchor texts and relies on neural networks over the keyword set to predict future requests. We cross-examine the system in daily browsing of ABC News, CNN, and MSNBC News sites for three months and demonstrate the effectiveness of the technique.
Keywords :
Internet; Web sites; information retrieval; neural nets; storage management; ABC News site; CNN site; MSNBC News site; NewsAgent; URL anchor texts; Web access latency reduction; client-side prefetching system; daily browsing; document semantics extraction; future request prediction; keyword set; neural networks; news reading activities; semantic preferences; semantics-based prefetching; Bandwidth; Cellular neural networks; Delay; History; Neural networks; Prefetching; Probability; Prototypes; Uniform resource locators; Web and internet services;
Conference_Titel :
Applications and the Internet, 2003. Proceedings. 2003 Symposium on
Print_ISBN :
0-7695-1872-9
DOI :
10.1109/SAINT.2003.1183065