Title :
Mining generalized query patterns from web logs
Author :
Ling, Charles X. ; Gao, Jianfeng ; Zhang, Huajie ; Qian, Weining ; Zhang, Hongjiang
Author_Institution :
Dept. of Comput. Sci., Univ. of Western Ontario, London, Ont., Canada
Abstract :
User logs of a popular search engine keep track of user activities including user queries, user click-through from the returned list, and user browsing behaviors. Knowledge about user queries discovered from user logs can improve the performance of the search engine. We propose a data-mining approach that produces generalized query patterns or templates from the raw user logs of a popular commercial knowledge-based search engine that is currently in use. Our simulation shows that such templates can improve search engine´s speed and precision, and can cover queries not asked previously. The templates are also comprehensible so web editors can easily discover topics in which most users are interested.
Keywords :
Internet; data mining; digital simulation; information resources; knowledge based systems; query processing; search engines; data-mining approach; generalized query patterns; generalized query patterns mining; knowledge-based search engine; search engine; simulation; templates; user activities; user browsing; user click-through; user logs; user queries; web editors; web logs; Bayesian methods; Computer science; Data mining; Electrical capacitance tomography; Internet; Read only memory; Search engines; Web pages; Web server; Web sites;
Conference_Titel :
System Sciences, 2001. Proceedings of the 34th Annual Hawaii International Conference on
Conference_Location :
Maui, HI, USA
Print_ISBN :
0-7695-0981-9
DOI :
10.1109/HICSS.2001.926534