Title :
Semantic log analysis based on a user query behavior model
Author :
Noriaki, Kawamae ; Takeya, Mukaigaito ; Miyoshi, Hanaki
Author_Institution :
NTT Inf. Sharing Platform Labs., Tokyo, Japan
Abstract :
We propose a novel log analysis method to capture the semantic relations among words appearing in Web search logs. Our method focuses on the reciprocal relations among a user´s intentions, stages of information need, and query behavior in seeking information via a search engine. The approach works because it is based on the assumption that a user´s intentions in each query can be derived as a model on the basis of his stage of information need and query behavior, through multiple empirical observations of search logs. The user´s intentions drive user to change the words in each successive queries and can thus be used to clarify the semantic relations among words. As a result, this method has the advantage of capturing the semantic relations among words without requiring either manual or natural language processing. Our experimental results indicate that semantic relations could successfully be derived from search logs, confirming that an ontology and thesaurus could be constructed automatically.
Keywords :
Internet; data mining; information needs; query formulation; search engines; semantic Web; thesauri; word processing; Web search logs; information need; natural language processing; ontology; search engine; semantic log analysis; semantic word relation; thesaurus; user query behavior model; Data mining;
Conference_Titel :
Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
Print_ISBN :
0-7695-1978-4
DOI :
10.1109/ICDM.2003.1250909