Title :
Clustering web search results using semantic information
Author :
Wen, Han ; Huang, Guo-shun ; Li, Zhao
Author_Institution :
Sch. of Sci., FOSHAN Univ., Foshan, China
Abstract :
Clustering Web search results will help users finding relevant information quickly. Suffix tree clustering (STC) algorithm is well fit for clustering Web documents. This paper puts forward an improved Web search results clustering algorithm based on STC. It uses latent semantic indexing method to assist finding common descriptive and meaningful topic phrases for the final document clusters. Using semantic information for clustering web snippets is able to make search engine results easy to browse and help users quickly find Web information interested. Evaluation of experiment results demonstrates that clustering Web search results based on the improved suffix tree algorithm gets better performance in cluster label quality and snippets assignment precision.
Keywords :
document handling; indexing; information retrieval; online front-ends; pattern clustering; search engines; semantic Web; trees (mathematics); Web browser; Web search results clustering; Web snippets; cluster label quality; clustering Web documents; document clusters; latent semantic indexing method; search engine; semantic information; snippets assignment precision; suffix tree clustering algorithm; Clustering algorithms; Cybernetics; Frequency; Indexing; Internet; Machine learning; Machine learning algorithms; Search engines; Singular value decomposition; Web search; Latent semantic indexing; Singular value decomposition; Suffix tree clustering;
Conference_Titel :
Machine Learning and Cybernetics, 2009 International Conference on
Conference_Location :
Baoding
Print_ISBN :
978-1-4244-3702-3
Electronic_ISBN :
978-1-4244-3703-0
DOI :
10.1109/ICMLC.2009.5212332