Title :
Hierarchical Clustering Based on Co-word for Web Information Retrieval
Author :
Li, Fenglin ; He, Zhoufang
Author_Institution :
Sch. of Inf. Manage., Wuhan Univ., Wuhan, China
Abstract :
This paper proposes a novel method to generate labels for grouping and organizing the search results returned by auxiliary search engines. It has applied statistical techniques to measure the quantities of co-occurrence keywords for forming the label matrix of them, and then agglomerated them into higher-level clusters by clustering algorithm in order to classify the results which return from the source search engine. Compared with Lingo, the experimental results show that the labels generated by our algorithm are of more readability and generality. What´s more, F-measure index also shows that our algorithm has improved the quality of text clustering to some extent.
Keywords :
Internet; information retrieval; pattern clustering; search engines; statistical analysis; F-measure index; Web information retrieval; auxiliary search engines; cooccurrence keyword; hierarchical clustering; higher level cluster; source search engine; statistical techniques; text clustering; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Home appliances; Labeling; Search engines; Web search; clustering; co-occurrence keywords; retrieval results;
Conference_Titel :
Computational and Information Sciences (ICCIS), 2010 International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-8814-8
Electronic_ISBN :
978-0-7695-4270-6
DOI :
10.1109/ICCIS.2010.138