Title :
A Clustering Algorithm of No-Word-Segmentation for Chinese Search Engine
Author :
Wang, Deqing ; Zhang, Hui ; Zhao, Liping ; Xie, Ke
Author_Institution :
Beihang Univ., Beijing
Abstract :
Along with information on the Internet increasing dramatically, People usually search and locate information that they needed by search engines. Clustering search engine results is an effective method to help people select information needed from the list of search engine results. The paper presents a clustering algorithm of no-word-segmentation for Chinese search engine results (CANWS). The algorithm firstly preprocesses the search engine results and then computes the similarities of the results based on the same sub-string. Lastly it clusters the results based on the similarity matrix. The paper also gives test and analysis of the algorithm performance by experiments.
Keywords :
pattern clustering; search engines; text analysis; Chinese search engine results; Internet; clustering algorithm; information location; no-word-segmentation; search engines; Algorithm design and analysis; Clustering algorithms; Feedback; Internet; Performance analysis; Programming; Search engines; Software algorithms; Testing; Web search;
Conference_Titel :
Semantics, Knowledge and Grid, Third International Conference on
Conference_Location :
Shan Xi
Print_ISBN :
0-7695-3007-9
Electronic_ISBN :
978-0-7695-3007-9
DOI :
10.1109/SKG.2007.11