DocumentCode :
2865369
Title :
A Clustering Algorithm of No-Word-Segmentation for Chinese Search Engine
Author :
Wang, Deqing ; Zhang, Hui ; Zhao, Liping ; Xie, Ke
Author_Institution :
Beihang Univ., Beijing
fYear :
2007
fDate :
29-31 Oct. 2007
Firstpage :
258
Lastpage :
261
Abstract :
Along with information on the Internet increasing dramatically, People usually search and locate information that they needed by search engines. Clustering search engine results is an effective method to help people select information needed from the list of search engine results. The paper presents a clustering algorithm of no-word-segmentation for Chinese search engine results (CANWS). The algorithm firstly preprocesses the search engine results and then computes the similarities of the results based on the same sub-string. Lastly it clusters the results based on the similarity matrix. The paper also gives test and analysis of the algorithm performance by experiments.
Keywords :
pattern clustering; search engines; text analysis; Chinese search engine results; Internet; clustering algorithm; information location; no-word-segmentation; search engines; Algorithm design and analysis; Clustering algorithms; Feedback; Internet; Performance analysis; Programming; Search engines; Software algorithms; Testing; Web search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Semantics, Knowledge and Grid, Third International Conference on
Conference_Location :
Shan Xi
Print_ISBN :
0-7695-3007-9
Electronic_ISBN :
978-0-7695-3007-9
Type :
conf
DOI :
10.1109/SKG.2007.11
Filename :
4438544
Link To Document :
بازگشت