Title :
CQIG: An Improved Web Search Results Clustering Algorithm
Author :
Ren, Yong-gong ; Fan, Dan
Author_Institution :
Sch. of Comput. & Inf. Technol., Liaoning Normal Univ., Dalian, China
Abstract :
Massive linear search results returned from traditional search engines bring much inconvenience to users when extract the information they need. Search result clustering is of critical need for grouping similar topics of documents. The existing algorithm has drawbacks in clustering labels screening, cluster quality assessment, overlapping clusters controlling. The improved clustering algorithm-CQIG, which based on LINGO, improved the cluster and cluster label scoring function, increased the cluster merging process and improved the processing effect of Chinese. Finally, a recommended platform for Web search results clustering is established based on carrot framework to prove the accuracy, distinction and readability of CQIG.
Keywords :
Internet; document handling; natural language processing; pattern clustering; search engines; CQIG; LINGO; Web search results clustering algorithm; cluster label scoring function; cluster merging process; cluster quality assessment; clustering labels screening; linear search results; overlapping clusters controlling; search engines; Algorithm design and analysis; Clustering algorithms; Matrix decomposition; Merging; Open systems; Singular value decomposition; Web search; cluster label Introduction; clustering quality assessment; search results clustering; singular value decomposition;
Conference_Titel :
Web Information Systems and Applications Conference (WISA), 2010 7th
Conference_Location :
Hohhot
Print_ISBN :
978-1-4244-8440-9
DOI :
10.1109/WISA.2010.36