DocumentCode
3745842
Title
Search Results Clustering without External Resources
Author
Chris Staff;Joel Azzopardi;Colin Layfield;Daniel Mercieca
Author_Institution
Fac. of Inf. &
fYear
2015
Firstpage
276
Lastpage
280
Abstract
Our unsupervised Search Results Clustering (SRC) system partitions into clusters the top-n results returned by a search engine. We present the results of experiments with our SRC system that performs incremental clustering on document titles and snippets only and does not use external resources, yet which outperforms the best performers to date on the SemEval-2013 Task 11 gold standard. We include Latent Semantic Analysis (LSA) as an optional step, using the snippets themselves as the background corpus. We demonstrate that better results are achieved by leaving the query terms out of the clustering process, and that currently, the version without LSA outperforms the version with LSA.
Keywords
"Clustering algorithms","Conferences","Semantics","Indexes","Search engines","Standards","Gold"
Publisher
ieee
Conference_Titel
Database and Expert Systems Applications (DEXA), 2015 26th International Workshop on
ISSN
1529-4188
Print_ISBN
978-1-4673-7581-8
Electronic_ISBN
2378-3915
Type
conf
DOI
10.1109/DEXA.2015.67
Filename
7406306
Link To Document