DocumentCode
553235
Title
Approximating Google´s rankings with Latent Semantic Analysis
Author
Cheng-Jye Luh ; Kazembe, C.W. ; Chun-Ju Li
Author_Institution
Dept. of Inf. Manage., Yuan Ze Univ., Taoyuan, Taiwan
Volume
3
fYear
2011
fDate
26-28 July 2011
Firstpage
1773
Lastpage
1777
Abstract
This study proposed a Latent Semantic Analysis based method to approximate Google´s ranking. We conduct Latent Semantic Analysis on Google´s search results for a given query to find terms with highest LSA weights as features. Then the correlation coefficients between the features and the given query are obtained for use as the feature values. Each result is scored and re-ranked based on a linear combination of weighted sum of feature values that appear in its title, snippet and URL. Experimental results on a small number of popular keywords show that this method is promising to achieve R-Precision up to 0.8 for some combination of search results and features used.
Keywords
approximation theory; query processing; search engines; semantic Web; Google ranking approximation; Google search; R-Precision; URL; correlation coefficient; latent semantic analysis; snippet; Correlation; Google; Matrix decomposition; Patents; Portable media players; Search engines; Semantics; Latent Semantic Analysis; Search Results Ranking; Web Search;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery (FSKD), 2011 Eighth International Conference on
Conference_Location
Shanghai
Print_ISBN
978-1-61284-180-9
Type
conf
DOI
10.1109/FSKD.2011.6019915
Filename
6019915
Link To Document