• DocumentCode
    553235
  • Title

    Approximating Google´s rankings with Latent Semantic Analysis

  • Author

    Cheng-Jye Luh ; Kazembe, C.W. ; Chun-Ju Li

  • Author_Institution
    Dept. of Inf. Manage., Yuan Ze Univ., Taoyuan, Taiwan
  • Volume
    3
  • fYear
    2011
  • fDate
    26-28 July 2011
  • Firstpage
    1773
  • Lastpage
    1777
  • Abstract
    This study proposed a Latent Semantic Analysis based method to approximate Google´s ranking. We conduct Latent Semantic Analysis on Google´s search results for a given query to find terms with highest LSA weights as features. Then the correlation coefficients between the features and the given query are obtained for use as the feature values. Each result is scored and re-ranked based on a linear combination of weighted sum of feature values that appear in its title, snippet and URL. Experimental results on a small number of popular keywords show that this method is promising to achieve R-Precision up to 0.8 for some combination of search results and features used.
  • Keywords
    approximation theory; query processing; search engines; semantic Web; Google ranking approximation; Google search; R-Precision; URL; correlation coefficient; latent semantic analysis; snippet; Correlation; Google; Matrix decomposition; Patents; Portable media players; Search engines; Semantics; Latent Semantic Analysis; Search Results Ranking; Web Search;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery (FSKD), 2011 Eighth International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-61284-180-9
  • Type

    conf

  • DOI
    10.1109/FSKD.2011.6019915
  • Filename
    6019915