Title :
Hybrid pre-query term expansion using latent semantic analysis
Author :
Park, Laurence A F ; Ramamohanarao, Kotagiri
Author_Institution :
Dept. of Comput. Sci., Melbourne Univ., Vic., Australia
Abstract :
Latent semantic retrieval methods (unlike vector space methods) take the document and query vectors and map them into a topic space to cluster related terms and documents. This produces a more precise retrieval but also a long query time. We present a new method of document retrieval which allows us to process the latent semantic information into a hybrid latent semantic-vector space query mapping. This mapping automatically expands the users query based on the latent semantic information in the document set. This expanded query is processed using a fast vector space method. Since we have the latent semantic data in a mapping, we are able to store and retrieve vector information in the same fast manner that the vector space method offers. Multiple mappings are combined to produce hybrid latent semantic retrieval which provide precision results 5% greater than the vector space method and fast query times.
Keywords :
information retrieval; text analysis; document retrieval; hybrid latent semantic; hybrid prequery term expansion; latent semantic analysis; latent semantic retrieval; query mapping; query vectors; user query; vector space method; Computational complexity; Computer science; Hybrid power systems; Information retrieval; Machine intelligence; Vectors; Writing;
Conference_Titel :
Data Mining, 2004. ICDM '04. Fourth IEEE International Conference on
Print_ISBN :
0-7695-2142-8
DOI :
10.1109/ICDM.2004.10085