DocumentCode :
2461546
Title :
An Efficient LSI based Information Retrieval Framework using Particle swarm optimization and simulated annealing approach
Author :
Latha, K. ; Rajaram, R.
Author_Institution :
IT/CSE Dept., Thiagarajar Coll. of Eng., Madurai
fYear :
2008
fDate :
14-17 Dec. 2008
Firstpage :
94
Lastpage :
101
Abstract :
The number of users and the amount of information available has exploded since the advent of the World Wide Web (WWW). Most of Web users use various search engines to get specific information. A key factor in the success of Web search engines are their ability to rapidly find good quality results to the queries that are based on specific terms. This paper aims at retrieving more relevant documents from a huge corpus based on the required information. We propose a text mining framework that consists of four distinct stages: 1. Text preprocessing 2. Dimensionality reduction using latent semantic indexing 3. Clustering based on hybrid combination of particle swarm optimization (PSO) and k-means algorithm 4. Information retrieval process using simulated annealing (SA). This framework provides more relevant documents to the user and reduces the irrelevant documents.
Keywords :
data mining; indexing; information retrieval; particle swarm optimisation; pattern clustering; simulated annealing; text analysis; clustering; dimensionality reduction; document retrieval; information retrieval; k-means algorithm; latent semantic indexing; particle swarm optimization; simulated annealing; text mining; text preprocessing; Indexing; Information retrieval; Large scale integration; Particle swarm optimization; Search engines; Simulated annealing; Text mining; Web search; Web sites; World Wide Web;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Computing and Communications, 2008. ADCOM 2008. 16th International Conference on
Conference_Location :
Chennai
Print_ISBN :
978-1-4244-2962-2
Electronic_ISBN :
978-1-4244-2963-9
Type :
conf
DOI :
10.1109/ADCOM.2008.4760433
Filename :
4760433
Link To Document :
بازگشت