Title :
Fast latent semantic index using random mapping in text processing
Author_Institution :
Sch. of Manage., Lanzhou Jiaotong Univ., Lanzhou
Abstract :
The calculation bottleneck problems of Kohonen self organizing feature map (SOFM) neural network in the high-dimensional vector environment of text processing and problems of input vector spaces had been analyzed in this paper, and then based on the theoretic analysis of RM (random mapping) and LSI (latent semantic indexing) method respectively, a RM-based fast latent semantic indexing method used in text processing was presented. The fast LSI method could greatly emerges original semantic links and settles the above mentioned problems in a low-cost, efficient and controllable way in the experiment. So the size and the calculation cost of Kohonen SOFM neural network were greatly reduced in text processing environment.
Keywords :
indexing; random processes; self-organising feature maps; text analysis; vectors; Kohonen self organizing feature map neural network; high-dimensional vector environment; input vector space; latent semantic indexing; random mapping; text processing; Costs; Functional analysis; Gaussian distribution; Indexing; Large scale integration; Neural networks; Neurons; Organizing; Pattern analysis; Text processing; Latent semantic indexing; Random mapping; Self organizing feature map neural network; Text classification;
Conference_Titel :
Wavelet Analysis and Pattern Recognition, 2008. ICWAPR '08. International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-2238-8
Electronic_ISBN :
978-1-4244-2239-5
DOI :
10.1109/ICWAPR.2008.4635884