DocumentCode :
2475061
Title :
An optimal locality preserving indexing algorithm for text mining
Author :
Tao, Jian-Wen ; Cheng, Guang-Hua ; Xin-Rong Lv ; Jie-Yu Zhao
Author_Institution :
Dept. of Inf. Eng., Zhejiang Bus. Technol. Inst., Ningbo
fYear :
2008
fDate :
25-27 June 2008
Firstpage :
165
Lastpage :
170
Abstract :
LPI is not efficient in time and memory which makes it difficult to be applied to very large data set. We propose a optimal algorithm called FLPI. FLPI decomposes the LPI problem as a graph embedding problem plus a regularized least squares problem. Such modification avoids eigen decomposition of dense matrices and can significantly reduce both time and memory cost in computation. Moreover, with a specifically designed graph in supervised situation, LPI only needs to solve the regularized least squares problem which is a further saving of time and memory. Real data experimental results show that FLPI obtains similar or better results comparing to LPI.
Keywords :
data mining; eigenvalues and eigenfunctions; graph theory; least squares approximations; matrix algebra; text analysis; very large databases; FLPI; dense matrices; eigen decomposition; graph embedding problem; optimal algorithm; optimal locality preserving indexing algorithm; regularized least squares problem; specifically designed graph; supervised situation; text mining; very large data set; Automation; Computational efficiency; Data engineering; Educational institutions; Indexing; Information science; Intelligent control; Least squares methods; Matrix decomposition; Text mining; Dimensionality Reduction; Document Indexing; Locality Preserving Indexing; Text Clustering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Control and Automation, 2008. WCICA 2008. 7th World Congress on
Conference_Location :
Chongqing
Print_ISBN :
978-1-4244-2113-8
Electronic_ISBN :
978-1-4244-2114-5
Type :
conf
DOI :
10.1109/WCICA.2008.4592918
Filename :
4592918
Link To Document :
بازگشت