DocumentCode :
1988588
Title :
Empirical study of a novel approach to LSI for text categorisation
Author :
Jaber, T. ; Amira, A. ; Milligan, P.
Author_Institution :
Sch. of Electron. Electr. Eng. & Comput. Sci., Queen´´s Univ., Belfast
fYear :
2007
fDate :
12-15 Feb. 2007
Firstpage :
1
Lastpage :
4
Abstract :
Latent Semantic Indexing (LSI) is a technique used in Information Retrieval (IR) as an effective tool in correlating and retrieving relevant documents. The authors presented a new philosophy for LSI analysis and evaluation based on the use of image processing tools. In this new approach the Term Document Matrix (TDM) generated in the LSI process is visualized and treated as an image enabling techniques from image processing to be applied. This paper presents a novel extension to this work in which various features of the target databases can be used to predict, and pre-select, search criteria. This latest approach has been evaluated and validated by applying it to a range of sample databases.
Keywords :
image processing; information retrieval; text analysis; image enabling techniques; image processing tools; information retrieval; latent semantic indexing; relevant document retrieval; term document matrix; text categorisation; Image analysis; Image databases; Image processing; Indexing; Information retrieval; Large scale integration; Spatial databases; Text categorization; Time division multiplexing; Visualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing and Its Applications, 2007. ISSPA 2007. 9th International Symposium on
Conference_Location :
Sharjah
Print_ISBN :
978-1-4244-0778-1
Electronic_ISBN :
978-1-4244-1779-8
Type :
conf
DOI :
10.1109/ISSPA.2007.4555496
Filename :
4555496
Link To Document :
بازگشت