DocumentCode :
2699228
Title :
A Comparative Study of Feature Vector-Based Topic Detection Schemes A Comparative Study of Feature Vector-Based Topic Detection Schemes
Author :
Hamamot, Masafumi ; Kitagawa, Hiroyuki ; Pan, Jia-Yu ; Faloutsos, Christos
Author_Institution :
Graduate Sch. of Syst. & Inf. Eng., Tsukuba Univ.
fYear :
2005
fDate :
8-9 April 2005
Firstpage :
122
Lastpage :
127
Abstract :
Topic detection is an important subject when voluminous text data is sent continuously to a user. We examine a method to detect topics in text data using feature vectors. Feature vectors represent the main distribution of data and they are obtained by various data analysis methods. This paper examines three methods: singular value decomposition (SVD), clustering, and independent component analysis (ICA). SVD and clustering are popular existing methods. Clustering, especially, is applied to many topic detection methods. ICA was recently developed in signal processing research. In applications related to text data, however, ICA has not been compared with SVD and clustering, nor has its relationship with them been explored. This paper reports comparative experiments for these three methods and then shows properties as they apply to text data
Keywords :
data analysis; independent component analysis; pattern clustering; singular value decomposition; text analysis; data analysis; data distribution; feature vector-based topic detection schemes; independent component analysis; singular value decomposition; text data; text streams; Computer science; Computer vision; Data analysis; Data engineering; Data mining; Feature extraction; Independent component analysis; Postal services; Singular value decomposition; Systems engineering and theory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Information Retrieval and Integration, 2005. WIRI '05. Proceedings. International Workshop on Challenges in
Conference_Location :
Tokyo
Print_ISBN :
0-7695-2414-1
Type :
conf
DOI :
10.1109/WIRI.2005.1
Filename :
1553004
Link To Document :
بازگشت