DocumentCode :
3177036
Title :
Trend detection from large text data
Author :
Abe, Hidenao ; Tsumoto, Shusaku
Author_Institution :
Dept. of Med. Inf., Shimane Univ., Izumo, Japan
fYear :
2010
fDate :
10-13 Oct. 2010
Firstpage :
310
Lastpage :
315
Abstract :
In temporal text mining, some importance indices such as simple appearance frequency, tf-idf, and differences of some indices play the key role to point out remarkable trends of terms in sets of documents. However, almost of conventional methods have treated their remarkable trends as discrete statuses for each time-point or fixed period. In this paper, we present a method to find out remarkable temporal behaviors of technical terms by using several importance indices and temporal clustering on the indices. The implemented method with three indices and k-means clustering performed on research document sets. The results of the case study show that the method has a feasibility to point out emergent, popular, and subsiding terms based on the linear trend of the temporal clusters of the technical terms.
Keywords :
data mining; pattern clustering; k-means clustering; remarkable temporal behavior; research document; temporal clustering; temporal text mining; trend detection; Jaccard´s Matching Coefficient; Linear Regression; TF-IDF; Temporal Clustering; Text Mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems Man and Cybernetics (SMC), 2010 IEEE International Conference on
Conference_Location :
Istanbul
ISSN :
1062-922X
Print_ISBN :
978-1-4244-6586-6
Type :
conf
DOI :
10.1109/ICSMC.2010.5641682
Filename :
5641682
Link To Document :
بازگشت