Title :
Trend detection from large text data
Author :
Abe, Hidenao ; Tsumoto, Shusaku
Author_Institution :
Dept. of Med. Inf., Shimane Univ., Izumo, Japan
Abstract :
In temporal text mining, some importance indices such as simple appearance frequency, tf-idf, and differences of some indices play the key role to point out remarkable trends of terms in sets of documents. However, almost of conventional methods have treated their remarkable trends as discrete statuses for each time-point or fixed period. In this paper, we present a method to find out remarkable temporal behaviors of technical terms by using several importance indices and temporal clustering on the indices. The implemented method with three indices and k-means clustering performed on research document sets. The results of the case study show that the method has a feasibility to point out emergent, popular, and subsiding terms based on the linear trend of the temporal clusters of the technical terms.
Keywords :
data mining; pattern clustering; k-means clustering; remarkable temporal behavior; research document; temporal clustering; temporal text mining; trend detection; Jaccard´s Matching Coefficient; Linear Regression; TF-IDF; Temporal Clustering; Text Mining;
Conference_Titel :
Systems Man and Cybernetics (SMC), 2010 IEEE International Conference on
Conference_Location :
Istanbul
Print_ISBN :
978-1-4244-6586-6
DOI :
10.1109/ICSMC.2010.5641682