DocumentCode :
3732071
Title :
How to Perform Incremental Clustering - A SOM Based View
Author :
Chen Lei;Wu Chong
Author_Institution :
Sch. of Manage., Harbin Inst. of Technol., Harbin, China
fYear :
2015
Firstpage :
450
Lastpage :
455
Abstract :
Due to fast development of network technique, internet users have to face to massive textual data every day. Because of unsupervised merit of clustering, clustering is a good solution for users to analyze and organize texts into categories. However, most of recent clustering algorithms conduct in static situation. That indicates, it doesn´t allow clustering algorithm to deal with novel data efficiently. When novel data appear, traditional clustering algorithms can´t change their structure easily. Obviously, this restrict is not fit to internet, since novel data appear at any time. For this reason, an incremental clustering algorithm is proposed in this paper to cluster incremental data. This algorithm has two factors. (a) It designs two measures to calculate feature´s ability and integrate them in similarity measurement by replacing concurrence based similarity measurements. (b) Based on proposed similarity measurement, this algorithm selects few samples from original texts to perform incremental clustering. Experimental results demonstrate that, after integrating feature´s capacity, our algorithm can obtain high quality to cluster texts.
Keywords :
"Transportation","Big data","Smart cities"
Publisher :
ieee
Conference_Titel :
Intelligent Transportation, Big Data and Smart City (ICITBS), 2015 International Conference on
Type :
conf
DOI :
10.1109/ICITBS.2015.117
Filename :
7384063
Link To Document :
بازگشت