Title :
Survey on text clustering algorithm
Author :
Liu, Fasheng ; Xiong, Lu
Author_Institution :
Jiangxi Univ. of Sci. & Technol., Ganzhou, China
Abstract :
With the popularity of Internet and large-scale improvement in the level of enterprise information, the explosive growth of resources, the research of text mining, information filtering and information search appear the unprecedented prospect. So, the cluster technology is becoming the core of text information mining technologies. Clustering is an important form of data mining. This paper introduces common text clustering algorithms, analyses and compares some aspects of clustering algorithms which contains the applicable scope, the initial parameters, termination conditions and noise sensitivity. Algorithms contain hierarchical clustering, partitioned clustering, density-based algorithm and self-organizing maps algorithm.
Keywords :
data mining; pattern clustering; text analysis; data mining; information filtering; information search; self-organizing maps algorithm; text clustering algorithm; text information mining; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Clustering methods; Partitioning algorithms; Shape; Signal processing algorithms; cluster text; hierarchical clustering; k-means algorithm; text clustering algorithm;
Conference_Titel :
Software Engineering and Service Science (ICSESS), 2011 IEEE 2nd International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-9699-0
DOI :
10.1109/ICSESS.2011.5982485