Title :
Research of text clustering based on fuzzy granular computing
Author :
Xia, Zhang ; Yixin, Yin ; Mingzhu, Xu ; Hailong, Zhao
Author_Institution :
Comput. Center, Hebei Univ. of Econ. & Bus., Shijiazhuang, China
Abstract :
The typical algorithm of text clustering is a ldquoHard Partitionrdquo one, Actually, Chinese text is better to treat with ldquoSoft Partitionrdquo for its diversity and largeness. The fuzzy-set theory supply a powerful analyzing tool to this ldquoSoft partitionrdquo. Traditional fuzzy text clustering methods mostly are getting the fuzzy equivalent matrix or fuzzy division by iterating the matrix of membership degree, huge storage space is necessary for that process. The text clustering based on fuzzy granular computing will work as: first provide a normalized distance function in the fuzzy granularity space of text set, then use the function to do a dynamic clustering work to text who has a less distance than granularity. Approved by the test, this method has such advantages on reducing the computing complexity and space complexity, suitable for the status that many samples need to be processed.
Keywords :
fuzzy set theory; iterative methods; matrix algebra; pattern clustering; text analysis; fuzzy equivalent matrix; fuzzy granular computing; fuzzy granularity space; fuzzy-set theory; hard partition; iterative method; normalized distance function; soft partition; text clustering; Clustering algorithms; Clustering methods; Fuzzy sets; Internet; Machine learning; Machine learning algorithms; Mathematics; Partitioning algorithms; Power supplies; Railway engineering; fuzzy; granular computing; normalized distance function; text cluster;
Conference_Titel :
Computer Science and Information Technology, 2009. ICCSIT 2009. 2nd IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-4519-6
Electronic_ISBN :
978-1-4244-4520-2
DOI :
10.1109/ICCSIT.2009.5234519