DocumentCode
1843735
Title
A Fuzzy Approach to Clustering of Text Documents Based on MapReduce
Author
Hu Zongzhen ; Zhu Weina ; Li Yu E ; Du Xiaojuan ; Yan Fan
Author_Institution
Dept. of Comput. Sci., Yunnan Univ., Kunming, China
fYear
2013
fDate
21-23 June 2013
Firstpage
666
Lastpage
669
Abstract
This paper discusses text clustering based on a parallel computing platform called Hadoop. According to the concept of fuzzy set, this paper presents a fuzzy clustering approach for document categorization. Furthermore, a parallel text clustered framework based on MapReduce was designed according to the proposed text clustering procedure.
Keywords
fuzzy set theory; parallel programming; pattern clustering; text analysis; Hadoop parallel computing platform; MapReduce; document categorization; fuzzy approach; fuzzy clustering approach; fuzzy set concept; text clustering procedure; text document clustering; Algorithm design and analysis; Clustering algorithms; Data mining; Educational institutions; Information entropy; Programming; Training; Distributed computing; Fuzzy approach; Hadoop; MapReduce; Parallel computing; Text document clustering;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational and Information Sciences (ICCIS), 2013 Fifth International Conference on
Conference_Location
Shiyang
Type
conf
DOI
10.1109/ICCIS.2013.181
Filename
6643097
Link To Document