• DocumentCode
    1843735
  • Title

    A Fuzzy Approach to Clustering of Text Documents Based on MapReduce

  • Author

    Hu Zongzhen ; Zhu Weina ; Li Yu E ; Du Xiaojuan ; Yan Fan

  • Author_Institution
    Dept. of Comput. Sci., Yunnan Univ., Kunming, China
  • fYear
    2013
  • fDate
    21-23 June 2013
  • Firstpage
    666
  • Lastpage
    669
  • Abstract
    This paper discusses text clustering based on a parallel computing platform called Hadoop. According to the concept of fuzzy set, this paper presents a fuzzy clustering approach for document categorization. Furthermore, a parallel text clustered framework based on MapReduce was designed according to the proposed text clustering procedure.
  • Keywords
    fuzzy set theory; parallel programming; pattern clustering; text analysis; Hadoop parallel computing platform; MapReduce; document categorization; fuzzy approach; fuzzy clustering approach; fuzzy set concept; text clustering procedure; text document clustering; Algorithm design and analysis; Clustering algorithms; Data mining; Educational institutions; Information entropy; Programming; Training; Distributed computing; Fuzzy approach; Hadoop; MapReduce; Parallel computing; Text document clustering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational and Information Sciences (ICCIS), 2013 Fifth International Conference on
  • Conference_Location
    Shiyang
  • Type

    conf

  • DOI
    10.1109/ICCIS.2013.181
  • Filename
    6643097