• DocumentCode
    3127814
  • Title

    A Novel Co-clustering Method with Intra-similarities

  • Author

    Wu, Jian-Sheng ; Lai, Jian-Huang ; Wang, Chang-Dong

  • Author_Institution
    Sch. of Inf. Sci. & Technol., Sun Yat-sen Univ., Guangzhou, China
  • fYear
    2011
  • fDate
    11-11 Dec. 2011
  • Firstpage
    300
  • Lastpage
    306
  • Abstract
    Recently, co-clustering has become a topic of much interest because of its applications to many problems. It has been proved more effective than one-way clustering methods. But the existing co-clustering approaches just treat the document as a collection of words, disregarding the word sequences. They only consider the co-occurrence counts of words and documents, but do not take into account the similarities between words and similarities between documents. However, these similarity information can help improving the co-clustering. In this paper, we incorporate the word similarities and document similarities into the co-clustering algorithm, and propose a new co-clustering method. And we provide a theoretical analysis that our algorithm can converge to a local minimum. The empirical evaluation on publicly available data sets also shows that our algorithm is effective.
  • Keywords
    pattern clustering; text analysis; coclustering method; cooccurrence word count; document clustering; document similarity; similarity information; word collection; word sequence; word similarity; Algorithm design and analysis; Clustering algorithms; Decision support systems; Joints; Mutual information; Partitioning algorithms; Prototypes; co-clustering; document similarities; word similarities;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshops (ICDMW), 2011 IEEE 11th International Conference on
  • Conference_Location
    Vancouver, BC
  • Print_ISBN
    978-1-4673-0005-6
  • Type

    conf

  • DOI
    10.1109/ICDMW.2011.15
  • Filename
    6137394