• DocumentCode
    76116
  • Title

    Multitask Spectral Clustering by Exploring Intertask Correlation

  • Author

    Yang Yang ; Zhigang Ma ; Yi Yang ; Feiping Nie ; Heng Tao Shen

  • Author_Institution
    Sch. of Comput. Sci. & Eng., Univ. of Electron. Sci. & Technol. of China, Chengdu, China
  • Volume
    45
  • Issue
    5
  • fYear
    2015
  • fDate
    May-15
  • Firstpage
    1069
  • Lastpage
    1080
  • Abstract
    Clustering, as one of the most classical research problems in pattern recognition and data mining, has been widely explored and applied to various applications. Due to the rapid evolution of data on the Web, more emerging challenges have been posed on traditional clustering techniques: 1) correlations among related clustering tasks and/or within individual task are not well captured; 2) the problem of clustering out-of-sample data is seldom considered; and 3) the discriminative property of cluster label matrix is not well explored. In this paper, we propose a novel clustering model, namely multitask spectral clustering (MTSC), to cope with the above challenges. Specifically, two types of correlations are well considered: 1) intertask clustering correlation, which refers the relations among different clustering tasks and 2) intratask learning correlation, which enables the processes of learning cluster labels and learning mapping function to reinforce each other. We incorporate a novel l2,p -norm regularizer to control the coherence of all the tasks based on an assumption that related tasks should share a common low-dimensional representation. Moreover, for each individual task, an explicit mapping function is simultaneously learnt for predicting cluster labels by mapping features to the cluster label matrix. Meanwhile, we show that the learning process can naturally incorporate discriminative information to further improve clustering performance. We explore and discuss the relationships between our proposed model and several representative clustering techniques, including spectral clustering, k -means and discriminative k -means. Extensive experiments on various real-world datasets illustrate the advantage of the proposed MTSC model compared to state-of-the-art clustering approaches.
  • Keywords
    learning (artificial intelligence); matrix algebra; pattern clustering; MTSC model; cluster label matrix; clustering performance; clustering tasks; data mining; discriminative k-mean clustering; discriminative property; individual task; intertask clustering correlation; intratask learning correlation; l2,p -norm regularizer; low-dimensional representation; mapping function learning process; multitask spectral clustering; out-of-sample data clustering; pattern recognition; representative clustering techniques; Algorithm design and analysis; Clustering algorithms; Coherence; Correlation; Educational institutions; Laplace equations; Linear programming; Clustering; multitask; out-of-sample;
  • fLanguage
    English
  • Journal_Title
    Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2168-2267
  • Type

    jour

  • DOI
    10.1109/TCYB.2014.2344015
  • Filename
    6902787