• DocumentCode
    2461181
  • Title

    A Novel K-Means Clustering Algorithm Based on Positive Examples and Careful Seeding

  • Author

    Liu, Qinchao ; Zhang, Bangzuo ; Sun, Haichao ; Guan, Yu ; Zhao, Lei

  • Author_Institution
    Sch. of Comput. Sci. & Inf. Technol., Northeast Normal Univ., Changchun, China
  • fYear
    2010
  • fDate
    17-19 Dec. 2010
  • Firstpage
    767
  • Lastpage
    770
  • Abstract
    Positive and unlabeled learning (PU Learning) is a special semi-supervise learning method. Its most important feature is that training set only includes two parts: positive examples and unlabeled examples. Many real-world classification applications appeal to PU Learning problem. The K-means++ clustering algorithm proposed a new seeding method. This paper describes a semi-supervised learning algorithm for positive and unlabeled examples (PU learning). Our approach extends K-means++, an enhancement to K-means that seeds the algorithm with suitably chosen cluster centers, to such situations. The experiments on the Spam and 20-newsgroup data sets shown that our proposed algorithm has better performances.
  • Keywords
    learning (artificial intelligence); pattern clustering; K-means clustering algorithm; positive examples; positive learning; seeding method; semi-supervise learning method; unlabeled learning; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Learning; Machine learning; Radio access networks; Unsolicited electronic mail; K-means++; PU Learning; Positive Examples; Semi-Supervise Clustering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational and Information Sciences (ICCIS), 2010 International Conference on
  • Conference_Location
    Chengdu
  • Print_ISBN
    978-1-4244-8814-8
  • Electronic_ISBN
    978-0-7695-4270-6
  • Type

    conf

  • DOI
    10.1109/ICCIS.2010.191
  • Filename
    5709200