• DocumentCode
    1554735
  • Title

    Robust and Scalable Graph-Based Semisupervised Learning

  • Author

    Liu, Wei ; Wang, Jun ; Chang, Shih-Fu

  • Author_Institution
    Department of Electrical Engineering, Columbia University, New York, NY, USA
  • Volume
    100
  • Issue
    9
  • fYear
    2012
  • Firstpage
    2624
  • Lastpage
    2638
  • Abstract
    Graph-based semisupervised learning (GSSL) provides a promising paradigm for modeling the manifold structures that may exist in massive data sources in high-dimensional spaces. It has been shown effective in propagating a limited amount of initial labels to a large amount of unlabeled data, matching the needs of many emerging applications such as image annotation and information retrieval. In this paper, we provide reviews of several classical GSSL methods and a few promising methods in handling challenging issues often encountered in web-scale applications. First, to successfully incorporate the contaminated noisy labels associated with web data, label diagnosis and tuning techniques applied to GSSL are surveyed. Second, to support scalability to the gigantic scale (millions or billions of samples), recent solutions based on anchor graphs are reviewed. To help researchers pursue new ideas in this area, we also summarize a few popular data sets and software tools publicly available. Important open issues are discussed at the end to stimulate future research.
  • Keywords
    Cost function; Image classification; Image processing; Laplace equations; Noise measurement; Semisupervised learning; Supervised learning; Anchor graphs; graph-based semisupervised learning (GSSL); image annotation; image classification; image search; label diagnosis; large scale; noisy labels;
  • fLanguage
    English
  • Journal_Title
    Proceedings of the IEEE
  • Publisher
    ieee
  • ISSN
    0018-9219
  • Type

    jour

  • DOI
    10.1109/JPROC.2012.2197809
  • Filename
    6235979