• DocumentCode
    29237
  • Title

    Classification in the Presence of Label Noise: A Survey

  • Author

    Frenay, Benoit ; Verleysen, Michel

  • Author_Institution
    Inst. of Inf. & Commun. Technol., Electron. & Appl. Math., Univ. Catholique de Louvain, Louvain-la-Neuve, Belgium
  • Volume
    25
  • Issue
    5
  • fYear
    2014
  • fDate
    May-14
  • Firstpage
    845
  • Lastpage
    869
  • Abstract
    Label noise is an important issue in classification, with many potential negative consequences. For example, the accuracy of predictions may decrease, whereas the complexity of inferred models and the number of necessary training samples may increase. Many works in the literature have been devoted to the study of label noise and the development of techniques to deal with label noise. However, the field lacks a comprehensive survey on the different types of label noise, their consequences and the algorithms that consider label noise. This paper proposes to fill this gap. First, the definitions and sources of label noise are considered and a taxonomy of the types of label noise is proposed. Second, the potential consequences of label noise are discussed. Third, label noise-robust, label noise cleansing, and label noise-tolerant algorithms are reviewed. For each category of approaches, a short discussion is proposed to help the practitioner to choose the most suitable technique in its own particular field of application. Eventually, the design of experiments is also discussed, what may interest the researchers who would like to test their own algorithms. In this paper, label noise consists of mislabeled instances: no additional information is assumed to be available like e.g., confidences on labels.
  • Keywords
    learning (artificial intelligence); pattern classification; label noise classification; label noise cleansing algorithm; label noise-robust algorithm; label noise-tolerant algorithms; machine learning; potential negative consequences; Context; Labeling; Noise; Noise measurement; Reliability; Taxonomy; Training; Class noise; classification; label noise; mislabeling; robust methods; survey; survey.;
  • fLanguage
    English
  • Journal_Title
    Neural Networks and Learning Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2162-237X
  • Type

    jour

  • DOI
    10.1109/TNNLS.2013.2292894
  • Filename
    6685834