• DocumentCode
    453878
  • Title

    On the Scarcity of Labeled Data

  • Author

    Bouchachia, Abdelhamid

  • Author_Institution
    Dept. of Inf., Klagenfurt Univ.
  • Volume
    1
  • fYear
    2005
  • fDate
    28-30 Nov. 2005
  • Firstpage
    402
  • Lastpage
    407
  • Abstract
    Scarcity of labeled data can be encountered in various engineering applications due to several factors. This raises the question of how to generate sufficient amounts of labeled data when it is sparse in order to build effective learning tools. One approach to overcome this problem is to use unlabeled data. In this paper, we propose two approaches, each is a two-step process for learning from data that is dominantly unlabeled. In the first approach, the k-NN algorithm is applied to pre-label the unlabeled data. A multi-layer perceptron is then used to classify the pre-labeled data. In the second approach, a prototypicality rule based on FCM is used to pre-label unlabeled data before training the MLP classifier. The evaluation, conducted on three data sets, shows how unlabeled data enhances the accuracy of the neural classifier
  • Keywords
    data analysis; learning (artificial intelligence); multilayer perceptrons; pattern classification; MLP classifier; k-NN algorithm; labeled data scarcity; multilayer perceptron; Clustering algorithms; Computational modeling; Data engineering; Informatics; Kernel; Labeling; Multilayer perceptrons; Neural networks; Prototypes; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence for Modelling, Control and Automation, 2005 and International Conference on Intelligent Agents, Web Technologies and Internet Commerce, International Conference on
  • Conference_Location
    Vienna
  • Print_ISBN
    0-7695-2504-0
  • Type

    conf

  • DOI
    10.1109/CIMCA.2005.1631299
  • Filename
    1631299