• DocumentCode
    1315971
  • Title

    The relative value of labeled and unlabeled samples in pattern recognition with an unknown mixing parameter

  • Author

    Castelli, Vittori ; Cover, Thomas M.

  • Author_Institution
    IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
  • Volume
    42
  • Issue
    6
  • fYear
    1996
  • fDate
    11/1/1996 12:00:00 AM
  • Firstpage
    2102
  • Lastpage
    2117
  • Abstract
    We observe a training set Q composed of l labeled samples {(X11),...,(Xl, θl )} and u unlabeled samples {X1´,...,Xu´}. The labels θi are independent random variables satisfying Pr{θi=1}=η, Pr{θi=2}=1-η. The labeled observations Xi are independently distributed with conditional density fθi(·) given θi. Let (X0 0) be a new sample, independently distributed as the samples in the training set. We observe X0 and we wish to infer the classification θ0. In this paper we first assume that the distributions f1(·) and f2(·) are given and that the mixing parameter is unknown. We show that the relative value of labeled and unlabeled samples in reducing the risk of optimal classifiers is the ratio of the Fisher informations they carry about the parameter η. We then assume that two densities g1(·) and g2(·) are given, but we do not know whether g1(·)=f1 (·) and g2(·)=f2(·) or if the opposite holds, nor do we know η. Thus the learning problem consists of both estimating the optimum partition of the observation space and assigning the classifications to the decision regions. Here, we show that labeled samples are necessary to construct a classification rule and that they are exponentially more valuable than unlabeled samples
  • Keywords
    decision theory; estimation theory; learning (artificial intelligence); pattern classification; random processes; Fisher information; classification rule; conditional density; decision region; independent random variables; independently distributed samples; labeled samples; learning problem; observation space; optimal classifiers; optimum partition; pattern recognition; training set; unknown mixing parameter; unlabeled samples; Random variables; Time of arrival estimation;
  • fLanguage
    English
  • Journal_Title
    Information Theory, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9448
  • Type

    jour

  • DOI
    10.1109/18.556600
  • Filename
    556600