• DocumentCode
    2207489
  • Title

    Semisupervised mixture modeling with fine-grained component-conditional class labeling and transductive inference

  • Author

    Miller, David J. ; Lin, Chu-Fang ; Kesidis, George ; Collins, Christopher M.

  • Author_Institution
    Dept. of Electr. Eng., Pennsylvania State Univ., University Park, PA, USA
  • fYear
    2009
  • fDate
    1-4 Sept. 2009
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    This paper introduces a new generative semisupervised (transductive) mixture model with a more fine-grained class label generation mechanism than that of previous works. Our approach effectively combines the advantages of standard semisupervised mixtures, which achieve label extrapolation over a mixture component when there are few labeled samples, and nearest-neighbor (NN) classification, which achieves accurate classification in the local vicinity of labeled samples. Toward this end, we propose a two-stage stochastic data generation mechanism, with the unlabeled samples first produced and then the labeled samples generated conditioned on both the unlabeled data and on their components of origin. This nested data generation entails a more complicated (albeit still closed-form) E-step evaluation than that for standard mixtures. Our model is advantageous, compared with previous semisupervised mixtures, when mixture components model data from more than one class and when within-component class proportions are not constant over the feature space region ldquoownedrdquo by a component. Experiments demonstrate gains in classification accuracy over both the previous semisupervised mixture of experts model and over K-NN classification on data sets from the UC Irvine Repository.
  • Keywords
    inference mechanisms; pattern classification; stochastic processes; fine-grained component-conditional class labeling; nearest-neighbor classification; semisupervised mixture modeling; stochastic data generation; transductive inference; Bayesian methods; Data mining; Educational institutions; Extrapolation; Labeling; Machine learning; Nearest neighbor searches; Neural networks; Semisupervised learning; Stochastic processes; Expectation-Maximization; Semisupervised learning; mixture models; nearest neighbor classification; transductive inference;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning for Signal Processing, 2009. MLSP 2009. IEEE International Workshop on
  • Conference_Location
    Grenoble
  • Print_ISBN
    978-1-4244-4947-7
  • Electronic_ISBN
    978-1-4244-4948-4
  • Type

    conf

  • DOI
    10.1109/MLSP.2009.5306229
  • Filename
    5306229