• DocumentCode
    3455294
  • Title

    An Improved EM-Based Semi-supervised Learning Method

  • Author

    Fan, Xinghua ; Guo, Zhiyi ; Ma, Houfeng

  • Author_Institution
    Coll. of Comput. Sci. & Technol., Chongqing Univ. of Posts & Telecommun., Chongqing, China
  • fYear
    2009
  • fDate
    3-5 Aug. 2009
  • Firstpage
    529
  • Lastpage
    532
  • Abstract
    The iterative process in the standard EM-based semi-supervised learning includes two steps: firstly, use the classifier constructed in previous iteration to classify all unlabeled samples; then, train a new classifier based on the reconstructed training set, which is composed of labeled samples and all unlabeled samples. There is a problem in the process of reconstructing the training set: unlabeled samples should be divided into two parts by the classifier, reliable and misclassified, the misclassified part is considered as training samples. This affects the finally classification performance and convergence efficiency. This paper presents an improved method, which reconstructs the labeled samples set and then considers it as the training set in each iterative process. The process is implemented by adding the reliable part in unlabeled samples to labeled samples and forming a new labeled samples set. Experimental results in text classification show that the proposed method improves the classification performance and convergence efficiency.
  • Keywords
    data handling; iterative methods; learning (artificial intelligence); pattern classification; classification performance; convergence efficiency; data reconstruction; iterative process; reconstructed training set; semisupervised learning method; unlabeled samples classification; Bioinformatics; Convergence; Electromagnetic compatibility; Intelligent systems; Iterative algorithms; Iterative methods; Machine learning; Semisupervised learning; Systems biology; Text categorization; Data reconstruction; EM algorithm; Semi-supervised learning; ensemble learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics, Systems Biology and Intelligent Computing, 2009. IJCBS '09. International Joint Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-0-7695-3739-9
  • Type

    conf

  • DOI
    10.1109/IJCBS.2009.27
  • Filename
    5260445