• DocumentCode
    3307405
  • Title

    An enhanced EM method of semi-supervised classification based on Naive Bayesian

  • Author

    Wen Han ; Xiao Nan-Feng ; Li Zhao

  • Author_Institution
    Sch. of Comput. Sci. & Eng., Univ. of Technol., Guangzhou, China
  • Volume
    2
  • fYear
    2011
  • fDate
    26-28 July 2011
  • Firstpage
    987
  • Lastpage
    991
  • Abstract
    Semi-supervised learning (SSL) based on Naïve Bayesian and Expectation Maximization (EM) combines small limited numbers of labeled data with a large amount of unlabeled data to help train classifier and increase classification accuracy. With the aim of improving the efficiency problem of the basic EM algorithm, an enhanced EM method is proposed. Firstly, a feature selection function of strong category information is constructed to control the dimension of feature vector and preserve useful feature terms. Secondly, an intermediate classifier gradually transfers unlabeled documents of maximum posterior category probability to labeled collection during each iteration process of the EM algorithm. The iteration number of the enhanced EM is obviously less than the basic EM. Finally, experiments shows that the improved method obtains very effective performance in terms of macro average accuracy and algorithm efficiency.
  • Keywords
    Bayes methods; expectation-maximisation algorithm; learning (artificial intelligence); text analysis; algorithm efficiency; automatic text classification; category information; expectation maximization method enhancement; feature selection function; feature terms; feature vector; intermediate classifier; iteration process; macro average accuracy; maximum posterior category probability; naive Bayesian; semisupervised learning; Accuracy; Bayesian methods; Classification algorithms; Educational institutions; Machine learning; Mathematical model; Text categorization; Naïve Bayesian; Semi-supervised classification; enhanced EM; feature selection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery (FSKD), 2011 Eighth International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-61284-180-9
  • Type

    conf

  • DOI
    10.1109/FSKD.2011.6019690
  • Filename
    6019690