• DocumentCode
    124256
  • Title

    Online Classification with Partially Labelled Texts

  • Author

    Shirai, Mikiyasu ; Miura, Tsuyoshi

  • Author_Institution
    Dept..of Electr. & Electr. Eng., HOSEI Univ., Koganei, Japan
  • Volume
    2
  • fYear
    2014
  • fDate
    11-14 Aug. 2014
  • Firstpage
    421
  • Lastpage
    428
  • Abstract
    In this investigation, we propose a novel approach to document stream classification using both online topic model and partially labelled documents. Although we may have several features for the classification, it seems natural that these features may vary dynamically depending upon the contents of stream. This is because they depend heavily on each theme within one class while we should follow dynamic mixture of them. Especially in the stream of news articles, word frequency changes dramatically because of bursts of the themes. Here we propose a dynamically learning method based on topic models assuming prior distribution of probabilities over classes adjusted by partially labelled documents in stream.
  • Keywords
    Internet; learning (artificial intelligence); pattern classification; statistical distributions; text analysis; document stream classification; dynamically learning method; news article stream; online classification; online topic model; partially labelled documents; partially labelled texts; probability distribution; stream content; word frequency; Adaptation models; Context modeling; Data models; Feature extraction; Maximum likelihood estimation; Probability distribution; Resource management; adaptive classification; document stream; online topic model; partially labelled documents; topic-burst;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2014 IEEE/WIC/ACM International Joint Conferences on
  • Conference_Location
    Warsaw
  • Type

    conf

  • DOI
    10.1109/WI-IAT.2014.128
  • Filename
    6927655