DocumentCode
124256
Title
Online Classification with Partially Labelled Texts
Author
Shirai, Mikiyasu ; Miura, Tsuyoshi
Author_Institution
Dept..of Electr. & Electr. Eng., HOSEI Univ., Koganei, Japan
Volume
2
fYear
2014
fDate
11-14 Aug. 2014
Firstpage
421
Lastpage
428
Abstract
In this investigation, we propose a novel approach to document stream classification using both online topic model and partially labelled documents. Although we may have several features for the classification, it seems natural that these features may vary dynamically depending upon the contents of stream. This is because they depend heavily on each theme within one class while we should follow dynamic mixture of them. Especially in the stream of news articles, word frequency changes dramatically because of bursts of the themes. Here we propose a dynamically learning method based on topic models assuming prior distribution of probabilities over classes adjusted by partially labelled documents in stream.
Keywords
Internet; learning (artificial intelligence); pattern classification; statistical distributions; text analysis; document stream classification; dynamically learning method; news article stream; online classification; online topic model; partially labelled documents; partially labelled texts; probability distribution; stream content; word frequency; Adaptation models; Context modeling; Data models; Feature extraction; Maximum likelihood estimation; Probability distribution; Resource management; adaptive classification; document stream; online topic model; partially labelled documents; topic-burst;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2014 IEEE/WIC/ACM International Joint Conferences on
Conference_Location
Warsaw
Type
conf
DOI
10.1109/WI-IAT.2014.128
Filename
6927655
Link To Document