DocumentCode :
2774918
Title :
Expert-Driven Topical Classification of Short Message Streams
Author :
Kamath, Krishna Y. ; Caverlee, James
Author_Institution :
Dept. of Comput. Sci., Texas A&M Univ., College Station, TX, USA
fYear :
2011
fDate :
9-11 Oct. 2011
Firstpage :
388
Lastpage :
393
Abstract :
We study the problem of expert-driven topical classification of short messages in time-evolving streams like Face book status updates, Twitter messages, and SMS communication. While high-level topics in these streams may be fixed (e.g., Sports, News), the content associated with these topics is typically less static, reflecting temporal change in interest as these streams evolve (e.g., tweets about the Olympics wane, while tweets about the World Cup rise in popularity). Coupled with this rapid concept drift, short messages themselves provide little contextual information and result in sparse features for effective classification. With these challenges in mind, we present an expert-driven framework for time-aware topical classification framework of short messages. Three of the salient features of the framework are (i) a novel expert-centric classifier, (ii) a sliding-window training for adaptive topical classification, and (iii) a suite of enrichment-based methods (lexical, link, collocation) for overcoming feature sparsity in short messages.
Keywords :
electronic messaging; pattern classification; social networking (online); Facebook status updates; SMS communication; Twitter messages; enrichment based methods; expert centric classifier; expert driven topical classification; feature sparsity; rapid concept drift; short message streams; sliding window training; time aware topical classification; time evolving streams; Entropy; Feature extraction; Frequency measurement; Medical services; Training; Twitter; classification; short-text; social media;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Privacy, Security, Risk and Trust (PASSAT) and 2011 IEEE Third Inernational Conference on Social Computing (SocialCom), 2011 IEEE Third International Conference on
Conference_Location :
Boston, MA
Print_ISBN :
978-1-4577-1931-8
Type :
conf
DOI :
10.1109/PASSAT/SocialCom.2011.213
Filename :
6113139
Link To Document :
بازگشت