DocumentCode :
3433001
Title :
Classifying twitter data with Naïve Bayes Classifier
Author :
Tseng, Chien-Chih ; Pateli, Nishant ; Paranjape, Hemant ; Lin, T.Y. ; SooTee Teoh
fYear :
2012
fDate :
11-13 Aug. 2012
Firstpage :
294
Lastpage :
299
Abstract :
Information Classification is the categorization of the huge amount of data in an efficient and useful way. In the current scenario data is growing exponentially due to the rise of internet rich applications. One such source of information is the blogs. Blogs are web logs maintained by their authors that contain information related to a certain topic and also contain authors view about that topic. Micro blogs, on the other hands, are variations of blogs that contain smaller data as compared to blogs. In this project, Twitter, a micro blogging website has been targeted to gather information on certain trending topics. The information is in the form of tweets. A tweet is a post or an update on status on the Twitter website. These tweets are extracted using Twitter Search APIs. This data is then classified into different classes based on its content. Using the classified data, features are extracted from the tweets and suggestions are given to the users based on the trending topics.
Keywords :
application program interfaces; belief networks; pattern classification; social networking (online); Internet rich application; Twitter Search API; Twitter data classification; Web log; application program interface; data categorization; feature extraction; information classification; microblogging Web site; naive Bayes classifier; trending topic; Blogs; Computers;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Granular Computing (GrC), 2012 IEEE International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4673-2310-9
Type :
conf
DOI :
10.1109/GrC.2012.6468706
Filename :
6468706
Link To Document :
بازگشت