DocumentCode :
2237739
Title :
Categorization and Monitoring of Internet Public Opinion Based on Latent Semantic Analysis
Author :
Wan, Yuan ; Tong, Hengqing
Author_Institution :
Math Dept., Wuhan Univ. of Technol., Wuhan
Volume :
2
fYear :
2008
fDate :
19-19 Dec. 2008
Firstpage :
121
Lastpage :
124
Abstract :
Rapid progress of network arouses much attention on Internet public opinion. To address this issue, we propose a novel system for categorization and monitoring of Internet public opinion. Due to the text format of Internet public opinion and the semantic relationship between words in such documents, we introduce latent semantic analysis (LSA) to represent document of public opinion. Compared to the traditional vector space model (VSM), LSA overcomes the problem of high dimensional space. We use two classifiers to perform text categorization on a corpus collected from a hot Website. For the monitoring, we give the structure of this module and introduce its main functions.
Keywords :
Internet; classification; social sciences computing; text analysis; Internet public opinion; Web site; latent semantic analysis; monitoring; public opinion document; text categorization; Data mining; Electronic mail; IP networks; Information analysis; Information management; Internet; Monitoring; Seminars; Space technology; Text categorization; -Internet public opinion; latent semantic analysi; monitoring; text categorization; vector space model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Business and Information Management, 2008. ISBIM '08. International Seminar on
Conference_Location :
Wuhan
Print_ISBN :
978-0-7695-3560-9
Type :
conf
DOI :
10.1109/ISBIM.2008.98
Filename :
5116436
Link To Document :
بازگشت