Title :
Categorization and Monitoring of Internet Public Opinion Based on Latent Semantic Analysis
Author :
Wan, Yuan ; Tong, Hengqing
Author_Institution :
Math Dept., Wuhan Univ. of Technol., Wuhan
Abstract :
Rapid progress of network arouses much attention on Internet public opinion. To address this issue, we propose a novel system for categorization and monitoring of Internet public opinion. Due to the text format of Internet public opinion and the semantic relationship between words in such documents, we introduce latent semantic analysis (LSA) to represent document of public opinion. Compared to the traditional vector space model (VSM), LSA overcomes the problem of high dimensional space. We use two classifiers to perform text categorization on a corpus collected from a hot Website. For the monitoring, we give the structure of this module and introduce its main functions.
Keywords :
Internet; classification; social sciences computing; text analysis; Internet public opinion; Web site; latent semantic analysis; monitoring; public opinion document; text categorization; Data mining; Electronic mail; IP networks; Information analysis; Information management; Internet; Monitoring; Seminars; Space technology; Text categorization; -Internet public opinion; latent semantic analysi; monitoring; text categorization; vector space model;
Conference_Titel :
Business and Information Management, 2008. ISBIM '08. International Seminar on
Conference_Location :
Wuhan
Print_ISBN :
978-0-7695-3560-9
DOI :
10.1109/ISBIM.2008.98