DocumentCode :
571629
Title :
Study on the Subjective and Objective Text Classification and Pretreatment of Chinese Network Text
Author :
Chao, Chupin ; Jiang, Wenbao
Author_Institution :
Sch. of Inf. Manage., Beijing Inf. Sci.&Technol. Univ., Beijing, China
Volume :
2
fYear :
2012
fDate :
26-27 Aug. 2012
Firstpage :
25
Lastpage :
29
Abstract :
Subjective and objective text classification is widely used in product reviews, video reviews, social public opinion analysis and micro-blogging attitude analysis. To solve the existing problem of network text formalization in subjective and objective text classification, a machine learning classification method based on network informal language (NIL) is proposed. Firstly, a network informal dictionary is constructed by writing a web crawler to collect informal words which can be divided into two categories: typical type and fuzzy type. Then, different methods are put forward to formalize the informal network text based on the two types of informal words. Finally, we adopt the Native Bayes classifier and Sequential Minimal Optimization classifier to distinguish subjectivity and objectivity of the text. The experimental results reveal that the method we proposed can improve the accuracy of subjective and objective text classification.
Keywords :
Bayes methods; Web sites; learning (artificial intelligence); optimisation; pattern classification; text analysis; NIL; chinese network text; machine learning classification method; microblogging attitude analysis; native Bayes classifier; network informal dictionary; network informal language; network text formalization; objective text classification; product reviews; sequential minimal optimization classifier; social public opinion analysis; subjective text classification; video reviews; web crawler; Classification algorithms; Dictionaries; Feature extraction; Formal languages; Support vector machines; Text categorization; Training; feature extraction; network informal language; sequential-covering algorithm; subjective and objective text lassification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Human-Machine Systems and Cybernetics (IHMSC), 2012 4th International Conference on
Conference_Location :
Nanchang, Jiangxi
Print_ISBN :
978-1-4673-1902-7
Type :
conf
DOI :
10.1109/IHMSC.2012.102
Filename :
6305716
Link To Document :
بازگشت