DocumentCode :
2307984
Title :
A New Method to Compute Chinese Text Concept
Author :
Yang, Feng ; Sen-Lin, Luo ; Li-Min, Pan ; Li-li, Liu ; Kai-Jiang, Chen
Author_Institution :
Lab. of Inf. Security & Countermeasures Technol., BIT, Beijing, China
Volume :
1
fYear :
2010
fDate :
6-7 March 2010
Firstpage :
59
Lastpage :
62
Abstract :
A new method to compute Chinese text concept is proposed in this paper. In this method, we construct sentence vectors from the text by extracting and quantifying some syntax and semantic features such as concept elements, dependent relations and correlative relations. Then, we combine these sentence vectors to the text vector to represent the text concept. Experimental results show that, in the application of text classification, the precision and the recall of this method achieve 91.5% and 91.3%. Contrast with TF-IDF and LSA, the proposed method is more accurate, less affected by text class and more stable on text concept approaching.
Keywords :
natural language processing; pattern classification; text analysis; vectors; Chinese text concept; concept elements; correlative relations; dependent relations; semantic features; sentence vectors; syntax; text classification; text vector; Computer science; Computer science education; Data mining; Educational technology; Electronic countermeasures; Frequency; Information processing; Information security; Speech; Text categorization; Chinese Information Processing; concept element; correlative relation; dependent relation; text concept computing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Education Technology and Computer Science (ETCS), 2010 Second International Workshop on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-6388-6
Electronic_ISBN :
978-1-4244-6389-3
Type :
conf
DOI :
10.1109/ETCS.2010.607
Filename :
5460298
Link To Document :
بازگشت