DocumentCode :
518181
Title :
A text categorization framework based on concept structure
Author :
Chen, Lin ; Zhou Jie ; Bi-Cheng, Li
Author_Institution :
Zhengzhou Inf. Technol. Inst., Zhengzhou, China
Volume :
3
fYear :
2010
fDate :
16-18 April 2010
Abstract :
From the view of concept structure in psychology, this paper analyses the cause of classes overlapping of training sample set in text categorization, and finds it is reflection about fuzzy boundary of nature concept in space distribution between training samples. The paper proposes a categorization framework based on concept structure. In the framework, concept structure and relation between concepts are shown more obviously through anew partitioning training sample set, and then based above processing results we construct hierarchy classifier to improve classification performance. The experiments show that the framework is effective for improving classification performance and its expansibility is good.
Keywords :
classification; fuzzy set theory; text analysis; classification performance; concept structure; fuzzy boundary; hierarchy classifier; space distribution; text categorization; Automatic testing; Cities and towns; Fuzzy sets; Information analysis; Information technology; Internet; Management training; Psychology; Reflection; Text categorization; Categorization Framework; Classes Overlapping; Concept Structure; Text Categorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Engineering and Technology (ICCET), 2010 2nd International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-6347-3
Type :
conf
DOI :
10.1109/ICCET.2010.5485805
Filename :
5485805
Link To Document :
بازگشت