Title :
A text categorization framework based on concept structure
Author :
Chen, Lin ; Zhou Jie ; Bi-Cheng, Li
Author_Institution :
Zhengzhou Inf. Technol. Inst., Zhengzhou, China
Abstract :
From the view of concept structure in psychology, this paper analyses the cause of classes overlapping of training sample set in text categorization, and finds it is reflection about fuzzy boundary of nature concept in space distribution between training samples. The paper proposes a categorization framework based on concept structure. In the framework, concept structure and relation between concepts are shown more obviously through anew partitioning training sample set, and then based above processing results we construct hierarchy classifier to improve classification performance. The experiments show that the framework is effective for improving classification performance and its expansibility is good.
Keywords :
classification; fuzzy set theory; text analysis; classification performance; concept structure; fuzzy boundary; hierarchy classifier; space distribution; text categorization; Automatic testing; Cities and towns; Fuzzy sets; Information analysis; Information technology; Internet; Management training; Psychology; Reflection; Text categorization; Categorization Framework; Classes Overlapping; Concept Structure; Text Categorization;
Conference_Titel :
Computer Engineering and Technology (ICCET), 2010 2nd International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-6347-3
DOI :
10.1109/ICCET.2010.5485805