Title :
The improved features selection for text classification
Author :
Yang, Yun ; Wu, Yanan
Author_Institution :
Sch. of Electr. & Inf. Eng., Shaanxi Univ. of Sci. & Technol., Xi´´an, China
Abstract :
With the development of modern social technology and people´s constant pursuit for knowledge, a more diverse and open cyber space is becoming the fact in academic field. In this environment, it is more necessary than ever before that people extract comprehensive, efficient, and valuable information from a wide range of cultural information. Therefore, text categorization research has become more important. The paper improved the precision of the traditional text categorization by the process that we mended the weight of words and mined potential keywords, then found their relationship. In the end of the paper, an experiment was used to illustrate the process of features extraction.
Keywords :
data mining; feature extraction; pattern classification; text analysis; cultural information; feature extraction; feature selection; information extraction; keyword mining; text categorization; text classification; Cultural differences; Data mining; Dictionaries; Feature extraction; Frequency; Knowledge engineering; Natural languages; Probability; Space technology; Text categorization; high-features selection; potential words; text classification;
Conference_Titel :
Computer Engineering and Technology (ICCET), 2010 2nd International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-6347-3
DOI :
10.1109/ICCET.2010.5486261