Title :
An Algorithm of Feature Selection and Feature Weighting Adjustment Based on Chinese FrameNet
Author :
Zhao Xu ; Liu Xi ; Hao Xiaoyan ; Liu KaiYing
Author_Institution :
Academe of Comput. & Software Eng., Taiyuan Univ. of Technol. Taiyuan, Taiyuan, China
Abstract :
The combination of TF and DF, which is used as the method of feature selection, and TF-EDF algorithm, which is used as feature weighting, are frequently used in the text categorization. But for a small training set, the combination of TF and DF will filter out many low-frequency words which have a strong capability of the feature discrimination. Hence the weight is directly influenced. In this paper, an algorithm of feature selection and feature weighting adjustment based on Chinese FrameNet (CFN) are presented which aims at solving the problem mentioned above. The experimental result indicates that the precision which is greater than the traditional algorithm can reach to 67.3% and can fits the small training set very well.
Keywords :
natural languages; text analysis; Chinese FrameNet; feature selection algorithm; feature weighting adjustment; text categorization; Filters; Frequency; Information technology; Large-scale systems; Layout; Software algorithms; Software engineering; Spatial databases; Statistics; Text categorization;
Conference_Titel :
Image and Signal Processing, 2009. CISP '09. 2nd International Congress on
Conference_Location :
Tianjin
Print_ISBN :
978-1-4244-4129-7
Electronic_ISBN :
978-1-4244-4131-0
DOI :
10.1109/CISP.2009.5303776