DocumentCode :
2336797
Title :
A rough set-based hybrid feature selection method for topic-specific text filtering
Author :
Li, Qiang ; Li, Jian-Hua ; Liu, Gong-Shen ; Li, Sheng-Hong
Author_Institution :
Modern Commun. Res. Dept., Shanghai Jiao Tong Univ., China
Volume :
3
fYear :
2004
fDate :
26-29 Aug. 2004
Firstpage :
1464
Abstract :
With the proliferation of harmful Internet content such as pornography, violence, and hate messages, effective content-filtering systems are essential. However, a non-trivial obstacle in good text filtering is the high dimensionality of the data. We introduced a hybrid method to select features more accurately using some feature selection method and rough set theory. We can select features firstly using one of feature selection methods, such as x2 statistic, mutual information, information gain, and then further select features using rough set. Thus more accurate and less features are extracted. In experiments, we used UCI machine learning dataset as our dataset. We use naive Bayes model to evaluate our feature selection method, the result shows our method has high precision and high recall, and is very effective and efficient.
Keywords :
Bayes methods; feature extraction; information filtering; learning (artificial intelligence); rough set theory; text analysis; Internet; UCI machine learning dataset; content filtering systems; feature extraction; hate messages; hybrid feature selection method; information gain method; mutual information method; naive Bayes model; nontrivial obstacle; pornography; rough set theory; topic specific text filtering; violence; x2 statistical method; Electronic mail; Frequency; Gain measurement; Information filtering; Information filters; Internet; Mutual information; Rough sets; Set theory; Statistics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on
Print_ISBN :
0-7803-8403-2
Type :
conf
DOI :
10.1109/ICMLC.2004.1382004
Filename :
1382004
Link To Document :
بازگشت