DocumentCode :
2477868
Title :
A Novel Knowledge Discovery Method for Chinese Architectural Document
Author :
Zhang, Xiang ; Li, Changhua ; Zhou, Mingquan ; Ye, Na ; Dong, Lehong
Author_Institution :
Coll. of Inf. Sci. & Technol., Northwest Univ., Xi´´an, China
fYear :
2010
fDate :
22-23 May 2010
Firstpage :
1
Lastpage :
4
Abstract :
Aiming at the problem of the traditional feature selection that threshold filtering loses a lot of effective architectural information and to improve the precise of Chinese architectural document classification, a new algorithm based on rough set and C4.5Bagging is proposed for Chinese architectural document categorization. Firstly the cores of attribute are found by discernibility matrix and one of the cores is regarded as the start point. Then attributes´ significance and dependency are used as the heuristic information to do feature selection. Finally the c4.5bagging is designed to architectural document classifier. The experimental results show that the novel method is not only easy to implement but can effectively reduce the dimensional space, and improve the accuracy of classification.
Keywords :
data mining; document handling; natural language processing; rough set theory; C4.5Bagging; chinese architectural document; feature selection; novel knowledge discovery method; rough set; Classification algorithms; Control engineering; Educational institutions; Filtering algorithms; Frequency; Information filtering; Information filters; Information science; Rough sets; Text categorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Systems and Applications (ISA), 2010 2nd International Workshop on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-5872-1
Electronic_ISBN :
978-1-4244-5874-5
Type :
conf
DOI :
10.1109/IWISA.2010.5473263
Filename :
5473263
Link To Document :
بازگشت