• DocumentCode
    2477868
  • Title

    A Novel Knowledge Discovery Method for Chinese Architectural Document

  • Author

    Zhang, Xiang ; Li, Changhua ; Zhou, Mingquan ; Ye, Na ; Dong, Lehong

  • Author_Institution
    Coll. of Inf. Sci. & Technol., Northwest Univ., Xi´´an, China
  • fYear
    2010
  • fDate
    22-23 May 2010
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Aiming at the problem of the traditional feature selection that threshold filtering loses a lot of effective architectural information and to improve the precise of Chinese architectural document classification, a new algorithm based on rough set and C4.5Bagging is proposed for Chinese architectural document categorization. Firstly the cores of attribute are found by discernibility matrix and one of the cores is regarded as the start point. Then attributes´ significance and dependency are used as the heuristic information to do feature selection. Finally the c4.5bagging is designed to architectural document classifier. The experimental results show that the novel method is not only easy to implement but can effectively reduce the dimensional space, and improve the accuracy of classification.
  • Keywords
    data mining; document handling; natural language processing; rough set theory; C4.5Bagging; chinese architectural document; feature selection; novel knowledge discovery method; rough set; Classification algorithms; Control engineering; Educational institutions; Filtering algorithms; Frequency; Information filtering; Information filters; Information science; Rough sets; Text categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems and Applications (ISA), 2010 2nd International Workshop on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-1-4244-5872-1
  • Electronic_ISBN
    978-1-4244-5874-5
  • Type

    conf

  • DOI
    10.1109/IWISA.2010.5473263
  • Filename
    5473263