• DocumentCode
    3591108
  • Title

    A HowNet-based Feature Selection Method for Chinese Text Representation

  • Author

    Zhao, Changwei ; Yao, Xueli ; Sun, Suhuan

  • Author_Institution
    Sch. of Electron. & Inf. Eng., Henan Univ. of Sci. & Technol., Luoyang, China
  • Volume
    1
  • fYear
    2009
  • Firstpage
    26
  • Lastpage
    30
  • Abstract
    Data dimension reduction plays an important role in the field of text representation. An effective dimension reduction method can not only reduce computation complexity, but help to improve the accuracy of text classification. This paper presents a new method of dimension reduction which is based on words semantic similarities. Being different with traditional methods which usually use the statistical information of words, natural language processing knowledge is used in our method which considers semantic information and POS information of feature terms. The experimental results show that our method is effective in dimensionality reduction of text representation and achieves a higher accuracy of text classification. The semantic similarity based method is a suitable method for text representation.
  • Keywords
    computational complexity; data mining; natural language processing; text analysis; Chinese text; HowNet method; computation complexity; data dimension reduction; feature selection method; natural language processing knowledge; part-of-speech information; semantic similarity; text representation; Classification algorithms; Frequency shift keying; Functional analysis; Fuzzy systems; Knowledge engineering; Natural language processing; Noise reduction; Sun; Text categorization; Text mining; Hownet; dimension reduction; text representation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery, 2009. FSKD '09. Sixth International Conference on
  • Print_ISBN
    978-0-7695-3735-1
  • Type

    conf

  • DOI
    10.1109/FSKD.2009.280
  • Filename
    5358671