• DocumentCode
    2084020
  • Title

    A feature extraction method using base phrase and keyword in Chinese text

  • Author

    Li, Xin-fu ; Zhao, Lei-lei ; Wu, Li-hong

  • Author_Institution
    Coll. of Math. & Comput. Sci., Hebei Univ., Baoding, China
  • Volume
    1
  • fYear
    2008
  • fDate
    17-19 Nov. 2008
  • Firstpage
    680
  • Lastpage
    684
  • Abstract
    The feature extraction is the most key technology of text categorization. The word is used as the feature in the traditional text classification, and its effect for the text classification is evidence. The feature extraction method using base phrase and keyword changes the feature extraction of Chinese text from syntax and semantic further. In the first, analyzing the feature of baseNP and basedVP, and then make some words into baseNP and baseVP which accord to the rules of phrase, give WSD to other words in the finally. The paper proposes a stepwise feature extraction from word to phrase. The experiment results show that this method can perform much better than traditional feature extraction method, it can improve the text classification precision and recall.
  • Keywords
    feature extraction; natural language processing; pattern classification; text analysis; Chinese text; base phrase; baseNP; basedVP; feature extraction method; keyword; text categorization; text classification; Competitive intelligence; Feature extraction; Frequency; Intelligent systems; Knowledge engineering; Learning systems; Support vector machine classification; Support vector machines; Tagging; Text categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent System and Knowledge Engineering, 2008. ISKE 2008. 3rd International Conference on
  • Conference_Location
    Xiamen
  • Print_ISBN
    978-1-4244-2196-1
  • Electronic_ISBN
    978-1-4244-2197-8
  • Type

    conf

  • DOI
    10.1109/ISKE.2008.4731016
  • Filename
    4731016