• DocumentCode
    2379433
  • Title

    Automatic term extraction from Chinese scientific texts

  • Author

    Zheng, Qinghua ; Luo, Junying ; Liu, Jun

  • Author_Institution
    MOE KLINNS Lab., Xi´´an Jiaotong Univ., Xi´´an, China
  • fYear
    2011
  • fDate
    8-10 June 2011
  • Firstpage
    727
  • Lastpage
    734
  • Abstract
    Automatic term extraction is an essential task in information processing and has a very important role in many fields, such as information retrieval, knowledge acquisition. However, existing methods are mostly proposed for English domain terms, so they can not fully adapt to the term extraction from Chinese scientific texts. This paper presents a/ new approach on the analysis of the characteristics of Chinese domain terms. Firstly, we introduce a new feature which we call it “max article time” to distinguish terms from non-terms. Then, we use the classification of terms and the links between different terms to obtain the maximum discrimination of this feature. Meanwhile, our method also combines with linguistic methods. Experiments conducted on two different domains for Chinese term extraction indicate our approach has significant improvement over existing techniques and also verify the relative domain independence of the approach.
  • Keywords
    information retrieval; text analysis; Chinese scientific texts; English domain terms; automatic term extraction; information processing; linguistic methods; max article time; Data mining; Feature extraction; Mutual information; Noise; Operating systems; Pragmatics; Speech; Chinese domain terminology; automatic term extraction; machine learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Supported Cooperative Work in Design (CSCWD), 2011 15th International Conference on
  • Conference_Location
    Lausanne
  • Print_ISBN
    978-1-4577-0386-7
  • Type

    conf

  • DOI
    10.1109/CSCWD.2011.5960199
  • Filename
    5960199