• DocumentCode
    2665308
  • Title

    Automatic recognition of Chinese scientific and technological terms using integrated linguistic knowledge

  • Author

    Sui, Zhifang ; Yirong Chen ; Wei, Zhouchao

  • Author_Institution
    Inst. of Comput. Linguistics, Peking Univ., Beijing, China
  • fYear
    2003
  • fDate
    26-29 Oct. 2003
  • Firstpage
    444
  • Lastpage
    451
  • Abstract
    We introduce our research on using integrated linguistic knowledge to automatically recognize Chinese scientific and technological terms based on the careful analysis of the characteristics of this kind of terms. The system of automatic term recognition includes two phases: learning stage and application stage. In the stage of learning, we use a series of machine learning methods to get various kinds of integrated knowledge for automatic term recognition from a large-scale corpus and a term bank. These knowledge includes the inner structural knowledge of terms, the statistical domain features of term component, the statistical mutual information between the components of terms, the outer environment features of terms and the distinct text-level features of term recognition etc.. In the stage of application, through an efficient model, we use all these various types of knowledge into automatic term recognition. The experiments show that the system can give great help to the expert of term standardization to discover new terms.
  • Keywords
    character recognition; knowledge based systems; learning (artificial intelligence); linguistics; natural languages; automatic Chinese scientific term recognition; integrated linguistic knowledge; machine learning; term standardization; Biological materials; Bluetooth; Bridges; Business; Character recognition; Computational linguistics; Materials science and technology; Natural languages; Paper technology; Standardization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003 International Conference on
  • Conference_Location
    Beijing, China
  • Print_ISBN
    0-7803-7902-0
  • Type

    conf

  • DOI
    10.1109/NLPKE.2003.1275948
  • Filename
    1275948