• DocumentCode
    2447364
  • Title

    A Tibetan and Uygur Sensitive Word Automatically Add System Based on Co-occurrence

  • Author

    Yan, Xiaodong ; Sun, Yuan ; Zhao, Xiaobing ; Yang, Guosheng

  • Author_Institution
    Sch. of Inf. Eng., Minzu Univ. of China, Beijing, China
  • fYear
    2012
  • fDate
    1-3 Nov. 2012
  • Firstpage
    242
  • Lastpage
    244
  • Abstract
    In this paper, we first built a sensitive word vocabulary and classified the sensitive words. Then in order to add Tibetan and Uygur sensitive word automatically, we adopted a method based on co-occurrence. In our system, we used a simple algorithm to calculate the relevance of sensitive words. According to the relevance of words, the new sensitive words are added to the vocabulary automatically.
  • Keywords
    natural language processing; Tibetan sensitive word automatically add system; Uygur sensitive word automatically add system; sensitive word vocabulary; word relevance; Correlation; Educational institutions; Indexes; Monitoring; Real-time systems; Sensitivity; Vocabulary; co-occurrence; sensitive words; vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Networks and Intelligent Systems (ICINIS), 2012 Fifth International Conference on
  • Conference_Location
    Tianjin
  • Print_ISBN
    978-1-4673-3083-1
  • Type

    conf

  • DOI
    10.1109/ICINIS.2012.87
  • Filename
    6376532