Title :
A Hybrid Approach Using Maximum Entropy Model and Rules to Identify Tibetan Person Names
Author :
Yangji Jia ; Jing Jiang ; Hongzhi Yu
Author_Institution :
China Inst. of Minorities Inf. Technol., Northwest Univ. for Nat., Lanzhou, China
Abstract :
Tibetan person name recognition is one of the most difficult tasks in the area of Tibetan information processing, and the effect of recognition impacts directly on the precision of Tibetan word segmentation and the performance of relative application systems, which include Tibetan-Chinese machine translation, Tibetan information search, text categorization, etc. Based on the analysis of wording rules and features of Tibetan name, this paper proposed a method which combines maximum entropy and rules to identify Tibetan person names. The experiment shows that this approach works really well for the value of F1-measure reaches 95.92%.
Keywords :
maximum entropy methods; natural language processing; text analysis; word processing; F1-measure value; Tibetan information processing; Tibetan information search; Tibetan name feature analysis; Tibetan person name identification rules; Tibetan person name recognition; Tibetan word segmentation; Tibetan-Chinese machine translation; hybrid approach; maximum entropy model; text categorization; wording rule analysis; Character recognition; Dictionaries; Educational institutions; Entropy; Natural language processing; Text recognition; Training; Tibetan name recognition; maximum entropy; rule-based approaches;
Conference_Titel :
Computer Sciences and Applications (CSA), 2013 International Conference on
Conference_Location :
Wuhan
DOI :
10.1109/CSA.2013.95