Title :
Personal Name Recognition Based on Categorized Linguistic Knowledge
Author :
Qu, Weiguang ; Tang, Xuri ; Li, Bin
Author_Institution :
Sch. of Math. & Comput. Sci., Nanjing Normal Univ., Nanjing
Abstract :
This paper proposes an integrated approach for personal name recognition (PNR) in Chinese by utilizing both statistical language models and categorized linguistic knowledge. Various formulas are proposed for calculating personal name credibility and context credibility for different types of personal names. Experiment is conducted on large-scale corpus to evaluate the approach and the F-1 scores has reached 98.85% and 92.73% respectively in close and open test.
Keywords :
character recognition; computational linguistics; natural language processing; statistical analysis; text analysis; vocabulary; categorized linguistic knowledge; context credibility; out of vocabulary; personal name recognition; statistical language model; Character recognition; Databases; Europe; Intelligent agent; Large-scale systems; Mathematics; Natural languages; Probability; Statistical analysis; Statistics; Credibility; Discourse; Knowledge database; Personal Name Recognition;
Conference_Titel :
Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT '08. IEEE/WIC/ACM International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
978-0-7695-3496-1
DOI :
10.1109/WIIAT.2008.155