DocumentCode :
2282386
Title :
Personal Name Recognition Based on Categorized Linguistic Knowledge
Author :
Qu, Weiguang ; Tang, Xuri ; Li, Bin
Author_Institution :
Sch. of Math. & Comput. Sci., Nanjing Normal Univ., Nanjing
Volume :
3
fYear :
2008
fDate :
9-12 Dec. 2008
Firstpage :
311
Lastpage :
315
Abstract :
This paper proposes an integrated approach for personal name recognition (PNR) in Chinese by utilizing both statistical language models and categorized linguistic knowledge. Various formulas are proposed for calculating personal name credibility and context credibility for different types of personal names. Experiment is conducted on large-scale corpus to evaluate the approach and the F-1 scores has reached 98.85% and 92.73% respectively in close and open test.
Keywords :
character recognition; computational linguistics; natural language processing; statistical analysis; text analysis; vocabulary; categorized linguistic knowledge; context credibility; out of vocabulary; personal name recognition; statistical language model; Character recognition; Databases; Europe; Intelligent agent; Large-scale systems; Mathematics; Natural languages; Probability; Statistical analysis; Statistics; Credibility; Discourse; Knowledge database; Personal Name Recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT '08. IEEE/WIC/ACM International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
978-0-7695-3496-1
Type :
conf
DOI :
10.1109/WIIAT.2008.155
Filename :
4740787
Link To Document :
بازگشت