DocumentCode :
3368368
Title :
Chinese Personal Name Recognition in Web Queries via Bootstrapping
Author :
Xueqiang Lv ; Ruihong Wu ; Bin Wen
Author_Institution :
Beijing Key Lab. of Internet Culture & Digital Dissemination Res., Beijing Inf. Sci. & Technol. Univ., Beijing, China
fYear :
2013
fDate :
14-15 Dec. 2013
Firstpage :
415
Lastpage :
419
Abstract :
The bootstrapping method of recognizing names in query logs is useful in some ways, but it is affected by noisy data, which comes from irrelevant queries and irrelevant template matching. We propose a trend selection method which aims to select templates more relevant to personal names rather than other categories. In order to eliminate noisy data when matching, the boundaries of candidates are relocated by the presented method named forward-backward keyword matching based on the corpus from People´s Daily. Experimental results on Sogou corpus indicate that the trend selection method is better while compared to other template selection method. And the forward-backward keyword matching method is helpful for boundary demarcation and recognition.
Keywords :
character recognition; natural language processing; query processing; Chinese personal name recognition; Sogou corpus; Web queries; bootstrapping method; boundary demarcation; boundary recognition; forward-backward keyword matching; query logs; template matching; trend selection method; Accuracy; Computational linguistics; Context; Educational institutions; Market research; Noise measurement; Search engines; Bootstrapping; Context; Personal name recognition; Query logs;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Security (CIS), 2013 9th International Conference on
Conference_Location :
Leshan
Print_ISBN :
978-1-4799-2548-3
Type :
conf
DOI :
10.1109/CIS.2013.94
Filename :
6746430
Link To Document :
بازگشت