Title :
Scholar search-oriented author disambiguation
Author :
Wu, Hao ; Pei, Yijian ; Li, Bo
Author_Institution :
Sch. of Inf. Sci. & Eng., Yunnan Univ., Kunming, China
Abstract :
Name ambiguity problem brings many challenges to scholar search. This problem has attracted many attentions in research communities, and various disambiguation algorithms combined with different citation features are proposed. However, there is still significant room for improvement. In this paper, we propose an unsupervised two-steps method to deal with the name disambiguation problems as an end user makes a scholar search. In the first step, the returned author´s citations are blocked by using co-authorship relation, and then in second step, these blocks are merged by the classical hierarchical agglomerative clustering method. We test various linkage criteria and pairwise distances during hierarchical clustering, and find the best components to disambiguate citations. Also, we propose some approaches to improve the disambiguation performance in each step. According to experiments, our method outperforms 15% a best state-of-the-art work using the same recognized dataset without the need for any training.
Keywords :
citation analysis; natural language processing; pattern clustering; classical hierarchical agglomerative clustering; coauthorship relation; hierarchical clustering; linkage criteria; name ambiguity problem; name disambiguation problem; pairwise distance; research communities; scholar search-oriented author disambiguation; unsupervised two-steps method; Clustering methods; Conferences; Couplings; Joints; Libraries; Measurement; Training; author disambiguation; hierarchical clustering; scholar search;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2012 9th International Conference on
Conference_Location :
Sichuan
Print_ISBN :
978-1-4673-0025-4
DOI :
10.1109/FSKD.2012.6234371