Title :
Detecting Ambiguous Author Names in Crowdsourced Scholarly Data
Author :
Sun, Xiaoling ; Kaur, Jasleen ; Possamai, Lino ; Menczer, Filippo
Author_Institution :
Dept. of Comput. Sci. & Technol., Dalian Univ. of Technol., Dalian, China
Abstract :
The name ambiguity problem is a challenge in many areas, especially in the field of bibliographic digital libraries. For example, in services that use citation data to compute the impact of authors, ambiguous names lead to biased measures. The problem is amplified where names are collected from heterogeneous sources, including crowd sourced annotations. This is the case in the Scholaro meter system, which cross-correlates author names in user queries with those retrieved from bibliographic data. The uncontrolled nature of user-generated annotations is very valuable, but creates the need to detect ambiguous names. In this paper, we propose an approach to detect ambiguous names at query time, which makes it applicable in the context of a social computing application. We explore two kinds of heuristic features based on citations and crowd sourced topics. Our approach can detect ambiguous author names in crowd sourced scholarly data with an accuracy of 75%.
Keywords :
citation analysis; digital libraries; Scholaro meter system; ambiguous author name; bibliographic digital libraries; citation data; crowd sourced annotation; crowd sourced topic; crowdsourced scholarly data; heuristic features; name ambiguity problem; social computing; user-generated annotation; Accuracy; Collaboration; Computer science; Educational institutions; Feature extraction; Indexes; Libraries; ambiguous name detection; citation analysis; crowdsourcing; discipline annotations; scholarly data; social tagging;
Conference_Titel :
Privacy, Security, Risk and Trust (PASSAT) and 2011 IEEE Third Inernational Conference on Social Computing (SocialCom), 2011 IEEE Third International Conference on
Conference_Location :
Boston, MA
Print_ISBN :
978-1-4577-1931-8
DOI :
10.1109/PASSAT/SocialCom.2011.43