DocumentCode :
3194079
Title :
Effects of Unpopular Citation Fields in Citation Matching Performance
Author :
Koo, Hee-Kwan ; Kim, Taehong ; Chun, Hong-Woo ; Seo, Dongmin ; Jung, Hanmin ; Lee, Sungin
Author_Institution :
Dept. of Inf. Sci. & Technol., Univ. of Sci. & Technol., Daejeon, South Korea
fYear :
2011
fDate :
26-29 April 2011
Firstpage :
1
Lastpage :
7
Abstract :
Citation matching is a problem of identifying which citations correspond to the same publication. Previous studies on citation matching select typically from a corpus or database of citation records, such as CORA, an arbitrary set of citation record fields such as author, title - a practice informed by "common sense" - in order to automatically group citations that refer to the same document. This study describes a systematic and computational approach to extract out the \´best candidate\´ citation record fields, to propose that there is always the best combination of citation record fields that helps increase citation matching performance and is applicable regardless of which research framework one may adopt, such as Machine Learning methods or Information Retrieval algorithms. Cross comparisons between previous studies and our approach, shown as pairwise F1 measures, within our framework based on field selection are presented.
Keywords :
citation analysis; pattern matching; citation matching performance; information retrieval algorithms; machine learning methods; unpopular citation fields; Clustering methods; Indexing; Ontologies; Systematics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Science and Applications (ICISA), 2011 International Conference on
Conference_Location :
Jeju Island
Print_ISBN :
978-1-4244-9222-0
Electronic_ISBN :
978-1-4244-9223-7
Type :
conf
DOI :
10.1109/ICISA.2011.5772372
Filename :
5772372
Link To Document :
بازگشت