DocumentCode :
1826811
Title :
Measuring inter-indexer consistency using a thesaurus
Author :
Medelyan, Olena ; Witten, Ian H.
Author_Institution :
Dept. of Comput. Sci., Waikato Univ., Hamilton
fYear :
2006
fDate :
38869
Firstpage :
274
Lastpage :
275
Abstract :
When professional indexers independently assign terms to a given document, the term sets generally differ between indexers. Studies of inter-indexer consistency measure the percentage of matching index terms, but none of them consider the semantic relationships that exist amongst these terms. We propose to represent multiple-indexers data in a vector space and use the cosine metric as a new consistency measure that can be extended by semantic relations between index terms. We believe that this new measure is more accurate and realistic than existing ones and therefore more suitable for evaluation of automatically extracted index terms
Keywords :
indexing; thesauri; consistency measure; cosine metric; index term automatic extraction; index term matching; inter-indexer consistency measurement; multiple-indexer data representation; term semantic relationship; thesaurus; Agriculture; Bidirectional control; Bismuth; Computer science; Data mining; Documentation; Indexing; Thesauri; Vocabulary; controlled indexing; inter-indexer consistency;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Digital Libraries, 2006. JCDL '06. Proceedings of the 6th ACM/IEEE-CS Joint Conference on
Conference_Location :
Chapel Hill, NC
Print_ISBN :
1-59593-354-9
Type :
conf
DOI :
10.1145/1141753.1141816
Filename :
4119139
Link To Document :
بازگشت