Title of article :
Automatic extraction of document keyphrases for use in digital libraries: Evaluation and applications
Author/Authors :
Steve Jones، نويسنده , , Gordon W. Paynter ، نويسنده ,
Issue Information :
ماهنامه با شماره پیاپی سال 2002
Pages :
25
From page :
653
To page :
677
Abstract :
This article describes an evaluation of the Kea automatic keyphrase extraction algorithm. Document keyphrases are conventionally used as concise descriptors of document content, and are increasingly used in novel ways, including document clustering, searching and browsing interfaces, and retrieval engines. However, it is costly and time consuming to manually assign keyphrases to documents, motivating the development of tools that automatically perform this function. Previous studies have evaluated Keaʹs performance by measuring its ability to identify author keywords and keyphrases, but this methodology has a number of well-known limitations. The results presented in this article are based on evaluations by human assessors of the quality and appropriateness of Kea keyphrases. The results indicate that, in general, Kea produces keyphrases that are rated positively by human assessors. However, typical Kea settings can degrade performance, particularly those relating to keyphrase length and domain specificity. We found that for some settings, Keaʹs performance is better than that of similar systems, and that Keaʹs ranking of extracted keyphrases is effective. We also determined that author-specified keyphrases appear to exhibit an inherent ranking, and that they are rated highly and therefore suitable for use in training and evaluation of automatic keyphrasing systems.
Journal title :
Journal of the American Society for Information Science and Technology
Serial Year :
2002
Journal title :
Journal of the American Society for Information Science and Technology
Record number :
993245
Link To Document :
بازگشت