Title of article :
Using Wikipedia concepts and frequency in language to extract key terms from support documents
Author/Authors :
Romero، نويسنده , , M. and Moreo، نويسنده , , A. and Castro، نويسنده , , J.L. and Zurita، نويسنده , , J.M.، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2012
Pages :
12
From page :
13480
To page :
13491
Abstract :
In this paper, we present a new key term extraction system able to handle with the particularities of “support documents”. Our system takes advantages of frequency-based and thesaurus-based approaches to recognize two different classes of key terms. On the one hand, it identifies multi-domain key terms of the collection using Wikipedia as knowledge resource. On the other hand, the system extracts specific key terms highly related with the context of a support document. We use the frequency in language as a criterion to detect and rank such terms. To prove the validity of our system we have designed a set of experiment using a Frequently Asked Questions (FAQ) collection of documents. Since our approach is generic, minor modifications should be undertaken to adapt the system to other kind of support documents. The empirical results evidence the validity of our approach.
Keywords :
Automatic Keyword Extraction , Natural language , word sense disambiguation , Wikipedia , Support documents , FAQ
Journal title :
Expert Systems with Applications
Serial Year :
2012
Journal title :
Expert Systems with Applications
Record number :
2352839
Link To Document :
بازگشت