Title of article :
Categorization-Driven Cross-Language Retrieval
of Medical Information
Author/Authors :
Hermes R. Freitas-Junior، نويسنده , , Berthier Ribeiro-Neto، نويسنده , , Rodrigo F. Vale، نويسنده , , Alberto H. F. Laender، نويسنده , , Luciano R. S. Lima، نويسنده ,
Issue Information :
ماهنامه با شماره پیاپی سال 2006
Abstract :
The Web has become a large repository of documents (or
pages) written in many different languages. In this context,
traditional information retrieval (IR) techniques cannot
be used whenever the user query and the documents
being retrieved are in different languages. To address
this problem, new cross-language information retrieval
(CLIR) techniques have been proposed. In this work, we
describe a method for cross-language retrieval of medical
information. This method combines query terms and
related medical concepts obtained automatically through
a categorization procedure. The medical concepts are
used to create a linguistic abstraction that allows
retrieval of information in a language-independent way,
minimizing linguistic problems such as polysemy. To
evaluate our method, we carried out experiments using
the OHSUMED test collection, whose documents are
written in English, with queries expressed in Portuguese,
Spanish, and French. The results indicate that our crosslanguage
retrieval method is as effective as a standard
vector space model algorithm operating on queries and
documents in the same language. Further, our results are
better than previous results in the literature
Journal title :
Journal of the American Society for Information Science and Technology
Journal title :
Journal of the American Society for Information Science and Technology