Title of article :
Multilingual Web Retrieval: An Experiment
in English–Chinese Business Intelligence
Author/Authors :
Jialun Qin and Yilu Zhou، نويسنده , , Michael Chau، نويسنده , , Hsinchun Chen، نويسنده ,
Issue Information :
ماهنامه با شماره پیاپی سال 2006
Abstract :
Asincreasing numbers of non-English resources have become
available on the Web, the interesting and important
issue of how Web users can retrieve documents in different
languages has arisen. Cross-language information retrieval
(CLIR), the study of retrieving information in one
language by queries expressed in another language, is a
promising approach to the problem. Cross-language information
retrieval has attracted much attention in recent
years. Most research systems have achieved satisfactory
performance on standard Text REtrieval Conference
(TREC)collections such as news articles, but CLIR techniques
have not been widely studied and evaluated for applications
such asWeb portals. In this article, the authors
present their research in developing and evaluating a multilingual
English–Chinese Web portal that incorporates
various CLIR techniques for use in the business domain.
A dictionary-based approach was adopted and combines
phrasal translation, co-occurrence analysis, and pre- and
posttranslation query expansion. The portal was evaluated
by domain experts, using a set of queries in both
English and Chinese. The experimental results showed
that co-occurrence-based phrasal translation achieved a
74.6% improvement in precision over simple word-byword
translation. When used together, pre- and posttranslation
query expansion improved the performance
slightly, achieving a 78.0% improvement over the baseline
word-by-word translation approach. In general, applying
CLIR techniques inWeb applications shows promise
Journal title :
Journal of the American Society for Information Science and Technology
Journal title :
Journal of the American Society for Information Science and Technology