Title of article :
A Lexical Knowledge Base Approach for English–Chinese Cross-Language Information Retrieval
Author/Authors :
Jiangping Chen، نويسنده ,
Issue Information :
ماهنامه با شماره پیاپی سال 2006
Pages :
11
From page :
233
To page :
243
Abstract :
This study proposes and explores a natural language processing- (NLP) based strategy to address out-ofdictionary and vocabulary mismatch problems in query translation based English–Chinese Cross-Language Information Retrieval (EC-CLIR). The strategy, named the LKB approach, is to construct a lexical knowledge base (LKB) and to use it for query translation. In this article, the author describes the LKB construction process, which customizes available translation resources based on the document collection of the EC-CLIR system. The evaluation shows that the LKB approach is very promising. It consistently increased the percentage of correct translations and decreased the percentage of missing translations in addition to effectively detecting the vocabulary gap between the document collection and the translation resource of the system. The comparative analysis of the top EC-CLIR results using the LKB and two other translation resources demonstrates that the LKB approach has produced significant improvement in EC-CLIR performance compared to performance using the original translation resource without customization. It has also achieved the same level of performance as a sophisticated machine translation system. The study concludes that the LKB approach has the potential to be an empirical model for developing real-world CLIR systems. Linguistic knowledge and NLP techniques, if appropriately used, can improve the effectiveness of English–Chinese crosslanguage information retrieval
Journal title :
Journal of the American Society for Information Science and Technology
Serial Year :
2006
Journal title :
Journal of the American Society for Information Science and Technology
Record number :
844062
Link To Document :
بازگشت