DocumentCode :
1885620
Title :
Query translation architecture for Malay-English Cross-Language information retrieval system
Author :
Rais, Nurjannaton Hidayah ; Abdullah, Muhamad Taufik ; Kadir, Rabiah Abdul
Author_Institution :
Dept. of Multimedia, Univ. Putra Malaysia, Serdang, Malaysia
Volume :
2
fYear :
2010
fDate :
15-17 June 2010
Firstpage :
990
Lastpage :
993
Abstract :
This paper discusses research on query translation events in Malay-English Cross-Language Information Retrieval (CLIR) system. We assume that by improving query translation accuracy, we can improve the information retrieval performance. The dictionary-based CLIR system facing three main problems: translation ambiguity; compound and phrase handling and proper names translation. The use of natural language processing (NLP) techniques, such as stemming, Part-of-Speech (POS) tagging is useful in query translation process. Hence, n-gram matching technique has successfully applied to information retrieval (IR) system for phrases and proper names translation. The proposed query translation architecture consist of stemming, Part-of-Speech (POS) tagging and n-gram matching techniques is useful in CLIR system as well as search engine application.
Keywords :
information retrieval systems; language translation; natural language processing; query processing; Malay-English cross-language information retrieval system; dictionary-based CLIR system; n-gram matching techniques; natural language processing techniques; part-of-speech tagging; query translation architecture; search engine; Multimedia communication; Part-of-Speech (POS) tagging; cross-language information retrieval; n-gram matching; query ambiguity; query translation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Technology (ITSim), 2010 International Symposium in
Conference_Location :
Kuala Lumpur
ISSN :
2155-897
Print_ISBN :
978-1-4244-6715-0
Type :
conf
DOI :
10.1109/ITSIM.2010.5561589
Filename :
5561589
Link To Document :
بازگشت