DocumentCode
2100449
Title
Automatic translation of Arabic queries for bilingual information retrieval
Author
Mallat, S. ; Zouaghi, Anis ; Hkiri, Emna ; Zrigui, M.
Author_Institution
Comput. Sci. Dept., Fac. of Sci. of Monastir, Monastir, Tunisia
fYear
2013
fDate
24-26 Oct. 2013
Firstpage
1
Lastpage
6
Abstract
In this paper, we present an automatic query translation in Arabic for information retrieval. This system, implements a method for lexical disambiguation of terms in a query. To choose the best sense, each term of the query is projected on the French EuroWordNet and we extract his related concepts in order to form a semantic network. In a second time using the same type of network, but by integrating LSI (Latent Semantic Index), and extraction of contextual hidden links between the concepts of the list (listSRF). This list extracted by the automatic alignment by Mkalign tool´s, from the knowledge base defined by the Monde Diplomatic parallel corpus. This tool will allow us to map between the relevant sentences in Arabic identified in our previous work with French sentences to build a listSRF. This list forms the basis of the lexical disambiguation method. Finally, we propose a mechanism for selecting the best sense of the ambiguous term of the query, based on the matching between each network corresponds to a word in the query with the network listSRF to extract an adequate sense with the highest degree of similarity. An evaluation and comparison are conducted to measure the quality of our translation system.
Keywords
language translation; natural language processing; query processing; semantic networks; Arabic query automatic translation; French EuroWordNet; French sentences; LSI; Monde Diplomatic parallel corpus; bilingual information retrieval; concept extraction; contextual hidden link extraction; latent semantic index; listSR; semantic network; term lexical disambiguation; Context; Dictionaries; Indexes; Knowledge based systems; Measurement; Semantics; Training; automatic evaluation metrics; contextual relationships; lexical disambiguation; machine translation; matching; semantic and conceptual indexing; systems of multilingual information retrieval;
fLanguage
English
Publisher
ieee
Conference_Titel
Information and Communication Technology and Accessibility (ICTA), 2013 Fourth International Conference on
Conference_Location
Hammamet
Type
conf
DOI
10.1109/ICTA.2013.6815308
Filename
6815308
Link To Document