Title :
Learning to rank for determining relevant document in Indonesian-English cross language information retrieval using BM25
Author :
Sari, Syandra ; Adriani, Mima
Author_Institution :
Fac. of Comput. Sci., Univ. of Indonesia, Depok, Indonesia
Abstract :
One important task in cross-language information retrieval (CLIR) is to determine the relevance of a document from a number of documents based on user query. In this paper we applied pointwise learning to rank in SVM (Support Vector Machine) to determine the relevance of a document and used BM25 (Best Match 25) ranking function for selecting words as features. We did the experiment in Indonesian-English CLIR The results show an average ability of SVM to identify relevant documents is 88.51%, while the average accuracy of SVM to identify non relevant documents is 88%.
Keywords :
document handling; learning (artificial intelligence); natural language processing; query processing; support vector machines; user interfaces; BM25; Best Match 25 ranking function; CLIR; Indonesian-English cross language information retrieval; SVM; pointwise learning; relevant document; support vector machine; user query; Computer science; Data collection; Decision support systems; Handheld computers; Research and development; Support vector machines;
Conference_Titel :
Advanced Computer Science and Information Systems (ICACSIS), 2014 International Conference on
DOI :
10.1109/ICACSIS.2014.7065896