Title :
An incremental learning strategy for search results optimization
Author :
Xiang Liu ; Dequan Zheng ; Bing Xu
Abstract :
The traditional search engines rarely consider features of the document set, so the retrieval results are not so satisfactory after new documents are added into the retrieval system. In this paper we combine the features of document set with traditional retrieval models and propose an incremental learning strategy to optimize the retrieval results. We got a feature thesaurus by extracting the document set. Then we collected some new features from the newly added documents and refreshed the feature thesaurus. Finally, the search results were reordered according to how well they matched the feature thesaurus with a query. Several parts of experiments show that this method averagely rises by 9.4% in precision, 14.9% in MAP, 4.6% in DCG towards the top 10 results than traditional retrieval means, which means that it processes better while making a query, even better while querying to the newly added documents, and faster while locating the required information.
Keywords :
document handling; feature extraction; learning (artificial intelligence); query processing; search engines; DCG; document querying; document set feature extraction; feature thesaurus; incremental learning strategy; query processing; retrieval model; retrieval system; search engine; search result optimization; Abstracts; Feature extraction; Optimization; Thesauri; Feature thesaurus; incremental learning; reorder; search results optimization;
Conference_Titel :
Machine Learning and Cybernetics (ICMLC), 2013 International Conference on
Conference_Location :
Tianjin
DOI :
10.1109/ICMLC.2013.6890514