DocumentCode
3045253
Title
Pairwise Learning to Rank for Search Query Correction
Author
Novak, A. ; Sedivy, Jan
Author_Institution
Dept. of Cybern., Czech Tech. Univ., Prague, Czech Republic
fYear
2013
fDate
13-16 Oct. 2013
Firstpage
3054
Lastpage
3059
Abstract
This article introduces a new algorithm for a Search Query Spelling Correction System. It is based on learning to rank approach and allows to use large number of various signals leading to an improved accuracy. The performance will be tested against the conventional solution - the Noisy Channel Model. The new system was developed on a Czech Internet search query set, but the feature vector structure and the algorithm can be easily adapted for any other language when sufficient data is available. We will describe the algorithm details, the training and validation data sets. Further, we will discuss the selection and impact of the new feature vector signals.
Keywords
Internet; learning (artificial intelligence); query processing; spelling aids; Czech Internet search query set; feature vector signals; feature vector structure; learning to rank model; noisy channel model; pairwise learning; search query correction rank; search query spelling correction system; Data models; Electronic publishing; Encyclopedias; Internet; Noise measurement; Vectors; feature vector; learning to rank; machine learning; noisy channel model; query correction;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man, and Cybernetics (SMC), 2013 IEEE International Conference on
Conference_Location
Manchester
Type
conf
DOI
10.1109/SMC.2013.521
Filename
6722274
Link To Document