Abstract :
Predicting users´ future requests in the World Wide Web can be applied effectively in many important applications, such as web search, latency reduction, and personalization systems. Such application has traditional tradeoffs between modeling complexity and prediction accuracy. In this paper, we study several hybrid models that combine different classification techniques, namely, Markov models, artificial neural networks (ANNs), and the All-Kth-Markov model, to resolve prediction using Dempster´s rule. Such fusion overcomes the inability of the Markov model in predicting beyond the training data, as well as boosts the accuracy of ANN, particularly, when dealing with a large number of classes. We also employ a reduction technique, which uses domain knowledge, to reduce the number of classifiers to improve the predictive accuracy and the prediction time of ANNs. We demonstrate the effectiveness of our hybrid models by comparing our results with widely used techniques, namely, the Markov model, the All-Kth-Markov model, and association rule mining, based on a benchmark data set.
Keywords :
Internet; Markov processes; data mining; data reduction; neural nets; pattern classification; uncertainty handling; Dempster rule; Markov model; Web navigation prediction; Web search; artificial neural network; association rule mining; domain knowledge; latency reduction; multiple evidence combination; pattern classification; personalization system; reduction technique; Accuracy; Artificial neural networks; Association rules; Data mining; Delay; Navigation; Predictive models; Training data; Web search; Web sites; Artificial neural networks (ANNs); Dempster´s rule; Markov model; N-gram; association rule mining (ARM);
Journal_Title :
Systems, Man and Cybernetics, Part A: Systems and Humans, IEEE Transactions on