Title :
A Web surfer model incorporating topic continuity
Author_Institution :
Machine Intelligence Unit., Indian Stat. Inst., Calcutta, India
fDate :
5/1/2005 12:00:00 AM
Abstract :
This paper describes a surfer model which incorporates information about topic continuity derived from the surfer´s history. Therefore, unlike earlier models, it captures the interrelationship between categorization (context) and ranking of Web documents simultaneously. The model is mathematically formulated. A scalable and convergent iterative procedure is provided for its implementation. Its different characteristic features, as obtained from the joint probability matrix, and their significance in Web intelligence are mentioned. Experiments performed on Web pages obtained from WebBase confirm the superiority of the model.
Keywords :
Internet; information retrieval; iterative methods; Web document; Web intelligence; Web surfer model; WebBase; iterative method; probability matrix; text categorization; topic continuity; Context modeling; Convergence; History; Internet; Mathematical model; State-space methods; Stochastic processes; Uniform resource locators; Web pages; Index Terms- Web intelligence; categorization.; context identification; page ranking; probabilistic surfer history; stochastic processes;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
DOI :
10.1109/TKDE.2005.69