Title :
Using discretization and Bayesian inference network learning for automatic filtering profile generation
Author :
Lam, Wai ; Low, Kon Fan
Author_Institution :
Dept. of Syst. Eng. & Eng. Manage., Chinese Univ. of Hong Kong, Shatin, China
fDate :
8/1/2000 12:00:00 AM
Abstract :
We develop a new approach for text document filtering based on automatic construction of filtering profiles using Bayesian inference network learning. Bayesian inference networks, based on probability theory, offer a suitable framework to harness the uncertainty found in the nature of the filtering problem. In order to learn the networks effectively, we explore three different techniques for discretization. Good features of high predictive power are automatically obtained from the training document content. Our approach does not need to know in advance the subject or content of documents as well as the information needs expressed as topics. A series of experiments on a set of topics were conducted on two large-scale real-world document corpora. The empirical results demonstrate that our Bayesian inference network learning with advanced discretization achieves better performance over the simple naive Bayesian approach.
Keywords :
belief networks; inference mechanisms; information needs; information retrieval; learning (artificial intelligence); probability; uncertainty handling; Bayesian inference network learning; automatic filtering profile generation; discretization; information needs; probability theory; text document filtering; Bayesian methods; Databases; Feedback; Filtering theory; Information filtering; Information filters; Large-scale systems; Satellite broadcasting; Training data; Uncertainty;
Journal_Title :
Systems, Man, and Cybernetics, Part C: Applications and Reviews, IEEE Transactions on
DOI :
10.1109/5326.885115