DocumentCode :
1407974
Title :
Using discretization and Bayesian inference network learning for automatic filtering profile generation
Author :
Lam, Wai ; Low, Kon Fan
Author_Institution :
Dept. of Syst. Eng. & Eng. Manage., Chinese Univ. of Hong Kong, Shatin, China
Volume :
30
Issue :
3
fYear :
2000
fDate :
8/1/2000 12:00:00 AM
Firstpage :
340
Lastpage :
351
Abstract :
We develop a new approach for text document filtering based on automatic construction of filtering profiles using Bayesian inference network learning. Bayesian inference networks, based on probability theory, offer a suitable framework to harness the uncertainty found in the nature of the filtering problem. In order to learn the networks effectively, we explore three different techniques for discretization. Good features of high predictive power are automatically obtained from the training document content. Our approach does not need to know in advance the subject or content of documents as well as the information needs expressed as topics. A series of experiments on a set of topics were conducted on two large-scale real-world document corpora. The empirical results demonstrate that our Bayesian inference network learning with advanced discretization achieves better performance over the simple naive Bayesian approach.
Keywords :
belief networks; inference mechanisms; information needs; information retrieval; learning (artificial intelligence); probability; uncertainty handling; Bayesian inference network learning; automatic filtering profile generation; discretization; information needs; probability theory; text document filtering; Bayesian methods; Databases; Feedback; Filtering theory; Information filtering; Information filters; Large-scale systems; Satellite broadcasting; Training data; Uncertainty;
fLanguage :
English
Journal_Title :
Systems, Man, and Cybernetics, Part C: Applications and Reviews, IEEE Transactions on
Publisher :
ieee
ISSN :
1094-6977
Type :
jour
DOI :
10.1109/5326.885115
Filename :
885115
Link To Document :
بازگشت