Title :
Discovering and filtering text information from Internet based on inductive learning
Author :
Bo, Yang ; Fei, Liu ; Yiqi, Sun
Author_Institution :
Shandong Univ. of Building Mater., Jinan, China
Abstract :
The paper aims at the discovering and filtering of text information from the Internet. Based on the theories of probability and statistics, an essential key word list is established by the statistics of functional words in the sample text and whether or not the text is suspicious. A probability-based inductive learning algorithm is presented to show how to set up and optimize the key words and further more classify the text by the parameter estimation of the text-class attributes. A knowledge-based network information classification and filtering scheme is studied by using the algorithm to different sets of sample text belongs to several domains. The theoretical basis and algorithm implementation are discussed in detail, the experimental results and application examples are also given in the paper
Keywords :
Internet; data mining; learning by example; multi-agent systems; parameter estimation; probability; Internet; essential key word list; knowledge-based network information classification; probability-based inductive learning algorithm; text information; text-class attribute; Building materials; Filtering algorithms; Information filtering; Information filters; Internet; Parameter estimation; Probability; Statistics; Sun;
Conference_Titel :
Intelligent Control and Automation, 2000. Proceedings of the 3rd World Congress on
Conference_Location :
Hefei
Print_ISBN :
0-7803-5995-X
DOI :
10.1109/WCICA.2000.862559