DocumentCode :
2923425
Title :
Eliminating Redundant and Less-Informative RSS News Articles Based on Word Similarity and a Fuzzy Equivalence Relation
Author :
Garcia, Ian ; Ng, Yiu-Kai
Author_Institution :
Dept. of Comput. Sci., Brigham Young Univ., Provo, UT
fYear :
2006
fDate :
Nov. 2006
Firstpage :
465
Lastpage :
473
Abstract :
The Internet has marked this era as the information age. There is no precedent in the amazing amount of information, especially network news, that can be accessed by Internet users these days. As a result, the problem of seeking information in online news articles is not the lack of them but being overwhelmed by them. This brings huge challenges in processing online news feeds, e.g., how to determine which news article is important, how to determine the quality of each news article, and how to filter irrelevant and redundant information. In this paper, we propose a method for filtering redundant and less-informative RSS news articles that solves the problem of excessive number of news feeds observed in RSS news aggregators. Our filtering approach measures similarity among RSS news entries by using the fuzzy-set information retrieval model and a fuzzy equivalent relation for computing word/sentence similarity to detect redundant and less-informative news articles
Keywords :
Internet; information filtering; information resources; word processing; Internet; fuzzy equivalence relation; fuzzy equivalent relation; fuzzy set information retrieval model; irrelevant information filtering; less-informative news article; network news feeds; news aggregator; news entry; online news article; online news feeds; redundant information filtering; redundant news article; sentence similarity; word similarity; Computer science; Feeds; IP networks; Information filtering; Information filters; Information retrieval; Internet; Large-scale systems; Monitoring; Portals;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Tools with Artificial Intelligence, 2006. ICTAI '06. 18th IEEE International Conference on
Conference_Location :
Arlington, VA
ISSN :
1082-3409
Print_ISBN :
0-7695-2728-0
Type :
conf
DOI :
10.1109/ICTAI.2006.54
Filename :
4031932
Link To Document :
بازگشت