DocumentCode
2053432
Title
Automatic abstracting important sentences of web articles
Author
Ren, Fuji ; Li, Shigang ; Kita, Kenji
Author_Institution
Fac. of Eng., Tokushima Univ., Japan
Volume
3
fYear
2001
fDate
2001
Firstpage
1705
Abstract
Being increasingly popular, the Internet greatly changes our live. We can conveniently receive and send information via the Internet. With the information explosion in Web, it is becoming crucial to develop means to automatically extract important sentences from the Web articles. In this paper, we propose a method which uses both statistical and structural information in sentence extraction. In addition, following the analysis of human´s extractions, several heuristic rules are added to filter out non-important sentences and to prevent similar sentences from being extracted. Our experimental results proved the effectiveness of these means. In particular, once the heuristic rules being added, a significant improvement has been observed
Keywords
computational linguistics; information resources; information retrieval; natural languages; Internet; Web articles; heuristic rules; sentence extraction; statistical information; structural information; Data mining; Explosions; Frequency; Humans; Information filtering; Information filters; Internet; Production; Statistics; Surface reconstruction;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man, and Cybernetics, 2001 IEEE International Conference on
Conference_Location
Tucson, AZ
ISSN
1062-922X
Print_ISBN
0-7803-7087-2
Type
conf
DOI
10.1109/ICSMC.2001.973531
Filename
973531
Link To Document