DocumentCode
2102150
Title
Classification with reject option in text categorisation systems
Author
Fumera, Giorgio ; Pillai, Ignazio ; Roli, Fabio
Author_Institution
Dept. of Electr. & Electron. Eng., Cagliari, Italy
fYear
2003
fDate
17-19 Sept. 2003
Firstpage
582
Lastpage
587
Abstract
The aim of this paper is to evaluate the potential usefulness of the reject option for text categorisation (TC) tasks. The reject option is a technique used in statistical pattern recognition for improving classification reliability. Our work is motivated by the fact that, although the reject option proved to be useful in several pattern recognition problems, it has not yet been considered for TC tasks. Since TC tasks differ from usual pattern recognition problems in the performance measures used and in the fact that documents can belong to more than one category, we developed a specific rejection technique for TC problems. The performance improvement achievable by using the reject option was experimentally evaluated on the Reuters dataset, which is a standard benchmark for TC systems.
Keywords
information retrieval; pattern classification; statistical analysis; text analysis; Reuters dataset benchmark; classification reliability; documents; performance measures; reject option; statistical pattern recognition; text categorisation systems; Indexing; Information filtering; Information filters; Information retrieval; Knowledge engineering; Learning systems; Machine learning; Pattern recognition; Text categorization; Web pages;
fLanguage
English
Publisher
ieee
Conference_Titel
Image Analysis and Processing, 2003.Proceedings. 12th International Conference on
Print_ISBN
0-7695-1948-2
Type
conf
DOI
10.1109/ICIAP.2003.1234113
Filename
1234113
Link To Document