DocumentCode
1303068
Title
Automatic text categorization and its application to text retrieval
Author
Lam, Wai ; Ruiz, Miguel ; Srinivasan, Padmini
Author_Institution
Dept. of Syst. Eng. & Eng. Manage., Chinese Univ. of Hong Kong, Shatin, Hong Kong
Volume
11
Issue
6
fYear
1999
Firstpage
865
Lastpage
879
Abstract
We develop an automatic text categorization approach and investigate its application to text retrieval. The categorization approach is derived from a combination of a learning paradigm known as instance-based learning and an advanced document retrieval technique known as retrieval feedback. We demonstrate the effectiveness of our categorization approach using two real-world document collections from the MEDLINE database. Next, we investigate the application of automatic categorization to text retrieval. Our experiments clearly indicate that automatic categorization improves the retrieval performance compared with no categorization. We also demonstrate that the retrieval performance using automatic categorization achieves the same retrieval quality as the performance using manual categorization. Furthermore, detailed analysis of the retrieval performance on each individual test query is provided
Keywords
information retrieval; MEDLINE database; automatic text categorization; detailed analysis; document retrieval technique; instance-based learning; learning paradigm; real-world document collections; retrieval feedback; retrieval quality; text retrieval; Cardiac disease; Classification tree analysis; Databases; Feedback; Humans; Information retrieval; Performance analysis; Query processing; Testing; Text categorization;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/69.824599
Filename
824599
Link To Document