DocumentCode :
1303068
Title :
Automatic text categorization and its application to text retrieval
Author :
Lam, Wai ; Ruiz, Miguel ; Srinivasan, Padmini
Author_Institution :
Dept. of Syst. Eng. & Eng. Manage., Chinese Univ. of Hong Kong, Shatin, Hong Kong
Volume :
11
Issue :
6
fYear :
1999
Firstpage :
865
Lastpage :
879
Abstract :
We develop an automatic text categorization approach and investigate its application to text retrieval. The categorization approach is derived from a combination of a learning paradigm known as instance-based learning and an advanced document retrieval technique known as retrieval feedback. We demonstrate the effectiveness of our categorization approach using two real-world document collections from the MEDLINE database. Next, we investigate the application of automatic categorization to text retrieval. Our experiments clearly indicate that automatic categorization improves the retrieval performance compared with no categorization. We also demonstrate that the retrieval performance using automatic categorization achieves the same retrieval quality as the performance using manual categorization. Furthermore, detailed analysis of the retrieval performance on each individual test query is provided
Keywords :
information retrieval; MEDLINE database; automatic text categorization; detailed analysis; document retrieval technique; instance-based learning; learning paradigm; real-world document collections; retrieval feedback; retrieval quality; text retrieval; Cardiac disease; Classification tree analysis; Databases; Feedback; Humans; Information retrieval; Performance analysis; Query processing; Testing; Text categorization;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/69.824599
Filename :
824599
Link To Document :
بازگشت