• DocumentCode
    1303068
  • Title

    Automatic text categorization and its application to text retrieval

  • Author

    Lam, Wai ; Ruiz, Miguel ; Srinivasan, Padmini

  • Author_Institution
    Dept. of Syst. Eng. & Eng. Manage., Chinese Univ. of Hong Kong, Shatin, Hong Kong
  • Volume
    11
  • Issue
    6
  • fYear
    1999
  • Firstpage
    865
  • Lastpage
    879
  • Abstract
    We develop an automatic text categorization approach and investigate its application to text retrieval. The categorization approach is derived from a combination of a learning paradigm known as instance-based learning and an advanced document retrieval technique known as retrieval feedback. We demonstrate the effectiveness of our categorization approach using two real-world document collections from the MEDLINE database. Next, we investigate the application of automatic categorization to text retrieval. Our experiments clearly indicate that automatic categorization improves the retrieval performance compared with no categorization. We also demonstrate that the retrieval performance using automatic categorization achieves the same retrieval quality as the performance using manual categorization. Furthermore, detailed analysis of the retrieval performance on each individual test query is provided
  • Keywords
    information retrieval; MEDLINE database; automatic text categorization; detailed analysis; document retrieval technique; instance-based learning; learning paradigm; real-world document collections; retrieval feedback; retrieval quality; text retrieval; Cardiac disease; Classification tree analysis; Databases; Feedback; Humans; Information retrieval; Performance analysis; Query processing; Testing; Text categorization;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/69.824599
  • Filename
    824599