• DocumentCode
    1542780
  • Title

    Document ranking and the vector-space model

  • Author

    Lee, Dik L. ; Chuang, Huei ; Seamons, Kent

  • Author_Institution
    Dept. of Comput. Sci., Hong Kong Univ. of Sci. & Technol., Hong Kong
  • Volume
    14
  • Issue
    2
  • fYear
    1997
  • Firstpage
    67
  • Lastpage
    75
  • Abstract
    Efficient and effective text retrieval techniques are critical in managing the increasing amount of textual information available in electronic form. Yet text retrieval is a daunting task because it is difficult to extract the semantics of natural language texts. Many problems must be resolved before natural language processing techniques can be effectively applied to a large collection of texts. Most existing text retrieval techniques rely on indexing keywords. Unfortunately, keywords or index terms alone cannot adequately capture the document contents, resulting in poor retrieval performance. Yet keyword indexing is widely used in commercial systems because it is still the most viable way by far to process large amounts of text. Using several simplifications of the vector-space model for text retrieval queries, the authors seek the optimal balance between processing efficiency and retrieval effectiveness as expressed in relevant document rankings
  • Keywords
    document handling; indexing; information retrieval; natural language interfaces; statistical analysis; vocabulary; commercial systems; document rankings; index terms; indexing; keywords; natural language text; retrieval performance; statistical analysis; text retrieval techniques; textual information; vector-space model; Content based retrieval; Data mining; Database languages; Feedback; Indexing; Information retrieval; Information systems; Particle measurements; Prototypes; Technology management;
  • fLanguage
    English
  • Journal_Title
    Software, IEEE
  • Publisher
    ieee
  • ISSN
    0740-7459
  • Type

    jour

  • DOI
    10.1109/52.582976
  • Filename
    582976