• DocumentCode
    3022540
  • Title

    A novel context matching based technique for Web document retrieval

  • Author

    Zakos, John ; Verma, Brijesh

  • Author_Institution
    Sch. of Inf. & Comm. Technol., Griffith Univ., Brisbane, Qld., Australia
  • fYear
    2005
  • fDate
    29 Aug.-1 Sept. 2005
  • Firstpage
    909
  • Abstract
    This paper presents a novel context matching technique for the retrieval of Web documents. The aim of the technique is to dynamically generate a context-based measure of document term significance during retrieval that can be used as a substitute or co-contributor of the term frequency measure. Unlike term frequency, which relies on a term to occur multiple times within a document to be considered significant, context matching is based on the notion that if a term in a given document occurs in that document in the context of the query, then that term is deemed to be significant. Context matching has the ability to potentially determine a term to be significant even if it occurs only once in a large document. The proposed technique has been implemented and the experiments were conducted using a TREC benchmark database. A comparative analysis shows that context matching significantly improves retrieval effectiveness and outperforms previously published results.
  • Keywords
    Internet; document handling; information retrieval; pattern matching; TREC benchmark database; Web document retrieval; context matching; Australia; Data analysis; Data mining; Databases; Frequency measurement; Indexing; Information retrieval; Information technology; Web sites; World Wide Web;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on
  • ISSN
    1520-5263
  • Print_ISBN
    0-7695-2420-6
  • Type

    conf

  • DOI
    10.1109/ICDAR.2005.26
  • Filename
    1575676