• DocumentCode
    476774
  • Title

    Hash join algorithms used in text-based information retrieval: Guidelines for users

  • Author

    Rahman, Nurazzah Abd ; Saad, Tareq Salahi

  • Author_Institution
    Faculty of Information Technology & Quantitative Sciences, Universiti Teknologi MARA, Shah Alam, Selangor, Malaysia
  • Volume
    2
  • fYear
    2008
  • fDate
    26-28 Aug. 2008
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    Text-Based Information Retrieval (IR) is a field where the search in a large document is a basic concept. Concise queries are very fundamentals process in order to satisfy user’s need for information. One of the basic fundamental techniques in IR to implement queries is Hashing. Different types of hashing algorithms are used in IR. This paper discussed about guidelines for users who are implementing Hash join algorithm variations in their IR applications. Algorithms are varied based on its techniques involved in join operations. Three different variations of hash join algorithm, namely, XJoin algorithm, Hash Merge Join (HMJ) algorithm, and Early Hash Join (EHJ) algorithm are studied experimentally. Analysis on the results obtained is given. A user guideline based on three factors: overall execution time, response time and input/output operations performed are presented.
  • Keywords
    Delay; Design optimization; Guidelines; Indexing; Information retrieval; Information technology; Query processing; Relational databases; System performance; Time factors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Technology, 2008. ITSim 2008. International Symposium on
  • Conference_Location
    Kuala Lumpur, Malaysia
  • Print_ISBN
    978-1-4244-2327-9
  • Electronic_ISBN
    978-1-4244-2328-6
  • Type

    conf

  • DOI
    10.1109/ITSIM.2008.4631743
  • Filename
    4631743