DocumentCode
476774
Title
Hash join algorithms used in text-based information retrieval: Guidelines for users
Author
Rahman, Nurazzah Abd ; Saad, Tareq Salahi
Author_Institution
Faculty of Information Technology & Quantitative Sciences, Universiti Teknologi MARA, Shah Alam, Selangor, Malaysia
Volume
2
fYear
2008
fDate
26-28 Aug. 2008
Firstpage
1
Lastpage
7
Abstract
Text-Based Information Retrieval (IR) is a field where the search in a large document is a basic concept. Concise queries are very fundamentals process in order to satisfy user’s need for information. One of the basic fundamental techniques in IR to implement queries is Hashing. Different types of hashing algorithms are used in IR. This paper discussed about guidelines for users who are implementing Hash join algorithm variations in their IR applications. Algorithms are varied based on its techniques involved in join operations. Three different variations of hash join algorithm, namely, XJoin algorithm, Hash Merge Join (HMJ) algorithm, and Early Hash Join (EHJ) algorithm are studied experimentally. Analysis on the results obtained is given. A user guideline based on three factors: overall execution time, response time and input/output operations performed are presented.
Keywords
Delay; Design optimization; Guidelines; Indexing; Information retrieval; Information technology; Query processing; Relational databases; System performance; Time factors;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Technology, 2008. ITSim 2008. International Symposium on
Conference_Location
Kuala Lumpur, Malaysia
Print_ISBN
978-1-4244-2327-9
Electronic_ISBN
978-1-4244-2328-6
Type
conf
DOI
10.1109/ITSIM.2008.4631743
Filename
4631743
Link To Document