DocumentCode :
476774
Title :
Hash join algorithms used in text-based information retrieval: Guidelines for users
Author :
Rahman, Nurazzah Abd ; Saad, Tareq Salahi
Author_Institution :
Faculty of Information Technology & Quantitative Sciences, Universiti Teknologi MARA, Shah Alam, Selangor, Malaysia
Volume :
2
fYear :
2008
fDate :
26-28 Aug. 2008
Firstpage :
1
Lastpage :
7
Abstract :
Text-Based Information Retrieval (IR) is a field where the search in a large document is a basic concept. Concise queries are very fundamentals process in order to satisfy user’s need for information. One of the basic fundamental techniques in IR to implement queries is Hashing. Different types of hashing algorithms are used in IR. This paper discussed about guidelines for users who are implementing Hash join algorithm variations in their IR applications. Algorithms are varied based on its techniques involved in join operations. Three different variations of hash join algorithm, namely, XJoin algorithm, Hash Merge Join (HMJ) algorithm, and Early Hash Join (EHJ) algorithm are studied experimentally. Analysis on the results obtained is given. A user guideline based on three factors: overall execution time, response time and input/output operations performed are presented.
Keywords :
Delay; Design optimization; Guidelines; Indexing; Information retrieval; Information technology; Query processing; Relational databases; System performance; Time factors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Technology, 2008. ITSim 2008. International Symposium on
Conference_Location :
Kuala Lumpur, Malaysia
Print_ISBN :
978-1-4244-2327-9
Electronic_ISBN :
978-1-4244-2328-6
Type :
conf
DOI :
10.1109/ITSIM.2008.4631743
Filename :
4631743
Link To Document :
بازگشت