DocumentCode :
2463106
Title :
An Efficient Theta-Join Query Processing Algorithm on MapReduce Framework
Author :
Chen, Shih-Ying ; Chang, Tsui-Ping ; Chang, Zhi-Hong
fYear :
2012
fDate :
4-6 June 2012
Firstpage :
686
Lastpage :
689
Abstract :
As the rapid development of hardware and network technology, cloud computing has become an important research topic. For applications of large-scale data processing, such as data warehouse, Map Reduce is the most famous platform for parallel data processing in cloud computing. To support the star-join queries in data warehouse, Scatter-Gather-Merge (SGM) proposes an efficient algorithm on the Map Reduce framework. However, SGM supports only the equi-join queries. Nonequi-join queries may cause SGM to fail. In this paper, we propose a method to cope with theta-join queries, i.e., both equi-join and nonequi-join queries. Our proposed method uses a novel manipulation of keys for partitioning data. The key manipulation matches up the Map Reduce paradigm, and makes theta-join queries workable on the Map Reduce platform. Our experimental results show that the proposed method achieves similar performance to SGM, but our method supports more join-query types. Our method performs even better than SGM in some query types of high data selectivity.
Keywords :
Internet; cloud computing; data handling; data warehouses; query processing; search engines; Internet; MapReduce framework; SGM; Scatter-Gather-Merge; cloud computing; data selectivity; data warehouse; efficient theta join query processing algorithm; equijoin queries; hardware technology; network technology; parallel data processing; search engine; star-join queries; Algorithm design and analysis; Benchmark testing; Data processing; File systems; Filtering algorithms; Operating systems; Radio frequency; MapReduce platform; query processing large-scale data; theta-join queries;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer, Consumer and Control (IS3C), 2012 International Symposium on
Conference_Location :
Taichung
Print_ISBN :
978-1-4673-0767-3
Type :
conf
DOI :
10.1109/IS3C.2012.178
Filename :
6228401
Link To Document :
بازگشت