DocumentCode :
150215
Title :
Efficient way of searching data in MapReduce paradigm
Author :
Shah, Ghafoor ; Annappa ; Shet, K.C.
fYear :
2014
fDate :
5-7 March 2014
Firstpage :
305
Lastpage :
310
Abstract :
Cloud computing has emerged as an effective solution in the computing world. When the cloud is used for large amounts of data storage, searching for any required data takes lots of time. A framework is required to distribute the work of searching and fetching from thousands of computers. The data in Hadoop Distributed File System is scattered and needs lots of time to retrieve. MapReduce function on data sets of key & value pair is the programming paradigm of large distributed operation. The proposed work aims to minimize the data retrieval time taken by the MapReduce program in the cloud. The major idea is to design a web server in the map phase using the jetty web server which shall give a fast and efficient way of searching data in MapReduce paradigm. For real time processing on Hadoop, a search mechanism is implemented in HDFS. The load balancer is used to balance the workload across servers to improve its availability, performance and scalability.
Keywords :
cloud computing; information retrieval; parallel programming; resource allocation; Hadoop distributed file system; MapReduce paradigm; Web server; cloud computing; data fetching; data retrieval time; data search; data storage; load balancer; programming paradigm; Cloud computing; Computer architecture; Distributed databases; File systems; Indexes; Web servers; Hadoop; MapReduce; indexing; jetty server; load balancing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computing for Sustainable Global Development (INDIACom), 2014 International Conference on
Conference_Location :
New Delhi
Print_ISBN :
978-93-80544-10-6
Type :
conf
DOI :
10.1109/IndiaCom.2014.6828149
Filename :
6828149
Link To Document :
بازگشت