DocumentCode :
3719541
Title :
Study of Hadoop-MapReduce on Google N-Gram Datasets
Author :
Satyajit Bhowmick;Suryadip Chakraborty;Dharma P. Agrawal
Author_Institution :
Dept. of Electr. Eng. &
fYear :
2015
Firstpage :
488
Lastpage :
490
Abstract :
In previous decades, there has been a significant paradigm shift in the domain of computer architecture and processing mechanisms of large-scale data due to the increase of computational power caused by an overwhelming flow of massive amount of data. Hadoop and MapReduce are very powerful concepts which enable the efficient development of scalable and parallel applications required for processing vast amounts of data. In this paper, we investigate the concept of Hadoop and MapReduce and eventually use the programming tool of MapReduce and Apache Pig to solve existing computation problems of very complex and complicated Google Ngrams datasets.
Keywords :
"Google","Dictionaries","Distributed databases","Electrical engineering","Programming","File systems"
Publisher :
ieee
Conference_Titel :
Mobile Ad Hoc and Sensor Systems (MASS), 2015 IEEE 12th International Conference on
Type :
conf
DOI :
10.1109/MASS.2015.105
Filename :
7366980
Link To Document :
بازگشت