Title :
A Solution for Privacy Protection in MapReduce
Author :
Tran, Quang ; Sato, Hiroyuki
Author_Institution :
Grad. Sch. of Eng., Univ. of Tokyo, Tokyo, Japan
Abstract :
Recently, the development of storage and networking technology have made processing tremendous data become real. As a result, the demand of discovering knowledge from the bigdata by using tools such as statistical analysis and data mining become higher. Using MapReduce a software framework introduced by Google in 2004 to implement computations on clusters of commodity computers is an economical solution. However, malicious MapReduce framework or source codes can leak the sensitive data through computation process. Giving user the least privilege on MapReduce-based system can solve the problem. Therefore, in our research, we propose a MapReduce-based computational system limiting the access to system resource by using RBAC and TE. Moreover, noise were added to the output of the Reduce to ensure the computational result can not signal the presence of a sensitive data. Our prototype implementation demonstrates the efficiency of preserving privacy on several cases.
Keywords :
data mining; data privacy; data reduction; source coding; statistical analysis; Google; MapReduce-based computational system; RBAC; TE; commodity computer clusters; data mining; economical solution; knowledge discovery; malicious MapReduce framework; networking technology; privacy protection; sensitive data; source codes; statistical analysis; storage technology; system resource; tremendous data processing; Access control; Data privacy; Noise; Privacy; Sensitivity; Usability; Cloud Computing; Privacy; Randomization; Security; Static Code Analysis;
Conference_Titel :
Computer Software and Applications Conference (COMPSAC), 2012 IEEE 36th Annual
Conference_Location :
Izmir
Print_ISBN :
978-1-4673-1990-4
Electronic_ISBN :
0730-3157
DOI :
10.1109/COMPSAC.2012.70