Title :
Research on private cloud platform of seed tracing based on Hadoop parallel computing
Author :
Li Dongming; Li Yan; Yuan Chao; Chen Haochuan; Zhang Lijuan
Author_Institution :
School of Information Technology Jilin Agriculture University, Changchun, China
Abstract :
It is important for the management of seeds tracing which includes the collection and calculation of seeds sale. In the paper, firstly, we designed and implemented the seeds Hadoop-based trace data processing model, through the processing of the crawl data of seed, such as ETL process, parallel computing, distributed storage. Secondly, we studied the Consistent Hash Algorithm to optimize the database cluster configuration and the great data parallel computing of MapReduce. The experiment results show that compared with the traditional single node approach, using private cloud parallel calculation can greatly improve parallel computing, the efficiency of storage and load capacity of the platform. The processing efficiency of cloud platform is increased by 33.3%.
Keywords :
"Cloud computing","Computational modeling","Data models","Distributed databases","Parallel processing","Algorithm design and analysis"
Conference_Titel :
Computer Science and Network Technology (ICCSNT), 2015 4th International Conference on
DOI :
10.1109/ICCSNT.2015.7490722