DocumentCode :
567056
Title :
Distrim: Parallel GMM learning on multicore cluster
Author :
Yang, Renyong ; Xiong, Tengke ; Chen, Tao ; Huang, Zhexue ; Feng, Shengzhong
Author_Institution :
Shenzhen Inst. of Adv. Technol., Shenzhen, China
Volume :
2
fYear :
2012
fDate :
25-27 May 2012
Firstpage :
630
Lastpage :
635
Abstract :
Learning GMM model on extreme large data is challenging. We provide theoretical support for the feasibility of parallel EM-based GMM learning via distributed computing, and also design and implement a distributed memory sharing GMM learning system on multicore clusters, which is named as Distrim. Distrim aims to maximize the usage of computational power and minimize the communication overheads as much as possible. The experimental results show that Distrim is much more efficient than Hadoop, and also has a good scalability with respect to the number of computing nodes.
Keywords :
Gaussian Mixture Model; MPI; distributed computing; memory sharing; parallel learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Automation Engineering (CSAE), 2012 IEEE International Conference on
Conference_Location :
Zhangjiajie, China
Print_ISBN :
978-1-4673-0088-9
Type :
conf
DOI :
10.1109/CSAE.2012.6272849
Filename :
6272849
Link To Document :
بازگشت