Title :
Distrim: Parallel GMM learning on multicore cluster
Author :
Yang, Renyong ; Xiong, Tengke ; Chen, Tao ; Huang, Zhexue ; Feng, Shengzhong
Author_Institution :
Shenzhen Inst. of Adv. Technol., Shenzhen, China
Abstract :
Learning GMM model on extreme large data is challenging. We provide theoretical support for the feasibility of parallel EM-based GMM learning via distributed computing, and also design and implement a distributed memory sharing GMM learning system on multicore clusters, which is named as Distrim. Distrim aims to maximize the usage of computational power and minimize the communication overheads as much as possible. The experimental results show that Distrim is much more efficient than Hadoop, and also has a good scalability with respect to the number of computing nodes.
Keywords :
Gaussian Mixture Model; MPI; distributed computing; memory sharing; parallel learning;
Conference_Titel :
Computer Science and Automation Engineering (CSAE), 2012 IEEE International Conference on
Conference_Location :
Zhangjiajie, China
Print_ISBN :
978-1-4673-0088-9
DOI :
10.1109/CSAE.2012.6272849