DocumentCode :
2977677
Title :
LPFSC: A Light Weight Parallel Framework for Super Computing
Author :
Yulong Ou ; Bo Li ; Zheng Yuan ; Qiang Hao ; Zhongzhi Luan ; Depei Qian
Author_Institution :
Dept. of Comput. Sci. & Eng., Beihang Univ., Beijing, China
fYear :
2012
fDate :
14-16 Dec. 2012
Firstpage :
453
Lastpage :
458
Abstract :
Supercomputing on the heterogeneous architectures that integrate multi-core or many-cores processors has been developed at a dramatically speed. It is widely used in theoretical physics, theoretical chemistry, climate modeling, biology simulation and medicine research for high-performance and energy-efficient computing. Yet it is still a big challenge to users when trying to run their scientific applications efficiently on large-scale supercomputers constructed by using heterogeneous multiprocessors. On the other hand, overhead cost issues of a large supercomputer for its resource managements, job scheduling, and system reliability become more and more important. In this paper, LPFSC, a light weight parallel framework for supercomputing, is presented, which helps programmers in planning their tasks on a supercomputer. In a huge supercomputer system, there might be a hundred of thousands of nodes, over a million processor cores and many other kinds of processors, general main-slave computing mode can hardly handle the huge amount of heterogeneous processors. LPFSC consists of modules for multiple master-slave support, load balance among huge amount computing tasks, and reliability support. Additional features will be added in the near future and it is supposed to provide good support for large heterogeneous computer systems. Finally, large amount tasks of 2D-FFT in varying size are tested under the framework for evaluation, which can scale to more than 300 processors.
Keywords :
multiprocessing systems; parallel machines; parallel programming; processor scheduling; resource allocation; 2D-FFT; LPFSC; biology simulation; climate modeling; energy-efficient computing; heterogeneous architectures; heterogeneous multiprocessors; high-performance computing; job scheduling; large-scale heterogeneous supercomputer system; lightweight parallel framework for super computing; load balancing; main-slave computing mode; many-cores processors; medicine research; multicore processors; multiple master-slave support; overhead cost; reliability support; resource managements; system reliability; theoretical chemistry; theoretical physics; Central Processing Unit; Computer architecture; Heart beat; Master-slave; Processor scheduling; Program processors; Supercomputers; LPSFC; heterogeneous architecture; supercomputing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Computing, Applications and Technologies (PDCAT), 2012 13th International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-0-7695-4879-1
Type :
conf
DOI :
10.1109/PDCAT.2012.89
Filename :
6589320
Link To Document :
بازگشت