DocumentCode
2647274
Title
A novel parallel prefix sum algorithm and its implementation on multi-core platforms
Author
Zhang, Nan
Author_Institution
Dept. of Comput. Sci. & Software Eng., Xi´´an Jiaotong-Liverpool Univ., Suzhou, China
Volume
2
fYear
2010
fDate
16-18 April 2010
Abstract
This paper presents a novel parallel prefix sum algorithm on n numbers by p processors. A parallel time analysis shows that this proposed algorithm is more efficient and easy-scalable than Blelloch´s classic parallel solution for the same problem. Unlike Blelloch´s method, the proposed algorithm does not require the number p of processors to be a power of 2, and it needs no extra buffers apart from the array that holds the numbers. For practical considerations, a variant of the proposed algorithm is further developed which incurs less overhead due to thread management and synchronisation. The Blelloch´s algorithm and the variant of the proposed method are implemented via POSIX Threads, and are tested on two multicore platforms. The results showed that the variant outperformed the Blelloch´s method provided that the size of the array not exceeded the total capacity of the L2 cache of the host machine. In some of the tested cases, the increase in performance was as high as 50%.
Keywords
Unix; cache storage; multi-threading; multiprocessing systems; synchronisation; Blelloch classic parallel solution; L2 cache; POSIX threads; multicore platforms; parallel prefix sum algorithm; parallel time analysis; synchronisation; thread management; Algorithm design and analysis; Computer science; Concurrent computing; Multicore processing; Parallel algorithms; Parallel processing; Scalability; Software algorithms; Testing; Yarn; Multi-core computing; Parallel algorithm; Parallel computing; Parallel prefix sum;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Engineering and Technology (ICCET), 2010 2nd International Conference on
Conference_Location
Chengdu
Print_ISBN
978-1-4244-6347-3
Type
conf
DOI
10.1109/ICCET.2010.5485315
Filename
5485315
Link To Document