DocumentCode :
792811
Title :
An efficient parallel prefix sums architecture with domino logic
Author :
Lin, Rong ; Nakano, Koji ; Olariu, Stephan ; Zomaya, Albert Y.
Author_Institution :
Dept. of Comput. Sci., State Univ. of New York, Geneseo, NY, USA
Volume :
14
Issue :
9
fYear :
2003
Firstpage :
922
Lastpage :
931
Abstract :
The main contribution of this work is to propose an efficient parallel prefix sums architecture based on the recently-developed technique of shift switching with domino logic, where the charge/discharge signals propagate along the switch chain producing semaphores in a network that is fast and highly hardware-compact. The proposed architecture for computing the prefix sums of N-1 bits features a total delay of (4 log N + √N-2)*Td, where Td is the delay for charging or discharging a row of two prefix sum units of eight shift switches. Our simulation results show that, under 0.8-micron CMOS technology, the delay Td does not exceed 1 ns. As it turns out, our design is faster than any design known to us for values on N in the range 1 ≤ N ≤ 210. Yet, another important and novel feature of the proposed architecture is that it requires very simple controls, partially driven by the semaphores. This significantly reduces the hardware complexity of the design and fully utilizes the inherent speed of the process.
Keywords :
CMOS digital integrated circuits; VLSI; delays; formal logic; logic design; parallel architectures; CMOS; VLSI design; binary counting; delay; domino logic; hardware complexity; parallel prefix sums architecture; semaphores; shift switching; simulation; switch chain; Arithmetic; CMOS technology; Compaction; Computer architecture; Concurrent computing; Delay; Hardware; Logic; Switches; Very large scale integration;
fLanguage :
English
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9219
Type :
jour
DOI :
10.1109/TPDS.2003.1233714
Filename :
1233714
Link To Document :
بازگشت