DocumentCode
14616
Title
Modular Design of High-Throughput, Low-Latency Sorting Units
Author
Farmahini-Farahani, Amin ; Duwe, H.J. ; Schulte, M.J. ; Compton, Katherine
Author_Institution
Dept. of Electr. & Comput. Eng., Univ. of Wisconsin-Madison, Madison, WI, USA
Volume
62
Issue
7
fYear
2013
fDate
Jul-13
Firstpage
1389
Lastpage
1402
Abstract
High-throughput and low-latency sorting is a key requirement in many applications that deal with large amounts of data. This paper presents efficient techniques for designing high-throughput, low-latency sorting units. Our sorting architectures utilize modular design techniques that hierarchically construct large sorting units from smaller building blocks. The sorting units are optimized for situations in which only the M largest numbers from N inputs are needed, because this situation commonly occurs in many applications for scientific computing, data mining, network processing, digital signal processing, and high-energy physics. We utilize our proposed techniques to design parameterized, pipelined, and modular sorting units. A detailed analysis of these sorting units indicates that as the number of inputs increases their resource requirements scale linearly, their latencies scale logarithmically, and their frequencies remain almost constant. When synthesized to a 65-nm TSMC technology, a pipelined 256-to-4 sorting unit with 19 stages can perform more than 2.7 billion sorts per second with a latency of about 7 ns per sort. We also propose iterative sorting techniques, in which a small sorting unit is used several times to find the largest values.
Keywords
sorting; TSMC technology; data mining; digital signal processing; high-energy physics; high-throughput sorting unit design; high-throughput sorting units; iterative sorting techniques; large sorting units; low-latency sorting units; modular design techniques; network processing; pipelined 256-to-4 sorting unit; scientific computing; sorting architectures; Algorithm design and analysis; Computer aided engineering; Hardware; Large Hadron Collider; Merging; Sorting; Throughput; Algorithm design and analysis; Computer aided engineering; Hardware; Large Hadron Collider; Merging; Sorting; TSMC technology; Throughput; VLSI designs; data mining; design optimization; digital signal processing; high-energy physics; high-throughput sorting unit design; high-throughput sorting units; iterative sorting; iterative sorting techniques; large sorting units; low-latency sorting units; modular design techniques; network processing; parallel sorting algorithms; partial sorting; pipelined 256-to-4 sorting unit; scientific computing; sorting; sorting architectures;
fLanguage
English
Journal_Title
Computers, IEEE Transactions on
Publisher
ieee
ISSN
0018-9340
Type
jour
DOI
10.1109/TC.2012.108
Filename
6205742
Link To Document