DocumentCode :
3260241
Title :
Optimal Parallel Hardware K-Sorter and Top K-Sorter, with FPGA Implementations
Author :
Matsumoto, Naoyuki ; Nakano, Koji ; Ito, Yasuaki
Author_Institution :
Dept. of Inf. Eng., Hiroshima Univ., Higashi-Hiroshima, Japan
fYear :
2015
fDate :
June 29 2015-July 2 2015
Firstpage :
138
Lastpage :
147
Abstract :
This paper presents a FIFO-based parallel merge sorter optimized for the latest FPGA. More specifically, we show a sorter that sorts K keys in latency K+log2 K-1 using log2 K comparators. It uses K/M +log2 K +log2 M-1 memory blocks with capacity M to implement FIFOs. It receives K keys one by one in every clock cycle and outputs the sorted sequence of them from K + log2 K - 1 clock cycles after. Since K clock cycles are necessary to input all K keys, our sorter is almost optimal in terms of the latency. Also, since the total FIFO capacity is only K +M log2 K +M log2 M -M and at least K keys must be stored in the sorter, our sorter is also almost optimal in terms of the total FIFO capacity if M is small. This paper also presents top K-sorter, which outputs top K keys in N input keys for any large N. Our top K-sorter runs in latency N + log2 K using log2 K + 1 comparators. It uses memory blocks of size M and the total FIFO capacity is only 2K+M log2 K +M log2 M - 2M. Quite surprisingly, the total FIFO capacity is independent of N. Also, since the latency must be at least N, that of our top Ksorter is almost optimal in terms of the latency. Finally, we have implemented our K-sorter and top K-sorter in a Xilinx Virtex-7 FPGA using built-in Distributed RAMs and Block RAMs. The implementation results show that our K-sorter reduces the used memory resources by half, and both K-sorter and top K-sorter are practical and efficient.
Keywords :
field programmable gate arrays; random-access storage; FIFO; FIFO capacity; FPGA Implementations; Top K-Sorter; Xilinx Virtex-7 FPGA; block RAM; clock cycle; distributed RAM; memory blocks; optimal parallel hardware K-sorter; Clocks; Computer architecture; Corporate acquisitions; Field programmable gate arrays; Sorting; Standards; Timing; big data; data mining; hardware algorithms; parallel sorting algorithms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Computing (ISPDC), 2015 14th International Symposium on
Conference_Location :
Limassol
Print_ISBN :
978-1-4673-7147-6
Type :
conf
DOI :
10.1109/ISPDC.2015.23
Filename :
7165140
Link To Document :
بازگشت