Title :
A high throughput LDPC decoder using a mid-range GPU
Author :
Xie Wen ; Jiao Xianjun ; Jaaskelainen, Pekka ; Kultala, Heikki ; Chen Canfeng ; Berg, Heikki ; Bie Zhisong
Author_Institution :
Beijing Univ. of Posts & Telecommun., Beijing, China
Abstract :
A standard-throughput-approaching LDPC decoder has been implemented on a mid-range GPU in this paper. Turbo-Decoding Message-Passing algorithm is applied to achieve high throughput. Different from traditional host managed multi-streams to hide host-device transfer delay, we use kernel maintained data transfer scheme to achieve implicit data transfer between host memory and device shared memory, which eliminates an intermediate stage of global memory. Data type optimization, memory accessing optimization, and low complexity Soft-In Soft-Out algorithm are also used to improve efficiency. Through these optimization methods, the 802.11n LDPC decoder on NVIDIA GTX480 GPU, which is released in 2010 with Fermi architecture, has achieved a high throughput of 295Mb/s when decoding 512 codewords simultaneously, which is close to highest bit rate 300Mb/s with 20MHz bandwidth in 802.11n standard. Decoding 1024 and 4096 codewords achieve 330 and 365Mb/s. A 802.16e LDPC decoder is also implemented, 374Mb/s (512 codewords), 435Mb/s (1024 codewords) and 507Mb/s (4096 codewords) throughputs have been achieved.
Keywords :
codecs; message passing; parity check codes; turbo codes; 802.11n LDPC decoder; 802.16e LDPC decoder; Fermi architecture; NVIDIA GTX480 GPU; bandwidth 20 MHz; bit rate 295 Mbit/s; bit rate 330 Mbit/s; bit rate 365 Mbit/s; bit rate 374 Mbit/s; bit rate 435 Mbit/s; bit rate 507 Mbit/s; data type optimization; device shared memory; global memory; host managed multistreams; host memory; host-device transfer delay; kernel maintained data transfer scheme; memory accessing optimization; mid-range GPU; soft-in soft-out algorithm; turbo-decoding message-passing algorithm; Decoding; Graphics processing units; IEEE 802.11n Standard; Kernel; Message systems; Parity check codes; Throughput;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6855061