DocumentCode :
2369473
Title :
Parallel LDPC decoder implementation on GPU based on unbalanced memory coalescing
Author :
Kang, Soonyoung ; Moon, Jaekyun
Author_Institution :
Dept. of Electr. Eng., Korea Adv. Inst. of Sci. & Technol., Daejeon, South Korea
fYear :
2012
fDate :
10-15 June 2012
Firstpage :
3692
Lastpage :
3697
Abstract :
We consider flexible decoder implementation of low density parity check (LDPC) codes via compute-unified-device-architecture (CUDA) programming on graphics processing unit (GPU), a research subject of considerable recent interest. An important issue in LDPC decoder design based on CUDA-GPU is realizing coalesced memory access, a technique that reduces memory transaction time considerably. In previous works along this direction, it has not been possible to achieve coalesced memory access in both the read and write operations due to the asymmetric nature of the bipartite graph describing the LDPC code structure. In this paper, a new algorithm is proposed that enables coalesced memory access in both the read and write operations for one half of the decoding process - either the bit-to-check or the check-to-bit message passing. For the remaining half of the decoding step our scheme requires address transformation in both the read and write operations but one translating array is sufficient. We also describe the use of on-chip shared memory and texture cache. Overall, experimental results show that proposed GPU-based LDPC decoder achieves more than 234×-speedup compared to CPU-based LDPC decoders and also outperforms existing GPU-based decoders by a significant margin.
Keywords :
decoding; graph theory; graphics processing units; message passing; parallel architectures; parity check codes; CUDA-GPU decoder; LDPC code structure; bipartite graph; bit-to-check message passing; check-to-bit message passing; coalesced memory access; compute-unified-device-architecture programming; decoding process; graphics processing unit; low density parity check codes; memory transaction time reduction; on-chip shared memory; parallel LDPC decoder design; texture cache; unbalanced memory coalescing; Arrays; Decoding; Error analysis; Graphics processing units; Memory management; Message passing; Parity check codes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications (ICC), 2012 IEEE International Conference on
Conference_Location :
Ottawa, ON
ISSN :
1550-3607
Print_ISBN :
978-1-4577-2052-9
Electronic_ISBN :
1550-3607
Type :
conf
DOI :
10.1109/ICC.2012.6363991
Filename :
6363991
Link To Document :
بازگشت