Title :
Memory System Optimization for FPGA-Based Implementation of Quasi-Cyclic LDPC Codes Decoders
Author :
Chen, Xiaoheng ; Kang, Jingyu ; Lin, Shu ; Akella, Venkatesh
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of California, Davis, Davis, CA, USA
Abstract :
Designers are increasingly relying on field-programmable gate array (FPGA)-based emulation to evaluate the performance of low-density parity-check (LDPC) codes empirically down to bit-error rates of 10-12 and below. This requires decoding architectures that can take advantage of the unique characteristics of a modern FPGA to maximize the decoding throughput. This paper presents two specific optimizations called vectorization and folding to take advantage of the configurable data-width and depth of embedded memory in an FPGA to improve the throughput of a decoder for quasi-cyclic LDPC codes. With folding it is shown that quasi-cyclic LDPC codes with a very large number of circulants can be implemented on FPGAs with a small number of embedded memory blocks. A synthesis tool called QCSyn is described, which takes the H matrix of a quasi-cyclic LDPC code and the resource characteristics of an FPGA and automatically synthesizes a vector or folded architecture that maximizes the decoding throughput for the code on the given FPGA by selecting the appropriate degree of folding and/or vectorization. This helps not only in reducing the design time to create a decoder but also in quickly retargeting the implementation to a different (perhaps new) FPGA or a different emulation board.
Keywords :
cyclic codes; decoding; field programmable gate arrays; optimisation; parity check codes; FPGA; QCSyn; bit-error rates; decoding throughput; embedded memory blocks; emulation board; field-programmable gate array; low-density parity-check codes; memory system optimization; quasicyclic LDPC code decoders; vectorization; Arrays; Bandwidth; Decoding; Emulation; Field programmable gate arrays; Iterative decoding; Message passing; NASA; Parity check codes; Programmable logic arrays; Random access memory; Throughput; Very large scale integration; Alignment; field programmable logic array (FPGA); folding; low-density parity-check (LDPC) decoder; memory system optimization; normalized min-sum algorithm; quasi-cyclic low-density parity-check (QC-LDPC) codes; very large scale integration (VLSI) implementation;
Journal_Title :
Circuits and Systems I: Regular Papers, IEEE Transactions on
DOI :
10.1109/TCSI.2010.2055250