Title :
High-throughput LDPC decoders
Author :
Mansour, Mohammad M. ; Shanbhag, Naresh R.
Author_Institution :
Univ. of Illinois, Urbana, IL, USA
Abstract :
A high-throughput memory-efficient decoder architecture for low-density parity-check (LDPC) codes is proposed based on a novel turbo decoding algorithm. The architecture benefits from various optimizations performed at three levels of abstraction in system design-namely LDPC code design, decoding algorithm, and decoder architecture. First, the interconnect complexity problem of current decoder implementations is mitigated by designing architecture-aware LDPC codes having embedded structural regularity features that result in a regular and scalable message-transport network with reduced control overhead. Second, the memory overhead problem in current day decoders is reduced by more than 75% by employing a new turbo decoding algorithm for LDPC codes that removes the multiple checkto-bit message update bottleneck of the current algorithm. A new merged-schedule merge-passing algorithm is also proposed that reduces the memory overhead of the current algorithm for low to moderate-throughput decoders. Moreover, a parallel soft-input-soft-output (SISO) message update mechanism is proposed that implements the recursions of the Balh-Cocke-Jelinek-Raviv (BCJR) algorithm in terms of simple "max-quartet" operations that do not require lookup-tables and incur negligible loss in performance compared to the ideal case. Finally, an efficient programmable architecture coupled with a scalable and dynamic transport network for storing and routing messages is proposed, and a full-decoder architecture is presented. Simulations demonstrate that the proposed architecture attains a throughput of 1.92 Gb/s for a frame length of 2304 bits, and achieves savings of 89.13% and 69.83% in power consumption and silicon area over state-of-the-art, with a reduction of 60.5% in interconnect length.
Keywords :
VLSI; algorithm theory; decoding; elemental semiconductors; integrated circuit interconnections; network routing; parity check codes; power consumption; silicon; turbo codes; 288 byte; Balh-Cocke-Jelinek-Raviv algorithm; dynamic transport network; interconnect complexity; interconnect length; low-density parity-check decoders; memory overhead; memory-efficient decoder; merged-schedule merge-passing algorithm; multiple check to-bit message; optimizations; power consumption; routing messages; scalable message-transport network; silicon; soft-input soft-output message update mechanism; storing; turbo decoding algorithm; Algorithm design and analysis; Decoding; Design optimization; Energy consumption; Memory architecture; Parity check codes; Performance loss; Routing; Silicon; Throughput;
Journal_Title :
Very Large Scale Integration (VLSI) Systems, IEEE Transactions on
DOI :
10.1109/TVLSI.2003.817545