A Low-Power 1-Gbps Reconfigurable LDPC Decoder Design For Multiple 4G Wireless Standards
A Low-Power 1-Gbps Reconfigurable LDPC Decoder Design For Multiple 4G Wireless Standards
2008
Abstract— In this paper we present an efficient system-on- 0, where x is a codeword (x ∈ C) and H can be viewed
chip implementation of a 1-Gbps LDPC decoder for 4G (or as a bipartite graph where each column and row in H
beyond 3G) wireless standards. The decoder has a scalable represent a variable node and check node, respectively.
datapath and can be dynamically reconfigured to support mul-
tiple 4G standards. We utilize a pipelined version of the layered
belief propagation algorithm to achieve partial-parallel decoding
A. Block structured LDPC codes
of structured LDPC codes. Instead of using the sub-optimal Min- Non-zero elements in H are typically placed at random
sum algorithm, we propose to use the powerful belief propaga- positions to achieve good performance. However, this
tion (BP) decoding algorithm by designing an area-efficient soft-
input soft-output (SISO) decoder. Two power saving schemes randomness is unfavorable for efficient VLSI implemen-
are employed to reduce the power consumption up to 65%. The tation that calls for structured design. To address this
decoder has been synthesized, placed, and routed on a TSMC issue, block-structured LDPC codes are recently pro-
90nm 1.0V 8-metal layer CMOS technology with a total area of posed for several new communication standards such
3.5 mm2 . The maximum clock frequency is 450 MHz and the as IEEE 802.11n, IEEE 802.16e, DVB-S2 and DMB-
estimated peak power consumption is 410 mW.
T. As shown in Fig. 1, a block structured parity check
I. INTRODUCTION matrix can be viewed as a 2-D array of square sub
matrices. Each sub matrix is either a zero matrix or a
The approaching fourth-generation (4G) wireless sys- cyclically shifted identity matrix Ix . Generally, a block
tems are projected to provide 100 Mbps to 1 Gbps structured parity check matrix H consists of a j × k
speeds by 2010, which consequently leads to orders of array of z × z cyclically shifted identity matrices with
magnitude complexity increases in the wireless receiver random shift values x (0 ≤ x < z). Table 1 summarizes
SoC (System-on-Chip). As a core technology in wireless the design parameters for H in several standards. In
communications, FEC (forward error correction) coding
has migrated from 2G convolutional/block codes to more x
log(1 + e − (||a|−|b||) )
8 |b| - proposed LDPC decoder architecture. In the proposed
b ABS - ABS LUT + M
8
U
D ...
Core Core Core
1 2 z
Fig. 5. One level look-ahead transform of f (·) recursion
Λ’ mn L’ n
400 425
IV. RESULTS
400
350
A multi-mode LDPC decoder which supports both IEEE
250
Fig. 8 shows the VLSI layout view of the LDPC decoder. 325
CTRL Misc
Logic
banks when the LDPC code size is small.
L-Mem Circular
Shifter
In/Out V. CONCLUSION
Buffer
A high performance LDPC decoder has been de-
R4-SISO Decoder + scribed that achieves a throughput of 1 Gbps. The de-
Distributed Λ-Mem coder has a scalable datapath and can be dynamically
x96 reconfigured to support multiple 4G wireless standards.
VI. Acknowledgement
This work was supported in part by Nokia and by NSF
under grants CCF-0541363, CNS-0551692, and CNS-
Fig. 8. VLSI layout view of the LDPC decoder 0619767.
References
Table 3: LDPC decoder architecture comparison [1] R. Gallager, “Low-density parity-check codes,” IEEE Trans. Inf.
This Work [3] [4] Theory, vol. 8, pp. 21–28, Jan. 1962.
Flexibility 802.16e/.11n 802.16e 2048-bit fixed [2] T. Brack, M. Alles, F. Kienle, and N. Wehn, “A Synthesizable IP
Max Throughput 1 Gbps 111 Mbps 640 Mbps Core for WIMAX 802.16e LDPC Code Decoding,” in IEEE 17th
Total Area 3.5 mm2 8.29 mm2 14.3 mm2 Int. Symp. Personal, Indoor and Mobile Radio Communications
Max Frequency 450 MHz 83 MHz 125 MHz (PIMRC), 2006, pp. 1 – 5.
Peak Power 410 mW 52 mW 787 mW [3] X.-Y. Shih, C.-Z. Zhan, C.-H. Lin, and A.-Y. Wu, “A 19-mode
Technology 90 nm 0.13 µm 0.18 µm 8.29mm2 52-mW LDPC Decoder Chip for IEEE 802.16e System,”
Max Iteration 10 8 10 in 2007 Symposium on VLSI Circuits, June 2007.
Algorithm Full BP Min-Sum Linear Apprx. [4] M.M. Mansour and N.R. Shanbhag, “A 640-Mb/s 2048-Bit Pro-
grammable LDPC Decoder Chip,” IEEE Journal of Solid-State
Circuits, vol. 41, pp. 684–698, March 2006.
As low power design is critical for wireless receivers, [5] A.J. Blanksby and C.J. Howland, “A 690-mW 1-Gb/s 1024-b, rate-
in order to save power, we have implemented a simple 1/2 low-density parity-check code decoder,” IEEE Journal of Solid-
State Circuits, vol. 37, no. 3, pp. 404–412, 2002.
and effective early termination criteria for stopping the [6] D. Hocevar, “A reduced complexity decoder architecture via lay-
iteration process. The decoding will stop if the following ered decoding of LDPC codes,” in IEEE Work. on Signal Process-
two conditions are satisfied: 1) the hard decisions for the ing Syst. (SIPS), Oct 2004, pp. 107–112.
[7] T. Zhang, Z. Wang, and K. Parhi, “On finite precision implementa-
information bits based on their LLR values do not change tion of low density parity check codes decoder,” in Int. Symposium
over two successive iterations, and 2) the minimum of the on Circuits and Systems (ISCAS), vol. 4, May 2001, pp. 202–205.
absolute values of the information bit LLRs is larger than [8] J. Hagenauer, E. Offer, and L. Papke, “Iterative decoding of binary
block and convolutional codes,” IEEE Trans. Inf. Theory, vol. 42,
a pre-defined threshold. As shown in Fig. 9 (a), when no. 2, pp. 429 – 445, 1996.
the wireless channel is good, the decoding needs fewer [9] X.-Y. Hu, E. Eleftheriou, D.-M. Arnold, and A. Dholakia, “Efficient
iterations to converge, which therefore saves substantial implementations of the sum-product algorithm for decoding LDPC
codes,” in IEEE GLOBECOM, Oct. 2001, pp. 1036–1036.
power (up to 65% power reduction). Another power sav- [10] K. Gunnam, G. S. Choi, M. B. Yeary, and M. Atiquzzaman, “VLSI
ing technique is to use distributed SISO decoders and Architectures for Layered Decoding for Irregular LDPC Codes of
memory banks. Fig. 9 (b) shows the power reduction from WiMax,” in Int. Conf. Commun. (ICC), June 2007.