0% found this document useful (0 votes)

46 views19 pages

Ibook - Pub Efficient Stochastic Successive Cancellation List Decoder For Polar Codes

This research paper presents an efficient stochastic successive cancellation list (SSCL) decoder for polar codes, addressing the high hardware complexity associated with traditional successive cancellation list (SCL) decoders. The proposed SSCL decoder utilizes stochastic computing techniques to reduce hardware costs while maintaining performance, introducing methods such as a doubling probability approach and adaptive distributed sorting. FPGA implementation results indicate that the SSCL decoder achieves a favorable tradeoff between performance and complexity compared to deterministic decoders.

Uploaded by

Roger Mascarenhas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views19 pages

Ibook - Pub Efficient Stochastic Successive Cancellation List Decoder For Polar Codes

Uploaded by

Roger Mascarenhas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

SCIENCE CHINA

Information Sciences
. RESEARCH PAPER . October 2020, Vol. 63 202303:1–202303:19
https://doi.org/10.1007/s11432-019-2924-6

Efficient stochastic successive cancellation list

decoder for polar codes

Xiao LIANG1,2,3,4† , Huizheng WANG1,2,3,4† , Yifei SHEN1,2,3,4 , Zaichen ZHANG1,2,3,4,

Xiaohu YOU1,2,3,4 & Chuan ZHANG1,2,3,4*
1
Laboratory of Efficient Architectures for Digital-communication and Signal-processing (LEADS),
Southeast University, Nanjing 211100, China;
2
National Mobile Communications Research Laboratory, Southeast University, Nanjing 211100, China;
3
Quantum Information Center of Southeast University, Nanjing 211100, China;
4
Purple Mountain Laboratories, Nanjing 211100, China

Received 29 December 2019/Revised 13 March 2020/Accepted 22 May 2020/Published online 21 September 2020

Abstract Polar codes are one of the most favorable capacity-achieving codes owing to their simple struc-
tures and low decoding complexity. Successive cancellation list (SCL) decoders with large list sizes achieve
performances very close to those of maximum-likelihood (ML) decoders. However, hardware cost is a se-
vere problem because an SCL decoder with list size L consists of L copies of a successive cancellation (SC)
decoder. To address this issue, a stochastic SCL (SSCL) polar decoder is proposed. Although stochastic
computing can achieve a good hardware reduction compared with the deterministic one, its straightforward
application to an SCL decoder is not well-suited owing to the precision loss and severe latency. Therefore,
a doubling probability approach and adaptive distributed sorting (DS) are introduced. A corresponding
hardware architecture is also developed. Field programmable gate array (FPGA) results demonstrate that
the proposed stochastic SCL polar decoder can achieve a good performance and complexity tradeoff.
Keywords SCL polar decoder, stochastic computing, 2-bit decoding, distributed sorting, hardware
Citation Liang X, Wang H Z, Shen Y F, et al. Efficient stochastic successive cancellation list decoder for polar
codes. Sci China Inf Sci, 2020, 63(10): 202303, https://doi.org/10.1007/s11432-019-2924-6

1 Introduction
Polar codes, proposed by Arıkan’s breakthrough paper [1], are an exciting new class of channel codes
that can asymptotically achieve the capacity for symmetric binary-input discrete memoryless channels.
Because of its FFT-like structure and low complexity with O(N log N ) where N denotes code length,
successive cancellation (SC) decoding algorithm has become one of the most popular polar decoding
algorithms. Nevertheless, it cannot be denied that compared to maximum likelihood (ML) decoder [2],
the decoding performance of SC polar decoder still suffers from an evident degradation. To narrow
the performance gap caused by the sub-optimality of traditional successive cancellation decoder, the
successive cancellation list (SCL) polar decoding algorithm is developed in [2, 3]. Simulation results have
revealed that SCL polar decoder can outperform the low density parity check (LDPC) codes even within
high error-rate regions. To realize the spectrum efficiency and low-latency required by next generation
wireless system, Refs. [4, 5] proposed two latency-reduced SCL decoders for polar codes, respectively.
The main drawback of SCL polar decoder is its complexity increases linearly with the list size L. To
realize satisfied performance, the SCL polar decoder suffers from high complexity cost, especially when
* Corresponding author (email: [email protected])
† Xiao LIANG and Huizheng WANG have the same contribution to this work.

c Science China Press and Springer-Verlag GmbH Germany, part of Springer Nature 2020 info.scichina.com link.springer.com
Liang X, et al. Sci China Inf Sci October 2020 Vol. 63 202303:2

L is considerably large. According to the simulation over the BPSK-AWGN channel [2], it is required
L > 32 to achieve frame error rate (FER) less than 10−5 when cyclic redundancy check (CRC) is 16 bits
and the signal to noise (SNR) is 2 dB, where L denotes list size. Owing to the complexity O(LN log N ) of
SCL decoding, large L will introduce huge complexity that obstructs the efficient implementation of SCL
decoder. To address the issue, a stochastic computing based SCL polar decoding algorithm is proposed
in this paper.
Stochastic computing [6], as an approximate computing technique, has the potential to provide signif-
icantly low hardware footprint with high energy efficiency and scalability [7]. In stochastic calculation
framework, a probability number is represented by a bit-stream, hence the fundamental arithmetic oper-
ations such as additions and multiplications can be realized as simple as multiplexers (MUX) and AND
gates, respectively [8]. Moreover, even some complex arithmetic operations can also be implemented
with very elementary hardware logic [9,10]. In [11], a fully parallel stochastic Markov Chain Monte Carlo
(MCMC) multiple-input and multiple-output (MIMO) detector is developed, which can achieve a higher
throughput with lower hardware cost compared with conventional MIMO detectors. Further, the corre-
sponding design supporting iterative detection is proposed in [12]. Considering the low-complexity and
extensive computation in SCL decoding algorithm, stochastic computing offers a colossal design space
for SCL optimization owing to its advantages in area reduction and soft error resiliency.
In [13], stochastic computing is incorporated with SCL decoder with L = 1, 2. Thanks to stochastic
computing, the stochastic SCL (SSCL) decoder is expected to be an efficient paradigm which can real-
ize complexity reduction, satisfied performance, and fault tolerance. Nevertheless, applying stochastic
computing to SCL decoder directly will lead to unacceptable performance degradation and latency. This
paper devotes itself in proposing solutions to reduce SCL decoder hardware complexity. The major nov-
elties of this paper are summarized as follows. First, channel message scaling and enlarging probability
(EP) are proposed to mitigate the randomness loss of stochastic computing. Second, to reduce decoding
latency, we propose a stochastic based double level (DL) decoding scheme that can estimate two bits
each time. Furthermore, 2p -level decoding method is developed to realize arbitrary tradeoff between
performance and latency. Third, based on distributed sorting (DS), we develop an adaptive DS (ADS)
scheme that can compromise different sorting scales in stochastic multiple-level decoding. Finally, the
detailed hardware architecture for the proposed stochastic SCL decoder is developed. To the best of our
knowledge, this is the first paper to implement stochastic SCL decoder to field programmable gate array
(FPGA) platform. Compared with the deterministic-based decoding framework, it shows advantages in
the balance between performance and complexity.
The remainder of this paper is organized as follows. Section 2 re-explains the SSCL decoding algorithm
with L = 1 that is proposed by [13] firstly. Section 3 presents the SSCL decoding with L = 2. Further, the
SSCL decoding with L = 2p is presented in Section 4. The implementation of SSCL decoder is developed
in Section 5. Comparison and conclusion are presented in Sections 6 and 7.

2 Stochastic SCL decoder for polar codes with single-level decoding method

2.1 Deterministic SCL decoding

We use the notation aN 1 as shorthand for denoting a row vector (a1 , . . . , aN ). Given a vector a1 , we
N
j j N
write ai , 1 6 i, j 6 N , to denote the subvector (ai , . . . , aj ); if j < i, ai is regarded as void. Given a1 and
A ⊂ {1, . . . , N }, we write aA to denote the subvector (ai : i ∈ A).
A polar code can be defined by parameters (N, K, A, uAc ) [1], where N , K, A, and uAc are the code
length, the number of information bits, the set of information bits, and the frozen bit whose value is always
zero, respectively. The transition probabilities of binary-input discrete memoryless channels (B-DMCs)
are defined as W (y|x), and for simplicity, let (N, K) denote a polar code in this study.
△
For a polar encoding process, we denote n = log2 N and F = [ 11 01 ]. Then the codeword x = [x1 , x2 , . . .,
xN ] can be generated from x = F ⊗n u. For SC decoding, the estimated bit sequence is defined as
û = [û1 , û2 , . . . , ûN ]. If ui is a frozen bit, we can simply assign ûi = 0. Otherwise, the decoded bit can
Liang X, et al. Sci China Inf Sci October 2020 Vol. 63 202303:3

be determined by
(i) (i)
(
0, if WN (y1N , ûi−1 N i−1
1 |0) > WN (y1 , û1 |1),
ûi = (i) (i) (1)
1, if WN (y1N , ûi−1 N i−1
1 |0) < WN (y1 , û1 |1),

(i)
where WN (y1N , ûi−1 N i−1
1 |ui = ûi ) is defined as the bit channel transition probability, (y1 , û1 ) denotes the
(i)
output of WN and ui is input. The deterministic successive cancellation decoder calculates this log-
likelihood ratio (LLR) by recursively utilizing two basic processing nodes, namely f node and g node,
whose functions are shown as
1 + ab
f (a, b) = , g(a, b, ûsum) = a1−2ûsum b, (2)
a+b

where a, b, ûsum ∈ {0, 1}.

Regarded as the generalization of the SC decoding, the SCL decoding algorithm was firstly proposed
by [3, 14]. In SC decoding, only the most likely estimation code bit can survive. Whenever a certain bit
is incorrectly decoded, not only this bit decoding fails, but the wrong feedback information also causes
influence on the following bit decoding. To this end, the SCL decoder utilizing the similar algorithm as
K-best MIMO detection [15] always expands a list of L code bits at each step on the full binary-tree,
and outputs the most likely codeword when the last step is completed. The expanding decoding steps of
SCL algorithm with list size L = 2 is illustrated in Figure 1. The SCL decoder improves the performance
at the expense of increasing hardware. One possible solution to lower the hardware complexity of basic
processing nodes is to implement the algorithm with stochastic computing.

2.2 Stochastic SCL decoding

Stochastic decoding utilizes bit-streams that are composed of ‘0’ and ‘1’ to represent a number rather
than employs weighted sum to represent values like conventional deterministic decoding. In stochastic
computing, the portion of 1’s in the whole bitstream denotes corresponding value. The first stochastic SC
decoder for polar codes was proposed by [16], which mentioned that when the normalization processing
is applied, the bit conditional probability denoted in (3) can be utilized as the decision metric for SC
polar decoder.
(i)

W (y N ,ûi−1 |u =û )
 PN 1 (i)1 N i i−1i , if i ∈ A.

WN (y1 ,û1 |ui )
p(ui = ûi |ûi−1
1 )= ui ∈{0,1} (3)
if i ∈ Ac .

1ui =0 ,


Pr(A=0) Pr(B=0)
According to the definition of likelihood ratio (LR) [16], a = Pr(A=1) and b = Pr(B=1) , the equation of
f node can be written as

1 + ab Pr(A = 1)Pr(B = 1) + Pr(A = 0)Pr(B = 0) Pr(F = 0)

f (a, b) = = = , (4)
a+b Pr(A = 0)Pr(B = 1) + Pr(A = 1)Pr(B = 0) Pr(F = 1)

where Pr(F = 0) + Pr(F = 1) = 1. Therefore, we can get the equation Prf = Pr(F = 1) = (1 − Pr(A =
1))Pr(B = 1) + Pr(A = 1)(1 − Pr(B = 1)).
To derive stochastic version of g node equation, we first set ûsum = 0, then,
Pr(A=0)Pr(B=0)
Pr(A=1)Pr(B=1)+Pr(A=0)Pr(B=0) Pr(G = 0)
g(a, b, 0) = ab = Pr(A=1)Pr(B=1)
= , (5)
Pr(G = 1)
Pr(A=1)Pr(B=1)+Pr(A=0)Pr(B=0)

where both numerator and denominator are divided by Pr(A = 1)Pr(B = 1) + Pr(A = 0)Pr(B =
0) in order to meet the requirement that Pr(G = 0) + Pr(G = 1) = 1. Then we can get Prg =
Pra Prb
Pr(G = 1) = (1−Pra )(1−Pr b )+Pra Prb
. Secondly, we set ûsum = 1, similarly we can get Prg = Pr(G =
(1−Pra )Prb
1) = Pra (1−Prb )+(1−Pra )Prb . In this instance, if we define b = 1ûsum =0 , the stochastic form (2) can be
Liang X, et al. Sci China Inf Sci October 2020 Vol. 63 202303:4

0 1
p11 p21
p11 p21
0 1 0 1 p 2
1 p22 p32 p42
p11 p12 p11 p22 p21 p32 p21 p42

p 3
p23 p33 p 3
1 4

0 1
0 1
p11 p12 p13 p11 p12 p23 p21 p32 p33 p21 p32 p43

p1i p2i p3i p4i

0 1 0 1 i i i i

∏p i
path1 ∏p
n =1
i
path 2 ∏p i
path 3 ∏p i
path4
n =1 n =1 n =1

Figure 1 SCL decoding steps with L = 2. Figure 2 (Color online) Stochastic SCL decoding with
L = 2.

rewritten as

Prf = Pra (1 − Prb ) + Prb (1 − Pra ), (6)

(bPra + (1 − b)(1 − Pra ))Prb
Prg = . (7)
(1 − b)Prf + b(1 − Prf )

From (6), bit conditional probabilities in SC decoding can be updated by recursively applying
 (2i−1) (i) N/2 (i)
(y1N , û2i−2 ) = PrN/2 (y1 , û2i−2 2i−2 N 2i−2


 PrN 1 1,o ⊕ û1,e ) 1 − PrN/2 (yN/2+1 , û1,e )

(i) N/2 (i)

+ 1 − PrN/2 (y1 , û2i−2 2i−2 N 2i−2
1,o ⊕ û1,e ) PrN/2 (yN/2+1 , û1,e ).

(8)
 (i) N/2 (i) N/2 (i)
((1−û2i−1 )PrN/2 (y1 ,û2i−2 2i−2
1,o ⊕û1,e )+û
2i−1
(1−PrN/2 (y1 ,û2i−2 2i−2 N 2i−2
1,o ⊕û1,e )))PrN/2 (yN/2+1 ,û1,e )

 Pr(2i) (y1N , û2i−1

)= .

N 1 2i−1
û
(2i−1)
PrN N 2i−2
(y1 ,û1 2i−1
)+(1−û
(2i−1) N 2i−2
)(1−PrN (y1 ,û1 ))

Please note that the two formulas only calculate the bit conditional probability instead of the path
probability P (ûi1 ). After calculating bit conditional probability as denoted in

i
 PrN , if i ∈ A and ûi = 1,


p(ui = ûi |ûi−1
1 )= 1 − PriN , if i ∈ A and ûi = 0, (9)

ui =0 , if i ∈ Ac ,

1

in order to obtain path probability as the path metric, all bit conditional probability in one path [p(u1 =
û1 ), . . . , p(uN = ûN )] has to be multiplied as

i
Y
P (ûi1 ) = P (ûi−1 i−1
1 )p(ui = ûi |û1 ) = p(un = ûn |ûn−1
1 ). (10)
n=1

Take the SSCL decoding with L = 2 depicted in Figure 2 as an example. At each level, SCL decoding
tree expands paths and updates path probabilities, and then selects paths with the largest L path prob-
abilities instead of only keeping the best path. In general, at the i-th decoding level of SSCL decoder, a
total of 2L ordered probabilities P{1,...,2L} (ûi1 ) are calculated as the metric to select the L most probable
candidates.
Therefore, the path probabilities at the i-th decoding level can be calculated through multiplying the
(i − 1)-th level’s path probabilities with the corresponding ordered bit conditional probability. Revealed
by (10), to implement path probabilities, only one two-input AND gate is needed for multiplication,
Liang X, et al. Sci China Inf Sci October 2020 Vol. 63 202303:5

100

10−1
p(ui=uAi|uA )
i−1
1

BER
P (uA1i )
P (uA1i−1) AND
10−2
N=64, deterministic decoding
N=64, stochastic decoding
N=64, α=0.5, stochastic
N=256, deterministic decoding
N=256, stochastic decoding
N=256, α=0.5, stochastic
10−3
1.5 2.0 2.5 3.0
SNR (dB)

Figure 3 (Color online) The architecture for generating Figure 4 (Color online) Comparison of different SC polar
path probability in stochastic SCL decoding. decoders.

with the corresponding architecture depicted in Figure 3. Moreover, the calculation of the conditional
probability of each corresponding ordered bit entails the recursive operation between f node and g node.
In the successive way, the most probable L candidates in each stage can always be kept and the best one
at the last stage can also be selected.

2.3 Channel message scaling

As illustrated in Figure 4, compared with the deterministic design, the decoding performance of stochastic
SC suffers from a sharp degradation. The reason is that the channel message represented by bitstreams
will lose the randomness in high SNR regions. For stochastic LDPC decoders, a new and essential
scaling method for stochastic decoders was proposed in [17], which is used to provide a similar level of
switching activity over different ranges of SNRs. This method can efficiently improve bit error rate (BER)
performance for stochastic decoders at a high level.
In the scaling method, the received channel reliabilities are scaled by a factor that is proportional to
the noise in channel. The scaled LLRs calculated by

Pr (xi = +1|yi )
LR(yi ) = log (11)
Pr (xi = −1|yi )

are independent towards channel noise. That is to say, the decoder does not need to estimate the noise
in the channel. Furthermore, according to the work in [17], the scaled LLR for the i-th received symbol
is LR′ (yi ) = αN0 LR(yi ), where the initial channel likelihood ratio can be denoted by LR(yi ) = 4yi /N0 ,
and N0 is the single-sided noise power density. The channel message scaling approach introduces a
scaling coefficient α and the scaled channel likelihood ratio becomes LR′ (yi ) = 4αyi , which is now used
for generating the input stochastic data bitstreams for SC polar decoders. The input probability can
therefore be rewritten as
1 1
Pr(yi = 1) ≈ −LR(y ) = −4αyi . (12)
e i +1 e +1
Considering that the choices of α would result in different decoding performances and previous work [18]
has proven the best performance of stochastic SC decoder can be achieved with α = 0.5, the scaling factor
α = 0.5 is employed in all numerical simulations in the following part of this paper.
To testify the effectiveness of the channel message scaling method, simulation comparisons of determin-
istic design, stochastic design, and stochastic design with scaling factor α = 0.5 are depicted in Figure 4.
Simulation results for both (64, 32) code and (256, 128) code have demonstrated that the stochastic SC
decoders with the scaling approach achieve improvement over the ones without.
Liang X, et al. Sci China Inf Sci October 2020 Vol. 63 202303:6

2.4 Enlarging probability approach

The decision metric in SSCL decoder is denoted as the path probability P (ûi1 ) calculated by (10) rather
than p(ui = ûi |ûi−1 i
1 ). Especially, P (û1 ) denotes the multiplicative of i ordered bit conditional proba-
bilities, and all the values of bit conditional probability are smaller than one. For stochastic decoding,
path probability P (ûi1 ) is denoted as bit 1’s occurrence probability of a fixed-length binary bit-stream,
and then the value of P (ûi1 ) keeps shrinking with the increase of decoding level i. Therefore, P (ûi1 ) will
become considerable small in stochastic decoding process, and then the bitstream will lose its randomness
characteristic and cause performance degradation.
To improve the randomness loss of path probability P (ûi1 ) in SSCL decoding process, an efficient
approach named enlarging probability is developed in [19]. Before all L path probabilities P (ûi1 ) become
too small, all path probability values are re-scaled by an identical product factor β, and therefore we still
keep the proportional relation between L path probabilities and ensure their randomness characteristic.
For instance, if all L candidates for SSCL decoder are smaller than 0.5, we set β = 2 otherwise β = 1.
Therefore, the path probabilities of the L most possible candidates are always remained in the region of
(0.5, 1].
2.5 Distributed sorting

The complexity of path probabilities’ comparison and memory management grows drastically with L.
Thus, SSCL decoder was designed to reduce the heavy consumption. However, it is difficult to sort
directly by stochastic computation representation, in other words, the stochastic data streams of path
probabilities should be changed into deterministic data. Here, DS [20] is developed for path selection.
In order to change stochastic data streams into deterministic values, a counter is applied after the
path probability of each candidate is calculated. The counter computes the number of ‘1’ in stochastic
streams, and this number is denoted as the path probability metric (PPM). Then paths with the largest
L PPM values are selected.
For clarity, children from the same father node with larger and smaller PPM are denoted by the first
child (FC) and the next child (NC), respectively. In traditional direct sorting algorithms, such as insertion
and bubble sorting, when the m-th maximum NC node and the m-th minimum FC node are found in the
m-th round, top m maximum NCs are all compared to the m-th minimum FC value. In DS algorithm,
this step can be simplified to only compare the m-th maximum NC and the m-th minimum FC, and
then the comparison result decides whether the next round of comparison needs to be conducted. The
DS reduces the comparison complexity from O(L2 ) to O(L), and an upper rounds coefficient k is set to
reduce the latency from kL2 to kL [20]. By employing DS algorithm, the selection of candidate paths is
depicted in Figure 5. In our stochastic decoding system, L FC PPM values are always superior to L NC
PPM values in most levels on the binary-tree, that is to say, when coefficient k is canceled, the average
round’s number of comparison required is still quite small and the DS method will have no performance
gap compared to direct sorting.
For stochastic polar SCL decoder, the scaling factor is set as α = 0.5, the bitstream length is chosen
as l = 1024, and doubling probability approach is employed in the stochastic system. Figure 6 shows
the numerical results for different decoder with (256, 128) codeword. From Figure 6, it can be observed
that the performance of DS-based SSCL decoding with list size 2L is similar to the performance of
deterministic SCL decoding with list size L.
The latency of DS algorithm is relevant to the average rounds of comparison. For (256, 128) polar
decoder with L = 8, the average comparison rounds of DS are illustrated in Figure 7. When SNR is in
[1.5, 3.0], the average rounds are no larger than 1.76, hence the low latency of DS algorithm is guaranteed.

3 Stochastic SCL polar decoder with a double-level decoding method

The biggest problem of SSCL decoder is decoding latency. Single-level decoding mentioned in Section 2
for stochastic polar code makes every bit decoding consume too many clock periods, which almost equals
Liang X, et al. Sci China Inf Sci October 2020 Vol. 63 202303:7

1 0

1 0 1 0
NC1 NC2 NC3 NC4 FC1 FC2 FC3 FC4

0
1
0
0
1
0
1
1st Round NC1 FC3 FC1 NC1 FC2 NC2 FC3 NC3 FC4 NC4
NC1 FC3

FC1 FC2 NC1 FC4 FC1 FC2 FC3 FC4

2nd Round NC4 NC4 FC2 FC2

FC NC Candidates
FC1 NC1 NC4 FC4 FC1 FC2 NC1 FC4
Node with maximal probability Node with minimal probability

3rd Round Node with sub-maximal probability Node with sub-minimal probability
NC2 NC2 FC4 FC4

Node with third-maximal probability Node with third-minimal probability

FC1 NC1 NC4 FC2 FC1 NC1 NC2 FC4

Probability comparison Probability sort

Figure 5 (Color online) Candidate paths’ selection in polar SCL decoder (L = 4).

10−1 1.77
1.76
1.75
Average comparison rounds

10−2 1.74
1.73
BER

1.72
1.71
10−3 Determined-SCL, L=1 1.70
Stochastic-SCL-DS, L=2
Determined-SCL, L=2 1.69
Stochastic-SCL-DS, L=4
Determined-SCL, L=4 1.68
Stochastic-SCL-DS, L=8
10 −4 1.67
1.6 1.8 2.0 2.2 2.4 2.6 1.6 1.8 2.0 2.2 2.4 2.6 2.8 3.0
SNR (dB) SNR (dB)

Figure 6 (Color online) Performance comparison of Figure 7 Comparisons in (256, 128) stochastic SCL de-
(256, 128) deterministic and stochastic decoders with single- coder with single-level DS decoding (L = 8, l = 1024).
level DS (l = 1024).

to the length of stochastic bitstream. To deal with the dilemma, in this section, double-level decoding
method is proposed to reduce decoding latency. Each time we estimate two bits no matter they are
information bits or frozen bits, and then utilize the feedback signal to calculate the next two bits codes.

3.1 Stochastic SCL decoder in double-level tree

(i) N/2 2i−2 (i)
According to (8), when PrN/2 (y1 , û1,o ⊕ û2i−2 N 2i−2
1,e ) and PrN/2 (yN/2+1 , û1,e ) have been calculated,
(2i−1) (2i)
PrN (y1N , û2i−2
1 ) can be computed. Then after estimating û2i−1 , PrN (y1N , û2i−1 1 ) can be calculated.
2i−1 (2i) N (2i)
It is notable that û only has two possible values ‘1’ or ‘0’, and hence PrN (y1 , 0) and PrN (y1N , 1)
(2i−1) N (2i)
can be pre-computed before estimating û2i−1 . Stochastic streams PrN (y1 , û2i−2
1 ), PrN (y1N , 0) and
(2i)
PrN (y1N , 1) can output simultaneously, that is to say, two neighbouring bit conditional probabilities
p(u2i−1 = û2i−1 |û2i−2
1 ) and p(u2i = û2i |û2i−1
1 ) can be output at the same time. The path probability (10)
is rewritten in the double-level scheme:
Y2i
2i−2
P (û2i
1 ) = P (û1 )p(u2i−1 = û2i−1 |û2i−2
1 )p(u2i = û2i |û2i−1
1 )= p(un = ûn |ûn−1
1 ). (13)
n=1

This double-level method is very similar to the pre-computation scheme in deterministic SC de-
Liang X, et al. Sci China Inf Sci October 2020 Vol. 63 202303:8

1 0

1 0 1 0

1 1 0 1 0
Frozen bit
0 0

1
1
Information bit
0 0

1
1
2 1 0 1 0
Information bit
0

0
0
Frozen bit
1
0

1
0
3 1 0 1 0
Information bit
1 0 1 0 1 0 1 0
Information bit

Chosen the best L paths Eliminated paths

Middle level without sorting step

Sorting: choose the best L paths from 2L candidates

Sorting: choose the best L paths from 4L candidates

Figure 8 (Color online) Binary-tree of stochastic SCL decoder in double-level (L = 2).

(2i)
coding [21]. Ref. [21] proposed a pre-computation scheme in all Prj (y1j , û12i−1 ) calculation when
j = 1, 2, . . . , N/2, N , in order to reduce one clock period when estimating one bit value in deterministic
decoding. In stochastic calculation, our proposed double-level scheme only employs pre-computation
(2i)
in PrN (y1N , û2i−1
1 ) to ensure two neighbouring bit conditional probabilities p(u2i−1 = û2i−1 |û2i−2 1 )
and p(u2i = û2i |û2i−11 ) output simultaneously. For decoding each bit conditional probability needs
almost l clock periods, the double-level scheme can save N l/2 clock periods. In stochastic decoding,
(2i)
pre-computation in other stages of Prj (y1j , û2i−11 ) calculation when j = 1, 2, . . . , N/2 is meaningless.
Double-level decoding scheme is illustrated in Figure 8. For each father node, there are two children at
the first level (the (2i − 1)-th level) and four sub-children (the children of children nodes) at the second
level (the 2i-th level). According to different distributions of information bits and frozen bits, there are
three cases in the double-level scheme. As shown in Figure 8, the first and second cases have a frozen bit
and an information bit in two neighbouring levels, then DS is utilized to select paths with largest L PPM
values from 2L children. The third case has two information bits in both levels, and then an adaptable
DS is employed to select L candidates from 4L children. Details of adaptable DS are in Subsection 3.2.
In double-level SCL decoding, every two levels need one sorting process.

3.2 Adaptable distributed sorting in double-level method

The ADS method is proposed to meet the sorting requirement for stochastic double-level SCL decoding.
Employing the adaptable DS algorithm, the selection process of candidate paths from 4L to L is illustrated
in Figure 9. From one father node, four sub-children are divided into one FC and three NCs. The ADS
algorithm is composed of at most L − 1 rounds of comparison. Hence, it is dynamic and approximate to
Liang X, et al. Sci China Inf Sci October 2020 Vol. 63 202303:9

1 0 1 0 1 0 1 0

1
0
1

0
0

0
0
1
NC1 FC1 NC2 NC3 NC4 NC5 NC6 FC2 NC7 FC3 NC8 NC9 NC10 NC11 NC12 FC4

Double-level distributed sorting

NC1 NC2 NC3 NC4 NC5 NC6 NC7 NC8 NC9 NC10 NC11 NC12 FC1 FC2 FC3 FC4

1st Round NC1 FC3

NC1 FC3

FC1 FC2 NC1 FC4 FC1 FC2 FC3 FC4

2nd Round NC10 NC10 FC2 FC2

FC1 NC1 NC10 FC4 FC1 FC2 NC1 FC4

3rd Round NC5 NC5 FC4 FC4

FC1 NC1 NC10 NC5 FC1 NC1 NC2 FC4

FC NC Candidates

Node with minimal Node with sub-minimal Node with the third-minimal
probability probability probability
Node with maximal Node with sub-maximal Node with the third-maximal
probability probability probability
Probability Probability sort
comparison

Figure 9 (Color online) Candidate paths’ selection in polar SCL decoder with double-level (L = 4).

DS. Firstly, all L FCs are picked out, and the remainders are 3L NCs. In the first round comparison,
NC with maximal PPM (NC1) and FC with minimal PPM (FC3) are chosen out (requiring 4L − 2
comparison times), and then compare NC1 and FC3 (requiring 1 comparison time). If the PPM of FC3
is larger, the sorting will be finished with the selected candidates of FC1, FC2, FC3, and FC4. In this
case, only one round of comparison is required with 4L − 1 PPM comparisons. Otherwise, FC3 is replaced
by NC1, and these two PPMs will not be compared in the following process any more. And then the
second round is operated in similar pattern. Next, NC with submaximal PPM (NC10) and FC with
subminimal PPM (FC2) are chosen out (requiring 4L − 4 comparison times), and then compare NC10
with FC2 (requiring 1 comparison time). If the PPM of FC2 is larger, the sorting process will finish with
the selected candidates of FC1, FC2, NC1, and FC4. In this case, two rounds of comparison are required
with 8L − 4 PPM comparisons. Otherwise, FC2 is replaced by NC10, the step of final round comparison
repeats the above process, and the final candidates can be obtained after 12L − 9 PPM comparisons.
To reasonably evaluate the performance of the proposed stochastic SCL decoder, numerical simulation
is carried out for 1/2-rate (1024, 512) codeword, where the scaling factor α = 0.5 and bit-stream length l =
1024. As shown in Figure 10, owing to the random fluctuations of stochastic decoding, the performance
Liang X, et al. Sci China Inf Sci October 2020 Vol. 63 202303:10

10−1 350

300

The sum of comparison rounds

10−2
250

Single-level DS
BER

10−3 200 Double-level DS

150
Stochastic-SCL-single level, L=2
10−4 Stochastic-SCL-double level, L=2
Determined-SCL-single level, L=2
Stochastic-SCL-double level, L=4 100
Determined-SCL-single level, L=4
Stochastic-SCL-double level, L=8
50
1.4 1.6 1.8 2.0 2.2 2.4 2.6 1.4 1.6 1.8 2.0 2.2 2.4 2.6 2.8
SNR (dB) SNR (dB)

Figure 10 (Color online) Performance comparison of the Figure 11 (Color online) The sum of rounds in SCL de-
1/2-rate (1024, 512) deterministic SCL decoder and stochas- coder with double-level DS 1/2-rate (1024, 512) decoding
tic SCL decoder with different decoding levels. (L = 4).

gap, resulting from deterministic SCL decoding and SSCL decoding, does exist. And the performance of
(double level stochastic SCL) DL-SSCL decoding with list size 2L can transcend the performance of single
level deterministic SCL decoder with list size L. Furthermore, the performance gap between single-level
and double-level stochastic decoding scheme results from multiplying two bit conditional probabilities
into the path probability simultaneously, and enlarging probability approach in double-level decoding
is applied half times than that in single-level decoding scheme. Accordingly, the randomness of PPM
in double-level scheme is worse than that in single-level scheme, but the performance gap of stochastic
decoders with list 2, depicted in Figure 10, can be still accepted.
In stochastic double-level SCL decoding, ADS method needs to select L best paths from 2L or 4L
children. These two kinds of sorting require different comparison times in each round. Therefore, the
latency of ADS is relevant to the average rounds of comparison. For (1024, 512) stochastic polar decoder
with L = 4, the sum of comparison rounds is shown in Figure 11, the red line presents the sum of
comparison rounds resulting from selecting L best paths from 4L children, and the black line denotes the
sum of comparison rounds resulting from selecting L best paths from 2L children.
According to Figures 5 and 9, the DS from 2L or 4L to L approximately requires 2L or 4L compar-
isons per round, respectively. Using 2L comparisons in one round as an evaluation standard, Figure 12
summarizes the average rounds for each information bit in double-level decoding. When SNR is in
[1.5, 3.0], the average rounds are no larger than 1.47, so the low latency of ADS algorithm is guaranteed.
Hence, the ADS can compromise different sorting scales in stochastic double-level decoding, as concluded
in Algorithm 1. The FER performance comparison with other FPGA-based SCL decoders is given in
Figure 13.

4 Stochastic SCL decoder for polar codes with 2p -level decoding method

As analyzed in Section 3, multiple-level scheme in binary tree is efficient for SSCL decoder, for it can
estimate several bits simultaneously rather than bit by bit. However, multiple-level scheme will cause
a larger sorting scale. Direct sorting cannot afford such large scale complexity, and the dramatically
increasing sorting complexity will cancel out the advantages brought by stochastic computing. To this
end, ADS is improved to maintain the complexity in multiple-level decoding in O(L).
Liang X, et al. Sci China Inf Sci October 2020 Vol. 63 202303:11

1.48 100
[20], (N, L, R)=(1024,4,0.5)
[22], (N, L, R)=(1024,4,0.5)
1.46 SSCL double-level decoding
SSCL single-level decoding
Average comparison rounds

10−1
1.44

1.42

FER
10−2
1.40

1.38
10−3
1.36

1.34 10−4
1.5 2.0 2.5 3.0 1.0 1.5 2.0 2.5
SNR (dB)
SNR (dB)

Figure 12 Average rounds in SCL decoder with single- Figure 13 (Color online) The FER performance of dif-
double-level DS 1/2-rate (1024, 512) decoding (L = 4). ferent SCL decoders with 1/2 rate (1024, 512) codeword
(L = 4, l = 1024).

Algorithm 1 Proposed ADS algorithm

Require:
s1 : jk = {00, 01}; s2 : jk = {00, 10}; s3 : jk = {00, 01, 10, 11}; I: L previous path numbers;
[PPMFC , JK] = maxs1,s2,s3 (PPM(û2i−21 |û2i−1
1 = j, û2i
1 = k));
PPMNC , expanded paths list P.
Ensure:
Updated paths list Pnew = [P(I), J, K] and PPM list PPMs .
1: [PPMs , Ix, JKx] = sort(PPMFC , ascend);
2: [PPM1 , Iy, JKy] = sort(PPMNC , descend);
3: for i = 1 : L − 1 do
4: f = 0;
5: if (PPM1,i 6 PPMs,i ) then
6: f = 1;
7: else
8: f = 0;
9: end if
10: if (f == 0) then
11: Update I: replace I Ix(i) with I Iy(i) ;

12: Update u2i−1 u2i : JK JKx(i) = JK JKy(i) ;
13: else
14: break;
15: end if
16: end for
17: Updated paths list Pnew = [P(I), J, K] and PPM list PPMs .

4.1 Stochastic SCL decoder in 2p -level scheme

The multiple-level scheme provides different depth of pre-computation in stochastic decoding process.
Denote p as the number of stages to perform pre-computation operations. In the single-level scheme,
there is not any pre-computation employed in decoding process, that is to say, p = 0 and 2p = 1.
Therefore, the decoding binary tree selects L candidates bit by bit. In double-level scheme, one stage of
(2i) (2i)
pre-computation is applied with [PrN (y1N , 0), PrN (y1N , 1)] pre-calculated without feedback information,
and p = 1 and 2p = 2. Therefore, the decoding binary tree selects L candidates from 2L or 4L nodes
every double levels. And so on, when increasing the stages of pre-computation to two (p = 2, 2p =
(i) N/2 (i) N/2 (2i) (2i)
4), [PrN/2 (y1 , 0), PrN/2 (y1 , 1)] and corresponding [PrN (y1N , 0), PrN (y1N , 1)] are pre-calculated
without feedback information. The decoding binary tree selects L candidates from 2L, 4L, 8L or 16L
nodes every four levels (2p = 4).
In general, when the number of pre-computation stages is p, the 2p -level scheme of SSCL decoder
(i/2p−2 ) N/2p−1 (i/2p−2 ) N/2p−1
is proposed. The last p stages [PrN/2p−1 (y1 , 0), PrN/2p−1 (y1 , 1), p > 2] to corresponding
(2i) (2i)
[PrN (y1N , 0), PrN (y1N , 1)] need to be pre-calculated without feedback information. For the informa-
tion bits are dispersed in N -bit code, the sorting scale is dynamic. The decoding binary tree selects L
Liang X, et al. Sci China Inf Sci October 2020 Vol. 63 202303:12

p
candidates from 2L, 4L, . . . , 22 L nodes every 2p levels. The proposed 2p -level scheme reduces decoding
latency from N l to N l/2p . It really improves the efficiency of stochastic decoding. For data are cal-
culated between bit and bit in the stochastic computation, the increasing computational elements for
pre-computation are negligible, and more details of architecture design are presented in Section 5.

4.2 Adaptive distributed sorting in 2p -level method

In 2p -level decoding scheme, the sorting scale is dynamic and large. The worst situation is selecting L
p
best candidates from 22 L nodes. The DS scheme picks L FC nodes from L paths, and then sequentially
chooses the worst FC and the best NC and compares them until the possibility is not changing. However,
p
when p is large, the number of NC nodes is (22 − 1)L, the comparisons of choosing the best NC are
difficult. To this end, in the first round, L best NCs are picked from L paths in parallel, and then
p
compare L NCs to select the best one, and this round needs (22 − 1)L NC comparisons. In the following
rounds, only the path of last selected NC needs to be compared. So these rounds need only (2p + L) NC
p
comparisons. Algorithm 2 shows that only the first round requires (22 − 1)L NC comparisons (line 7),
the following rounds need (2p + L) NC comparisons (lines 10 and 18).

Algorithm 2 ADS algorithm for 2p level decoder

Require:
2p -level scheme, list size is L;
Ensure:
Final L candidates.
1: for i = 1 : L do
2: Choose the best PPM as FCi , the rest as NCi = [NCi,1 , NCi,2 , . . . , NCi,(22P −1)L ];
3: Set FC = [FC1 , FC2 , . . . , FCL ];
4: end for
5: Set candidates= [FC1 , FC2 , . . . , FCL ];
6: for i = 1 : L do
7: Choose the best PPM of each NCi ;
8: NCk = [NC1k , NC2k , . . . , NCLk ];
9: end for
10: Select the worst PPM FCk of FC;
11: Select the best PPM NCpk of NCk , p denotes NCpk is from NCp ;
12: if NCpk < FCk then
13: Jump to line 21;
14: else
15: Keep on;
16: end if
17: Replace FCk with NCpk in candidates;
18: Cancel FCk from FC, cancel NCpk from NCp ;
19: Select the best PPM of NCp as NCpk ;
20: Jump to line 10;
21: Output final L candidates.

4.3 Performance analysis of 2p -level decoding method

In stochastic 2p -level SCL polar decoding, when p increases, the decoding latency will be reduced from
N l to N l/2p . However, increasing p also causes performance loss to some extent. As shown in Figure 10,
there is a slight performance gap between double-level scheme and single-level scheme. To update path
probability every 2p -level, the 2p bit probabilities will be multiplied together and multiplied with the
old path probability. Meanwhile, to keep the probabilities for stochastic computing maintained in a
reasonable range, enlarging probability approach is employed. Every time updating the path probability,
we re-scale all the path probabilities with the same multiple factor to make the best PPM maintained
in the range of [0.5, 1]. However, the re-scaling times will decrease when p grows. Consequently, the 2p
bit probabilities multiplying at the same time with one re-scaling approach make stochastic randomness
worse with p increasing. To conclude, increasing p lowers the latency of stochastic 2p -level SCL polar
decoding at the expense of slight performance loss. Therefore, the selection of size p depends on the
realistic situations and requirements of designers.
Liang X, et al. Sci China Inf Sci October 2020 Vol. 63 202303:13

Prb
x A XOR REG
Pra
xs (t) Prforg
A>B 0 MUX
1
AND J
LFSR B FF
R ûsum Q
log2 l bits CLK
1 bit AND K
i
Stochastic bit stream
CLK 1 bit

Figure 14 (Color online) The bit stream generation mod- Figure 15 (Color online) The architecture of the pro-
ule. posed mixed node I.

Table 1 The complexity of different modules of stochastic SCL decodera)

Modules Mixed node I Mixed node II Mixed node III SC decodera) SCL decoderb)
XOR 1 1 1 N −1 N L/2 − L + N/2
NOT 3 6 5 3N 3N L/2 + 5N /2
AND 2 8 4 2N + 4 N L + 4L + 2N
1-bit register 1 1 1 N −1 N L/2 − L + N/2
JK-FF 1 2 2 N N L/2 + N
MUX 2 – – 2(N − 2) N L + N − 4L
Counter – – – 4 4L
a) N and L denote the code length and list size, respectively;
b) Double-level decoding.

5 Stochastic SCL decoder architecture

Considering the tradeoff between latency and performance of SCL decoder, we employ p = 1 here as
our proposed architecture design. This section focuses on designing details of every module in the
corresponding proposed SSCL decoder architecture.

5.1 Stochastic bit stream generation module

As illustrated in Figure 14, the deterministic value of x is compared with a random number R with
R ∈ [0, 1]. In reality, R is pseudo-random, generated by a linear feedback shift register (LFSR). The
stochastic bitstream output of the comparator is denoted by xs (t). If x > R, xs (t) = 1, otherwise,
Pl
xs (t) = 0. Consequently, the deterministic value of a stochastic stream xs is obtained by x = 1l t=1 xs (t),
where l denotes the length of a stochastic bitstream. This module converses the deterministic value of
channel transition probability into stochastic streams. To decode an N -bit polar code, the N paralleling
stochastic bit stream generation (SG) modules are required as the interface to connect to the main
stochastic decoder. For N paralleling SG sharing the same pseudo-random sequence, the complexity of
this interface is determined by that of the comparator, which is based on the length of bitstream l. To
this end, the complexity of this interface module is O(N l).

5.2 Mixed node I

Through adopting similar pre-computation [21], a mixed node I is defined here to realize single level
decoding. More specifically, f node and g node are merged in mixed node I to make the whole de-
coder succincter. On the basis of the elementary nodes presented in [16], the mixed node I is depicted
in Figure 15.
This node can execute the function of f and g nodes based on stochastic computing. Table 1 summarizes
the hardware complexity of each mixed node I. Although mixed node I can complete the general stochastic
element processing, it is unsuitable to realize the proposed double-level decoding.
Liang X, et al. Sci China Inf Sci October 2020 Vol. 63 202303:14

Prb
REG P (uA1i )
XOR
Pra AND

AND J
AND
FF Q
CLK
AND K

AND

AND J

AND
FF
Q
CLK
1 bit
AND K
Stochastic bit stream

P (uA1i−2)

Figure 16 (Color online) Mixed node II for the last stage, also mixed node III for the first stage in SCL decoding.

5.3 Mixed node II

To realize the double-level dcoding, mixed node II that can improve the efficiency of stochastic polar
decoding is proposed. To implement the proposed double-level decoding algorithm, the stochastic bit-
stream of p(u2i = û2i |û2i−1
1 ) needs to be pre-computed as the simultaneous output with the stream of
2i−2
p(u2i−1 = û2i−1 |û1 ), and hence the node module in the last stage should be changed as mixed node
II when performing double-level decoding.
As shown in Figure 16, mixed node II (the architecture in blue box) is employed in the last stage of the
SCL decoder. According to the property of probabilities, only an inverter is required to implement the
complementary operation. The inverter in Figure 16 calculates stochastic stream of p(û2i−1 = 0|ûsum ).
Different with mixed node I outputting only one-bit stream, mixed node II simultaneously outputs four-
bit streams of the production of two adjacent bit probabilities, which are p(û2i = 1, û2i−1 = 1|ûsum ),
p(û2i = 1, û2i−1 = 0|ûsum ), p(û2i = 0, û2i−1 = 1|ûsum) and p(û2i = 0, û2i−1 = 0|ûsum ).

5.4 Mixed node III

The SCL decoding architecture consists of L SC decoders in parallel. However, the parallelling design
extends the hardware complexity dramatically. For the first stage of SCL decoding, only three outputs
are possible for each node module (one f node calculation result and two possible g node calculation
results). That is to say, the first stage does not need to use the L parallelling architecture. Therefore,
mixed node III is proposed to output all possible results for the second stage calculation. As shown
in Figure 16, mixed node III (the architecture in green box) is applied in the first stage in SCL decoder.

5.5 Architecture of stochastic successive cancellation decoder with double-level decoding

As illustrated in Figure 17, the architecture of stochastic SC decoder with double-level decoding is
composed of two blocks, the interface and the main decoder. The figure presents an example of 8-bit
stochastic decoder, and N -bit decoder obeys the same design. The interface includes N SG modules,
which generate stochastic bit streams of channel bit probabilities as the inputs of main decoder.
The main decoder consists of three parts: the node process, estimation and feedback module. The node
process of an N -bit SSCL decoder consists of (N − 2) mixed nodes I and one mixed node II on a total
of log2 N stages. For double-level decoding, mixed node II is applied on the last stage of the decoder.
Liang X, et al. Sci China Inf Sci October 2020 Vol. 63 202303:15

Stage 1 Stage 2 Stage 3

Pr1(1) (y1) Main decoder
SG
Pr1(1) (y2) MN I
SG C log2l bits
uˆ1+uˆ2+uˆ 3+uˆ4 MN I
Pr1(1) (y3) 1 bit
SG (1)
Pr1 (y4) MN I C û2i−1
SG A
uˆ1+uˆ2 B Mixed node I
uˆ3+uˆ4 MN II O MN I
Pr1(1) (y5)
or C MAX û2i
SG uˆ5+uˆ6 D
Pr1(1) (y6) MN I C MN II Mixed node II
SG
uˆ2+uˆ4 MN I
Pr1(1) (y7) C C Counter
SG
Pr1(1) (y8) MN I
SG uˆ2 or uˆ6
û4 O Output
Interface uˆ1+uˆ2+uˆ 3+uˆ4 uˆ1+uˆ2 or uˆ5+uˆ6

D D 0
uˆ3+uˆ4
1
uˆ2+uˆ4 D uˆ2 or uˆ6

û4 D D 0
1
D

Figure 17 (Color online) The 8-bit stochastic SC decoder with double-level decoding.

Interface Stage 1 Stage 2 ... Stage n−1 Stage n Stochastic

Pr1(1) (y1) SC decoder
SG MN 0
Pr1(1) (y2) 0
SG III 110
1

Pr1(1) (y3) ûsum MN CC

MN MNIII C P(uˆ 1i−2)
SG 0
Pr1(1) (y4) 0
10
SG III 11 MN I
MN I CC
C û2i−1
ûsum
LC
MN II
MN II û2i
Pr1(1) (yN−3) CC
SG MN 0 C
Pr1(1) (yN−2) 0
10
SG III 11 MN I P(uˆ 1i)
MN I
ûsum CC
Pr1(1) (yN−1) MN C
MNIII
SG MN 0
Pr1(1) (yN) 0
SG III 110
1
UUSUM
ûsum g or f USUM
SUM

Llog2l bits MN I Mixed node I

CC Counter
N-2 bits UUSUM
SUM L parallel
MN II Mixed node II
modules
LC List core
1 bit MN III Mixed node III

Figure 18 (Color online) The N -bit stochastic SCL decoder with double-level decoding.

Because stochastic bit streams are not suitable for comparing, four counters are employed to convert
the stochastic streams into corresponding deterministic data, and then a maximum module selects the
best probability. Output in Figure 17 is an estimation module to output the paths û2i−1 and û2i , which
corresponds to selected probability.
The feedback part is utilized to produce ûsum signal for the g node calculation. Considering that
the architecture has been researched well [21], we omit its details here. It is worth mentioning that for
stochastic decoding, the clock set in feedback part is different from that in stochastic node processing
framework. The clock in feedback part is l times than that in stochastic calculation.

5.6 Architecture of stochastic SCL decoder with double-level decoding

As shown in Figure 18, the architecture of SSCL decoder with double-level scheme is similar to that of
SC design. The interface block includes N SG modules, while the main decoder expands to L paralleling
basic frames. However, the node modules in the first stage can merely output three possible possibilities,
Liang X, et al. Sci China Inf Sci October 2020 Vol. 63 202303:16

REG

Memory block I PPMNCi Y

PPMNC1 Read in order MAX REG a

1
a >b
PPMNC2 PPMNC Y/N

REG REG b
...

N
PPMFCj
3 3
PPMNC3L

Read in order MIN l/2 >b

b
Y/N
Memory block II PPMFC Selected PPM streams
2 PPMFC1 SG

PPMFC2 SG Y N
...
...

PPMFCL SG

4 5 5

log2l bits Control signal (Yes)

Control
1 5
instructions
1 bit Control signal (No)

Figure 19 (Color online) The architecture of list core module with ADS method and enlarging probability approach.

as mentioned in Subsection 5.4. To this end, N/2 mixed nodes III are employed in the first stage, instead
of expanded N L/2 mixed nodes I. All nodes in stage 2 to stage n expand to L paralleling modules. The
LC denotes the list core module, which uses ADS method to select L best paths from 4L candidates.

5.7 List core architecture

The list core module implements the improved DS and enlarging probability approach in the proposed
SSCL decoder. To realise the ADS method, PPMs have been categorized into two groups, FCs and NCs.
Subsequently, the L FCs are saved into memory block II, and 3L NCs are stored in memory block I.
As presented in the ADS method, PPMs in memory block II have higher priorities to be selected. As
illustrated in Figure 19, two multiplexers respectively read data from memory block I or II in order in
each sorting round, and select the best PPM NCi from block I and the worst PPM FCj from block II.
If NCi is better, the control signal is activated to execute the instructions ① and ②. The instruction ①
removes NCi from block I, and the instruction ② replaces the value of FCj with that of NCi . Then, the
next round of sorting keeps active. If FCj is better, the sorting process is over and data in memory block
II are the selected L best paths, the instruction ③ is executed. The instruction ③ is an indicator of the
end of sorting and the beginning of enlarging probability. The corresponding L best paths are output.
To realize the enlarging probability approach, the instruction ③ is executed to start-up the enlarging
probability approach. If the best-selected path is smaller than l/2 (the corresponding probability is
smaller than 0.5), the instruction ④ executes left shifting operations to all PPMs in block II. Then the
instruction ⑤ employs L SG modules to regenerate enlarged PPM stochastic streams. If the best-selected
path is larger than l/2, the instruction ⑤ is executed directly.

6 Comparison of stochastic SCL decoding and deterministic decoding

As shown in Figure 20, the decoding schedule can be denoted by three parts: node processing, DS and
enlarging probability approach. When decoding every two code bits û2i and û2i+1 , all stochastic streams
flow through n stages (n = log2 N ), and the latency is (n + l − 1) clock cycles. The latency of DS is
Liang X, et al. Sci China Inf Sci October 2020 Vol. 63 202303:17

Processing cycle in the first stage=l clocks

l+1 l+2 n+l−2 n+l−1
CLK 1 2 3 4 ... n−1 n ... l ...

Stage 1

Stage 2

Stage 3

Stage n−1

Stage n

Processing cycle in the node framework=n+l−1 clocks

Decoding u1 and u2 Decoding u3 and u4 Decoding u5 and u6

l+n−1 l+n−1 l+n−1

The node
processing
framework

Enlarging
probability

l l

Deterministic processing Dynamic processing

clock cycles clock cycles

Figure 20 (Color online) Decoding schedule for N -bit stochastic SCL decoder with double-level decoding.

Table 2 Comparison of the implementation results of several FPGA-based SCL architectures

SSCLa) SSCLa)
Decoder [20] [22]
with 1-level decoding with 2-level decoding
FPGA deviceb) Altera stratix V Xilinx Kintex 7 Altera stratix V Altera stratix V
(N, L, R) (1024, 4, 0.5) (1024, 4, 0.5) (1024, 4, 0.5) (1024, 4, 0.5)
ALMs(A)/LUTs(X)c) 101160 142961 8080 8146
Registers 13544 19795 2824 2862
fop (MHz) – 42.66 462.9 445.2
Latency (cycles) 4064 371 220 219
TP (Mbps) – 115 0.452 0.871
a) Stream length l = 1024.
b) All the FPGAs are manufactured on 28 nm process technology.
c) An ALM on Altera FPGA can be used as a 6-input LUT.

dynamic, and more details are presented in Subsection 2.5. Enlarging probability approach includes an
SG module, and the latency of output is l clock cycles. When decoding an N -bit codeword, the latency
L is
N Nl
L =(n + l − 1) × + L(DS) ≈ + L(DS). (14)
2 2
Table 2 compares the proposed SSCL decoder with other FPGA-based SCL decoders. Compared with
deterministic SCL designs [20,22], the single-level SSCL only requires around 8%, 5.6% ALMs and 20.8%,
14.2% registers at the expense of increasing latency and reduced throughput. With double-level decoding,
the SSCL can efficiently improve its throughput performance at slight cost of hardware consumption.
Moreover, the FER performances of these SCL decoders are shown in Figure 13. Compared with [20,22],
the single-level SSCL decoder has a loss of about 0.5 and 1 dB, respectively, and the loss of double-
level is little larger. Although double-level decoding expends little more hardware resource compared to
single-level decoding, it can achieve more excellent performance in throughput and latency. Therefore,
Liang X, et al. Sci China Inf Sci October 2020 Vol. 63 202303:18

our proposed SSCL decoder can successfully realize a good hardware cost and decoding performance
tradeoff.

7 Conclusion

Stochastic SCL polar code decoders are proposed. Approaches that can improve decoding performance
have been discussed. It is shown that the stochastic SCL decoder can achieve similar error-correcting
performance to its deterministic counterpart and considerable hardware reduction at the expense of
increasing latency. Hence, it can achieve a suitable tradeoff between hardware cost and error performance.
To our knowledge, this is the first stochastic SCL polar decoder, which has potential applications for
wearable and IoT devices that do not require high speed but have strict hardware constraints.

Acknowledgements This work was supported in part by National Key R&D Program of China (Grant No. 2020YFB220-
5503), National Natural Science Foundation of China (Grant Nos. 61871115, 61501116), Jiangsu Provincial NSF for
Excellent Young Scholars (Grant No. BK20180059), Six Talent Peak Program of Jiangsu Province (Grant No. 2018-
DZXX-001), Distinguished Perfection Professorship of Southeast University, Fundamental Research Funds for the Central
Universities, and Student Research Training Program of Southeast University.

References

1 Arıkan E. Channel polarization: a method for constructing capacity-achieving codes for symmetric binary-input mem-
oryless channels. IEEE Trans Inform Theor, 2009, 55: 3051–3073
2 Tal I, Vardy A. List decoding of polar codes. In: Proceedings of the IEEE International Symposium on Information
Theory Proceedings, Petersburg, 2011. 1–5
3 Chen K, Niu K, Lin J R. List successive cancellation decoding of polar codes. Electron Lett, 2012, 48: 500–501
4 Zhou H Y, Zhang C, Song W Q, et al. Segmented CRC-aided SC list polar decoding. In: Proceedings of 2016 IEEE
83rd Vehicular Technology Conference (VTC Spring), 2016. 1–5
5 Shen Y F, Zhang C, Yang J M, et al. Low-latency software successive cancellation list polar decoder using stage-located
copy. In: Proceedings of IEEE International Conference on Digital Signal Processing (DSP), 2016. 84–88
6 Gaines B R. Stochastic computing systems. In: Advances in Information Systems Science. Boston: Springer, 1969.
37–172
7 Moons B, Verhelst M. Energy-efficiency and accuracy of stochastic computing circuits in emerging technologies. IEEE
J Emerg Sel Top Circuits Syst, 2014, 4: 475–486
8 Brown B D, Card H C. Stochastic neural computation. I. Computational elements. IEEE Trans Comput, 2001, 50:
891–905
9 Wang H Z, Zhang Z C, You X H, et al. Low-complexity Winograd convolution architecture based on stochastic
computing. In: Proceedings of the IEEE International Conference on Digital Signal Processing, Shanghai, 2018. 1–5
10 Ceroici C, Gaudet V C. FPGA implementation of a clockless stochastic LDPC decoder. In: Proceedings of the IEEE
Workshop on Signal Processing Systems, Belfast, 2014. 1–5
11 Chen J N, Hu J H, Sobelman G E. Stochastic MIMO detector based on the Markov chain Monte Carlo algorithm.
IEEE Trans Signal Process, 2014, 62: 1454–1463
12 Chen J N, Hu J H, Sobelman G E. Stochastic iterative MIMO detection system: algorithm and hardware design. IEEE
Trans Circ Syst I, 2015, 62: 1205–1214
13 Liang X, Zhang C, Xu M H, et al. Efficient stochastic list successive cancellation decoder for polar codes. In:
Proceedings of the IEEE International System-on-Chip Conference, Beijing, 2015. 421–426
14 Tal I, Vardy A. List decoding of polar codes. In: Proceedings of the IEEE International Symposium on Information
Theory Proceedings, Petersburg, 2011. 1–5
15 Bakulin M, Kreyndelin V, Rog A, et al. MMSE based K-best algorithm for efficient MIMO detection. In: Proceedings
of the International Congress on Ultra Modern Telecommunications and Control Systems and Workshops, Munich,
2017. 258–263
16 Yuan B, Parhi K K. Successive cancellation decoding of polar codes using stochastic computing. In: Proceedings of
the IEEE International Symposium on Circuits and Systems, Lisbon, 2015. 3040–3043
Liang X, et al. Sci China Inf Sci October 2020 Vol. 63 202303:19

17 Tehrani S S, Mannor S, Gross W J. Fully parallel stochastic LDPC decoders. IEEE Trans Signal Process, 2008, 56:
5692–5703
18 Xu Z L, Niu K. Successive cancellation decoders of polar codes based on stochastic computation. In: Proceedings of
the IEEE 25th Annual International Symposium on Personal, Indoor, and Mobile Radio Communication, Washington,
2014. 908–912
19 Zhang Z Y, Zhang L, Wang X B, et al. A split-reduced successive cancellation list decoder for polar codes. IEEE J
Sel Areas Commun, 2016, 34: 292–302
20 Liang X, Yang J M, Zhang C, et al. Hardware efficient and low-latency CA-SCL decoder based on distributed sorting.
In: Proceedings of the IEEE Global Communications Conference, Washington, 2016. 1–6
21 Zhang C, Parhi K K. Low-latency sequential and overlapped architectures for successive cancellation polar decoder.
IEEE Trans Signal Process, 2013, 61: 2429–2441
22 Xiong C, Zhong Y, Zhang C, et al. An FPGA emulation platform for polar codes. In: Proceedings of the IEEE
Workshop on Signal Processing Systems, 2016. 148–153

SW Usb Audio - SW Usb Audio - (Design-Guide) 6.16.1alpha1
No ratings yet
SW Usb Audio - SW Usb Audio - (Design-Guide) 6.16.1alpha1
112 pages
A Multimode Area-Efficient SCL Polar Decoder
No ratings yet
A Multimode Area-Efficient SCL Polar Decoder
14 pages
BP Eerrors
No ratings yet
BP Eerrors
10 pages
Sequential Decoding of Polar Codes
No ratings yet
Sequential Decoding of Polar Codes
4 pages
Reduced-Latency SC Polar Decoder Architectures: Chuan Zhang, Bo Yuan, and Keshab K. Parhi
No ratings yet
Reduced-Latency SC Polar Decoder Architectures: Chuan Zhang, Bo Yuan, and Keshab K. Parhi
5 pages
A High Throughput List Decoder Architecture For Polar Codes
No ratings yet
A High Throughput List Decoder Architecture For Polar Codes
14 pages
List Decoding Polarcodes
No ratings yet
List Decoding Polarcodes
11 pages
Fast Sequential Decoding of Polar Codes
No ratings yet
Fast Sequential Decoding of Polar Codes
19 pages
Low-Latency Sequential and Overlapped Architectures For Successive Cancellation Polar Decoder
No ratings yet
Low-Latency Sequential and Overlapped Architectures For Successive Cancellation Polar Decoder
13 pages
Reduced-Latency SC Polar Decoder Architectures
No ratings yet
Reduced-Latency SC Polar Decoder Architectures
5 pages
Mutual-Information Based Successive Cancellation of Polar Codes
No ratings yet
Mutual-Information Based Successive Cancellation of Polar Codes
5 pages
An Fpga Implementation of Successive Cancellation List Decoding For Polar Codes
No ratings yet
An Fpga Implementation of Successive Cancellation List Decoding For Polar Codes
85 pages
Paper - 2011 - HARDWARE ARCHITECTURES FOR SUCCESSIVE CANCELLATION DECODING OF POLAR CODES - Camille Leroux
No ratings yet
Paper - 2011 - HARDWARE ARCHITECTURES FOR SUCCESSIVE CANCELLATION DECODING OF POLAR CODES - Camille Leroux
4 pages
Fast Liste Decoding
No ratings yet
Fast Liste Decoding
7 pages
Ibook - Pub Neural Successive Cancellation Flip Decoding of Polar Codes
No ratings yet
Ibook - Pub Neural Successive Cancellation Flip Decoding of Polar Codes
12 pages
Fast SCC SCC ML
No ratings yet
Fast SCC SCC ML
11 pages
Simplified Successive-Cancellation List Decoding of Polar Codes
No ratings yet
Simplified Successive-Cancellation List Decoding of Polar Codes
22 pages
Paper - 2011 - An FPGA Implementation Architecture For Decoding of Polar Codes - Alptekin Pamuk
No ratings yet
Paper - 2011 - An FPGA Implementation Architecture For Decoding of Polar Codes - Alptekin Pamuk
5 pages
Efficient Design and Decoding of Polar Codes
No ratings yet
Efficient Design and Decoding of Polar Codes
7 pages
Coding Theory PPT Him Updated Nikita
No ratings yet
Coding Theory PPT Him Updated Nikita
17 pages
Paper - 2024 - Deep Polar Codes - Geon Choi
No ratings yet
Paper - 2024 - Deep Polar Codes - Geon Choi
14 pages
Engineering Journal New Hybrid Concatenated Codes
No ratings yet
Engineering Journal New Hybrid Concatenated Codes
7 pages
Improving Belief Propagation Decoding of Polar Codes Using Scattered EXIT Charts
No ratings yet
Improving Belief Propagation Decoding of Polar Codes Using Scattered EXIT Charts
5 pages
Paper - 2011 - A Simplified Successive-Cancellation Decoder For Polar Codes - Amin Alamdar-Yazdi
No ratings yet
Paper - 2011 - A Simplified Successive-Cancellation Decoder For Polar Codes - Amin Alamdar-Yazdi
3 pages
Frozen Set Design For Precoded Polar Codes
No ratings yet
Frozen Set Design For Precoded Polar Codes
11 pages
Decoding of The Extended Golay Code by The Simplified Successive-Cancellation List Decoder Adapted To Multi-Kernel Polar Codes
No ratings yet
Decoding of The Extended Golay Code by The Simplified Successive-Cancellation List Decoder Adapted To Multi-Kernel Polar Codes
9 pages
Paper - 2024 - Design of Compactly Specified Polar Codes With Dynamic Frozen Bits Based On Reinforcement Learning - Vera Miloslavskaya
No ratings yet
Paper - 2024 - Design of Compactly Specified Polar Codes With Dynamic Frozen Bits Based On Reinforcement Learning - Vera Miloslavskaya
16 pages
Fast Scan
No ratings yet
Fast Scan
5 pages
MATLAB Implementation of Polar Codes For 5G Systems
No ratings yet
MATLAB Implementation of Polar Codes For 5G Systems
5 pages
1.2 Increasing The Throughput of Polar Decoders
No ratings yet
1.2 Increasing The Throughput of Polar Decoders
4 pages
Stochastic Decoding of LDPC Codes
No ratings yet
Stochastic Decoding of LDPC Codes
3 pages
Using Linear Programming To Decode Binary Linear Codecs
No ratings yet
Using Linear Programming To Decode Binary Linear Codecs
19 pages
Construction of Polar Codes With Reinforcement Learning: December 2020
No ratings yet
Construction of Polar Codes With Reinforcement Learning: December 2020
6 pages
Polar
No ratings yet
Polar
3 pages
Istc Contest Aff3ct
No ratings yet
Istc Contest Aff3ct
2 pages
4 A Reconfigurable Decoder
No ratings yet
4 A Reconfigurable Decoder
4 pages
Discrete BP Polar Decoder Using Information Bottleneck Method
No ratings yet
Discrete BP Polar Decoder Using Information Bottleneck Method
12 pages
10 1109@WCNCW48565 2020 9124897
No ratings yet
10 1109@WCNCW48565 2020 9124897
6 pages
A Low-Power 1-Gbps Reconfigurable LDPC Decoder Design For Multiple 4G Wireless Standards
No ratings yet
A Low-Power 1-Gbps Reconfigurable LDPC Decoder Design For Multiple 4G Wireless Standards
4 pages
Paper - 2024 - On The Performance of Low-Complexity Decoders of LDPC and Polar Codes - Qingqing Peng
No ratings yet
Paper - 2024 - On The Performance of Low-Complexity Decoders of LDPC and Polar Codes - Qingqing Peng
25 pages
2.1. List Decoding of Polar Codes 2015
No ratings yet
2.1. List Decoding of Polar Codes 2015
14 pages
Error Control Coding
No ratings yet
Error Control Coding
8 pages
IJCA Template - International Journal of Computer Applications - IJCA
No ratings yet
IJCA Template - International Journal of Computer Applications - IJCA
7 pages
Performance Analysis of LDPC Decoding Algorithm
No ratings yet
Performance Analysis of LDPC Decoding Algorithm
8 pages
Paper - 2011 - List Decoding of Polar Codes - Ido Tal
No ratings yet
Paper - 2011 - List Decoding of Polar Codes - Ido Tal
14 pages
High Throughput, Parallel, Scalable LDPC Encoder/Decoder Architecture For Ofdm Systems
No ratings yet
High Throughput, Parallel, Scalable LDPC Encoder/Decoder Architecture For Ofdm Systems
4 pages
A Hybrid Partial Sum Computation Unit Architecture For List Decoders of Polar Codes
No ratings yet
A Hybrid Partial Sum Computation Unit Architecture For List Decoders of Polar Codes
5 pages
Polar Codes Construction and Performance Analysis - Ramtin Pedarsani
No ratings yet
Polar Codes Construction and Performance Analysis - Ramtin Pedarsani
48 pages
Twisted-Pair Superposition Transmission: Suihua Cai and Xiao Ma
No ratings yet
Twisted-Pair Superposition Transmission: Suihua Cai and Xiao Ma
19 pages
Ref 230 43 Algorithms For Iterative Decoding in The Presence of Strong Phase Noise
No ratings yet
Ref 230 43 Algorithms For Iterative Decoding in The Presence of Strong Phase Noise
10 pages
Construction and Performace of Polar Codes For Transmission Over The AWGN Channel
No ratings yet
Construction and Performace of Polar Codes For Transmission Over The AWGN Channel
56 pages
Brest Bitflipping
No ratings yet
Brest Bitflipping
87 pages
Sum Subt Fix Point LDPC Dec
No ratings yet
Sum Subt Fix Point LDPC Dec
12 pages
Low - Density Parity Check Codes Over GF (Q) - M.C. Davey and D. Mackay
No ratings yet
Low - Density Parity Check Codes Over GF (Q) - M.C. Davey and D. Mackay
2 pages
Decoding of Polar-Coded OFDM Systems Over Multipath Fading Channels With Interleaving and Channel State Information
No ratings yet
Decoding of Polar-Coded OFDM Systems Over Multipath Fading Channels With Interleaving and Channel State Information
10 pages
Partition Decoding For Reed-Solomon Codes Based On Bit Reliability
No ratings yet
Partition Decoding For Reed-Solomon Codes Based On Bit Reliability
4 pages
ECS452 4 U2
No ratings yet
ECS452 4 U2
6 pages
Low Density Parity Check Codes For Erasure Protection: Alexander Sennhauser April 22, 2005
No ratings yet
Low Density Parity Check Codes For Erasure Protection: Alexander Sennhauser April 22, 2005
20 pages
LDPC
100% (1)
LDPC
30 pages
2019/9/16 Yuh-Ming Huang CSIE, NCNU 1
No ratings yet
2019/9/16 Yuh-Ming Huang CSIE, NCNU 1
32 pages
Error-Correction on Non-Standard Communication Channels
From Everand
Error-Correction on Non-Standard Communication Channels
Edward A. Ratzer
No ratings yet
Quality Manager - Avery 05-11
No ratings yet
Quality Manager - Avery 05-11
3 pages
JP-Finance Officer
No ratings yet
JP-Finance Officer
2 pages
3.1 Critical Thinking Rubric
No ratings yet
3.1 Critical Thinking Rubric
1 page
Bond Strength of Concrete Plugs Embedded in Tubula PDF
No ratings yet
Bond Strength of Concrete Plugs Embedded in Tubula PDF
16 pages
QMS Manual (23 Files Merged)
100% (1)
QMS Manual (23 Files Merged)
168 pages
SDO Animo Year End 2020-2021 - GBB - Lopez
No ratings yet
SDO Animo Year End 2020-2021 - GBB - Lopez
2 pages
Fundamentals of Quantum Programming in I
No ratings yet
Fundamentals of Quantum Programming in I
354 pages
海信 Residential Catalog-一拖多 16到27页
No ratings yet
海信 Residential Catalog-一拖多 16到27页
36 pages
Project Report Format (M.SC.)
0% (1)
Project Report Format (M.SC.)
6 pages
Meity - Approved Exemption Application Format-Revised
No ratings yet
Meity - Approved Exemption Application Format-Revised
8 pages
Kmu BSN 1st Semes Computer Slides by M Ibrahim
No ratings yet
Kmu BSN 1st Semes Computer Slides by M Ibrahim
33 pages
CEPT
No ratings yet
CEPT
19 pages
Whitepaper: Decentralized Finance Global Smart AMM DEX Protocol
No ratings yet
Whitepaper: Decentralized Finance Global Smart AMM DEX Protocol
16 pages
Microprocessor in Agriculture
No ratings yet
Microprocessor in Agriculture
2 pages
ThinkCentre M75q Gen 2 Spec
No ratings yet
ThinkCentre M75q Gen 2 Spec
9 pages
ANL201 Study Unit 1 (Ver 20200108)
No ratings yet
ANL201 Study Unit 1 (Ver 20200108)
42 pages
Sika RainTite
No ratings yet
Sika RainTite
2 pages
514 614 L28 32H Fuel Oil System
100% (1)
514 614 L28 32H Fuel Oil System
30 pages
10 Ways To Sell 100,000 Copies of Your Book PDF
No ratings yet
10 Ways To Sell 100,000 Copies of Your Book PDF
6 pages
Ul-1 13
No ratings yet
Ul-1 13
13 pages
Computer Technology An Introduction
No ratings yet
Computer Technology An Introduction
184 pages
Rem Mark 50 Bagger
No ratings yet
Rem Mark 50 Bagger
64 pages
ICT in Education
No ratings yet
ICT in Education
26 pages
Cover Letter Qatar
No ratings yet
Cover Letter Qatar
1 page
Shoprite - Navigating A Competitive Market
No ratings yet
Shoprite - Navigating A Competitive Market
4 pages
5S in Warehouse Operations
No ratings yet
5S in Warehouse Operations
11 pages
Hoja de Especificaciones PTZ-N2404I-DE3F
No ratings yet
Hoja de Especificaciones PTZ-N2404I-DE3F
5 pages
XZB (36B) Rotating Cups-Installation Technique and Inspection Record
No ratings yet
XZB (36B) Rotating Cups-Installation Technique and Inspection Record
23 pages
Sans 10292
No ratings yet
Sans 10292
31 pages

Ibook - Pub Efficient Stochastic Successive Cancellation List Decoder For Polar Codes

Uploaded by

Ibook - Pub Efficient Stochastic Successive Cancellation List Decoder For Polar Codes

Uploaded by

SCIENCE CHINA

Efficient stochastic successive cancellation list

Xiao LIANG1,2,3,4† , Huizheng WANG1,2,3,4† , Yifei SHEN1,2,3,4 , Zaichen ZHANG1,2,3,4,

2.1 Deterministic SCL decoding

where a, b, ûsum ∈ {0, 1}.

2.2 Stochastic SCL decoding

1 + ab Pr(A = 1)Pr(B = 1) + Pr(A = 0)Pr(B = 0) Pr(F = 0)

p1i p2i p3i p4i

Prf = Pra (1 − Prb ) + Prb (1 − Pra ), (6)

2.3 Channel message scaling

2.4 Enlarging probability approach

3 Stochastic SCL polar decoder with a double-level decoding method

FC1 FC2 NC1 FC4 FC1 FC2 FC3 FC4

2nd Round NC4 NC4 FC2 FC2

Node with third-maximal probability Node with third-minimal probability

FC1 NC1 NC4 FC2 FC1 NC1 NC2 FC4

3.1 Stochastic SCL decoder in double-level tree

Chosen the best L paths Eliminated paths

Middle level without sorting step

Sorting: choose the best L paths from 2L candidates

Sorting: choose the best L paths from 4L candidates

Figure 8 (Color online) Binary-tree of stochastic SCL decoder in double-level (L = 2).

3.2 Adaptable distributed sorting in double-level method

Double-level distributed sorting

1st Round NC1 FC3

FC1 FC2 NC1 FC4 FC1 FC2 FC3 FC4

2nd Round NC10 NC10 FC2 FC2

FC1 NC1 NC10 FC4 FC1 FC2 NC1 FC4

3rd Round NC5 NC5 FC4 FC4

FC1 NC1 NC10 NC5 FC1 NC1 NC2 FC4

The sum of comparison rounds

10−3 200 Double-level DS

Algorithm 1 Proposed ADS algorithm

4.1 Stochastic SCL decoder in 2p -level scheme

4.2 Adaptive distributed sorting in 2p -level method

Algorithm 2 ADS algorithm for 2p level decoder

4.3 Performance analysis of 2p -level decoding method

Table 1 The complexity of different modules of stochastic SCL decodera)

5 Stochastic SCL decoder architecture

5.1 Stochastic bit stream generation module

5.2 Mixed node I

5.3 Mixed node II

5.4 Mixed node III

5.5 Architecture of stochastic successive cancellation decoder with double-level decoding

Stage 1 Stage 2 Stage 3

Interface Stage 1 Stage 2 ... Stage n−1 Stage n Stochastic

Pr1(1) (y3) ûsum MN CC

Llog2l bits MN I Mixed node I

5.6 Architecture of stochastic SCL decoder with double-level decoding

Memory block I PPMNCi Y

PPMNC1 Read in order MAX REG a

Read in order MIN l/2 >b

log2l bits Control signal (Yes)

5.7 List core architecture

6 Comparison of stochastic SCL decoding and deterministic decoding

Processing cycle in the first stage=l clocks

Processing cycle in the node framework=n+l−1 clocks

Decoding u1 and u2 Decoding u3 and u4 Decoding u5 and u6

l+n−1 l+n−1 l+n−1

Deterministic processing Dynamic processing

Table 2 Comparison of the implementation results of several FPGA-based SCL architectures

You might also like