Low-Power Design of Reed-Solomon Encoders: Wei Zhang, Jing Wang Xinmiao Zhang
Low-Power Design of Reed-Solomon Encoders: Wei Zhang, Jing Wang Xinmiao Zhang
0
g 2 1 t
g
1
g 2 2 t
g
( ) f x
( ) c x
0
a
b
Fig. 1. Asymmetric encoder
a
b
D D D D
1
g
t
g
( ) f x
( ) c x
0
a
b
1
and we can get it by transforming
0
directly. The way
of elements generating is
0
= 1,
j
= (
j1
<< 1) mod p(x) (j Z) (2)
where Z is the integer set. The inversion of (2) is
(p(x)q +
j
) >> 1 =
j1
(j Z) (3)
where q is the quotient. We get
2
m
2
=
1
= (p(x) 1 +
0
) >> 1 (4)
in (4), q is always equal to 1. Consequently, the last element is
equal to the most signicant m bits of p(x). It can be proved
that if a 5-tuple representation of the element is similar to
p(x), there would be more redundant items in the product and
low power CMs can be obtained. We denote the multipliers
described in (b) and (c) by the set R.
As an example, for b(x) =
30
= x
4
+x, p(x) = x
5
+x
2
+1,
since the word-level addition is the bit-wise XOR operation
over the nite eld, by eliminating redundant items and
1561
TABLE I
XOR GATES COMPLEXITY IN DIFFERENT PRIMITIVE POLYNOMIALS
GF(2
5
) GF(2
8
) GF(2
10
)
prime 37 47 351 451 1153 2041
(m + 1)-Tuple 100101 101111 101011111 111000011 10010011111 11111111001
Weight 3 5 7 5 7 9
Area of CM
1
1 3.5 5.5 3 1 7.5
Area of CM
(2
m
2)
1 3.5 5.5 3 1 7.5
Mean area of all CM 6.8 6.1 17.4 17.8 27.9 27.3
Variance(normalization) 4.4 1 1 2.03 2.23 1
TABLE II
RESULTS IN ALGORITHM A STEP 2
d g
0
g
1
g
2
g
3
g
4
l
7
3
16
30
0
2
8
7
19
0
1
12
23
10
0
1
14
0
14
0
0
intermediate variables, we have the following simple results:
4
=
0
;
3
=
4
;
2
=
3
;
1
=
2
0
;
0
=
1
; (5)
Power of CMs over GF(2
5
) and GF(2
6
) are shown in Fig.
3, with their primes to be 37 and 67 respectively (In brief,
p(x) = x
5
+ x
2
+ 1 also can be considered as a prime 37).
The corresponding power of CMs for a symmetric encoder
with t = 2 are marked in Fig. 3. Even the number of CMs in
symmetric encoder is t, much less than that of an asymmetric
counterpart, the former do not have the lowest power in
the 2
m
1 CMs. Hence there can be other possibilities in
asymmetric encoders that lead to more low-power.
B. Primitive Polynomial
Table I shows the effect of primitive polynomials on the
CMs over GF(2
5
), GF(2
8
) and GF(2
10
). Each AND gate
requires 3/4 the area of an XOR. The area consumptions to be
equivalent XOR gate complexities are listed [8]. Obviously, the
hardware requirements in
1
,
2
m
2
R are much less than
the mean area of all the multipliers. What should be noticed
is that the CMs with high-weighted p(x) have less means
and variances over the whole elds while the low-weighted
lead the CMs in R to reduce circuit complexity and power
consumption more effectively. That is because higher weights
of primitive polynomials tend to increase the complexity
of the expression between input and output. Nevertheless,
more intermediate computation items have opportunities to be
reused so that the circuit complexity and power consumption
decline. The expressions about CMs in R are just simple
without much intermediates, so the low-weighted primitive
polynomials are better.
C. Generator Polynomial
At the last of Section III-A, we present that the symmetric
encoder is usually considered to be the most energy-efcient
but there will be other possibilities in asymmetric encoders that
478
591
981
596
762
1325
811
1083
2030
462
602
1000
559
789
1378
801
1145
2199
0
500
1000
1500
2000
2500
t=2 t=3 t=6 t=2 t=3 t=6 t=2 t=3 t=6
GF(2^5) pri me=37 GF(2^6) pri me=67 GF(2^8) pri me=451
power of symmetric coefficients encoder the lowest power of rest asymmetric encoders
/ W P
Fig. 4. The power consumption of RS encoders over different GF(2
m
)
lead to even lower power. The basic idea for the low power
design of RS encoders is that the encoder includes multipliers
in R
{
0
} as many as possible. A way to nd the proper d
is shown in algorithm A.
Algorithm A: Search for generator polynomials
Input: g(x) = (x +
d
)(x +
d+1
) . . . (x +
d+2t1
),
d {0, 1, 2, . . . 2
m
2}, l = 0, v = 0, S =
Step 1: for i = 0 to 2t 1
begin
for j = 0 to 2
m
2
if (g
i
(d
j
) mod
2
m
1
=
0
and d
j
/ S)
v = v + 1, d
v
= d
j
;
end
S = {d
1
, d
2
, . . . , d
v
};
Step 2: for = 1 to v
begin
g(x) = (x+
d
)(x+
d+1
) . . . (x+
d+2t1
);
for = 0 to 2t 1
if g
R l = l + 1;
d
= argmax
l
(d);
end
Output: d
14
7+13=20 XOR
prime = 37 d
min
=7
3
16
,
30
3+8+1=12 XOR
dgen=1
10
29
19
24
10+2+8+12=32 XOR
(63, 59) dsym=30
48
36
13+17=30 XOR
prime = 67 d
min
=14
62
32
1+4+9=14 XOR
dgen=1
10
24
41
19
7+10+17+16=50 XOR
(255, 251) dsym=126
87
208
24+28=52 XOR
prime = 451 d
min
=97
139
150
58
24+16+12=52 XOR
dgen=1
10
222
213
217
33+26+26+23=108 XOR
Considering that the only difference among these circuits
constitution is the CM, the comparison of CM can be enough
to represent the point. Moreover the important performances
of encoder circuits, such as the throughput, latency etc. are
not deteriorated. Without loss of generality, the roots of a
generator polynomial are expressed with
1
,
2
, . . . ,
nk
,
and we denote d
gen
= 1. Additionally, d
sym
= 2
m1
t is
regarded as the parameters that can make g(x) symmetrical.
We have already known the exact number of XOR gates of
each CM over different nite elds. Therefore, by comparing
the number of XOR gates of total CMs involved in each
encoder, the hardware requirement with beginning root d
min
is no more than that of its counterpart d
sym
, even though
the latter has more CMs than the former. Furthermore, the
common parameter d
gen
= 1 may lead to more extra hardware
requirements. These prove that the proposed algorithm is an
effective way to nd the parameters of encoder with the lowest
power consumption.
V. CONCLUSION
In this paper, we analyze the low power design of RS
encoders. All factors such as multipliers, primitive polynomial
and generator polynomial are discussed in details. Simulation
results show that in the case of t = 2, there exists g(x)
with asymmetric coefcients which makes the encoder power
consumption lower than the symmetric encoders. And low-
weighted primitive polynomials are better. While t > 2,
the symmetric encoders have better power performance. In
addition, a method to nd the proper generator and primitive
polynomial quickly is also proposed.
REFERENCES
[1] M. Ayinala and K. K. Parhi, High-Speed Parallel Architectures for
Linear Feedback Shift Registers, IEEE Trans. Signal Processing, vol.
59, no. 9, pp. 4459-4469,Sept. 2011.
[2] C. Cheng and K. K. Parhi, High Speed VLSI Architecture for General
Linear Feedback Shift Register (LFSR), in Proc. Signals, Systems and
Computers, 2009, pp. 713 - 717.
[3] S. Lin and D. J. Costello Error Control Coding: Fundamentals and
Applications. Pearson-Prentice Hall, 2004.
[4] G. Seroussi, A Systolic Reed-Solomon Encoder, IEEE Trans. Info.
Theory, vol. 37, no. 4, pp. 1217-1220, Jul. 1991.
[5] C. K. Koc and T. Acar, Montgomery Multiplication In GF(2
k
), Designs,
Codes And Cryptography, vol. 14, no. 1, pp. 57-69, Apr. 1998.
[6] E. D. Mastrovito, VLSI Designs For Multiplication Over Finite Fields
GF(2
m
), vol. 357, pp. 297-309, 1989.
[7] J. Lv and P. Kalla, Formal Verication Of Galois Field Multipliers
Using Computer Algebra Techniques, in Proc. IEEE Intl. VLSI Design
(VLSID), 25th, 2012, pp. 388-393.
[8] X. Wu, X. Shen and Z. Zeng, An Improved RS Encoding Algorithm ,
in Proc. IEEE Consumer Electronics, Communications and Networks,
2nd, 2012, pp. 1648-1652 .
1563