Dynamic Programming Training Period For An MSE Adaptive Equalizer
Dynamic Programming Training Period For An MSE Adaptive Equalizer
average excess mean-squared error is included, and the set of step sizes
isdetermined by invoking the principle of optimality in dynamic
programming.
I B
The resulting algorithm iscompared to the popular fixed step-size
algorithm. Predicted and experimental results are given. A fairly well (b)
conditioned and a poorly conditioned channel are considered. Results Fig. 1. Adaptiveequalizer for slowlyvaryingchannels. (a) Channel
show that the new algorithm has a faster adaptation time. It is more model. (b) Adaptive equalizer.
complex than the fixed step-size algorithm, but for longtransversal
filters requires little additionalkmputation time. training period algorithm that will shorten the training period.
For the .two channels considered, the training period has been
I. INTRODUCTION found to be considerably shorter than that of the commonly
used training period algorithm.
T. HEPROBLEMof transmitting information over an un-
known channel in the presence of intersymbol interference
and additive noise has been investigated for many years [ 11-
The channels t h a t we consider here are either time invariant
or areslowlyvarying with respect to the signaling rate. The
[3]. Recently,aconsiderable amount of attention has been channelthereforeexhibitsadelay spread butno Doppler
given to the adaptiveequalizer as a solution to the problem spread. In addition, white additive Gaussian noise is assumed.
[4]-[6], [lo], [12], [13], [lS]-[18]. The basic operation An adaptive equalizer in the form of a nonrecursive trans-
versal filter is used to reducetheintersymbolinterference
of the equalizers is the same for many proposed systems. A
nonrecursive digital filterwith variable tap gains learnsthe caused by the time dispersion. Each tap of the filter is delayed
T seconds from the previous tap, The tap weights { C i } areN
characteristic of thechannel. Thetap gains are adjusted to
in numbkr arid are variable. Fig. 1showsthechanneland
minimize some error criterion. When the filter has reduced the
Adaptive equalizer that we consider here. The channel output
error criterion sufficiently, this period of operation, called the
samples at t = kT, ?nk = m(kT), are assumed to be the sum of
learningortrainingperiod, is terminatedand reliable com-
the signal samples xk and noisesamples nk. The samples
munications may take place. The equalizer either then tracks
y k = y ( k T ) are the equalizer output samples. These samples
slow changes in the channel characteristics or remains fixed at
make up vectors m and y. If we assume that the channel is
the last tap gain settings, and periodically enters a short train-
dispersive by L T seconds, then mk = 0 for k < 1 and k > L ,
ing period to update the tap gains.
a n d y k = O f o r k < l a n d k > L + N - 1.
Clearly, there is a delay in data transmission proportional to
The training period consists of the transmission of isolated
the length of the, training period, and a decrease in this delay
pulses p ( t ) of duration T seconds.Each pulse is received
is desirable. Thispaperconcernsitselfwiththe design ofa
smeared out in time and is passed through the equalizer. Be-
fore thenext puiseis transmitted,theequalizer is adjusted.
Pa' er approved by the Communication Theory Committee of the The transmitted level vector corresponding to the L + N - 1
IEEB Communications Society for publication after partial presentation
at the 1971 IEEE Decision and Control Conference, Miami Beach, Fla., equalizer output samples is a. Clearly, this vector is composed
December 15-17, 1971.Manuscript.receivedSeptember 20, 1971; re- of zero elements, except for one element that corresponds to
vised February 29, 1972. It is based in part on a portion of a disserta-
tion by S. H. Richman in patial fulfillment of the requirements for the the transmitted pulse amplitude.
Ph.D. degree, at the Polytechnic Institute of Brooklyn, Brooklyn, N.Y., When the equalizer output is approximately a, the training
and was supportedinpart byNASA Grant NGR-33-006-040and
Signatron, Inc., Lexington, Mass. period ends. Two measures of the difference betweeny and u
s. H. Richman is with Signatron, Inc., Lexington, Mass. 02173. have beenpopularlyused:absoluteandsquarederror. We
M. Schwartz is with the Department of Electrical Engineering, Poly-
technic Ibstitute of Brooklyn, Brooklyn, N.Y. 11201. choose squared erroras our performance criterion.
Authorized licensed use limited to: BEIJING UNIVERSITY OF POST AND TELECOM. Downloaded on March 22,2023 at 03:53:56 UTC from IEEE Xplore. Restrictions apply.
858 IEEE TRANSACTIONS ON COMMUNICATIONS, OCTOBER 1972
If matrix M of dimension (L t N - 1) X has the form vert directly. The equivalent set of equations
... 0
QC opt = g (8)
0.. . 0 is therefore iteratively solved for C opt.
The iteration algorithm that we propose to use to solve (8)
is of the form
Authorized licensed use limited to: BEIJING UNIVERSITY OF POST AND TELECOM. Downloaded on March 22,2023 at 03:53:56 UTC from IEEE Xplore. Restrictions apply.
RICHMAN
PERIOD
PROGRAMMING
TRAINING
ANDDYNAMIC
SCHWARTZ: 859
y *‘
ming [9] , [ 1 11, and we define the functionals problem by using the average step size on each iteration:
b f = E (ai}. (22)
The simulation results show that this assumption is valid. The
use of bf provides an upper bound on thesystem performance.
The true performance results when b: = ai for all i. Since we
where we have taken use the average stepsize,theactual convergence should be
P-1 much better than the boundswill indicate.
a; 0, It is found [lo] that the convergence of the tap gains of the
r=P DPE is given by2
we findthe recursive relationship forthefunctionals to be
Authorized licensed use limited to: BEIJING UNIVERSITY OF POST AND TELECOM. Downloaded on March 22,2023 at 03:53:56 UTC from IEEE Xplore. Restrictions apply.
860 IEEE TRANSACTIONS ON COMMUNICATIONS,OCTOBER 1972
t
2 4 8 10
Moxlrnum No. of Iterotions, P
Maxlmum No. of I t e r a l l o n g P
Fig. 2. Tap gain convergence. Fig. 3. Tapgain convergence.
RESULTS
IV. SIMULATION
A computer program was written to compare the DP and FS
equalizers. An IBM 1130 computer wasused with at least
P < 10 iterations. Figs. 2 and 3 show the simulation results of
tap gain error convergence for both channels, while Figs. 6 and
7 showthestandard d e ~ i a t i o n . ~In all cases, thepredicted
trends are evident in the simulation results. In fact, the DPE
converges faster than predicted by theaverage step-size assump-
tion. The simulated tap gain standard deviation appears to be
hlgher for the DPE. This was also found in [ 151 and [ 161 for
the Chebyschev equalizer. Jt turns out that the equalizer out-
put error is a shallow function of the tap gains, and has the
same standard deviation for the DP and FS equalizers.
Since the physical quantitythat is mostimportantwith
respect to detection and probability of error is the equalizer
2
10-20 5
IO 15 20 25 30
Maximum No. of lteratlons, P
output error, we present these simulation results [13]. In all
cases, wehave constrainedthe averageexcess mean-squared Fig. 4. Predicted standard deviation bound.
Authorized licensed use limited to: BEIJING UNIVERSITY OF POST AND TELECOM. Downloaded on March 22,2023 at 03:53:56 UTC from IEEE Xplore. Restrictions apply.
RICHMAN AND SCHWARTZ:DYNAMICPROGRAMMINGTRAININGPERIOD 861
-
h? 17.8; = 164
9- Pmin Pmln
hx17.8; = 164.
Pmin mln SNR = 30 dB
8-
. SNR = 30 dB 8= 0.0974
7- = 0.0974
6- -
5-
002 -
4-
- -------
Fig. 7. Simulation-SD { I q I }.
DP Dynamlc Programmlng
A FixedStepSize
I
) - l I l I I I I I l I I I
0 2 4 6 8 10
Moxlmum No. of lterottons. P
A _2_
reachedthis region. The vertical markson each curve mark it is expected that the DPE will also outperform the FSE. The
off *1 standard deviation of output error. The results for the standard deviation of the output error was found to be rela-
DPE aresignificantly different from those for the FSE, even tively equal for bothequalizers and both channels.
when the standard deviation is included.
Forthepoorlyconditionedchannel,the DPE hasentered V . REMARKS
the convergence region after four iterations, as shown in Fig. 9. With the inclusionofadditivenoise,aconstraint on the
However, after 11 iterations, both equalizers are close in per- averageexcess mean-squarederror,andlimitingthetraining
formance, and as shown by the vertical marks, are not signifi- period to P iterations, we have found a dynamic programming
cantlydifferent. Still, however,thetrainingperiodwould algorithm for adjusting the tap weights of an adaptive equalizer
have been terminated after five iterations had the DPE been during its training p e ~ i o d . ~As compared to the FSE, a con-
used. Forthetwochannelsconsidered,itappearsthatthe
DPE can be used to speed up the training period. For channels 4The equalizer may then continue to adapt to andtrack channel
whose condition falls between the two extremes we have used, changes that are slow compared t o the convergence time.
Authorized licensed use limited to: BEIJING UNIVERSITY OF POST AND TELECOM. Downloaded on March 22,2023 at 03:53:56 UTC from IEEE Xplore. Restrictions apply.
862 IEEE TRANSACTIONS ON COMMUNICATIONS, OCTOBER 1972
clL-L+Lu
00 2 6 8 IO .
Fig. 10. Tap controller and DPE step-size calculator. (a) Tap controller.
(b) DPE stepsize calculator.
Authorized licensed use limited to: BEIJING UNIVERSITY OF POST AND TELECOM. Downloaded on March 22,2023 at 03:53:56 UTC from IEEE Xplore. Restrictions apply.
RICHMAN AND SCHWARTZ: DYNAMIC PROGRAMMING TRAINING PERIOD 863
of the gradient that is readily available. In the FSE, the gradi- APPENDIXA
ent was multiplied by a fixed constant. In the DPE, the gradi- DYNAMIC PROGRAMMING STEP SIZES
entmust be multipliedbya constant times itssquared The recursive dynamic programming relationship that we use
magnitude. Theadditionalhardwarerequired is aset of N to determinethe step-size relationship is given by (15) and
squarers, an accumulator, and a multiplier. Evidently, if the repeated here:
FSE can operate in real time, so can the DPE.
fp-s(fs)= min [ A+pf 4p q s + l ) )I.
The tap controller of Fig. l(b) is shown in Fig. lO(a) for the { “ r 1% \
rth tap and is the same fortheDPandFS equalizers. The
Starting at the last iteration s = P - 1, we begin to determine
present equalizer output error multiplies the contents of the
the step sizes. Let
rth tap cell. Thisprocedure is continuedfor (L + N - 1)T
secondsuntiltheisolatedpulsehasclearedtheline.Each Fp-1 = Ap&$-l + ~2 p ; (27)
product is accumulated in G,. After (L + N - l ) T seconds, then
the switch closes and G,, now equal to the rth component~,of f l ( ~ p -=~min
) Fp-l. (28)
V e 2 , entersthe step-size calculatorandthemultiplier. T% “P-1
step size is calculated as described in the following, and also
Using the approximation’
entersthemultiplier. The resultant product is subtracted in
the tap accumulator to form the newest update of the 7th tap € i + l = (I- ( ~ i A i€i) (29)
gain. At the start of equalization, the tap gain accumulator is where
set with an initial value of C,. Every (L + N - l ) T seconds the AiWiM?, (30)
gradient accumulator is reset to zero.
The above is requiredin boththeFS andDPequalizers. we find by substituting (29) into (27) and by differentiating
Only the step-size calculator is different. For the FSE, a fixed FPbl with respect to ap-p-lthat
step size equal to A is placed in the step-size calculator. It is T T 2
ap-p-1 = f p - 1 A p - 1 € p - l / ( A p + ep-1 AP-1 ep-1). (31)
this constant value that is outputed every (L + N - l ) T
This is the optimum step size to use for the last iteration, as-
seconds. In effect, the step-size calculator is a memory with suming that P - 1 iterations have already been performed. In
value equal to the chosenstep size. Thestep-sizecalculator order to proceed to determine the remaining step sizes, it is
for the DPE is shown in Fig. 10(b). In this case, each of the N necessary to write f l ( e p - l ) as afunction of theoptimum
gradientcomponents is squared,thenaddedtogether,and ~ . matrix (I - ap-l A P e 1 )becomes
o ~ p - The
scaled by l/Ap where Ap has been predetermined. The result is
the variable step size a . Theincreasedcomplexitytherefore
consists o f N squarers, an accumulator, and a multiplier.
Per sample interval, both equalizers require N multiplies and
adds to accomplish the convolution of input signal and filter
response. In addition, N multiplies and adds are required to
formthegradientoftheequalizererrorandoneadd is re-
quired to form the output error. After (L + N - l ) T seconds,
2 N ( L + N - 1) multiplies and ( 2 N + 1) (L + N -‘1) adds have
have beenperformed.Bothequalizersalsorequire N multi-
plies to scale the gradient by the step size and N adds to up-
date the tap gains, The DPE requires N additional multiplies
and adds to form the squared magnitude of the gradient andN
multiplies’ to scale thestep size bythe Lagrange multiplier.
Thetotal
number of
equalizer
operations is therefore
2N(L + N - 1/2) multiplies and 2N(L + N - 1/2) - 1/2 adds
for the FSE, and 2N(L + N + 1 / 2 ) multiplies and ( 2 N + 1) .
(L + N ) - 1 adds for the DPE. In terms of multiplies and adds,
the total number of operations for the FSE is approximately
2N(L + N - 1 / 2 ) and for the DPE 2N(L + N + 1/2). Actually,
the DPE equalizer requires ( N - 1) fewer adds. Ordinarily,N
is on the order of L , so that both equalizers require approxi-
mately 4 N 2 operations. The difference in the number of DPE
operations as compared to the FSE is 1/(L + N - 1/2), which
goes tozerofor large N. Fortheequalizers that wehave
simulated, L + N - 1/2 is 33.5, which meansthe DPE re-
quired 3 percentmoreoperationsthantheFSE.This is a T
SThe exactrelationship is given by ~ i =+ (Mi+,Ci
~ - a ) - c~iMi+~M q .i
small price to pay for
thespeedup of equalizer error I t has been found [ l o ] , however, that for signal-to-noise ratios of 10 dB
convergence. or more, (29) is a good approximation.
Authorized licensed use limited to: BEIJING UNIVERSITY OF POST AND TELECOM. Downloaded on March 22,2023 at 03:53:56 UTC from IEEE Xplore. Restrictions apply.
864 IEEETRANSACTIONS ON COMMUNICATIONS,OCTOBER 1972
and that the stepsize to use on the final iterationis R. Bellman, Dynamic Programming. Princeton, N. J.: Princeton
T Univ. Press, 1957.
CXp-1 = Ep-1 A p - 1 E p - l l h p . (39) S. H. Richman, “Dynamic programming in adaptive equalization
of digital communication channels,”Ph. D. dissertation, Dep.
Clearly, since f l ( f p - , ) is of the same form as fO(+), the ex- Elec.Eng., Polytechnic Inst. Brooklyn,Brooklyn, N. Y., June
pression for the step size at the(P- 1)st iteration must be 1971.
J. T.Tou, Optimum Design of Digital ControlSystems. New
T T 2 York: Academic Press, 1963.
qP-2 = EP-2 AP-2 €P-Zl(XP + EP-2 A P - 2 EP-2). D. C. Coll, “A system for the optimum utilization of pulse com-
munication channels,” Defense Res. Telecommun. Establ., Rep.
The same assumptions are applied at the (P - 1)st iteration, 1168, Dec. 1968.
andfinallyforallprecedingiterations. A pointisreached P. Mmsen,“Feedbackequalizationforfading dispersive chan-
where the assumptions are no longer valid. This is the point nels, IEEE Trans. Inform. Theory, vol. IT-17,pp.
56-64,
where the set of step sizes deviates from a true dynamic pro- Jan. 1971.
L. Fox, An Introduction toNumerical LinearAlgebra. New York:
gramming solution. It appears, however, that the assumptions Oxford Univ. Press, 1965.
are true after a very small number of iterations. The DPE re- T. J. Schonfeld and M. Schwartz. “A raDidlv converging first-
sultsshowthatevenwiththeseassumptions,the DPE out- order training algorithm for an adaptive equalizer,” IEkE-Trans.
performs the FSE with respect to convergence. Inform. Theory, vol. IT-17, pp. 431-439, July 1971.
-, “Rapidly conver$ng second-order tracking algorithmsfor
The strategy of choosing the step sizes will be adaptiveequalization, IEEE Trans. Inform.Theory, vol. IT-17,
pp. 572-579, Sept. 1971.
= ETA , er/Xp. (40) D. George, R. Bowen, and J. Storey, “An adaptive decision feed-
back equalizer,” IEEE Trans. Commun. Technol., vol.COM-19,
pp. 281-293, June 1971.
APPENDIX
B R. W. Chang, “A new equalizer structure for fast start-up digital
communication,” Bell Sysf. Tech. J., vol. 50, July/Aug. 1971.
LAGRANGE MULTIPLIER S. H. Richman, Signatron,Inc., Lexington, Mass., Internal Memo.
The constant E in the constraint equationis f o u n d t o b e[ 101
Authorized licensed use limited to: BEIJING UNIVERSITY OF POST AND TELECOM. Downloaded on March 22,2023 at 03:53:56 UTC from IEEE Xplore. Restrictions apply.