Least Squares Quantization in PCM
Least Squares Quantization in PCM
Least Squares Quantization in PCM
2, MARCH 1982
129
each aj being one of the integers { 1,2,. . +,v}. The technology of this transmission does not concern us, except that
130
q,K(r
-co<ttoQ.
tj)P
j=-*
y(s(tj))K(t
ple,
F(x)
= P{s( t) 5 X})
-@3(x<m,
=/:mxzdF(x),
w-9
Moreover, the r and n processeshave this same property;
the averagereceiver-output power R is given by
R = E{r2(t)}
=I-Ly(x)
dF(x),
-03~<~<.
tj)9
~,MARCH 1982
=J-tz2(x)dF(x),
jz-03
(1-a
(7)
n(t)
r(t)
s(t),
-m<ttcc,
is given by
n(t)= 5 z(s(t,))K(t
- t,), -co~tt<,
jz-00
IV.
(8)
where
-Mexico,
z(x) =y(x> - x,
(9)
may be regarded as the quantization error added to a
sample which has voltage value x.
Note that we assumethat the receiver uses the nonrealizable pulses (2). If other pulses are used (e.g:, step functions
or other realizable pulses) there will be sampling noise, in
general, even without quantization [3]. Our noise (8) is due
strictly to quantization.
Finally we must emphasizethat we assumethat the {Q,}
and {q,) are constant in time. In deltamodulation and its
refinements the {Q,} and {q,} change from sampling
instant to sampling instant, depending on the past behavior of the signal being handled. Such systems are very
difficult to treat theoretically.
III.
( (q,-x)2dF(x).
(13)
NOISE POWER
4a =
JQ,
/ QndF(x)
a = 1,2;**,v >
(14
131
(16)
Qv-,= {x:x,-2(xIx,-,}
Q,= {x:x,~~~x~co},
where the endpoints {x~} are given
ji q:lQ dF(x),
a
LY=l
where the {q,} here are the optimum ones of (14). The sum
on the right is the receiver-output power from (11). Hence
when the {q,} are centers of mass of the {Q,}, optimum or
not, then S = R + N, which implies that the noise is
orthogonal to the receiver output. One expects this in a
least squaresapproximation, of course.
V. THE BEST PARTITION
{X:X<XIX~}
Xl = &I,
q2)
x2 = 5(42
q3)
07)
QUANTIZATION
PROCEDURES
132
will lie equidistant from x, (17), and from (19) the noise
will not change if we reassign the mass at x, to Q,, 1,
-00 <xp<xp<
. . . <xp, < CQ
retaining the given (4,) as quanta. But q, and qa+ , are
are chosen arbitrarily except that each of the resulting definitely not centers of mass of the corresponding mod{Qt)} should have nonvanishing mass. Then the centers of ified sets,and the noise will strictly decreaseas qa and qa+ 1
mass of these sets are taken as the first trial quanta {q:)}.
are moved to the new centers of mass. Thus the given
These values will not satisfy the midpoint conditions configuration is not a local minimum, contrary to assump(17), in general, so that the second trial point pc2) is taken tion. From this result and (19) we seethat N(p) is continuto be
ous in a neighborhood of a local minimum. We have
proved
also that there is no essential loss of generality in
qL2= q;),
a = 1,2;. ,V
assuming the form (16) for the {Q,}.
,@)
a = 1,2;-*,v - 1,
We refer to the above trial-and-error method as Method
a = i(q2
a + qa?,),
I. Another trial-and-error method is the following one,
with appropriate modifications if any of the resulting { QL2)} Method II. To simplify the discussion we assume for the
have vanishing mass. This step does not increasethe noise, moment that F is continuous and nowhere constant. We
in view of the discussion in Section V; that is, N(pc2))5
choose a trial value ql satisfying
N( p)).
The new {qf)}, centers of mass (c.m.) of the old {Q$},
41 < ( xdF(x).
J-cc
will not be centers of mass of the new {QL2)}, in general;
The condition that q, be the center of mass of Q, detrial point pc3) is determined by
termines x1 as the unique solution of
x(3) = .(2)
ff = 1,2,*.*,v - 1,
,a,
= (ac.m.
of Qi3)),
a = 1,2;*.,v.
41=
p))
L ....
/::
dF(x)
-00
N( p() r N(
x xdF(x)
= 2x1
41.
q2.
J
4, =
co xdF(x)
yz
dF(x)
(22)
X,-I
and the q, obtained from (21) will not satisfy (22) in
general. The discrepancy between the right members of
(21) and (22) will vary continuously with the starting,value
q,, and the method consists of running through such chains
133
TABLE I
GAUSSIAN,Y = 2
a
1
co
4a
0.4528
1.5104
1
2
X,
0.9816
w
TABLE III
GAUSSIAN,Y = 8
EXAMPLES
. ..<q.-,<x,-,<q,;
x,
TABLE II
GAUSSIAN,~ = 4
40
0.7919
a
1
2
3
4
5
6
7
8
-cQ~x-=c<.
0.7584
0.3880
0.6568
0.9423
1.2562
1.6181
2.0690
2.1326
0.%82
0.5224
0.7996
1.0993
1.4371
1.8435
2.4008
co
TABLE VIII
GAUSSIAN;OPTIMUMNOISEFORVARIOUSVALUES OF Y
Y
X,
0.5006
1.0500
1.7480
co
TABLE IV
GAUSSXAN,~= 16
(23)
the endpoints and quanta for the negative half-axis .are the
negativesof these.
We normalize to unit signal power S = 1. The {q,} and
{x~} for other values of S are to be obtained by multiplying the numbers in the tables by \rs.
The simplest case is the uniform distribution:
4a
0.245 1
0.7560
1.3439
2.1520
1
2
3
4
2
4
8
16
32
64
128
(03)
Y2N
YXl
0.3634
0.1175
3.455 x 10-2
9.500 x 10-3
1.452
1.880
2.205
2.430
3.93
4.00
4.13
(0)
(2.72)
(4.34)
-lxl!h -m<xX<.
f(x)=c-.6
ASYMPTOTIC PROPERTIES
Let us assume that the distribution F is absolutely continuous with density function f = F, which is itself dif4The author is indebted to V. Vyssotsky of the Acoustics Research
Group for this information (private communication).
134
TABLE XVII
APPROXIMATELASTENDPOINT
FROMASYMPTOTICFORMULA
2
4
8
16
32
64
128
dx = 0,
a = 2,3;..,v
- 1.
s, OL
-h a-,(
t(hi - hit-,)fk,)
+ +@a+ hi-,)f(d
= o( h3,) + o( hi-,),
a = 2,3;*.,v
Gaussian
b
0
1.168
1.992
2.657
Laplacian
bv
g(x)
--g(x)
- fb>
3fb)
-09~x~co.
(26)
f 13(x)
g(x) = ( fl/3(x,)dx
The numbers in Tables VIII and XVI suggest the ex--ooxx~oo,
(27)
istence of an asymptotic fractional density of quanta.
J-cc
Accordingly, we define the function g,(x), - cc < x < cc,
provided that the integral in the denominator exists.
by
The noise power becomes
-cQooxxq,
d-4 = 09
N=I
II- 1
4, <x 5 q&+1,
12v2
2vh,
(25)
a = 1,2,-a-,v - 1,
(28)
=[I^
f1i3(x)dx]3+o(f),
= 0,
q,<x<m.
12v2 --oo
The definition is arranged so that (for given v) the sets
Q2, Q3,.. -,Q,-, subtend equal areas of l/v each under
the graph of g,(x) versusx. We will proceed as if a limiting
density,
g(x) = p~&)~
-caooxx<,
existed.
We wish to express g in terms of the given sample
density function f. To do this we will use conditions (24),
lim v( ha- 1 + h,) = -!together with the following further assumptions. We asv-m
g(O)
400
sume that g has a derivative, and we assumethat for given
x and k the difference c(x) = g,(x) - g(x), -cc < x (
comparing with (25). In the Gaussian example we find
co, has the property
l/g(O) = G(=
4.34), and for the Laplacian: l/g(O) =
3a( = 4.24).
$ = Jbmd:) dx
Y
1
1
i&z, + ha) - dqa - ha-,) 1
g(qJ
= - 2v2g3(qa)
5Thenotationu(v)
a = 2,3;..,v
= O(Z)(Y)) meansinourcaselim,,,u(v)/o(v)
- 1,
= 0.
135
convergeboundedlyto
ACKNOWLEDGMENT
The numerical results presented in the tables are due to
Miss M. C. Gray and her assistants in the Numerical
Analysis and Digital ProcessesGroup; the programming of
Method I for the IBM-650 electronic computer was done
by Miss C. A. Conn.
After substantial progress had been made on the work
described here there appeared in [ 1l] a review of a paper
by J. Lukaszewicz and H. Steinhauson optimum go/no-go
gauge sets. The present author has not been able to obtain
a copy of this paper, but it seemslikely that these authors
have treated a problem similar or identical to the one
discussedin Sections IV-VI. M. P. Schtitzenbergerin [12]
examines the quantization problem in the casewhere v = 2
and where F increasesat 3 or 4 points.
APPENDIX
Suppose s(t), - w < t < co, is a continuous parameter stochastic process, real, separable, measurable, stationary, and of
finite power:
-w<t<co,
-co<t<w,
s(t) = jm e2*iAtdS(A),
(29)
--m
where the spectral process [(A), - cc < X < co, has orthogonal
increments [4, p. 5271. To say that s is band-limited to the
frequency band - W 5 X 5 W is to say that the [ process has
vanishing increments outside of this band with probability one,
and (29) becomes
s(t) =j-w;Joe2itdt(A).
-w<t<w.
(30)
+ 26,cos2aWt
S(X)=/lily,(h)
A=
i(t) =
X I W, has Fourier
(8:)
=E{8;}
=+E{)[(W+O)-+W-O)12},
we see from (32) that the sampling series (31) represents s with
probability 1, if and only if, the [ process has no power concentrated at the band edges,
E(J[()W+O)-[(W-0)\2}
. . . ?d4-,)),
cpMGd)~ cp(4~1))> ...
(33)
constitute a stationary discrete-parameter stochastic process. If
the number
@ = E{q2(+,))}
=j_m_lp(x)
pwwq
Q+(t,))
-co
=j-wwe2ilh/(2Wd$(h),
<j<
cc),
-co cjc
co,
-co
This process is stationary in the wide sense and has the given
process (33) as its samples, clearly
-al
<j -=cco.
= j~ww~{ldd~~ I} = j~mv2(x)df(x),
-w<t~co.
-wCj<w,
f - t,),
dF(x),
-[
=O.
E{e(i)}
in the notation of Section I. This function is of bounded variation, so that the partial sums
i
(32)
Moreover,
(2W))
29rW(t - j/ (2W))
jzz
(31)
converges(in stochasticmeansquare)to
e(t,) = cp(S(l,)),
= sin2mW(t -j/
s(t,)K(t - t,)
j=-m
1 w e-2Tijh/(2W)e277ihrd~
I = 2~ / _ w
ax)
w,
-cw,
e(t)
~(*W+O)-[(*W-O)=6,-Ci8,.
= K(t - tj),
-w<x<
,,+
- 26,sin2mWt,
-w~t~w,
= e2niXt
-woOttwoO,
(34)
- w 5 x 5 w,
136
IEEETRANSACTIONS
ON INFORMATIONTHEORY,VOL. IT-28, NO. 2, MARCH 1982
CP
Cl
[W.
The well-known mean-erogodic property,
b,
1.i.m.
m
2
m II-m'+1
mf~--m~-m j=mj
b2
b,
b3
x-
(-l)'cP(s(tj))
=Tj(W+O)-q(w-0)+q(-w+0)-l)(-W-O)
=2Re[q(W+O)--(tW-0)],
-I
O I
XI
X-
A simple example shows that the conditions (14) and (17) are
c2
Xl =T,
3(c2
q2
NE&-
- 3c,
4c2
cl)
9
(c2-3c~)3
16~;
Note 1981:
( ;
1
The main reason the paper was not submitted for publiagreeing with (27).)
cation previously was that the numerical calculations were
The following interesting example is due to J. L. Kelly, Jr., of
never completed. The Gaussian v = 32 case was done on
the Visual Research Group. Let the density f be as in Fig. 2, with
c2 > c,, c, + c2 = 1. The signalpoweris S = l/3, independently the IBM 650 card programmable calculator; the Laplacian
of c, and c2. Supposev = 2. One configurationfor which condi- cases were done only for v = 2. Some time later the 650
was replaced by an IBM 701 electronic computer, but no
tions (14) and (17) are satisfied is q, = - 4, x, = 0, q2 = 4,
clearly, and the resultingnoise in N = l/ 12. When c2 > 3c, ; quantizing program was written for it.
I was not satisfied with not having conditions for a
however, there is another solution-it is the one which in the
limit c, = 0, c2 = 1 goes into the scheme where (0,l) is divided
unique minimum but would have published the paper
N
c,(b2
bJ3
c2(hl
b313
137
[41
[51
[61
[71
PI
[91
[lOI
[Ill
REFERENCES
B. M. Oliver, J. R. Pierce, and C. E. Shannon, The philosophy of
PMC, Proc. I. R. E., vol. 36, pp. 1324-1331, 1948.
PI H. S. Black, Modulution Theory. Princeton, NJ: Van Nostrand,
1953.
[31 S. P. Lloyd and B. McMillan, Linear least squares filtering and
prediction of sampled signals, in Proc. Symp. on Modern Network
[II
[I21
1131
where
(JXZ- 5
18fi .
This fact and the following theorem verify Goldsteins
conjecture in a very sharp way.
Theorem: If S is the unit square and f( Z) is any (measurable complex valued) function on S, which takes only n
distinct values, then
p - f(Z)12
>f
where
5
u=iip.
Proof:
Let f*(Z)
trivial.)