Multivariate Approximation
Multivariate Approximation
1.1 Introduction
The primary problem in approximation theory is the choice of a successful method
of approximation. In this chapter and in Chapter 2 we test various approaches,
based on the concept of width, to the evaluation of the quality of a method of
approximation. We take as an example the approximation of periodic functions of
a single variable. The two main parameters of a method of approximation are its
accuracy and complexity. These concepts may be treated in various ways depend-
ing on the particular problems involved. Here we start from classical ideas about the
approximation of functions by polynomials. After Fourier’s 1807 article the repre-
sentation of a 2π -periodic function by its Fourier series became natural. In other
words, the function f (x) is approximately represented by a partial sum Sn ( f , x) of
its Fourier series:
n
Sn ( f , x) := a0 /2 + ∑ (ak cos kx + bk sin kx),
k=1
π π
1 1
ak := f (x) cos kx dx, bk := f (x) sin kx dx.
π −π π −π
1 2n−1
Vn ( f ) := ∑ Sl ( f )
n l=n
of order 2n − 1.
From the point of view of accuracy this method is close to the Chebyshev
method; de la Vallée Poussin proved that
f −Vn ( f ) ≤ 4En ( f ) p , 1 ≤ p ≤ ∞.
p
From the point of view of complexity it is close to the Fourier method, and the
property of linearity essentially distinguishes it from the Chebyshev method.
We see that common to all these methods is approximation by trigonometric
polynomials. However, the methods of constructing these polynomials differ: some
1.1 Introduction 3
We remark that at that time, early in the twentieth century, classes similar to
W r H α were used in other areas of mathematics for obtaining the orders of decrease
of various quantities. As an example we formulate a result of Fredholm (1903). Let
f (x, y) be continuous on [a, b] × [a, b] and
max f (x, y + t) − f (x, y) ≤ C|t|α , 0 < α ≤ 1.
x,y
Here and later we write an ≍ bn for two sequences an and bn if there are two positive
constants C1 and C2 such that C1 bn ≤ an ≤ C2 bn for all n.
The problem of approximation of functions in the classes W r H α by trigonomet-
ric polynomials was so natural that a tendency to find either asymptotic or exact
values of the following quantities appeared:
Sn (W r H α )∞ := sup f − Sn ( f )∞ , En (W r H α )∞ := sup En ( f )∞ .
f ∈W r H α f ∈W r H α
We now formulate the first results in this direction. Kolmogorov (1936) proved the
relation (in our notation W r = W∞,r
r , see §1.4)
4 ln n
Sn (W r )∞ = + O(n−r ), n → ∞.
π 2 nr
Independently, Favard (1937) and Akhiezer and Krein (1937) proved the equality
En (W r )∞ = Kr (n + 1)−r ,
where Kr is a number depending on the natural number r.
In 1936 Kolmogorov introduced the concept of the width dn of a class F in a
space X:
n
dn (F, X) := infn sup infn f − ∑ c j φ j .
{φ j } j−1 f ∈F {c j } j−1 j=1
X
1.1 Introduction 5
This concept allows us to find, for a fixed n and for a class F, a subspace of
dimension n that is optimal with respect to the construction of a best approxi-
mating element. In other words, the concept of width allows us to choose from
among various Chebyshev methods having the same quantitative characteristic of
complexity (the dimension of the approximating subspace) the one which has the
greatest accuracy.
The first result about widths (Kolmogorov, 1936), namely
d2n+1 (W2r , L2 ) = (n + 1)−r ,
showed that the best subspace of dimension 2n + 1 for the approximation of classes
of periodic functions is the subspace of trigonometric polynomials of order n. This
result confirmed that the approximation of functions in the class W2r by trigonomet-
ric polynomials is natural. Further estimates of the widths d2n+1 (Wq,r α , L p ), 1 ≤ q,
p ≤ ∞, some of which are discussed in §2.1 below, showed that, for some values
of the parameters q, p, the subspace of trigonometric polynomials of order n is
optimal (in the sense of the order of decay) but for other values of q, p this subspace
is not optimal.
The Ismagilov (1974) estimate for the quantity dn (W1r , L∞ ) gave the first exam-
ple, where the subspace of trigonometric polynomials of order n is not optimal.
This phenomenon was thoroughly studied by Kashin (1977).
In analogy to the problem of the Kolmogorov width, that is, to the problem con-
cerning the best Chebyshev method, problems concerning the best linear method
and the best Fourier method were considered.
Tikhomirov (1960b) introduced the linear width:
λn (F, L p ) := inf sup f − A f p ,
A:rank A≤n f ∈F
(2) In approximation theory (as well as in other fields of mathematics) it has turned
out that it is natural to unite functions with the same smoothness into a class.
(3) The subspace of trigonometric polynomials has been obtained in many cases as
the solution of problems regarding the most precise method for the classes of
smooth functions: the Chebyshev method (which uses the Kolmogorov width),
the linear method (which uses the linear width), or the Fourier method (which
uses the orthowidth).
On the basis of these remarks we may formulate the following general strategy
for investigating approximation problems; we remark that this strategy turns out to
be most fruitful in those cases where we do not know a priori a natural method
of approximation. First, we solve the width problem for a class of interest in the
simplest case, that of approximation in Hilbert space, L2 . Second, we study the
system of functions obtained and apply it to approximation in other spaces L p .
This strategy will be used in Chapters 3, 4, and 5.
Denote
xl := 2π l/(2n + 1), l = 0, 1, . . . , 2n.
Clearly, the points xl , l = 1, . . . , 2n, are zeros of the Dirichlet kernel Dn on [0, 2π ].
For any |k| ≤ n we have
2n 2n
l l
∑ eikx Dn (x − xl ) = ∑ eimx ∑ ei(k−m)x = eikx (2n + 1).
l=0 |m|≤n l=0
follows from (1.2.2). Applying the Hölder inequality (see (A.1.1) in the Appendix)
for estimating Dn 22 we get
Relations (1.2.7) and (1.2.8) imply for 1 < q < ∞ the relation
Theorem 1.2.1 The operator Sn does not change polynomials from T (n) and for
p = 1 or ∞ we have
Sn p→p ≤ C ln n, n = 2, 3, . . . ,
This theorem follows from (1.2.3) and the Marcinkiewicz multiplier theorem
(see Theorem A.3.6).
For t ∈ T (n),
n
t(x) = a0 /2 + ∑ (ak cos kx + bk sin kx),
k=1
t˜ p ≤ C(p)t p .
Clearly, it suffices to consider the case of odd n. Let this be the case and set m :=
(n + 1)/2, l := (n − 1)/2. Representing D̃n (x) in the form
n −1
1 1 imx
D̃n (x) = ∑ eikx − ∑ eikx = e Dl (x) − e−imx Dl (x) ,
i k=1 k=−n i
∑(sign k) fˆ(k)eikx .
k
1.2 Trigonometric Polynomials 9
The Marcinkiewicz multiplier theorem A.3.6 implies that this operator is bounded
as an operator from L p to L p for 1 < p < ∞. We denote by f˜ the conjugate function.
Jna 1 = 1. (1.2.16)
P0 := Q0 := 1,
i2 j x j
Pj+1 (x) := Pj (x) + e Q j (x), Q j+1 (x) := Pj (x) − ei2 x Q j (x).
12 Approximation of Univariate Functions
holds.
Gn 1 ≤ C. (1.2.23)
Further,
Gn (x) = ∑ φ (m/n)eimx
|m|≤n
φ (1) = ∑ al (−1)l = 0,
l
Theorem 1.2.4 Let ε > 0, and let a subspace Ψ ⊂ T (n) be such that dim Ψ ≥
ε (2n + 1). Then there exists a t ∈ Ψ such that
t∞ = 1,
and
t2 ≥ C(ε ) > 0.
where q is a natural number and l, q are coprime; that is, (l, q) = 1. We confine
ourselves to the case where q is an odd prime.
Theorem 1.2.5 Let q > 2 be a prime, l = 0 an integer, and k an integer. Then, for
q
2
S(q, l, k) := ∑ ei2π (l j +k j)/q ,
j=1
Proof We first consider the case k = 0. Note that the quantity S(q, l) does not
change if we sum over the complete system of remainders modulo q instead of the
segment [1, q]. Consequently, for any integer h,
q
2
S(q, l) = ∑ ei2π l( j+h) /q . (1.2.28)
j=1
Further,
q q
S(q, l)2 = ∑ e−i2π lh
2 /q 2
∑ ei2π l j /q .
h=1 j=1
Now let k be nonzero. Since q is a prime different from 2, the numbers 2lb,
b = 1, . . . , q, run through a complete system of remainders modulo q. Consequently,
there is a b such that
2lb ≡ k (mod q).
Then
l j2 + k j ≡ l( j + b)2 − lb2 (mod q)
and, consequently,
S(q, l, k) = S(q, l) = q1/2 .
Proof The proof of this theorem can easily be derived from a deep number the-
oretical result due to Hardy and Littlewood about estimating incomplete Gaussian
sums: for any n ∈ [1, q]
n
i2π l j2 /q
∑ e ≤ Cq1/2 , (l, q) = 1. (1.2.31)
j=1
G := {k j − a, j = 1, . . . , n}.
We set
tn (x) := ∑ eikx .
k∈G
Then
n
π
π 2 /q
tn (2π l/q) = ∑ ei2 lk/q = ∑ ei2 l j ,
k∈G j=1
which by (1.2.31) implies the required estimates for tn (2π l/q). The bound
tn (0) = n ≥ (n + 1)/2 is obvious.
For the sake of completeness we will prove Theorem 1.2.6 using Theorem 1.2.5.
Instead of (1.2.31) we prove the inequality
i2π l j2 /q
∑ (1 − | j − a|/n + e ≤ q1/2 , (l, q) = 1. (1.2.32)
j
We set
un (x) := t(x) ∗ Kn−1 (x).
Then by (1.2.5),
2a
un (x) = q−1 ∑ t(xk )Kn−1 (x − xk ),
k=0
1.3 The Bernstein–Nikol’skii Inequalities. The Marcienkiewicz Theorem 17
Further,
2
un (0) = ∑ 1 − | j − a|/n + ei2π l j /q ,
j
where the k j are the same as in the beginning of the proof of this theorem, we get
i2π lk /q i2π l j2 /q
tn (2π l/q) = ∑ 1 − | j − a|/n e j
= ∑ 1 − | j − a|/n + e ,
j + j
Proof Applying twice the Abel transformation (see (A.1.18) in the Appendix)
with aν = 0 for ν > n, we obtain
n n
t(x) := a0 + ∑ aν 2 cos ν x = ∑ Dν (x)Δaν
ν =1 ν =0
n ν n
= ∑ ∑ Dµ (x) Δ2 aν = ∑ (ν + 1)Kν (x)Δ2 aν . (1.3.1)
ν =0 µ =0 ν =0
18 Approximation of Univariate Functions
as required.
with ak = kr and bk = 1 − (k − n)/n, we see that the inequality (1.3.5) will be valid
for n ≤ k ≤ 2n − 3 too. For the remaining values of k = 0 we have
|Δ2 vk | ≤ |Δvk | + |Δvk+1 | ≤ C(r)nr−1 . (1.3.6)
From the inequality |Δ2 v0 | ≤ C(r) and relations (1.3.4)–(1.3.6) we get the rela-
tion (1.3.3) for r > 0 and α = 0.
Let α = 1 and let A˜s (x) denote the polynomial which is the trigonometric con-
jugate to As (x), which means that in the expression for As (x) the functions cos kx
are substituted by sin kx. We prove that
A˜s 1 ≤ C. (1.3.7)
Clearly, it suffices to consider s ≥ 3. It is not difficult to see that the equality
s−1 s−3
A˜s (x) = 2 Im As (x) ∗ 4K2s−1 −1 (x) − 3K2s−1 −2s−3 −1 (x) ei(2 +2 )x ,
holds. From this equality, by virtue of the Young inequality with p = q = a = 1
(see A.1.16)) and the properties of the functions Kn and As , we obtain (1.3.7).
Further, for n = 2m , we have
Vnr (x, 1) − 1 = V2nr (x, 0) − 1 ∗ Vn0 (x, 1)
m+1 m+1
=− ∑ V2nr (x, 0) ∗ A˜s(x) = − ∑ V2r (x, 0) ∗ A˜s (x).
s (1.3.8)
s=1 s=1
From (1.3.8) by means of the Young inequality and using (1.3.7) and relation
(1.3.3), which has been proved for α = 0, we get
r m+1
Vn (x, α )1 ≤ C(r) ∑ 2rs ≤ C(r)nr . (1.3.9)
s=0
The operator Dαr is defined in such a way that it has an inverse for each T (n).
This property distinguishes Dαr from the differential operator and it will be conve-
nient for us. On the other hand it is clear that
dr f
= Drr f − fˆ(0).
dxr
Theorem 1.3.2 For any t ∈ T (n) we have, for r > 0, α ∈ R, 1 ≤ p ≤ ∞,
(r)
t (x, α ) ≤ C(r)nr t p , n = 1, 2, . . .
p
and, consequently,
(0)
t (x, α ) = t p , 1 ≤ p ≤ ∞. (1.3.11)
p
holds.
It remains to consider the cases p = 1, ∞. It is sufficient to consider α = 1.
We have for t ∈ T (n),
t (0) (x, 1) = tˆ(0) − t˜(x) = tˆ(0) − t(x) ∗ D2n+1 (x).
Further,
2n+1
D̃2n+1 (x) = 2 ∑ sin kx = 2 Im Dn (x)ei(n+1)x ;
k=1
consequently,
D2n+1 1 ≤ C ln(n + 2).
1.3 The Bernstein–Nikol’skii Inequalities. The Marcienkiewicz Theorem 21
The relation (1.3.11) with α = 0 and (1.3.12) imply for all α the inequality
(0)
t (x, α ) ≤ C ln(n + 2)t p , p = 1, ∞. (1.3.13)
p
The upper estimate follows from (1.3.12). Let us prove the lower estimate. We
first consider the case p = ∞. Let f (x) = (π − x)/2, 0 < x < 2π , be a 2π -periodic
function; then
∞
f (x) = ∑ (sin kx)/k.
k=1
Vm 1 ≤ 3, (1.3.15)
(0)
Vm (x, 1) ≥ C ln(m + 2). (1.3.16)
1
Let us prove (1.3.16). For t we have from the above consideration for p = ∞,
(0) (0)
σ = |Vm (x, 1),t| ≤ Vm (x, 1)1 t∞ (1.3.17)
and
m
σ≥ ∑ 1/k ≥ C ln(m + 2). (1.3.18)
k=1
From relations (1.3.14), (1.3.17) and (1.3.18) we obtain (1.3.16). Then (1.3.15) and
(1.3.16) give the required lower estimate for p = 1.
22 Approximation of Univariate Functions
which means that the set T (n) as a subspace of L2 is isomorphic to ℓ22n+1 . Relation
(1.2.6) shows that a similar isomorphism can be
set up in another way: by mapping
2n
a polynomial t ∈ T (n) to the vector m(t) := t(xl ) l=0 of its values at the points
xl := 2π l/(2n + 1), l = 0, . . . , 2n.
Relation (1.2.6) gives
t2 = (2n + 1)−1/2 m(t)2 .
The following statement is the Marcinkiewicz theorem.
1.3 The Bernstein–Nikol’skii Inequalities. The Marcienkiewicz Theorem 23
Theorem 1.3.6 Let 1 < p < ∞; then for t ∈ T (n), n > 0, we have the relation
C1 (p)t p ≤ n−1/p m(t) p ≤ C2 (p)t p .
From relations (1.3.22) and (1.3.23), using the Riesz–Torin theorem (see Theorem
A.3.2) we find that
1−1/p
V ℓ2n+1
p →L p ≤ Cn ,
We now continue the proof of Theorem 1.3.6. Let Sn be the operator that takes
the partial Fourier sum of order n. Using Theorem 1.2.1 we derive from Lemma
1.3.7 the upper estimate (the first inequality in Theorem 1.3.6):
2n
t(x) = (2n + 1)−1 ∑ t(xl )Dn (x − xl )
l=0
2n
= Sn (2n + 1)−1 ∑ t(xl )Vn (x − xl ) .
l=0
Consequently,
t p ≤ C(p)n−1/p m(t) p .
24 Approximation of Univariate Functions
We now prove the lower estimate (the second inequality in Theorem 1.3.6) for
1 ≤ p < ∞. We have
2n 2n
m(t) p = ∑ t(xl ) p = ∑ t(xl )εl t(xl ) p−1
p
l=0 l=0
2π 2n p−1
= (2π )−1 t(x) ∑ εl t(xl ) Vn (x − xl )dx
0 l=0
2n p−1
≤ t p ∑ εl t(x ) Vn (x − x ) ,
l l
l=0 ′
p
which implies the required lower estimate and the theorem is proved.
Remark 1.3.8 In the proof of Theorem 1.3.6 we also proved the inequality
m(t)1 ≤ Cnt1 .
Proof In the same way as for Lemma 1.3.7 one can prove:
Lemma 1.3.10 with a = M(t) and relation (1.2.15) implies the upper estimate
t p ≤ Cn−1/p M(t) p .
The corresponding lower estimate for 1 ≤ p < ∞ can be proved in the same way as
above for m(t), substituting xl by x(l).
The lower estimate for p = ∞ is obvious.
1.4 Approximation of Functions in the Classes Wq,r α and Hqr 25
Let us prove that the definition of this operator is reasonable. To establish this it
suffices to prove that Fr ∈ L1 .
Further,
fsr (x, α ) = Dr−α fs2r (x, 0),
Theorem 1.4.2 Let 1 < q < p < ∞, β = 1/q − 1/p, r > β ; then
r−β
Wq,r α1 ⊂ Wp,α2 B, α1 , α2 ∈ R.
For the case B = 1 we simply write Hqr := Hqr 1, i.e., we drop the constant B.
Let us study these classes from the point of view of their approximation by
trigonometric polynomials.
En (Hqr )q ≍ (n + 1)−r , n = 0, 1, . . .
Proof Let us prove the upper estimate. Clearly, it suffices to consider the case
n > 0. Let f ∈ Hqr . We consider (see §1.2.4)
π
t(x) := (2π )−1 f (x) − Δay f (x) Jna (y)dy.
−π
which by the definition of the class Hqr and relation (1.2.19) implies that
f − tq ≤ C(r)n−r .
4n ≤ 2s ≤ 8n.
28 Approximation of Univariate Functions
We consider
f (x) := 2−(r+1−1/q)s As (x) (1.4.11)
and remark that to prove the theorem it suffices to consider the simpler function
f (x) = (n + 1)−r ei(n+1)x . Then, for any t ∈ T (n), we have on the one hand
f − t, As = f , As = 2−(r+1−1/q)s As 22 ≥ C2−(r−1/q)s . (1.4.12)
On the other hand using the definition of As and (1.2.14) we get
f − t, As ≤ f − tq As q′ ≤ C2s/q f − tq . (1.4.13)
From relations (1.4.12) and (1.4.13) we obtain
En ( f )q ≥ C2−rs ≥ Cn−r .
To show that f ∈ Hqr B, we prove the following auxiliary statement.
Lemma 1.4.4 Let g(x) be an a-times continuously differentiable 2π -periodic
function. Then for all 1 ≤ q ≤ ∞ we have
a
Δy g(x) ≤ | y|a g(a) (x) .
q q
as required.
From (1.4.11), (1.2.14), and the Bernstein inequality (Theorem 1.3.2) we get
f (a) q ≤ C(a)2(a−r)s . (1.4.14)
Using Lemma 1.4.4 and the simple inequality
a
Δy f (x) ≤ 2a f q ,
q
we obtain
a
Δy f (x) ≤ C(a) min |y|a na−r , n−r , (1.4.15)
q
which implies that f ∈ Hqr B with some B that is independent of n, and this proves
the lower estimate.
Let us now prove a representation theorem for the class Hqr B. Let
As ( f ) := As ∗ f
and denote the value of As ( f ) at a point x by As ( f , x).
1.4 Approximation of Functions in the Classes Wq,r α and Hqr 29
(The constants C(r, q) may be different for the cases of necessity and sufficiency.)
Proof
Sufficiency. Let
As ( f ) ≤ γ 2−rs , (1.4.16)
q
Let g(x) denote the sum of the series ∑∞s=0 As ( f , x) in the sense of convergence in
L p . From Corollary 2.2.7 below it follows that f and g are equivalent. From (1.4.21)
r−β
and the equality As ( f ) = As (g), by Theorem 1.4.5 we obtain g ∈ Hp B.
The theorem is proved.
With the aid of Theorem 1.4.8 we can prove the following statement.
Theorem 1.4.9 Let 1 ≤ q, p ≤ ∞, r > (1/q − 1/p)+ . Then
En (Wq,r α ) p ≍ En (Hqr ) p ≍ n−r+(1/q−1/p)+ .
Proof By relation (1.4.20) it suffices to prove the upper estimate for the H classes
and the lower estimate for the W classes. We first prove the upper estimate. Let
1 ≤ q ≤ p ≤ ∞. Then Theorems 1.4.8 and 1.4.3 give
En (Hqr ) p ≪ n−r+1/q−1/p . (1.4.22)
For 1 ≤ p < q ≤ ∞ we have, by the monotonicity of the L p -norms and Theorem
1.4.3,
En (Hqr ) p ≤ En (Hqr )q ≪ n−r .
From this and relation (1.4.22) the required upper estimates follow.
Let us prove the lower estimate. Let n and s be the same as in the proof of the
lower estimate in Theorem 1.4.3 and let f be defined by (1.4.11). Then by the
Bernstein inequality,
Drα f q ≤ C(r),
and f ∈ Wq,r α C(r).
Let 1 ≤ q ≤ p ≤ ∞. From relation (1.4.12) and relation (1.4.13) with p instead
of q we get
En ( f ) p ≥ Cn−r+1/q−1/p . (1.4.23)
For 1 ≤ p ≤ q ≤ ∞ it suffices to consider as an example
f (x) = 2(n + 1)−r cos(n + 1)x.
r and, for any t ∈ T (n),
Then f ∈ W∞, α
Theorem 1.4.12 Let 1 ≤ q, p ≤ ∞, (q, p) = (1, 1) or (∞, ∞), and r > (1/q −
1/p)+ . Then
Sn (Wq,r α ) p ≍ Sn (Hqr ) p ≍ n−r+(1/q−1/p)+ .
Proof In the case 1 < p < ∞ the theorem follows from Theorem 1.4.9 and relation
(1.4.27). It remains to consider the cases p = 1, q > 1 and 1 ≤ q < p = ∞. In the
case p = 1, q > 1 we have
Sn (Hqr )1 ≤ Sn (Hqr∗ )q∗ ≪ n−r ,
where q∗ = min(q, 2).
Now let 1 ≤ q < p = ∞. In the case 1 ≤ q < 2, by Theorem 1.4.8 we have
r−(1/q−1/2)
Hqr ⊂ H2 B,
1.4 Approximation of Functions in the Classes Wq,r α and Hqr 33
which indicates that it suffices to consider the case 2 ≤ q < ∞. In this case by
Theorem 1.2.1 and Corollary 1.4.6 we have for s > sn , where sn is such that 2sn −1 ≤
n < 2sn ,
δs ( f ) ≤ C(r, q)2−rs ,
q
δs ( f ) − Sn δs ( f ) ≤ C(r, q)2−rsn .
n n q
Proof The upper estimates follow from Theorem 1.4.9 and the inequality
(1.4.28). Owing to (1.4.20) it suffices to prove the lower estimates for the W
classes. We first remark that
Sn (W1,r α )1 = Sn (W∞,−
r
α )∞ . (1.4.29)
Therefore, to obtain the lower estimate it suffices to consider the case p = 1. Let n
be given. We consider
f (x) := einx Kn−1 (x);
34 Approximation of Univariate Functions