All 2014
All 2014
Bruno Clerckx
August 2014
1 / 494
Course Ojectives
Central question: How to deal with fading and interference in wireless networks?
• Massive MIMO
• Real-World MIMO Wireless Networks
– MIMO and Interference Management in 4G and beyond (LTE, LTE-Advanced,
WiMAX)
– Homogeneous and Heterogeneous Networks
– System-Level Performance Evaluations
4 / 494
Important Information
5 / 494
Important Information
• Reference book
6 / 494
Some fundamentals/revisions (matrix analysis,
probability, information theory)
7 / 494
Reference Book
– Appendix A, B
9 / 494
Matrix properties
A(1, 1)B . . . A(1, n)B
.. ..
• Kronecker product:A ⊗ B = . ... .,
A(m, 1)B . . . A(m, n)B
• (A ⊗ B) ⊗ C = A ⊗ (B ⊗ C)
• (A ⊗ B)H = AH ⊗ BH
• (A ⊗ B) (C ⊗ D) = (AC ⊗ BD)
• (A ⊗ B)−1 = A−1 ⊗ B−1 if A, B square and non singular.
• det (Am×m ⊗ Bn×n ) = det (A)n det (B)m
• Tr {A ⊗ B} = Tr {A} Tr {B}
2
• Tr {AB} ≥ Tr {A} σmin (B) with σmin (B) the smallest singular value of B
• vec (A) converts [m × n] matrix into mn × 1 vector by stacking the columns of A
on top of one another.
– vec (ABC) = CT ⊗ A vec (B)
• det (I + ǫA) = 1 + ǫTr {A} if ǫ << 1
10 / 494
Gaussian random variable
11 / 494
Gaussian random variable
Definition
A discrete channel is defined as a system consisting of an input alphabet X and
output alphabet Y and a probability transition matrix p(y|x) that expresses the
probability of observing the output symbols y given that the symbold x is sent.
Definition
The channel is memoryless if the probability distribution of the output depends
only on the input at that time and is conditionally independent of previous
channel inputs or outputs, i.e. if x1 ,...,xn are inputs, and y1 ,...,yn denote the
corresponding outputs, for n channel uses, then
Example
Binary Symmetric Channel (BSC): x and y take values in 0,1 such that
1 − p, y = x,
p(y|x) =
p, y = 1 − x.
13 / 494
Entropy
Lemma
H(X) ≥ 0
1
Proof: 0 ≤ p(x) ≤ 1 such that log2 p(x)
≥0
Definition
The joint entropy H(X, Y ) of a pair of discrete random variables X and Y
with a joint pmf p(x, y) is defined as
XX
H(X, Y ) = −E {log2 p(X, Y )} = − p(x, y) log2 p(x, y)
x y
15 / 494
Conditional Entropy
• The conditional entropy of a random variable given another is the expected value of
the entropies of the conditional distributions, averaged over the conditioning random
variable
Definition
The conditional entropy H(Y |X) is defined as
X
H(Y |X) = p(x)H(Y |X = x)
x
X X
=− p(x) p(y|x) log2 p(y|x)
x y
XX
=− p(x, y) log2 p(y|x)
x y
16 / 494
Joint Entropy
Theorem
Chain rule
H(X, Y ) = H(X) + H(Y |X)
Proof:
XX XX
H(X, Y ) = − p(x, y) log2 p(x, y) = − p(x, y) log2 p(x)p(y|x)
x y x y
XX XX
=− p(x, y) log2 p(x) − p(x, y) log2 p(y|x)
x y x y
X XX
=− p(x) log2 p(x) − p(x, y) log2 p(y|x)
x x y
Definition
The relative entropy between two pmf p(x) and q(x) is defined as
X p(x) p(X)
D(p||q) = p(x) log2 = Ep log2
x
q(x) q(X)
Theorem
The relative entropy is always nonnegative D(p||q) ≥ 0 and is zero if and only
if p = q.
18 / 494
Mutual Information
Definition
For a pair of discrete random variables X and Y with a joint pmf p(x, y) and
marginal pmf p(x) and p(y), the mutual information I(X; Y ) is the relative
entropy between p(x, y) and p(x)p(y)
p(X, Y )
I(X; Y ) = D(p(x, y)||p(x)p(y)) = Ep(x,y) log2
p(X)p(Y )
XX p(x, y)
= p(x, y) log2
x y
p(x)p(y)
19 / 494
Mutual Information
• The mutual information I(X; Y ) is the reduction in the uncertainty of one random
variable due to the knowledge of the other
X p(x, y)
I(X; Y ) = p(x, y) log2
x,y
p(x)p(y)
X p(x|y)
= p(x, y) log2
x,y
p(x)
X X
=− p(x, y) log2 p(x) + p(x, y) log2 p(x|y)
x,y x,y
!
X X
=− p(x) log2 p(x) − − p(x, y) log2 p(x|y)
x x,y
= H(X) − H(X|Y )
= H(Y ) − H(Y |X) = I(Y ; X)
20 / 494
Mutual Information
Theorem
Nonnegativity of mutual information: For any two random variables X,Y
I(X; Y ) ≥ 0
Theorem
Conditioning reduces entropy: For any two random variables X,Y
H(X|Y ) ≤ H(X)
21 / 494
Channel Coding Theorem
Theorem
(a) For a DMC with channel transition pmf p(y|x), we can use i.i.d. inputs
with pmf p(x) to communicate reliably, as long as the code rate satisfies
R < I(X; Y ).
(b) The achievable rate can be maximized over the input density p(x) to obtain
the channel capacity
C = max I(X; Y ).
p(x)
22 / 494
Differential Entropy
Definition
For a continuous random variable X, the differential entropy h(X) is defined as
Z
1
h(X) = E = −E {log2 p(x)} = − p(x) log2 p(x)dx,
log2 p(x)
Example
2
For X ∼ N (µ, σ 2 ), − log2 p(x) = (x−µ)
2σ 2
log2 (e) + 21 log2 (2πσ 2 ). Thus,
h(X) = −E {log2 p(x)} = 2 log2 (e) + 2 log2 (2πσ 2 ) = 21 log2 (2πeσ 2 ). The
1 1
Theorem
Consider a RV with zero mean and variance σ 2 . Then h(X) ≤ 1
2
log2 (2πeσ 2 ),
with equality iff X ∼ N (0, σ 2 ).
23 / 494
AWGN Channel
Y = X + N, N ∼ N (0, σ 2 )
where X is power-constrained input E X 2 ≤ Es
24 / 494
AWGN Channel Capacity
Theorem
The capacity of the real AWGN channel is
1 Es
C= max I(X; Y ) = log2 (1 + 2 ).
p(x):E {X }≤Es
2 2 σ
Proof: Consider Y = X + N , with N ∼ N (0, σ 2 ) and E X 2 ≤ Es . Given X = x,
h(Y |X = x) = h(N ), so that h(Y |X) = h(N ) and
I(X; Y ) = h(Y ) − h(Y |X) = h(Y ) − h(N ).
Maximizing
I(X;
Y ) comes
to maximize h(Y ). Since X and N are independent,
E Y 2 = E X 2 + E N 2 ≤ Es + σ 2 . We now know that
1
h(Y ) ≤ log2 (2πe(Es + σ 2 ))
2
and equality is achieved iff Y ∼ N (0, Es + σ 2 ). Y ∼ N (0, Es + σ 2 ) is achieved if
the input distribution is X ∼ N (0, Es ), independent of the noise. We then get
1 1 1 Es
I(X; Y ) = h(Y ) − h(Z) = log2 (2πe(Es + σ 2 )) − log2 (2πeσ 2 ) = log2 (1 + 2 ).
2 2 2 σ
25 / 494
Jensen’s inequality
Theorem
If f is a convex function and X is a random variable,
E {f (X)} ≥ f (E {X}).
26 / 494
Fading and Diversity
27 / 494
Reference Book
– Chapter 1
Section: 1.2, 1.3, 1.4, 1.5
Appendix A, B
28 / 494
Space-Time Wireless Channels:Discrete Time
Representation
• channel: the impulse response of the linear time-varying communication system
between one (or more) transmitter(s) and one (or more) receiver(s).
• Assume a SISO transmission where the digital signal is defined in discrete-time by
the complex time series {cl }l∈ and is transmitted at the symbol rate Ts .
• The transmitted signal is then represented by
∞
X √
c(t) = Es cl δ(t − lTs ),
l=−∞
where Es is the transmitted symbol energy, assuming that the average energy
constellation is normalized to unity.
• Define a function hB (t, τ ) as the time-varying (along variable t) impulse response of
the channel (along τ ) over the system bandwidth B = 1/Ts , i.e. hB (t, τ ) is the
response at time t to an impulse at time t − τ .
• The received signal y(t) is given by
y(t) = hB (t, τ ) ⋆ c(t) + n(t)
Z τmax
= hB (t, τ )c(t − τ )dτ + n(t)
0
where ⋆ denotes the convolution product, n(t) is the additive noise of the system and
τmax is the maximal length of the impulse response.
29 / 494
Discrete Time Representation
• hB is a scalar quantity, which can be further decomposed into three main terms,
hB (t, τ ) = fr ⋆ h(t, τ ) ⋆ ft ,
where
– ft is the pulse-shaping filter,
– h(t, τ ) is the electromagnetic propagation channel (including the transmit and receive
antennas) at time t,
– fr is the receive filter.
• Nyquist criterion: the cascade f = fr ⋆ ft does not create inter-symbol interference
when y(t) is sampled at rate Ts .
• In practice,
– difficult to model h(t, τ ) (infinite bandwidth is required).
– hB (t, τ ) is usually the modeled quantity, but is written as h(t, τ ) (abuse of notation).
– Same notational approximation: the channel impulse response writes as h(t, τ ) or ht [τ ].
• The input-output relationship reads thereby as
∞
X √
y(t) = h(t, τ ) ⋆ c(t) + n(t) = Es cl ht [t − lTs ] + n(t).
l=−∞
30 / 494
Discrete Time Representation
• Sampling the received signal at the symbol rate Ts (yk = y(t0 + kTs ), using the
epoch t0 ) yields
∞
X √
yk = Es cl ht0 +kTs [t0 + (k − l)Ts ] + n(t0 + kTs )
l=−∞
∞
X √
= Es cl hk [k − l] + nk
l=−∞
Example
At time k = 0, the channel has two taps: h0 [0], h0 [1]
√
y0 = Es [c0 h0 [0] + c−1 h0 [1]] + n0
• If Ts >> τmax ,
– hB (t, τ ) is modeled by a single dependence on t: write simply as hB (t) (or h(t) using
the same abuse of notation). In the sampled domain, hk = h(t0 + kTs ).
– the channel is then said to be flat fading or narrowband
p
y k = E s hk ck + nk
• Otherwise the channel is said to be frequency selective.
31 / 494
Path-Loss and Shadowing
where |dB indicates the conversion to dB, and L0 is the deterministic path-loss at a
reference distance R0 , and Λ is generally known as the path-loss.
32 / 494
Path-Loss and Shadowing
• Path loss models are identical for both single- and multi-antenna systems.
• For point to point systems, it is common to discard the path loss and shadowing and
only investigate the effect due to fading, i.e. the classical model for narrowband
channels √
y = Es hc + n,
where the time index isremoved
for better legibility and n is usually taken as white
Gaussian distributed, E nk n∗l = σn2 δ(k − l).
• Es can then be seen as an average received symbol energy. The average SNR is then
defined as ρ , Es /σn2 .
33 / 494
Fading
• Multipaths
receiver
diffusion
line-of-sight diffraction
specular reflection
transmitter
• Assuming that the signal reaches the receiver via a large number of paths of similar
energy,
– h is modeled such that its real and imaginary parts are i.i.d. zero mean Gaussian
variablesof variance
σ 2 (circularly symmetric complex Gaussian variable).
– Recall E |h|2 = 2σ 2 = 1.
34 / 494
Fading
35 / 494
Fading
• Illustration of the typical received signal strength of a Rayleigh fading channel over a
certain time interval
10
0
Received signal [dB]
−5
−10
−15
−20
−25
−30
0 1 2 3 4 5
Time [s]
– The signal level randomly fluctuates, with some sharp declines of power and
instantaneous received SNR known as fades.
– When the channel is in a deep fade, a reliable decoding of the transmitted signal may
not be possible anymore, resulting in an error.
– How to recover the signal? Use of diversity techniques
36 / 494
Maximum likelihood detection
• Decision rule: choose the hypothesis that maximizes the conditional density
37 / 494
Diversity in Multiple Antennas Wireless Systems
39 / 494
Diversity in Multiple Antennas Wireless Systems
array gain = SNR shift Careful that the curves have been
plotted against the single-branch
Error probability
SNR ρ [dB]
• Coding Gain: a shift of the error curve (error rate vs. SNR) to the left, similarly to
the array gain.
– If the error rate vs. the average receive SNR ρ̄out , any variation of the array gain is
invisible but any variation of the coding gain is visible: for a given SNR level ρ̄out at
the input of the detector, the error rates will differ.
40 / 494
SIMO Systems
• Receive diversity may be implemented via two rather different combining methods:
– selection combining : the combiner selects the branch with the highest SNR among the
nr receive signals, which is then used for detection,
– gain combining : the signal used for detection is a linear combination of all branches,
z = gy, where g = [g1 , . . . , gnr ] is the combining vector.
1 Equal Gain Combining
2 Maximal Ratio Combining
3 Minimum Mean Square Error Combining
• We assume that the receiver is able to acquire the perfect knowledge of the channel.
41 / 494
Receive Diversity via Selection Combining
• Assume that the nr channels are independant and identically Rayleigh distributed
(i.i.d.) with unit energy and that the noise levels are equal on each antenna.
• Choose the branch with the largest amplitude smax = max{s1 , . . . , snr }.
• The probability that s falls below a certain level S is given by its CDF
2
/2σ 2
P [s < S] = 1 − e−S .
• The probability that smax falls below a certain level S is given by
h i
2 nr
P [smax < S] = P [s1 , . . . , snr ≤ S] = 1 − e−S .
• The average SNR at the output of the combiner ρ̄out is eventually given by
Z ∞ nr
X 1 nr ր 1
ρ̄out = ρs2 psmax (s) ds = ρ ≈ ρ γ + log(nr ) + .
0 n=1
n 2nr
• For BPSK and a two-branch diversity, the SER as a function of the average SNR per
channel ρ writes as
Z ∞ p
P̄ = Q 2ρs psmax (s) ds
0
r r
1 ρ 1 ρ
= − +
2 1+ρ 2 2+ρ
ρր
∼ 3
= .
8ρ2
The slope of the bit error rate curve is equal to 2.
• In general, the diversity gain gdo of a nr -branch selection diversity scheme is equal to
nr . Selection diversity extracts all the possible diversity out of the channel.
43 / 494
Receive Diversity via Gain Combining
• In gain combining, the signal z used for detection is a linear combination of all
branches,
nr
X √
z = gy = gn yn = Es ghc + gn
n=1
where
– gn ’s are the combining weights and g , [g1 , . . . , gnr ]
– the data symbol c is sent through the channel and received by nr antennas
– h , [h1 , . . . , hnr ]T
• Assume Rayleigh distributed channels hn = |hn | ejφn , n = 1, . . . , nr , with unit
energy, all the channels being independent.
• Equal Gain Combining : fixes the weights as gn = e−jφn .
– Mean value of the output SNR ρ̄out (averaged over the Rayleigh fading):
h i2
P nr √
E n=1 E s |h n |
π
ρ̄out = 2
= . . . = ρ 1 + (n r − 1) ,
n r σn 4
where the expectation is taken over the channel statistics. The array gain grows
linearly with nr , and is therefore larger than the array gain of selection combining.
– The diversity gain of equal gain combining is equal to nr analogous to selection.
44 / 494
Receive Diversity via Gain Combining
where u = khk2 is χ2 distribution with 2nr degrees of freedom when the different
channels are i.i.d. Rayleigh
1
pu (u) = unr −1 e−u .
(nr − 1)!
At high SNR, P̄ becomes
2nr − 1
P̄ = (4ρ)−nr .
nr
The diversity gain is again equal to nr .
45 / 494
Receive Diversity via Gain Combining
where N̄e and dmin are respectively the number of nearest neighbors and minimum
distance of separation of the underlying constellation.
Since u is a χ2 variable with 2nr degrees of freedom, the above average upper-bound
is given by
nr
1
P̄ ≤ N̄e
1 + ρd2min /4
ρր ρd2min −nr
≤ N̄e .
4
The diversity gain gdo is equal to the number of receive branches in i.i.d. Rayleigh
channels.
46 / 494
Receive Diversity via Gain Combining
Example
Question: Assume a transmission of a signal c from a single antenna
transmitter to a multi-antenna receiver through a SIMO channel h. The
transmission is subject to the interference from another transmitter sending
signal x through the interfering SIMO channel hi .
The received signal model writes as
y = hc + hi x + n
where n is the zero mean complex additive white Gaussian noise (AWGN)
vector with E{nnH } = σn2 Inr .
We apply a combiner g at the receiver to obtain the observation z = gy.
Derive the expression of the MMSE combiner and the SINR at the output of
the combiner.
48 / 494
Receive Diversity via Gain Combining
Example
Answer: The MMSE combiner g is given by
g = hH R−1
ni
where Rni = E ni nH i with ni = hi x + n.
Hence Rni = hi Px hi + σn2 Inr with Px = E |x|2 , the power of the
H
interfering signal.
Hence, −1
g = hH hi Px hH 2
i + σn I nr .
z = gy = hH R−1 H −1
ni hc + h Rni ni .
49 / 494
Receive Diversity via Gain Combining
Example
Answer: The output SINR writes
H −1 2
h R n h P c
i
ρout = n H o
E hH R−1 H −1
ni i h Rni ni
n
H −1 2
h Rn h P c
= H −1 i H −1
E h Rni ni ni Rni h
H −1 2
h Rn h P c
i
=
hH R−1ni h
= hH R−1ni hPc
−1
H
= Pc h hi Px hH 2
i + σn I n r h
−1
= SNR hH INR hi hH i + I nr h
with Pc = E |c|2 , SNR = Pc /σn2 (the average SNR), INR = Px /σn2 (the
average INR - Interference to Noise Ratio).
50 / 494
MISO Systems
• MISO systems exploit diversity at the transmitter through the use of nt transmit
antennas in combination with pre-processing or precoding.
• A significant difference with receive diversity is that the transmitter might not have
the knowledge of the MISO channel.
– At the receiver, the channel is easily estimated.
– At the transmit side, feedback from the receiver is required to inform the transmitter.
• There are basically two different ways of achieving direct transmit diversity :
– when Tx has a perfect channel knowledge, beamforming can be performed to achieve
both diversity and array gains,
– when Tx has a partial or no channel knowledge of the channel, space-time coding is
used to achieve a diversity gain (but no array gain in the absence of any channel
knowledge).
51 / 494
Transmit Diversity via Matched Beamforming
• The actual transmitted signal is a vector c′ that results from the multiplication of
the signal c by a weight vector w.
• At the receiver, the signal reads as
√ √
y = Es hc′ + n = Es hwc + n,
where h , [h1 , . . . , hnt ] represents the MISO channel vector, and w is also known as
the precoder.
• The choice that maximizes the receive SNR is given by
hH
w= .
khk
• Transmit along the direction of the matched channel, hence it is also known as
matched beamforming or transmit MRC.
• The array gain is equal to the number of transmit antennas, i.e. ρ̄out = nt ρ.
• The diversity gain equal to nt as the symbol error rate is upper-bounded at high
SNR by 2 −nt
ρdmin
P̄ ≤ N̄e .
4
• Matched beamforming presents the same performance as receive MRC, but requires
a perfect transmit channel knowledge.
52 / 494
Transmit Diversity via Space-Time Coding
– The mean output SNR (averaged over the channel statistics) is thus equal to
( 2 )
Es khk2
ρ̄out = 2 E = ρ.
σn 2 khk2
No array gain owing to the lack of transmit channel knowledge.
– The average symbol error rate at high SNR can be upper-bounded according to
2 −2
ρdmin
P̄ ≤ N̄e .
8
The diversity gain is equal to nt = 2 despite the lack of transmit channel knowledge.
0
10
no spatial diversity
transmit MRC
Transmit MRC vs. Alamouti with 2
Alamouti scheme
transmit antennas in i.i.d. Rayleigh
10
−1 fading channels (BPSK).
Observations:
−2 – At high SNR, any increase in the
SER
10
SNR by 10dB leads to a decrease of
SER by 10−n for diversity order n.
10
−3
Alamouti, transmit MRC: 2
No spatial diversity: 1
−4
– Transmit MRC has 3 dB gain over
10
0 2 4 6 8 10 12 Alamouti
SNR [dB]
54 / 494
Indirect Transmit Diversity
• It is also possible to convert spatial diversity to time or frequency diversity, which are
then exploited using well-known SISO techniques.
• Assume that nt = 2 and that the signal on the second transmit branch is
– either delayed by one symbol period: the spatial diversity is converted into frequency
diversity (delay diversity)
– either phase-rotated: the spatial diversity is converted into time diversity
– The effective SISO channel resulting from the addition of the two branches seen by the
receiver now fades over frequency or time. This selective fading can be exploited by
conventional diversity techniques, e.g. FEC/interleaving.
55 / 494
MIMO Systems - Transmission
56 / 494
Reference Book
– Chapter 1
Section: 1.2.4, 1.3.2, 1.6
57 / 494
Introduction - Previous Lectures
58 / 494
MIMO Systems
• In MIMO systems, the fading channel between each transmit-receive antenna pair
can be modeled as a SISO channel.
• For uni-polarized antennas and small inter-element spacings (of the order of the
wavelength), path loss and shadowing of all SISO channels are identical.
• Stacking all inputs and outputs in vectors ck = [c1,k , . . . , cnt ,k ]T and
yk = [y1,k , . . . , ynr ,k ]T , the input-output relationship at any given time instant k
reads as √
yk = Es Hk c′k + nk ,
where
– c′k is a precoded version of ck that depends on the channel knowledge at the Tx.
– Hk is defined as the nr × nt MIMO channel matrix, Hk (n, m) = hnm,k , with hnm
denoting the narrowband channel between transmit antenna m (m = 1, . . . , nt ) and
receive antenna n (n = 1, . . . , nr ),
– nk = [n1,k , . . . , nnr ,k ]T is the sampled noise vector, containing the noise contribution
at each receive antenna, such that the noise is white in both time and spatial
dimensions, E nk nH l = σn2I
nr δ (k − l).
• Using the same channels normalization as for SISO channels, E kHk2F = nt nr .
• when Tx has a perfect channel knowledge: (dominant and multiple) eigenmode
transmission
• when Tx has no knowledge of the channel: space-time coding (with c′k = ck )
59 / 494
Space-Time Coding
60 / 494
Space-Time Coding
2
• Apply the matched filter HH H
ef f to y (Hef f Hef f = kHkF I2 )
z1 √ √
z= = Es HH ef f y = Es kHk2F I2 c + n′
z2
where n′ is such that E{n′ } = 02×1 and E{n′ n′H } = kHk2F σn2 I2 .
61 / 494
Dominant Eigenmode Transmission
• Decompose
H
H = U H ΣH V H ,
ΣH = diag{σ1 , σ2 , . . . , σr(H) }.
w = vmax
g = uH
max
where vmax and umax are respectively the right and left singular vectors
corresponding to the maximum singular value of H, σmax = max{σ1 , σ2 , . . . , σr(H) }.
Note the generalization of√matched beamforming (MISO) and MRC (SIMO)!
• Equivalent channel: z = Es σmax c + ñ where ñ = gn has a variance equal to σn2 .
62 / 494
Dominant Eigenmode Transmission
2
• Array gain: E{σmax } = E{λmax } where λmax is the largest eigenvalue of HHH .
Commonly, max{nt , nr } ≤ ga ≤ nt nr .
Example
Array gain changes depending on the channel properties and distribution
√ of Sight: H = 1nr ×nt . Only one singular value is non-zero and equal to
– Line
nt nr : ga = nt nr .
√ √ 2
– In the i.i.d. Rayleigh case: for large nt , nr , ga = nt + nr .
• Diversity gain: the dominant eigenmode transmission extracts a full diversity gain of
nt nr in i.i.d. Rayleigh channels.
63 / 494
Dominant Eigenmode Transmission
Example
Question: Show that the optimum (in the sense of SNR maximization) transmit
precoder and combiner in dominant eigenmode transmission is given by the
dominant right and left singular vector of the channel matrix, respectively.
Answer: Let us write
√ √
y = Es Hc′ + n = Es Hwc + n,
√
z = gy = Es gHwc + gn.
Example
Answer: The inequality is replaced by an equality if w = vmax . By choosing
w = vmax ,
g = wH HH = vmax
H
V H ΣH U H
H
= σmax uH
max
65 / 494
Multiple Eigenmode Transmission
where {s1 , . . . , snt } is the power allocation on each of the channel eigenmodes.
• The capacity scales linearly in nt . By contrast, this transmission does not necessarily
achieve the full diversity gain of nt nr but does at least provide nr -fold array and
diversity gains (still assuming nt ≤ nr ).
• In general, the rate scales linearly with the rank of H.
66 / 494
Multiple Eigenmode Transmission
Example
Question: Is the rate achievable in a MIMO channel with multiple eigenmode
transmission and uniform power allocation across modes always larger than
that achievable with dominant eigenmode transmission?
Answer: No! The achievable rate with multiple eigenmode transmission in the
MIMO channel is the sum of the SISO channel achievable rates
r(H)
X
R= log2 (1 + ρsk σk2 ),
k=1
• Array/diversity/coding gains are exploitable in SIMO, MISO and MIMO but MIMO
can offer much more than MISO and SIMO.
• The multiplexing gain is the pre-log factor of the rate at high SNR, i.e.
R ≈ gs log2 ρ
• Modeling only the individual SISO channels from one Tx antenna to one Rx antenna
not enough:
– MIMO performance depends on the channel matrix properties
– characterize all statistical correlations between all matrix elements necessary!
68 / 494
Interference Management
• With multiple antennas, it is possible to exploit the difference between the spatial
signatures of the desired vs. the interfering channels to reduce the intra-cell and
inter-cell interference:
– In a single-cell multi-user context, Multi-user MIMO (MU-MIMO)
– In a multi-cell context, Multi-Cell MIMO (MC-MIMO)
69 / 494
Channel Modelling
70 / 494
Reference Book
– Chapter 2
Section: 2.1.1, 2.1.2, 2.1.3, 2.1.5, 2.2,
2.3.1
– Chapter 3
Section: 3.2.1, 3.2.2, 3.4.1
71 / 494
Double-Directional Channel Modeling
h (t, pt , pr , τ, Ωt , Ωr ) = hk (t, pt , pr , τ, Ωt , Ωr ) ,
k=0
• In the case of a plane wave, and considering a fixed transmitter and a mobile receiver,
hk (t, pt , pr , τ, Ωt , Ωr ) , αk ejφk e−j∆ωk t δ(τ − τk ) δ(Ωt − Ωt,k ) δ(Ωr − Ωr,k ),
where
– αk is the amplitude of the kth contribution,
– φk is the phase of the kth contribution,
– ∆ωk is the Doppler shift of the kth contribution,
– τk is the time delay of the kth contribution,
– Ωt,k is the DoD of the kth contribution,
– Ωr,k is the DoA of the kth contribution.
• A more compact notation (all temporal variations are grouped into t)
nX
s −1
h (t, τ, Ωt , Ωr ) = hk (t, τ, Ωt , Ωr )
k=0
• Wide-Sense Stationary:
– Time correlations only depend on the time difference
– Signals arriving with different Doppler frequencies are uncorrelated
• Uncorrelated Scattering:
– Frequency correlations only depend on the frequency difference
– Signals arriving with different delays are uncorrelated
• Homogeneous:
– Spatial correlation only depends on the spatial difference at both transmit and receive
sides
– Signals departing/arriving with different directions are uncorrelated
74 / 494
Spectra
75 / 494
Angular Spread
L=2
L=1
L=0
76 / 494
The MIMO Channel Matrix
where
ZZ
hnm (t, τ ) , hnm t, τ, Ωt , Ωr dΩt dΩr
where kt (Ωt ) and kr (Ωr ) are the transmit and receive wave propagation 3 × 1
vectors.
77 / 494
Steering Vectors
• Under the plane wave and balanced narrowband array assumptions, the MIMO
channel matrix can be rewritten as a function of steering vectors as
Z Z
(1)
H(t, τ ) = h t, pt , p(1) T
r , τ, Ωt , Ωr ar (Ωr ) at (Ωt ) dΩt dΩr .
78 / 494
A Finite Scatterer MIMO Channel Representation
• The transmitter and receiver are coupled via a finite number of scattering paths with
ns,t DoDs at the transmitter and ns,r DoAs at the receiver.
−→ Replace the integral by a summation (assume for simplicity 2-D azimuthal
propagation)
ns,t ns,r
XX (l,p) (l)
H(t, τ ) = h11 (t, τ )ar (θr(p) )aTt (θt )
l=1 p=1
= Ar Hs (t, τ )ATt
where
– Ar and At represent the nr × ns,r and nt × ns,t matrices whose columns are the
steering vectors related to the directions of each path observed at Rx and Tx
– Hs (t, τ ) is a ns,r × ns,t matrix whose elements are the complex path gains between all
DoDs and DoAs at time instant t and delay τ
(l)
• Assume the columns of At are written as at (θt ), l = 1, ..., ns,t . Let us write
ns,t ns,t
X (l)
X
H = Ar Hs ATt = H̃s (:, l)aTt (θt ) = H(l) ,
| {z }
l=1 l=1
H̃s
where H(l) can be viewed as the channel matrix corresponding to the lth scatterer
(l)
located in the direction of departure θt .
79 / 494
Statistical Properties of the MIMO Channel Matrix
• Assume narrowband channels, the spatial correlation matrix of the MIMO channel
R = E{vec(HH )vec(HH )H }
This is a nt nr × nt nr positive semi-definite Hermitian matrix.
• It describes the correlation between all pairs of transmit-receive channels:
– E {H(n, m)H∗ (n, m)}: the average energy of the channel between antenna m and
antenna n,
(nq)
– rm = E {H(n, m)H∗ (q, m)}: the receive correlation between channels originating
from transmit antenna m and impinging upon receive antennas n and q,
(mp)
– tn = E {H(n, m)H∗ (n, p)}: the transmit correlation between channels originating
from transmit antennas m and p and arriving at receive antenna n,
– E {H(n, m)H∗ (q, p)}: the cross-channel correlation between channels (m, n) and
(q, p).
Example
2x2 MIMO
1 t∗1 r1∗ s∗1
t1 1 s∗2 r2∗ t1 = E {H(1, 1)H∗ (1, 2)}
R=
r1 s2 1 t∗2
s1 r2 t2 1 r1 = E {H(1, 1)H∗ (2, 1)}
80 / 494
Spatial Correlation
where
– ϕr,t (θr,t ) = 2π(dr,t /λ) cos θr,t ,
– dr and dt are the Rinter-element spacing
at the receive/transmit arrays
– h11 t, Ωt , Ωr , h11 t, τ, Ωt , Ωr dτ .
• Correlation between channels hnm and hqp
Z 2π Z 2π
E hnm h∗qp = E h11 t, Ωt , Ωr 2 e−j(m−p)ϕt (θt ) e−j(n−q)ϕr (θt ) dθt dθr
0 0
Z 2π Z 2π n 2 o −j(m−p)ϕt (θt ) −j(n−q)ϕr (θt )
= E h11 t, Ωt , Ωr e e dθt dθr ,
0 0
Z 2π Z 2π
= A (θt , θr ) e−j(m−p)ϕt (θt ) e−j(n−q)ϕr (θt ) dθt dθr ,
0 0
where A (θt , θr ) is the joint direction power spectrum restricted to the azimuth
angles.
• The channel correlation is related to both the antenna spacings and the joint
direction power spectrum! 81 / 494
Spatial Correlation
• When the energy spreading is very large at both sides and dt /dr are sufficiently
large, elements of H become uncorrelated, and R becomes diagonal.
Example
Consider two transmit antennas spaced by dt . The transmit correlation writes
as Z 2π
t= ej2π(dt /λ) cos θt At (θt )dθt ,
0
which only depends on the transmit antenna spacing and the transmit direction
power spectrum.
– isotropic scattering : very rich scattering environment around the transmitter with
a uniform distribution of the energy, i.e. At (θt ) ∼
= 1/2π
Z 2π Z 2π
1 1
t= ejϕt (θt ) dθt = ej2π(dt /λ) cos θt dθt
2π 0 2π 0
!
dt
= J0 2π .
λ
The transmit correlation only depends on the spacing between the two antennas.
82 / 494
Spatial Correlation
Example
– highly directional scattering : scatterers around the transmit array are
concentrated along a narrow direction θt,0 , i.e., At (θt ) → δ(θt − θt,0 )
0.5
– directional scattering (κ = ∞):
κ=2 correlation never reaches 0
– in practice, decorrelation in rich
0 scattering is reached for
dt ≈ 0.5λ
κ=0 – The more directional the
azimuthal dispersion (i.e. for κ
−0.5
0 0.5 1 1.5 2 2.5 3 3.5 4 increasing), the larger the
Transmit antenna spacing relative to wavelength
antenna spacing required to
obtain a null correlation. 83 / 494
Analytical Representation of Rayleigh MIMO Channels
where Rt and Rr are respectively the transmit and receive correlation matrices.
• Strictly valid only if r1 = r2 = r and t1 = t2 = t and s1 = rt and s2 = rt∗ (for 2 × 2)
1 t∗1 r1∗ s∗1 1 t∗ r ∗ r ∗ t∗
t1 1 s∗2 r2∗ t 1 r∗ t r∗ 1 r∗ 1 t∗
R= r1 s2 1 t∗2 r rt∗
= = ⊗
1 t∗ r 1 t 1
s 1 r 2 t2 1 rt r t 1 | {z } | {z }
Rr Rt
85 / 494
Correlated Rayleigh Fading Channels
Example
Question: Assume a MISO system with two transmit antennas. The channel
gains are identically distributed circularly symmetric complex Gaussian but can
be correlated and are denoted as h1 and h2 . Write the expression of the
transmit correlation matrix Rt and derive the eigenvalues and eigenvectors of
Rt as a fonction of the transmit correlation coefficient t.
Answer: We write
∗
h1 E |h1 |2 E{h∗1 h2 } 1 t∗
Rt = E h1 h2 = =
h∗2 E {h1 h∗2 } E |h2 |2 t 1
where t = E {h1 h∗2 } is the transmit correlation coefficient. The SVD leads to
H
1 1 1 + |t| 0 1 1
Rt = .
t/|t| −t/|t| 0 1 − |t| t/|t| −t/|t|
The eigenvalues are only function of the magnitude of t while the eigenvectors
are only function of the phase of t.
86 / 494
Correlated Rayleigh Fading Channels
Example
Question: Assume the previous example with |t| → 1. Compute the weights of
the matched beamformer (or maximum ratio transmission/transmit MRC).
Answer: With matched beamforming, w = hH / khk where
1/2
h = h w Rt
p H
1 1 1 + |t| p 0 1 1
= hw
t/|t| −t/|t| 0 1 − |t| t/|t| −t/|t|
H
1 1
= 2hw
t/|t| t/|t|
where the last equality comes from the fact that |t| = 1. This shows
that for
high correlation, the channel direction (h/ khk) is aligned with 1 t∗ /|t| .
Hence
H 1
w = h / khk = .
t/|t|
Transmission is performed in the direction where all scatterers are located.
87 / 494
Analytical Representation of Ricean MIMO Channels
• In the presence of a strong coherent component which does not experience any
fading over time
– e.g. a line-of-sight field, one or several specular contributions, coherent addition of
reflected and diffracted contributions (in fixed wireless access only).
• All these situations lead to a Ricean distribution of the received field amplitude.
– The relative strength of the dominant coherent component is characterized by the
K-factor K. As the channel contains a coherent component, its amplitude can be
written as
|h(t)| = h̄ + h̃(t) ,
where h̄ is the coherent component, and h̃(t) is the non coherent part, whose energy is
denoted as 2σs2 .
– The K-factor is defined as 2
h̄
K= .
2σs2
– |h(t)| , s′ is Ricean distributed, and its distribution is given, as a function of K, as
" !# !
′ 2s′ K s′2 2s′ K
ps′ (s ) = 2 exp − K 2 + 1 I0 .
h̄ h̄ h̄
– For K = 0, the Ricean distribution boils down to the Rayleigh distribution while for
K = ∞, the channel becomes deterministic (no fading).
88 / 494
Ricean MIMO Channels
R = E{vec(H̃H )vec(H̃H )H }
– With only one coherent contribution with given DoD and DoA (Ωt,c and Ωr,c ),
H̄ = ar (Ωr,c ) aT
t (Ωt,c )
– For broadside arrays with a pure line-of-sight component, H̄ = 1nr ×nt .
89 / 494
Capacity of point-to-point MIMO Channels
90 / 494
Reference Book
– Chapter 5
Section: 5.1, 5.2, 5.3, 5.4.1, 5.4.2
(except “Antenna Selection Schemes”),
5.5.1 - “Kronecker Correlated Rayleigh
Channels”, 5.5.2, 5.7, 5.8.1 (except
Proof of Proposition 5.9 and Example
5.4)
91 / 494
Introduction - Previous Lectures
• Transmission strategies
– Space-Time Coding when no Tx channel knowledge
– Multiple (including dominant) eigenmode transmission when Tx channel knowledge
p
z = Es GHc′ + Gn
p
= E s UHH HVH c + U n
H
p
= Es ΣH c + ñ.
Multiple parallel data pipes → Spatial multiplexing gain!
92 / 494
System Model
• The input covariance matrix is defined as the covariance matrix of the transmit
H
signal c′ (we drop the time index) and writes as Q = E c′ c′ .
• Short-term power constraint: Tr{Q} ≤ 1.
• Long-term power constraint (over a duration Tp >> T ): E {Tr{Q}} ≤ 1 where the
expectation refers here to the averaging over successive codeword of length T .
• Channel time variation: Tcoh coherence time
– slow fading : Tcoh is so long that coding is performed over a single channel realization.
– fast fading : Tcoh is so short that coding over multiple channel realizations is possible.
93 / 494
Capacity of Deterministic MIMO Channels
Proposition
Definition
The capacity of a deterministic nr × nt MIMO channel with perfect channel
state information at the transmitter is
C (H) = max log2 det Inr + ρHQHH .
Q≥0:Tr{Q}=1
94 / 494
Capacity of Deterministic MIMO Channels
Proof: Denoting the entropy by H(.), the mutual information between input and output
is given by
I(H, Q) = I(c′ ; y |H ),
= H(y |H ) − H(y c′ , H )
= H(y |H ) − H(n c′ , H ).
When the input vector has a covariance Q = E c′ c′H , we have that the covariance of y
is given by
E yyH = σn2 Inr + Es HQHH ,
since the noise is an additive white Gaussian noise (AWGN). Following similar steps as in
SISO, H(y |H ) is largest when y is zero-mean circularly symmetric complex Gaussian,
which is achieved when c′ is zero-mean circularly symmetric complex Gaussian. Because
the differential entropy H(c′ ) of a zero-mean circularly symmetric complex Gaussian
input vector c′ with covariance matrix Q is given by log2 det πeQ , we get
!
2 H
I(H, Q) = log2 det πe σn Inr + Es HQH − log2 det πeσn2 Inr ,
H
= log2 det Inr + ρHQH .
95 / 494
Capacity and Water-Filling Algorithm
• What is the best transmission strategy, i.e. the optimum input covariance matrix Q?
• First, create n = min{nt , nr } parallel data pipes (Multiple Eigenmode Transmission)
– Decouple the channel along the individual channel modes (in the directions of the
singular vectors of the channel matrix H at both the transmitter and the receiver)
H
H = U H ΣH V H ,
UH H H
H HVH = UH UH ΣH VH VH = ΣH
– Optimum input covariance matrix Q⋆ writes as
Q⋆ = VH diag {s⋆1 , . . . , s⋆n } VH
H
,
• Second, allocate power to data pipes
– ΣH = diag {σ1 , . . . , σn }, and σk2 , λk
Pn Pn ⋆
– Capacity: C(H) = max{sk }n k=1 log2 1 + ρsk λk = k=1 log2 1 + ρsk λk
k=1
Proposition
⋆ . , s⋆n } that maximizes
Pn power allocation strategy {s1 , . . . , sn } = {s1 , . .P
The
n
k=1 log2 (1 + ρλk sk ) under the power constraint k=1 sk = 1, is given by the
water-filling solution,
+
1
s⋆k = µ − , k = 1, . . . , n
ρλk
P
where µ is chosen so as to satisfy the power constraint n ⋆
k=1 sk = 1.
96 / 494
Water-Filling Algorithm
97 / 494
Water-Filling Algorithm
Example
Question: Consider the transmission y = Hc′ + n with perfect CSIT over a
deterministic point to point MIMO channel whose matrix is given by
a 0 a 0
H=
0 b 0 b
where a and b are complex scalars with |a| ≥ |b|. The input covariance matrix
is given by Q = E c′ c′H and is subject to the transmit power constraint
Tr {Q} ≤ P .
1 Compute the capacity with perfect CSIT of that deterministic channel.
Particularize to the case a = b. Explain your reasoning.
2 Explain how to achieve that capacity.
3 In which deployment scenario, could such channel matrix structure be
encountered?
98 / 494
Water-Filling Algorithm
Example
Answer:
H
1 Let us write Q = VPV with the diagonal element of P, denoted as Pk
P nt
(satisfying k=1 Pk = P ), refers to the power allocated to stream k. The
capacity with perfect CSIT over the deterministic channel H is given by
min{2,4}
X Pk
C (H) = max log2 1 + 2 λk
P1 ,...,Pk σn
k=1
Example
Answer: 2
2
σn σn
Assuming P1⋆ and P2⋆ are positive, µ = P
2
+ 4
1
|a|2
+ 1
|b|2
. If µ − 2|b|2
≤ 0,
2 2
σn σn
i.e. P
2
+ 4|a|2
− 4|b|2
≤ 0, P2⋆ = 0 and P1⋆ = P . The capacity writes as
P
C (H) = log2 1 + 2 2 |a|2 .
σn
σ2 σ2 σ2 σ2 σ2 σn2
If P2 + 4|a|n2 − 4|b|n2 > 0, P1⋆ = P2 − 4|a|n2 + 4|b|n2 and P2⋆ = P2 + 4|a|n2 − 4|b|2
.
The capacity writes as
P⋆ P⋆
C (H) = log2 1 + 12 2 |a|2 + log2 1 + 22 2 |b|2 .
σn σn
100 / 494
Water-Filling Algorithm
Example
Answer:
H
2 Transmit along V, given by the two dominant eigenvector of H H. They
are easily computed given the orthogonality of the channel matrix H as
1 0
1 0 1
V= √ .
2 1 0
0 1
The power allocated to the two streams is given by P1⋆ and P2⋆ . At the
receiver, the precoded channel is already decoupled and no further
combiner is necessary. Each stream can be decoded using the
corresponding SISO decoder.
3 Dual-polarized antenna deployment (e.g. VHVH-VH) with LoS and good
antenna XPD.
101 / 494
Capacity Bounds and Suboptimal Power Allocations
• At any SNR
– lower bound
C (H) ≥ log2 (1 + ρλmax ) ,
n
X ρ
C (H) ≥ log2 1 + λk .
k=1
n
• Fast fading:
– Doppler frequency sufficiently high to allow for coding over many channel
realizations/coherence time periods
– The transmission capability is represented by a single quantity known as the ergodic
capacity
• MIMO Capacity with Perfect Transmit Channel Knowledge
– similar strategy as in deterministic channels: transmit along eigenvectors of channel
matrix and allocate power following water-filling
– short term power constraint: water-filling solution applied over space as in
deterministic channels
( )
C̄CSIT,ST = E max log2 det Inr + ρHQHH
Q≥0:Tr{Q}=1
n
X
= E log2 1 + ρs⋆k λk .
k=1
– long term power constraint: water-filling solution applied over both time and space
X n
C̄CSIT,LT = E log2 1 + ρs⋆k λk .
k=1
– Impact on coding strategy? Use a variable-rate code (family of codes of different rates)
adapted as a function of the water-filling allocation. No need for the codeword to span
many coherence time periods.
103 / 494
MIMO Capacity with Partial Transmit Channel Knowledge
Such a rate varies over time according to the channel fluctuations. The average rate
of information flow over a time duration T >> Tcoh is
T −1 h i
1 X
log2 det Inr + ρHk QHH k .
T
k=0
Definition
The ergodic capacity of a nr × nt MIMO channel with channel distribution
information at the transmitter (CDIT) is given by
h i
C̄CDIT , C̄ = max E log2 det Inr + ρHQHH ,
Q≥0:Tr{Q}=1
Observations: C̄CSIT also scales linearly with n. The spatial multiplexing gain is
gs = n. MISO fading channels do not offer any multiplexing gain.
105 / 494
I.I.D. Rayleigh Fast Fading Channels: Partial Transmit
Channel Knowledge
• Optimal covariance matrix
Proposition
In i.i.d. Rayleigh fading channels, the ergodic capacity with CDIT is achieved
under an equal power allocation scheme Q = Int /nt , i.e.,
( ) ( n )
ρ H
X ρ
C̄CDIT = Īe = E log2 det Inr + Hw Hw =E log2 1 + λk .
nt nt
k=1
Encoding requires a fixed-rate code (whose rate is given by the ergodic capacity)
with encoding spanning many channel realizations.
• Low SNR:
( )
ρ ρ o
2
C̄CDIT ≥ E log2 1 + kHw kF ≈ E kHw k2F log2 (e) = nr ρ log2 (e)
nt nt
Observations:
– C̄CDIT is only determined by the energy of the channel.
– A MIMO channel only yields a nr gain over a SISO channel. Increasing the number of
transmit antennas is not useful (contrary to perfect CSIT). SIMO and MIMO channels
reach the same capacity for a given nr .
106 / 494
I.I.D. Rayleigh Fast Fading Channels: Partial Transmit
Channel Knowledge
• High SNR:
( n ) ( n
)
X ρ ρ X
C̄CDIT ≈ E log2 λk = nlog2 +E log2 (λk )
nt nt
k=1 k=1
Observations:
– C̄CDIT at high SNR scales linearly with n (by contrast to the low SNR regime).
– The multiplexing gain gs is equal to n, similarly to the CSIT case.
– C̄CDIT and C̄CSIT are not equal: constant gap equal to n log2 (nt /n) at high SNR.
• Expressions can be particularized to SISO, SIMO, MISO cases. At high SNR,
– SISO (N = n = 1):
n o
C̄CDIT ≈ log2 (ρ) + E log2 |h|2 = log2 (ρ) − 0.83 = CAW GN − 0.83
– SIMO (nt = n = 1, nr = N ):
C̄CDIT ≈ log2 (nr ρ)
– MISO (nr = n = 1, nt = N ):
n o nt →∞
C̄CDIT ≈ log2 (ρ) + E log2 khk2 /nt ≈ log2 (ρ) = CAW GN
107 / 494
I.I.D. Rayleigh Fast Fading Channels
• Ergodic capacity of various nr × nt i.i.d. Rayleigh channels with full (CSIT) and
partial (CDIT) channel knowledge at the transmitter.
20
2 x 2 (CSIT)
18 4 x 2 (CSIT)
2 x 4 (CSIT)
16 4 x 4 (CSIT)
2 x 2 (CDIT)
Ergodic capacity [bps/Hz]
14 4 x 2 (CDIT)
2 x 4 (CDIT)
12 4 x 4 (CDIT)
10
0
−10 −5 0 5 10 15 20
SNR [dB]
108 / 494
Correlated Rayleigh Fast Fading Channels: Uniform Power
Allocation
• Assume the channel covariance matrix is unknown to the transmitter
• Mutual information with identity input covariance matrix
( )
ρ H
Īe = E log2 det Inr + HH .
nt
• Low SNR ( )
ρ 2
Īe ≥ E log2 1 + kHkF .
nt
1/2 1/2
• High SNR in Kronecker Correlated Rayleigh Channels H = Rr Hw Rt (with full
rank correlation matrices) and nt = nr
( )
ρ H
Īe ≈ E log2 det Hw Hw + log2 det(Rr ) + log2 det(Rt ).
nt
Observations:
– det(Rr ) ≤ 1 and det(Rt ) ≤ 1: receive and transmit correlations always degrade the
mutual information (with power uniform allocation) with respect to the i.i.d. case.
– Īe still scales linearly with min{nt , nr }
109 / 494
Correlated Rayleigh Fast Fading Channels: Partial
Transmit Channel Knowledge
• Assume the channel covariance matrix is known to the transmitter.
Proposition
Q = U R t ΛQ U H
Rt ,
110 / 494
Correlated Rayleigh Fast Fading Channels: Partial
Transmit Channel Knowledge
• Mutual information of various strategies at 0 dB SNR as a function of the transmit
correlation |t| in TIMO. Beamforming refers here to the tranmsission of one stream
along the dominant eigenvector of Rt .
3.6
3.5
Mutual information/capacity [bps/Hz]
3.4
3.3
3.2
3.1
2.9
2.8
Optimal allocation
2.7 Equi−power allocation
Beamforming
2.6
0 0.2 0.4 0.6 0.8 1
Magnitude of transmit correlation
111 / 494
Outage Capacity and Probability in Slow Fading Channels
• In slow fading, the encoding still averages out the randomness of the noise but
cannot fully average out the randomness of the channel.
• For a given channel realization H and a target rate R, reliable transmission if
log2 det Inr + ρHQHH > R
If not met with any Q, an outage occurs and the decoding error probability is strictly
non-zero.
• Look at the tail probability of log2 det Inr + ρHQHH , not its average!
Definition
The outage probability Pout (R) of a nr × nt MIMO channel with a target rate
R is given by
Pout (R) = min P log2 det Inr + ρHQHH < R ,
Q≥0:Tr{Q}≤1
• More meaningful in the absence of CSI knowledge at the transmitter: the transmitter
cannot adjust its transmit strategy → hopes the channel is good enough
112 / 494
Diversity-Multiplexing Trade-Off in Slow Fading Channels
• Compound channel coding theorem: there exist “universal” codes with rate R
bits/s/Hz that achieve reliable transmission over any slow fading channel realization
which is not in outage.
– CSIT is actually not necessary in slow fading channels if the aim is transmit reliably
when the channel is not in outage.
• For a given R, how does Pout behave as a function of the SNR ρ?
Definition
A diversity gain gd⋆ (gs , ∞) is achieved at multiplexing gain gs at infinite SNR if
R ρ
lim = gs
ρ−→∞ log
2 ρ
log2 Pout (R)
lim = −gd⋆ (gs , ∞)
ρ−→∞ log2 ρ
– The multiplexing gain indicates how fast the transmission rate increases with the SNR.
– The diversity gain represents how fast the outage probability decays with the SNR.
113 / 494
Diversity-Multiplexing Trade-Off in I.I.D. Rayleigh Slow
Fading Channels
114 / 494
Diversity-Multiplexing Trade-Off in I.I.D. Rayleigh Slow
Fading Channels
115 / 494
Diversity-Multiplexing Trade-Off of a Scalar Rayleigh
Channel h
• Determine for a transmission rate R scaling with ρ as gs log2 (ρ), the rate at which
the outage probability decreases with ρ as ρ increases.
• Outage probability
Pout (R) = P log2 1 + ρ |h|2 < gs log2 (ρ)
= P 1 + ρ |h|2 < ρgs
• At high SNR,
Pout (R) ≈ P |h|2 ≤ ρ−(1−gs )
• Since |h|2 is exponentially distributed, i.e., P |h|2 ≤ ǫ ≈ ǫ for small ǫ
An outage occurs at high SNR when |h|2 ≤ ρ−(1−gs ) with a probability ρ−(1−gs ) .
• DMT for the scalar Rayleigh fading channel gd⋆ (gs , ∞) = 1 − gs for gs ∈ [0, 1].
116 / 494
Space-Time Coding over I.I.D. Rayleigh Flat
Fading Channels
117 / 494
Reference Book
– Chapter 6
Section: 6.1, 6.2, 6.3 (except “Antenna
Selection” in 6.3.2), 6.4.1, 6.4.2 (except
the Proofs), 6.5.1, 6.5.2, 6.5.3, 6.5.4,
6.5.8, Figure 7.1
118 / 494
Introduction - Previous Lectures
• Previous lecture
– Capacity of deterministic MIMO channels
C (H) = max log2 det Inr + ρHQHH .
Q≥0:Tr{Q}=1
119 / 494
Overview of a Space-Time Encoder
space-time encoder
T
temporal coding
B bits Q symbols
time interleaving space-time nt codeword C
coding
symbol mapping
nt
• First black box: combat the randomness created by the noise at the receiver.
• Second black box: spatial interleaver which spreads symbols over several antennas in
order to mitigate the spatial selective fading.
• The ratio B/T is the signaling rate of the transmission.
• The ratio Q/T is defined as the spatial multiplexing rate (representative of how
many symbols are packed within a codeword per unit of time).
120 / 494
System Model
• MIMO system with nt transmit and nr receive antennas over a frequency flat-fading
channel
• Transmit a codeword C = [c0 . . . cT −1 ] [nt × T ] contained in the codebook C
• At the kth time instant, the transmitted and received signals are related by
√
yk = Es Hk ck + nk
where
– yk is the nr × 1 received signal vector,
– Hk is the nr × nt channel matrix,
– nk is a nr × 1 zero mean complex AWGN vector with E{nk nH 2
l } = σn Inr δ (k − l),
– 2
The parameter Es is the energy normalization factor. SNR ρ = Es /σn .
• No transmit channel knowledge but we
know
it
is i.i.d. Rayleigh fading.
• Codeword average transmit power E Tr CCH = T . Assume
E kHk2F = nt nr .
• Channel time variation:
– slow fading : Tcoh >> T and {Hk = Hw }T −1
k=0 , with Hw denoting an i.i.d. random
fading matrix with unit variance circularly symmetric complex Gaussian entries.
T −1
– fast fading : T ≥ Tcoh and Hk = Hk,w , where Hk,w k=0 are uncorrelated matrices,
each Hk,w being an i.i.d. random fading matrix with unit variance circularly
symmetric complex Gaussian entries.
121 / 494
Error Probability Motivated Design Methodology
• With instantaneous channel realizations perfectly known at the receive side, the ML
decoder computes an estimate of the transmitted codeword according to
T −1
X
√
2
Ĉ = arg min
yk − Es Hk ck
C
k=0
with lC,E the effective length of the pair of codewords {C, E}, i.e., lC,E = ♯τC,E
with τC,E = {k | ck − ek 6= 0}. Q
• Diversity gain: nr lC,E , coding gain: k∈τC,E kck − ek k−2nr .
124 / 494
The Distance-Product Criterion
• At high SNR, the error probability is naturally dominated by the worst-case PEP
Design Criterion
( Distance-product criterion) Over i.i.d. Rayleigh fast fading channels,
1 distance criterion: maximize the minimum effective length Lmin of the code
over all pairs of codewords {C, E} with C 6= E
Lmin = min lC,E
C,E
C6=E
2 product criterion: maximize the minimum product distance dp of the code over
all pairs of codewords {C, E} with C 6= E
Y
dp = min kck − ek k2
C,E
C6=E k∈τC,E
lC,E =Lmin
• The presence of multiple antennas at the transmitter does not impact the achievable
diversity gain gdo (∞) = nr Lmin but improves the coding gain gc = dp .
• The diversity gain is maximized first, and the coding gain is maximized only in a
second step.
125 / 494
Slow Fading MIMO Channels
Design Criterion
127 / 494
The Rank-Determinant Criterion
Definition
A full-rank (a.k.a. full-diversity) code is characterized by rmin = nt . A
rank-deficient code is characterized by rmin < nt .
Example
Rank-deficient and full-rank codes for nt = 2
• Rank-deficient code
1 c1 1 e1
C= √ ,E = √
2 c2 2 e2
• Full-rank code
1 c1 −c∗2 1 e1 −e∗2
C= √ ,E = √
2 c2 c∗1 2 e2 e∗1
H 1 |c1 − e1 |2 + |c2 − e2 |2 0
(C − E) (C − E) =
2 0 |c1 − e1 |2 + |c2 − e2 |2
128 / 494
The Rank-Determinant Criterion
Example
Question: Relying on the rank-determinant criterion, show that delay diversity
achieves full diversity. Assume for simplicity two transmit antennas.
Answer: The codeword for delay diversity can be written as
1 c1 c2 . . . cT −1 0
C= √ .
2 0 c1 c2 ... cT −1
The diversity gain is given by the minimum rank of the error matrix over all
possible pairs of (different) codewords, i.e.
rmin = min r Ẽ = min r C − E .
C,E C,E
C6=E C6=E
129 / 494
The Rank-Determinant Criterion
Example
With delay diversity, we have
1 c1 − e1 c2 − e2 ... cT −1 − eT −1 0
C−E= √ .
2 0 c1 − e1 c2 − e2 ... cT −1 − eT −1
Obviously, r C − E ≤ 2. Actually, r C − E = 2 as long as C 6= E. Indeed
even in the case where all ck − ek = 0 except for one index k (in order to keep
C 6= E), e.g. k = 1,
1 c1 − e1 0 ... 0 0
C−E= √ ,
2 0 c1 − e1 0 ... 0
130 / 494
The Rank-Determinant Criterion
Example
Question: Assume that c1 , c2 , c3 and c4 are constellation symbols taken from a
unit average energy QAM constellation. Consider the Linear Space-Time Block
Code, characterized by codewords
1 c1 + c3 c2 + c4
C= .
2 c2 − c4 c1 − c3
What is the diversity gain achieved by this code over slow Rayleigh fading
channels?
Answer: Check the rank of its error matrix
1 d1 + d3 d2 + d4
C−E=
2 d2 − d4 d1 − d3
where dk = ck − ek for k = 1, ..., 4. This code is rank deficient. It is easily seen
that by taking two codewords C and E such that d3 = d4 = 0 and
d1 = d2 = d (which is encountered for any constellations), r (C − E) = 1.
Hence diversity gain of nr .
131 / 494
Information Theory Motivated Design Methodology: Fast
Fading MIMO Channels - Achieving The Ergodic Capacity
• Recall Lecture 5&6: ergodic capacity
n o
C̄ = max E log2 det Inr + ρHQHH .
Q:Tr{Q}=1
• Codewords are uncoded in the sense that no error correcting code is contained in the
STBC.
Example
Alamouti code: nt = 2, Q = 2, T = 2, rs = 1
1 c1 −c∗2
C= √ ∗ .
2 c2 c1
134 / 494
A General Framework for Linear STBCs
where
– Φq are complex basis matrices of size nt × T ,
– cq stands for the complex information symbol (taken for example from PSK or QAM
constellations),
– Q is the number of complex symbols cq transmitted over a codeword,
– ℜ and ℑ stand for the real and imaginary parts.
Definition
Tall (T ≤ nt ) unitary basis matrices are such that ΦH 1
q Φq = Q I T
∀q = 1, . . . , 2Q. Wide (T ≥ nt ) unitary basis matrices are such that
Φq ΦH T
q = Qnt Int ∀q = 1, . . . , 2Q.
Definition
Q
The spatial multiplexing rate of a space-time block code is defined as rs = T
.
A full rate space-time block code is characterized by rs = nt .
135 / 494
A General Framework for Linear STBCs
√
• Apply the vec operator to yk = Es Hk ck + nk and make use of STBC structure
√
Y = Es HX S + N
where
– Y[2nr T × 1] is the channel output vector
ℜ y0 ... yT −1
Y = vec ,
ℑ y0 ... yT −1
– H[2nr T × 2nt T ] is the block diagonal channel
ℜ [H] −ℑ [H]
H = IT ⊗ H ′ , where H′ = ,
ℑ [H] ℜ [H]
– X [2nt T × 2Q] is the linear code matrix
ℜ [Φ1 ] ℜ Φ2Q
X = vec ··· vec ,
ℑ [Φ1 ] ℑ Φ2Q
– S[2Q × 1] is a block of uncoded input symbols
T
S = ℜ [c1 ] · · · ℜ cQ ℑ [c1 ] ··· ℑ cQ ,
– N [2nr T × 1] is the noise vector
ℜ n0 ... nT −1
N = vec ,
ℑ n0 ... nT −1
136 / 494
A General Framework for Linear STBCs
Proposition
Φq ΦH H
p + Φp Φq = 0nt , q 6= p for wide {Φq }2Q
q=1 ,
ΦH H
q Φp + Φp Φq = 0T , q 6= p for tall {Φq }2Q
q=1 .
137 / 494
A General Framework for Linear STBCs
Proposition
138 / 494
A General Framework for Linear STBCs
• Decoding
Proposition
Φq ΦH H
p + Φp Φq = 0nt , ∀q 6= p.
Example
This code is called Spatial Multiplexing. Optimal for worst-case PEP min.
capacity-efficient, large decoding complexity.
140 / 494
A General Framework for Linear STBCs
Example
A code such that T = 2, nt = 2, Q = 2, rs = 1 with the following wide basis
matrices
1 1 0 1 0 −1
Φ1 = √ , Φ2 = √ ,
2 0 1 2 1 0
1 j 0 1 0 j
Φ3 = √ , Φ4 = √ ,
2 0 −j 2 j 0
or equivalently, with the following matrix X
1 0 0 0
0 1 0 0
0 0 1 0
1 0 0 0 1
X = √ .
2 0 −1 0 0
1 0 0 0
0 0 0 1
0 0 −1 0
This code is called Alamouti code. Optimal for worst-case PEP min. not
capacity-efficient, low decoding complexity. 141 / 494
Spatial Multiplexing/V-BLAST/D-BLAST
• Spatial Multiplexing (SM), also called V-BLAST, is a full rate code (rs = nt ) that
consists in transmitting independent data streams on each transmit antenna.
• In uncoded transmissions, we assume one symbol duration (T = 1) and codeword C
is a symbol vector of size nt × 1.
• From previous results on error probability and capacity,
Proposition
Spatial Multiplexing with basis matrices characterized by square X such that
1
XTX = I2nt
nt
is capacity-efficient (Proposition 6) and optimal from an error rate
minimization perspective (Proposition 5).
Example
nt
1 T 1 X
C= √ c1 ... c nt = √ In (:, q) ℜ [cq ] + jInt (:, q) ℑ [cq ] .
nt nt q=1 t
• Error probability
Z π/2 h i−nr
1
P (C → E) = det Int + η Ẽ dβ
π 0
Z nt
!−nr
π/2
1 η X
= 1+ |cq − eq |2 dβ
π 0 nt q=1
−nr nt
!−nr
ρ X 2
≤ |cq − eq |
4nt q=1
The SNR exponent is equal to nr . Due to the lack of coding across transmit
antennas, no transmit diversity is achieved and only receive diversity is exploited.
• Over fast fading channels, we know that it is not necessary to code across antennas
to achieve the ergodic capacity.
Proposition
Spatial Multiplexing with ML decoding and equal power allocation achieves the
ergodic capacity of i.i.d. Rayleigh fast fading channels.
143 / 494
ML decoding
Proposition
144 / 494
Zero-Forcing (ZF) Linear Receiver
• The complexity of ZF decoding similar to SISO ML decoding, but the inversion step
is responsible for the noise enhancement (especially at low SNR).
√ T
• Assuming that a symbol vector C = 1/ nt c1 . . . cnt is transmitted, the
output of the ZF filter GZF is given by
T
z = GZF y = c1 . . . cnt + GZF n
145 / 494
Zero-Forcing (ZF) Linear Receiver
146 / 494
Zero-Forcing (ZF) Linear Receiver
• In fast fading channels, the average maximum achievable rate C̄ZF is equal to the
sum of the maximum rates achievable by all layers
min{nt ,nr }
X
C̄ZF = E {log2 (1 + ρq )}
q=1
(ρր) ρ
≈ min {nt , nr } log2 + min {nt , nr } E log2 χ22(nr −nt +1) .
nt
Note the difference with
ρ X n n o
C̄CDIT ≈ nlog2 + E log2 (χ22(N −n+k) ) .
nt
k=1
147 / 494
Zero-Forcing (ZF) Linear Receiver
148 / 494
Zero-Forcing (ZF) Linear Receiver
• ZF receiver maximizes the SNR under the constraint that the interferences from all
other layers are nulled out.
– For a given layer q, the ZF combiner gq is such that this layer is detected through a
projection of the output vector y onto the direction closest to H (:, q) within the
subspace orthogonal to the one spanned by the set of vectors H (:, p), p 6= q.
• Assume the following system model with nr ≥ nt
y = Hc + n,
X
= hq c q + hp c p + n
p6=q
th
where hq is the q column of H.
• Let us build the following nr × (nt − 1) matrix by collecting all hp with p 6= q:
H−q = . . . hp . . . p6=q ,
= U′ Ũ ΛVH
where Ũ is the matrix containing the left singular vectors corresponding to the null
singular values. Similarly we define
T
c−q = . . . cp . . . p6=q .
149 / 494
Zero-Forcing (ZF) Linear Receiver
• By multiplying by ŨH , we project the output vector onto the subspace orthogonal to
the one spanned by the columns of H′
• To maximize the SNR, noting the noise is still white, we match to the effective
channel ŨH hq such that
H H
z = ŨH hq ŨH hq cq + ŨH hq ŨH n
H
and the ZF combiner is gq = ŨH hq ŨH = hH H
q ŨŨ .
150 / 494
Minimum Mean Squared Error (MMSE) Linear Receiver
• Filter maximizing the SINR. Minimize the total resulting noise: find G such that
T
2
E
Gy − c1 . . . cnt
is minimum.
• The combined noise plus interference signal ni,q when estimating symbol cq writes as
r
X Es
ni,q = hp cp + n.
nt
p6=q
• An alternative and popular representation of the MMSE filter can also be written as
r −1 r −1
nt nt nt H nt
GM M SE = H H H + I nt HH = H HHH + Inr
Es ρ Es ρ
• Bridge between matched filtering at low SNR and ZF at high SNR.
151 / 494
Minimum Mean Squared Error (MMSE) Linear Receiver
• At high SNR, the MMSE filter is practically equivalent to ZF and the diversity
achievable is thus limited to nr − nt + 1.
152 / 494
Successive Interference Canceler
• Successively decode one symbol (or more generally one layer/stream) and cancel the
effect of this symbol from the received signal.
• Decoding order based on the SINR of each symbol/layer: the symbol/layer with the
highest SINR is decoded first at each iteration.
• SM with (ordered) SIC is generally known as V-BLAST, and ZF and MMSE
V-BLAST refer to SM with respectively ZF-SIC and MMSE-SIC receivers.
• The diversity order experienced by the decoded layer is increased by one at each
iteration. Therefore, the symbol/layer detected at iteration i will achieve a diversity
of nr − nt + i.
• Major issue: error propagation
– The error performance is mostly dominated by the weakest stream.
– Non-ordered SIC: diversity order approximately nr − nt + 1.
– Ordered SIC: performance improved by reducing the error propagation caused by the
first decoded stream. The diversity order remains lower than nr .
153 / 494
Successive Interference Canceler
(∗)
2
1 Initialization: i ←− 1, y(1) = y,G(1) = GZF (H),q1 = arg minj
G(1) (j, :)
where GZF (H) is defined as the ZF filter of the matrix H.
2 Recursion:
1 step 1: extract the qith transmitted symbol from the received signal y(i)
154 / 494
Successive Interference Canceler
The loss that was observed with ZF filtering is now compensated because the
successive interference cancellation improves the SNR of each decoded layer.
Proposition
Spatial Multiplexing with ZF-SIC (ZF V-BLAST) and equal power allocation
achieves the ergodic capacity of i.i.d. Rayleigh fast fading MIMO channels at
asymptotically high SNR.
155 / 494
Successive Interference Canceler
Proposition
156 / 494
Successive Interference Canceler
157 / 494
Impact of Decoding Strategy on Error Probability
• SM with ML, ordered and non ordered ZF SIC and simple ZF decoding in 2 × 2 i.i.d.
Rayleigh fading channels for 4 bits/s/Hz.
0
10
ML
ordered ZF SIC
ZF SIC
ZF
−1
10
BER
−2
10
−3
10
−4
10
0 5 10 15 20 25
SNR [dB]
• SM with ML, ZF and MMSE in i.i.d. Rayleigh slow fading channels with
nt = nr = 4 and QPSK.
0
10
ML
MMSE
ZF
−1
10
BER
−2
10
−3
10
−4
10
0 5 10 15 20 25
SNR [dB]
159 / 494
D-BLAST
Example
Consider two layers a and b and nt = 2. Assume that layer a is made of two
streams a(1) and a(2) and layer b of two streams as well b(1) and b(2) . Each
stream can be seen as a block of symbols. The transmitted codeword C is now
written as (1)
a b(1)
C= .
a(2) b(2)
160 / 494
Orthogonal Space-Time Block Codes
• O-STBC vs. SM
– Remarkable properties which make them extremely easy to decode: MIMO ML
decoding decouples into several SIMO ML decoding
– Achieve a full-diversity of nt nr .
– Much smaller spatial multiplexing rate than SM.
• Linear STBC characterized by the two following properties
1 the basis matrices are wide unitary
T
Φq ΦH
q = In ∀q = 1 . . . 2Q
Qnt t
2 the basis matrices are pairwise skew-hermitian
Φq ΦH H
p + Φp Φq = 0, q 6= p
• Complex O-STBCs with rs = 1 only exist for nt = 2. For larger nt , codes exist with
rs ≤ 1/2. For some particular values of nt > 2, complex O-STBCs with
1/2 < rs < 1 have been developed. This is the case for nt = 3 and nt = 4 with
rs = 3/4.
161 / 494
Orthogonal Space-Time Block Codes
Example
Alamouti code: complex O-STBC for nt = 2 with a spatial multiplexing rate
rs = 1
1 c1 −c∗2
C= √ ∗ .
2 c2 c1
Example
For nt = 3 , a complex O-STBC expanding on four symbol durations (T = 4)
and transmitting three symbols on each block (Q = 3)
c −c∗2 c∗3 0
2 1 ∗ ∗
C= c2 c1 0 c3 .
3
c3 0 −c∗1 −c∗2
Proposition
O-STBCs enjoy the decoupling property.
Example
Assume a MISO transmission based on the Alamouti code
r
Es c1 −c∗2
y1 y2 = h1 h2 ∗ + n1 n2
2 c2 c1
or equivalently
r
y1 Es h1 h2 c1 n1
= + .
y2∗ 2 h∗2 −h∗1 c2 n∗2
| {z }
Hef f
163 / 494
Orthogonal Space-Time Block Codes
Example
Applying the space-time matched filter HH ef f to the received vector decouples
the transmitted symbols
r
z1 H y1 Es c1 n1
= Hef f ∗ = |h1 |2 + |h2 |2 I2 + HHef f ∗ .
z2 y2 2 c2 n2
164 / 494
Orthogonal Space-Time Block Codes
• Error Probability
Z π/2 h i−nr
1
P (C → E) = det Int + η Ẽ dβ
π 0
Z Q
!−nr nt
π/2
1 T X
= 1+η |cq − eq |2 dβ
π 0 Qnt q=1
−nr nt Q
!−nr nt
(ρր) ρ T X 2
≤ |cq − eq | .
4 Qnt q=1
165 / 494
Orthogonal Space-Time Block Codes
Proposition
For a given channel realization H, the mutual information achieved by any
O-STBC is always upper-bounded by the channel mutual information with
equal power allocation Ie . Equality occurs if and only if both the rank of the
channel and the spatial multiplexing rate of the code are equal to one.
Corollary
The Alamouti scheme is optimal with respect to the mutual information when
used with only one receive antenna.
166 / 494
Orthogonal Space-Time Block Codes
Proposition
The diversity-multiplexing trade-off at high SNR achieved by O-STBCs using
QAM constellations in i.i.d. Rayleigh fading channels is given by
gs
gd (gs , ∞) = nr nt 1 − , gs ∈ [0, rs ] .
rs
Proposition
The Alamouti code with any QAM constellation achieves the optimal
diversity-multiplexing trade-off for two transmit and one receive antennas in
i.i.d. Rayleigh fading channels.
167 / 494
Orthogonal Space-Time Block Codes
• Block error rate for 4 different rates R = 4, 8, 12, 16 bits/s/Hz in 2 × 2 i.i.d. slow
Rayleigh fading channels.
0
10
−1
10
(2x2) Block Error Rate
−2
10
−3
10
−4
10
M=4, R=4 <−−−12dB−−−>
M=16, R=8
−5 M=64, R=12
10 M=256, R=16
0 10 20 30 40 50 60
SNR [dB]
169 / 494
Global Performance Comparison
4.5
3
ZF D−BLAST
2.5
2 SM with ML
1.5
SM with ZF,
unordered ZF V−BLAST
1
0.5
0
0 0.5 1 1.5 2 2.5
Spatial multiplexing gain gs
170 / 494
Global Performance Comparison
• Bit error rate (BER) of several space-time block codes in i.i.d. slow Rayleigh fading
channels with nt = 2 and nr = 2 in a 4-bit/s/Hz transmission. ML decoding is used.
−1
10
SM
Alamouti
Dayal
TAST
B
2,φ
Tilted−QAM
−2
10
BER
−3
10
−4
10
10 12 14 16 18 20
SNR [dB]
171 / 494
Space-Time Coding in Real-World MIMO
Channels
172 / 494
Reference Book
– Chapter 8
Section: 8.1, 8.2, 8.3.1 (“Rayleigh Slow
Fading Channels”, “How Realistic is the
High SNR Regime Approximation?”),
8.3.4
173 / 494
Introduction - Previous Lectures
• Some environments may highly deviate from the i.i.d. Rayleigh fading scenario.
• Objectives:
– how codes developed under the i.i.d. Rayleigh assumption behave in more realistic
propagation conditions, i.e., how these codes are affected by non ideal propagation
conditions,
– how a more adapted design criterion might significantly improve their performance.
174 / 494
System Model
175 / 494
Radiation Patterns
PL−1 (l) PL−1 (l) (l)
• Decompose the channel Hk = l=0 Hk = l=0 Hk (:, 1) aTt θt,k , where
(l) (l)
at θt,k is the transmit array response in the direction of departure θt,k .
• PEP argument writes as
T −1 T
−1
L−1
2
X X
2
X (l) T (l)
kHk (ck − ek )kF =
Hk (:, 1)at θt,k (ck − ek )
k=0 k=0 l=0
aTt ck
• It may be thought of as an array factor function of the transmitted codewords. At
every symbol period,
– the energy radiated in any direction varies as a function of the transmitted codewords.
– for a given codeword and omnidirectional antennas, the radiated energy is not uniformly
distributed in all directions, but may present maxima and minima in certain directions.
176 / 494
Radiation Patterns
177 / 494
Radiation Patterns
Definition
In the presence of small angle spread at the transmit side, the MIMO channel
degenerates into a SIMO channel where the 1 × T transmitted codeword is given by
aTt (θt ) C
T −1 T −1
"n #
X X 2 X r
Since a space-time code designed for i.i.d. channels is only concerned with C and E,
its interaction with at (θt ) is not taken into account.
178 / 494
Radiation Patterns
Example
The Spatial Multiplexing example for nt = 2: ck = [ c1 [k] c2 [k] ]T
c2 [k] −2πj dλt cos θt
cTk at (θt ) = c1 [k] 1 + e
c1 [k]
| {z }
Gt (θt |ck )
150 1 30 150 1 30
180 0 180 0
150 1 30 150 1 30
180 0 180 0
d/λ=0.5
d/λ=0.1
210 330 210 330 −2
10
−2 −1 0 1 2 3 4
240 300 240 300
270 270 Phase shift between the two transmitted symbols [rad]
179 / 494
Derivation of the Average PEP
• Conditional PEP
v
u T −1
uρ X
P C → E| {Hk }Tk=0
−1
= Q t kHk (ck − ek )k2F
2
k=0
Z
1 π/2 1
= MΓ − dβ
π 0 2 sin2 (β)
1
≤ MΓ −
2
P −1
with MΓ (γ) moment generating function (MGF) of Γ = ρ2 Tk=0 kHk (ck − ek )k2F
Z ∞
∆
MΓ (γ) = exp (γΓ) pΓ (Γ) dΓ
0
Theorem
The moment generating function of a Hermitian quadratic form in complex
Gaussian random variable y = zFzH , where z is a circularly symmetric complex
Gaussian vector with mean z̄ and a covariance matrix Rz and F a Hermitian
matrix, is given by
exp sz̄F (I − sRz F)−1 z̄H
Z ∞
∆
My (s) = exp (sy) py (y) dy =
0 det (I − sRz F) 181 / 494
Derivation of the Average PEP
183 / 494
Average PEP in Rayleigh Slow Fading Channels
Example
Question: Assume a spatially correlated slow Rayleigh fading MIMO channel.
Derive the Average PEP for ML receiver.
Answer: The conditional PEP writes as
r
ρ
P (C → E | H) = Q kH (C − E)k2F .
2
The average PEP over Rayleigh slow fading channels is
Z
1 π/2 1
P (C → E) = EH {P (C → E|H)} = MΓ − dβ
π 0 2 sin2 (β)
where Ẽ = (C − E) (C − E)H .
184 / 494
Average PEP in Rayleigh Slow Fading Channels
Example
This is a hermitian quadratic form of complex gaussian random variables of the
H
form zFzH (with z = vec HH and F = Inr ⊗ Ẽ) and we can use Theorem
where the mean
z̄ = 0 is the zero vector
H and the covariance matrix is
Rz = R = E vec HH vec HH .
We then write (with η = ρ/(4 sin2 (β)))
Z
1 π/2
P (C → E) = (det (Inr nt + ηCR ))−1 dβ.
π 0
where CR = RF. With the Kronecker model, R = Rr ⊗ Rt , and
Z
1 π/2 −1
P (C → E) = det Inr nt + η (Rr ⊗ Rt ) Inr ⊗ Ẽ dβ
π 0
Z
1 π/2 −1
= det Inr nt + ηRr ⊗ Rt Ẽ dβ.
π 0
185 / 494
Average PEP in Rayleigh Slow Fading Channels
Example
Question: In a 2 × 2 spatially correlated Rayleigh fading MIMO channel, derive
the Average PEP for Spatial Multiplexing with ML receiver and discuss the
effect of transmit and receive correlation on the performance.
Answer: For SM, C − E is a nt × 1 vector and
P (C → E)
Z
1 π/2 −1
= det Inr nt + η (Rr ⊗ Rt ) Inr ⊗ Ẽ dβ
π 0
Z
1 π/2 −1
= det Inr nt + η (Rr ⊗ Rt ) (Inr ⊗ (C − E)) (Inr ⊗ (C − E)H ) dβ
π 0
Z
1 π/2 −1
= det Inr + η(Inr ⊗ (C − E)H ) (Rr ⊗ Rt ) (Inr ⊗ (C − E)) dβ
π 0
Z π/2 −1
1
= det Inr + η Rr ⊗ (C − E)H Rt (C − E) dβ.
π 0
187 / 494
Average PEP in Rayleigh Slow Fading Channels
Example
For SM over a 2 × 2 MIMO channel,
c0 − e0 1 t∗ 1 r∗
C−E= , Rt = , Rr = ,
c1 − e1 t 1 r 1
Example
Observations:
– performance of SM in correlated channels depends on the projection of C − E
onto the space spanned by the eigenvectors of Rt :
– worse performance when C − E is parallel to the eigenvector of Rt
corresponding to the lowest eigenvalue.
Intuition: transmitting all the information contained in the unique non-zero
eigenvalue of the error matrix Ẽ in the direction of the space offering the lowest
scatterer density.
– receive correlation induces a coding gain loss independent of the error matrix.
189 / 494
Average PEP in Rayleigh Slow Fading Channels
• Analysis can be extended to finite SNR and space-time correlated Rician channels
(see more in Chapter 8 if interested).
• Main observations:
– Rank deficient codes are very sensitive to spatial correlation.
– The maximization of the coding gain in i.i.d. Rayleigh channels is not a sufficient
condition to guarantee the good performance of a code in correlated channels at finite
SNR, even for full-rank codes.
– Robust code design exist (see Chapter 9 if interested)
190 / 494
Average PEP in Rayleigh Slow Fading Channels
−1 −1
10 10
BER
BER
−2 −2
10 10
−3 −3
10 10
SM and new SM − i.i.d.
Hassibi et al. − i.i.d.
SM − correlated
Hassibi et al. − correlated
−4 −4
new SM − correlated
10 10
4 6 8 10 12 14 16 18 20 22 4 6 8 10 12 14 16 18 20 22
SNR [dB] SNR [dB]
191 / 494
MIMO with Partial Channel State Information at
the Transmitter
192 / 494
Reference Book
– Chapter 10
Section: 10.1, 10.2.1, 10.5, 10.6.1,
10.6.2, 10.6.3, 10.6.4, 10.9
193 / 494
Introduction
• full CSIT
– array and diversity gain
– lower system complexity (parallel virtual transmissions)
– hardly achievable (especially when the channel varies rapidly), costly in terms of
feedback
• Exploiting Channel Statistics at the Transmitter
– low rate feedback link
– statistical properties of the channel (correlations, K-factor) vary at a much slower rate
than the fading channel itself
– The receiver estimates the channel stochastic properties and sends them back to the
transmitter “once in a while” (if channel reciprocity cannot be exploited)
– stationary channel: statistics do not change over time
• Exploiting a Limited Amount of Feedback at the Transmitter
– codebook of precoding matrices, i.e., a finite set of precoders, designed off-line and
known to both the transmitter and receiver.
– The receiver estimates the best precoder as a function of the current channel and feeds
back only the index of this best precoder in the codebook.
194 / 494
System Model
1/2
C S C’’ W C’
constellation shaper beamformer
Q = U R t ΛQ U H
Rt ,
Transmit a single stream along the dominant eigenvector of Rt if very large transmit
correlation. Transmit multiple streams with uniform power allocation if very low
transmit correltion.
• Error Probability motivated strategy
P⋆ = arg min max P (C → E)
P Ẽ6=0
2
where ζ = ηT δ /(Qne ) and δ = dmin .
Proposition
In Kronecker Rayleigh fading channels, the optimal precoder minimizing the
average PEP/SER is given by P = WS1/2 where
– W = U′Rt with U′Rt the nt × ne submatrix of URt containing the ne
dominant eigenvector of Rt , i.e., Rt = URt ΛRt UH
Rt ,
197 / 494
Channel Statistics based Precoding
Example
Let us consider the Alamouti O-STBC with two transmit antennas
(ne = nt = 2). Denoting S = diag {s1 , s2 }, the transmitted codewords are
proportional, at the first time instant, to
1 c1 1 √ 1 √
√ URt S1/2 = √ URt (:, 1) s1 c1 + √ URt (:, 2) s2 c2
2 c2 2 2
and, at the second time instant, to
1 −c∗2 1 √ 1 √
√ URt S1/2 = − √ URt (:, 1) s1 c∗2 + √ URt (:, 2) s2 c∗1 .
2 c∗1 2 2
Extreme cases:
• s1 = s2 = 1: Alamouti scheme
• s1 = 2, s2 = 0: beamforming in the dominant eigenbeam
The precoder allocates more power to angular directions corresponding to the
peaks of the transmit direction power spectrum.
198 / 494
Channel Statistics based Precoding
BER
−2 −2
10 10
−3 −3
10 10
0 2 4 6 8 10 12 0 2 4 6 8 10 12 14
SNR [dB] SNR [dB]
199 / 494
Quantized Precoding: dominant eigenmode transmission
with Cw set of unit-norm vectors. The best precoder is the dominant right singular
vector of H.
• Reduce the number of feedback bits: limit the space Cw over which w can be chosen
to a codebook called W. The receiver evaluates the best precoder w⋆ among all
unit-norm precoders wi ∈ W (with i = 1, . . . , np ) such that
200 / 494
Quantized Precoding: distortion function
• How to design the codebook? Need for a distortion function, i.e. measure of the
average (over all channel realizations) array gain loss induced by the quantization
process n o
2
df = EH λmax − kHw⋆ k
• Upper-bound
2
H
w⋆ ,
df ≤ EH λmax − λmax vmax
2
(a) H
w⋆
= EH {λmax } EH 1 − vmax
| {z }
quality of the channel | {z }
quality of the codebook
where vmax is the dominant right singular vector of H. Equality (a) is only valid for
i.i.d. Rayleigh fading channels.
201 / 494
Quantized Precoding: Lloyd
Algorithm
For the given codebook, find the optimal quantization cells using the nearest
neighbor rule. For the so-obtained quantization cells, determine that optimal
quantized precoders using the centroid condition. Iterate till convergence.
• Essential conditions:
– Assume MISO channel.
– centroid condition: the optimal quantized precoder w k of any quantization
cell Rk is
to be chosen as the dominant eigenvector of Rk = E hH h | h ∈ Rk .
– nearest neighbor rule: all channel vectors that are closer to the quantized precoder wk
are assigned to quantization cell Rk , i.e. h ∈ Rk if khk2 − |hwk |2 ≤ khk2 − |hwj |2 ,
• Optimal codebook design for arbitrary fading channels
202 / 494
Quantized Precoding: Grassmannian
Design Criterion
is maximized.
203 / 494
Quantized Precoding: How many bits?
• How many feedback bits B = log2 (np ) are required? In i.i.d. channels
n o
− B
C̄quant ≈ Eh log2 1 + ρ khk2 1 − 2 nt −1 ,
− B
leading to an SNR degradation of 10 log10 1 − 2 nt −1 dB relative to perfect
CSIT.
Proposition
In order to maintain a constant SNR or capacity gap between perfect CSIT and
quantized feedback, it is not necessary to scale the number of feedback bits as
a function of the SNR. The multiplexing gain gs is not affected by the quality
of CSIT.
• Achievable diversity gain?
– Antenna selection (AS) is a particular case of a quantized precoding whose codebook is
chosen as the columns of the identity matrix Int .
– AS achieves a diversity gain of nt .
– Sufficient to take a full rank codebook matrix with np ≥ nt to extract the full diversity
204 / 494
Quantized Precoding: Evaluations
• SER of a 3 × 3 MIMO system using 2-bit and 6-bit quantized BPSK-based DET.
−2
10
2-bit D-DET
6-bit Q-DET
ant. selection Example
P-DET
W = √13 , √
3
,
√1 −j
−4
10
√
3 3
−1 −j
√ √
√13 √ 3
−1
3 , 3
−1
√ √j
0 1 2 3 4 5 6 3 3
SNR [dB]
205 / 494
Quantized Precoding: spatially correlated channels
0.4
0.35
0.3
f,n
Average distortion d
0.25
0.2
0.15
206 / 494
Quantized Precoding: some extensions
The codebooks Wne are defined for ranks ne = 1, . . . , min {nt , nr }. Rate is
(n )
computed on the equivalent precoded channel HWi e .
– Uniform power allocation and joint ML decoding
ρ (ne ) H H (n )
R = log2 det Ine + Wi H HWi e .
ne
– With other types of receivers/combiner
ne
X
(n )
R= log2 1 + ρq HWi e .
q=1
where ρq is the SINR of stream q on at the output of the combiner for the equivalent
(n )
channel HWi e .
207 / 494
Frequency-Selective MIMO Channels -
MIMO-OFDM
208 / 494
Reference Book
– Chapter 11
Section: 11.1, 11.4.1, 11.5.2
209 / 494
Introduction
210 / 494
Single-Carrier Transmissions
211 / 494
Virtual Transmit Antenna Array
212 / 494
Multi-Carrier Transmissions: MIMO-OFDM
• Basic idea of OFDM: Turn the channel matrix into a circulant matrix via the
addition of a cyclic prefix to the transmitted sequence
– A circulant matrix has the property that its left and right singular vector matrices are
respectively DFT and IDFT matrices.
– The multiplication by an IDFT matrix at the transmitter and by a DFT matrix at the
receiver transforms the frequency selective channel into a diagonal matrix, whose
elements are the singular values of the circulant matrix.
– The original frequency-selective channel in the time domain becomes a set of parallel
flat fading channels in the frequency domain
c0 y0
c1 y1
• •
IDFT CP CP DFT
• •
• •
cT-1 yT-1
• Fundamental Steps 1 to 6
th
1 Apply an IDFT to the codeword C. We obtain as output at the n time interval
(n = 0, . . . , T − 1),
T −1
1 X 2π
xn = √ ck ej T kn ,
T k=0
or equivalently in a matrix form,
T T
x0 . . . xT −1 = DH c0 . . . cT −1 ,
T T T
T H
x0 . . . xT −1 = D ⊗ I nt c0 . . . cTT −1
T
.
where
H [L − 1] ... H [1] H [0] 0nr ×nt ... 0nr ×nt
..
0nr ×nt H [L − 1] . H [1] H [0] ... 0nr ×nt
Hg =
.. .. .. .. .. .. ..
. . . . . . .
0nr ×nt ... 0nr ×nt H [L − 1] H [L − 2] ... H [0]
215 / 494
Orthogonal Frequency Division Multiplexing (OFDM)
5 Observe that the choice of the CP creates a blockwise circulant matrix Hcp of size
nr T × nt T
T T T T T
r0 . . . rTT −1 = Hcp xT0 . . . xTT −1 + n0 . . . nTT −1
with
H [0] 0nr ×nt ... 0nr ×nt H [L − 1] ... H [1]
H [1] H [0] 0nr ×nt ... 0nr ×nt ... H [2]
. .. .. .. .. .. .
. . . . . . .
. .
H [L − 2] ... H [0] 0nr ×nt ... 0nr ×nt H [L − 1]
Hcp = .
H [L − 1] ... H [1] H [0] 0nr ×nt ... 0nr ×nt
.. .. .. .. .. .. ..
. . . . . . .
0nr ×nt ... H [L − 1] H [L − 2] ... H [0] 0nr ×nt
0nr ×nt ... 0nr ×nt H [L − 1] ... H [1] H [0]
SVD decomposition Hcp = DH ⊗ Inr Λcp (D ⊗ Int ) is such that Λcp is a block
diagonal
matrix whose blocks are obtained by a blockwise DFT of
H [0] H [1] . . . H [L − 1] , i.e., for the (k, k)th block
L−1
X
(kk) 2π kl
Λcp = H [l] e−j T , k = 0, . . . , T − 1,
l=0
The original frequency selective channel has been converted into a set of T parallel
flat fading channels in the frequency domain, the channel gains being given by the
diagonal blocks of Λcp .
217 / 494
Orthogonal Frequency Division Multiplexing (OFDM)
with
L−1
X
(kk) 2π kl
H(k) = Λcp = H [l] e−j T .
l=0
218 / 494
Orthogonal Frequency Division Multiplexing (OFDM)
OFDM
interleaver S/P modulator
encoder
de- OFDM
decoder interleaver demodulator
• Strong analogy with the input-output relationship over a flat fading MIMO channel:
– Temporal dimension replaced by frequency dimension
– Hk ↔ H(k)
– Commonly known as Space-Frequency Coded MIMO-OFDM.
– If the coherence bandwidth of the channel is small, the channel gains H(k) vary
significantly from tone to tone. The channel in the frequency domain can then be
considered as a fast fading channel in the frequency domain.
219 / 494
Virtual Transmit Antenna Array
220 / 494
Unified Representation for Single and Multi-Carrier
Transmissions
• Unique virtual nr × Lnt MIMO channel
√
Y = y0 · · · yT −1 = Es H C + n0 ··· nT −1
• Equivalent transmitted codewords in the virtual nr × Lnt MIMO representation
C(0)
C=
..
.
C(L−1)
T −1 X
n
1 X
= max log2 1 + ρs(k),l λ(k),l .
T PT −1 Pn
s =T k=0 l=1
k=0 l=1 (k),l
Example
Send c0 , . . . , cT −1 over two antennas in a 2-tap Rayleigh fading channel.
• classical delay-diversity scheme
1 c0 c1 ... cT −1 0 0
C= √ ,
2 0 c0 c1 ... cT −1 0
Diversity of
c0 c1 ... cT −1 0 0
3nr only!
1 0 c0 c1 ... cT −1 0
C= √ .
2 0 c0 c1 ... cT −1 0
0 0 c0 c1 ... cT −1
• Generalized delay-diversity scheme
1 c0 c1 ... cT −1 0 0 0
C= √ ,
2 0 0 c0 c1 ... cT −1 0
Diversity of
c0 c1 ... cT −1 0 0 0
4nr !
1 0 0 c0 c1 ... cT −1 0
C= √ .
2 0 c0 c1 ... cT −1 0 0
0 0 0 c0 c1 ... cT −1
224 / 494
Code Design for Space-Frequency Coded MIMO-OFDM
• Diversity gain
– Define lC,E (l) = ♯τC,E (l) (l = 1, . . . , nt ) with τC,E (l) = {k | ck (l) − ek (l) 6= 0} .
– Define lC,E = ♯τC,E with τC,E = {k | ck − ek 6= 0}.
Proposition
For full rank space-tap correlation matrix R, a pair of space-frequency
codewords {C, E} with an effective length lC,E , effective lengths {lC,E (l)}n t
l=1
and a rank r(Ẽ) achieves the full diversity nt nr L if
r(Ẽ) = nt ,
lC,E (l) ≥ L, ∀l = 1, . . . , nt ,
lC,E ≥ nt L.
225 / 494
Code Design for Space-Frequency Coded MIMO-OFDM
• Coding gain
– Assume each tap l is i.i.d. Rayleigh distributed with an average power βl and no
correlation between taps
h i
CR = Rf ⊙ 1nr ×nr ⊗ (C − E)H (C − E) ,
R f = I nr ⊗ R F , (space-frequency correlation matrix)
L−1
X
RF = βl d H
(l) d(l) with d(l) = 1 ... e−j2πkl/T ... e−j2π(T −1)l/T
l=0
Example
PL−1 2π
SISO channel h(k) = l=0 h [l] e−j T kl . Frequency correlation between channel on
subcarrier k and k + K
n o L−1
X n o 2π
E h(k) h∗(k+K) = E |h[l]|2 ej T lK
l=0 | {z }
βl
226 / 494
Code Design for Space-Frequency Coded MIMO-OFDM
227 / 494
Code Design for Space-Frequency Coded MIMO-OFDM
• With interleaver, use codes with large effective length and product distance for
OFDM transmissions as in fast fading channels.
• FER of the 16-state space-time trellis code for L = 2, 3 and 4 in uniformly
distributed i.i.d. Rayleigh channels with and without interleaver.
0
10
−1
10
FER
−2
10 L=2, with interleaver
L=3, with interleaver
L=4, with interleaver
fast fading
L=2, no interleaver
L=3, no interleaver
L=4, no interleaver
−3
10
7.5 10 12.5 15 17.5
SNR [dB]
228 / 494
Code Design for Space-Frequency Coded MIMO-OFDM
∆1 CP
c OFDM x
∆2 CP
modulator
∆ nt CP
– Analogous to GDD, CC converts a MIMO channel into a SIMO channel with enhanced
frequency selectivity. The subsequent frequency diversity is extracted by appropriate
outer codes.
– A cyclic shift in the time domain corresponds to the multiplication by a phase shift in
the frequency domain. Therefore, the received signal in the frequency domain reads as
s
Es
yk = heq,(k) ck + nk
nt
where the equivalent SIMO channel matrix on the kth tone, denoted as heq,(k) , is
given by
nt
X 2π
heq,(k) = H(k) (:, m) e−j T k∆m
m=1
and H(k) is the DFT of the impulse response evaluated on the kth subcarrier
(kk) PL−1 2π
H(k) = Λcp = l=0 H [l] e−j T kl .
230 / 494
Code Design for Space-Frequency Coded MIMO-OFDM
231 / 494
Multi-User MIMO - Multiple Access Channels
(Uplink) & Broadcast Channels (Downlink)
232 / 494
Reference Book
– Chapter 12
Section: 12.1, 12.2, 12.3, 12.4
233 / 494
Introduction
where
– yul ∈ nr
– Hul,q ∈ nr ×nt,q models the small scale time-varying fading process and Λ−1
q refers
to the large-scale fading accounting for path loss
and shadowing
– nul is a complex Gaussian noise CN 0, σn 2I
nr .
• User q’s input covariance matrix
is defined as the covariance matrix of the transmit
H
signal of user q as Qul,q = E c′ul,q c′ul,q .
• Power constraint: Tr{Qul,q } ≤ Es,q .
235 / 494
MIMO MAC System Model
• By stacking up the transmit signal vectors and the channel matrices of all K users,
h ′ ′
iT
c′ul = cul,1
T T
, . . . , cul,K ,
h i
−1/2 −1/2
Hul = Λ1 Hul,1 , . . . , ΛK Hul,K ,
• Note on the notations: the dependence on the path loss and shadowing is made
explicit in order to stress that the co-scheduled users experience different path losses
and shadowings and therefore receive power.
• We assume that the receiver (i.e. the BS in a UL scenario) has always perfect
knowledge of the CSI, but we will consider strategies where the transmitters have
perfect or partial knowledge of the CSI.
236 / 494
Capacity Region of Deterministic Channels
• In a multi-user setup, given that all users share the same spectrum, the rate
achievable by a given user q, denoted as Rq , will depend on the rate of the other
users Rp , p 6= q → Trade-off between rates achievable by different users!
• The capacity region C formulates this trade-off by expressing the set of all user rates
(R1 , . . . , RK ) that are simultaneously achievable.
Definition
The capacity region C of a channel Hul is the set of all rate vectors
(R1 , . . . , RK ) such that simultaneously user 1 to user K can reliably
communicate at rate R1 to rate RK , respectively.
Any rate vector not in the capacity region is not achievable (i.e. transmission at
those rates will lead to errors).
Definition
• For given input covariance matrices Qul,1 , . . . , Qul,K , the achievable rate region is
defined by
1 The rate achievable by a given user q with a given transmit strategy Qul,q cannot be
larger than its achievable rate in a single-user setup, i.e.
Λ−1
q
Rq ≤ log2 det Inr + 2 Hul,q Qul,q HH ul,q , q = 1, . . . , K
σn
where Qul,q = E c′q c′q H is subject to the power constraint Tr{Qul,q } ≤ Es,q .
2 The sum of the rates achievable by a subset S of the users should be smaller than the
total rate achievable
P when those users “cooperate” with each other to form a giant
array with nt,S = q∈S nt,q transmit antennas subject to their respective power
constraints, i.e.
X 1
Rq ≤ log2 det Inr + 2 Hul,S Qul,S HH ul,S
q∈S
σn
1 X −1
= log2 det Inr + 2 Λq Hul,q Qul,q HH ul,q ,
σn q∈S
h i
−1/2 −1/2
with Hul,S = Λi Hul,i , . . . , Λj Hul,j ,
i,j∈S
Qul,S = diag Qul,i , . . . , Qul,j i,j∈S , subject to the constraints Tr{Qul,q } ≤ Es,q .
• The rate region looks like a K-dimensional polyhedron with K! corner points on the
boundary.
238 / 494
Rate Region of a Two-User MIMO MAC
• Remarkably, at point A, user 1 can transmit at a rate equal to its single-link MIMO
rate and user 2 can simultaneously transmit at a rate R2′ > 0 equal to
Λ−1 Λ−1
R2′ = log2 det Inr + 12 Hul,1 Qul,1 HHul,1 +
2
2
Hul,2 Qul,2 HH
ul,2
σn σn
Λ−1
− log2 det Inr + 12 Hul,1 Qul,1 HH ul,1
σn
!−1
−1
Λ Λ−1
= log2 det Inr + 22 Hul,2 Qul,2 HHul,2 I nr + 1
2
H Q H H
ul,1 ul,1 ul,1 .
σn σn
239 / 494
Capacity Region of MIMO MAC
Proposition
240 / 494
Capacity Region of a Two-User MIMO MAC
• Due to the union of pentagons, the capacity region of the two-user MIMO MAC
does not look like a pentagon in general.
• However, with a single antenna (nt,q = 1), the capacity region remains a pentagon
because a single data is transmitted per user at the full power, i.e. Es,q .
241 / 494
Capacity Region of SISO MAC
Corollary
X X
CM AC = (R1 , . . . , RK ) : Rq ≤ log2 1 + ηq |hul,q |2 , ∀S ⊆ K
q∈S q∈S
where ηq = Λ−1 2
q Es,q /σn .
Example
Two-user SISO: CM AC is the set of all rates pair (R1 , R2 ) satisfying to
Rq ≤ log2 1 + ηq |hul,q |2 , q = 1, 2
R1 + R2 ≤ log2 1 + η1 |hul,1 |2 + η2 |hul,2 |2 .
R2′ = log2 1 + η1 |hul,1 |2 + η2 |hul,2 |2 − log2 1 + η1 |hul,1 |2
2
η2 |hul,2 |2 Λ−1
2 |hul,2 | Es,2
= log2 1 + = log2 1 + .
1 + η1 |hul,1 |2 σn2 + Λ−1 2
1 |hul,1 | Es,1
242 / 494
Capacity Region of SIMO MAC
Corollary
( P )
(R1 , . . . ,hRK ) : q∈S Rq ≤ i
CM AC = P
log2 det Inr + q∈S ηq hul,q hH
ul,q , ∀S ⊆ K
where ηq = Λ−1 2
q Es,q /σn .
Example
Two-user SIMO: CM AC is the set of all rates pair (R1 , R2 ) satisfying to
Rq ≤ log2 1 + ηq khul,q k2 = log2 det Inr + ηq hul,q hH ul,q , q = 1, 2
R1 + R2 ≤ log2 det Inr + η1 hul,1 hH H
ul,1 + η2 hul,2 hul,2 .
R2′ = log2 det Inr + η1 hul,1 hH H H
ul,1 + η2 hul,2 hul,2 − log2 det Inr + η1 hul,1 hul,1
−1
= log2 det Inr + η2 hul,2 hH H
ul,2 Inr + η1 hul,1 hul,1
−1
= log2 1 + η2 hH H
ul,2 Inr + η1 hul,1 hul,1 hul,2
243 / 494
Achievability of the Capacity Region
Proposition
MMSE-SIC is optimal for achieving the corner points of the MIMO MAC rate
region.
• The exact corner point that is achieved on the rate region depends on the stream
cancellation ordering:
– Point A, user 2 is canceled first (i.e. all streams from user 2) such that user 1 is left
with the Gaussian noise and can achieve a rate equal to the single-link bound.
– Assuming nt = 1, R2′ = log2 (1 + ρq ) where ρq is the SINR of the MMSE receiver for
user 2’s stream treating user 1’s stream as colored Gaussian interference.
244 / 494
Comparisons with TDMA
• TDMA allocates the time resources in an orthogonal manner such that users are
never transmitting at the same time (line D-C in the rate region).
• SISO: both TDMA and SIC exploit a single degree of freedom but TDMA rate
region is strictly smaller than the one achievable with SIC.
• SIMO: TDMA incurs a big loss compared to SIMO MAC (with MMSE-SIC) as it
only exploits a single degree of freedom despite the presence of min {nr , K} degrees
of freedom achievable with SIMO MAC at high SNR.
• MIMO: As nt increases, the gap between the TDMA and MIMO MAC rate regions
decreases.
245 / 494
Ergodic Capacity Region of Fast Fading Channels: Perfect
CSIT
• The ergodic capacity region is the set of achievable long-term average rates
R1 , . . . , RK where the averaging is taken w.r.t. all channel realizations.
• The rate region can therefore be extended to fast fading channels as
( )
X 1 X −1 H
Rq ≤ E log2 det Inr + 2 Λq Hul,q Qul,q Hul,q , ∀S ⊆ K
q∈S
σn q∈S
246 / 494
Fast Fading - Perfect CSIT
247 / 494
Fast Fading - Partial Transmit Channel Knowledge
• Hul,q is not known to the transmitter q ∀Q → we cannot adapt Qul,q at all time
instants
• Rate of information flow between a subset of users S and Rx at time instant k over
channels Hul,k,q ∀q ∈ S
1 X −1
log2 det Inr + 2 Λq Hul,k,q Qul,q HH
ul,k,q .
σn q∈S
Such a rate varies over time according to the channel fluctuations. The average rate
of information flow over a time duration T >> Tcoh is
T −1
1 X 1 X −1
log2 det Inr + 2 Λq Hul,k,q Qul,q HH
ul,q .
T σn q∈S
k=0
248 / 494
Fast Fading - Partial Transmit Channel Knowledge
• The ergodic capacity region is obtained as the union of all the K-dimensional
polyhedrons whose corresponding input covariance matrices satisfy the power
constraints.
Proposition
The ergodic capacity region C¯M AC of the Gaussian fast fading MIMO MAC is
the set of all achievable rate vectors (R1 , . . . , RK ) given by
P
[ (R1 , . . . , RK) : q∈S Rq ≤
P Λ−1
q H .
E log2 det Inr + q∈S σ2 Hul,q Qul,q Hul,q , ∀S ⊆ K
Tr{Qul,q }≤Es,q n
Qul,q ≥0,∀q
249 / 494
Outage Capacity and Probability in Slow Fading Channels
• The transmitters have only partial transmit channel knowledge in the form of the
channel distribution information.
S
• MAC outage event O = S OS where
( X )
1 H
OS = Hul : log2 det Inr + 2 Hul,S Qul,S Hul,S < Rq .
σn q∈S
• Outage probability of the MAC is defined as the probability that the target rate
vector (R1 , . . . , RK ) lies outside the achievable rate region.
Definition
The outage probability Pout (R1 , . . . , RK ) of a MIMO MAC with target rate
vector (R1 , . . . , RK ) is given by
!
[
Pout (R1 , . . . , RK ) = min P OS .
{Qul,q ≥0,Tr(Qul,q )≤Es,q }∀q S
250 / 494
Diversity-Multiplexing trade-off of i.i.d. Rayleigh Slow
Fading Channels
• Assume that all users have the same transmit power constraint Es,q = Es ∀q and
experience independent and identically distributed channels with Λq = Λ (so that
ηq = η ∀q) and Hq being i.i.d. Rayleigh fading.
• Asymptotic (i.e. large η) diversity-multiplexing trade-off of the K-user MIMO MAC
Definition
⋆
A diversity gain gd,M AC (gs,1 , . . . , gs,K , ∞) is achieved for the set of K-tuple
multiplexing gains (gs,1 , . . . , gs,K ) if
Rq η
lim = gs,q , ∀q
η−→∞ log
2 η
log2 Pout (R1 , . . . , RK ) ⋆
lim = −gd,M AC (gs,1 , . . . , gs,K , ∞)
η−→∞ log2 η
⋆
The curve gd,M AC (gs,1 , . . . , gs,K , ∞) as function of (gs,1 , . . . , gs,K ) is known
as the asymptotic diversity-multiplexing trade-off of the MIMO MAC.
• The DMT in the MAC differs from that of the single-link MIMO channel by the fact
that coding can only P be performed across antennas belonging to the same user and
not jointly across all nt,q antennas.
251 / 494
Diversity-Multiplexing trade-off of i.i.d. Rayleigh Slow
Fading Channels
⋆
• Asymptotic symmetric (nt,q = nt and gs,q = gs ∀q) DMT gd,M AC (gs , ∞) of MIMO
nr
MAC for nt > K+1
⋆ nr
• Asymptotic symmetric DMT gd,M AC (gs , ∞) of MIMO MAC for nt ≤ K+1
is the
same as single-link MIMO.
252 / 494
MIMO BC System Model
yq = Λq−1/2 Hq c′ + nq
where
– yq ∈ nr,q
– Hq ∈ nr,q ×nt models the small scale time-varying fading process and Λ−1
q refers to
the large-scale fading accounting for path loss and
shadowing
– nq is a complex Gaussian noise CN 0, σn,q2 I
nr,q .
• The input covariance
matrix
is defined as the covariance matrix of the transmit
signal as Q = E c′ c′H .
• Power constraint: Tr{Q} ≤ Es .
253 / 494
MIMO BC System Model
• By stacking up the received signal vectors, the noise vectors and the channel
matrices of all K users,
h iT
y = y1T , . . . , yK
T
,
h iT
n = nT1 , . . . , nTK ,
h iT
−1/2 T −1/2
H = Λ1 H1 , . . . , ΛK HTK ,
the system model also writes as
y = Hc′ + n.
H is assumed to be full-rank as it would be the case in a typical user deployment.
• SNR of user q defined as ηq = Es Λ−1 2
q /σn,q .
• Perfect instantaneous channel state information (CSI) at the Tx and all Rx.
• Generally speaking, c′ is written as the superposition of statistically independent
signals c′q
XK
c′ = c′q .
q=1
The input covariance matrix of user q is defined as Qq = E c′q c′H
q .
254 / 494
Capacity Region of two-user SISO Deterministic BC
• In two-user SISO MAC, point A was obtained by canceling user 2’s signal first such
that user 1 is left with Gaussian noise.
• Let us apply the same philosophy to the SISO BC:
– transmit c′ = c′1 + c′2 , with power of c′q denoted as sq
– user 1 cancels user 2’s signal c′2 so as to be left with its own Gaussian noise
– user 2 decodes its signal by treating user 1’s signal c′1 as Gaussian noise.
• Achievable rates of such strategy (with sum-power constraint s1 + s2 = Es )
Λ−1
1 s1 2
R1 = log2 1 + 2 |h1 |
σn,1
!
2
Λ−12 |h2 | s2
R2 = log2 1 + 2 2 .
σn,2 + Λ−1
2 |h2 | s1
• Careful! For user 1 to be able to correctly cancel user 2’s signal, user 1’s channel has
to be good enough to support R2 , i.e.
!
2
Λ−1
1 |h1 | s2
R2 ≤ log2 1 + 2 2 .
σn,1 + Λ−11 |h1 | s1
• The channel gains normalized w.r.t. their respective noise power should be ordered
2
Λ−1
2 |h2 | Λ−1 |h1 |2
2
≤ 1 2 .
σn,2 σn,1 255 / 494
Capacity Region of two-user SISO Deterministic BC
• If the ordering condition is satisfied, the above strategy achieves the boundary of the
capacity region of the two-user SISO BC for any power allocation s1 and s2
satisfying s1 + s2 = Es .
• The capacity region is given by the union of all rate pairs (R1 , R2 ) over all power
allocations s1 and s2 satisfying s1 + s2 = Es .
256 / 494
Capacity Region of K-user SISO Deterministic BC
−1/2
• Define hq = Λq hq /σn,q . Assume |h1 |2 ≥ |h2 |2 ≥ . . . ≥ |hK |2 .
Proposition
With the ordering |h1 |2 ≥ |h2 |2 ≥ . . . ≥ |hK |2 , the capacity region CBC of the
Gaussian SISO BC is the set of all achievable rate vectors (R1 , . . . , RK ) given
by
2
[ |h q | s q
(R1 , . . . , RK ) : Rq ≤ log2 1 + hP i , ∀q .
PK
sq :
1 + |h |2 q
q−1
s p
q=1 sq =Es p=1
Proposition
The sum-rate capacity of the SISO BC is achieved by allocating the transmit
power to the strongest user
CBC = log2 1 + Es max |hq |2 = log2 1 + max ηq |hq |2 .
q=1,...,K q=1,...,K
Recall that the MAC sum-rate capacity is obtained with all users simultaneously
transmitting at their respective full power.
257 / 494
Achievability of the SISO BC Capacity Region
Proposition
If Tx has full (non-causal) knowledge of the interference, the capacity of the dirty
paper channel is equal to the capacity of the channel with the interference completely
absent.
– By encoding users in the increasing order of their normalized channel gains, DPC
achieves the capacity region of the SISO BC.
Example
Assume |h1 |2 ≥ |h2 |2 . By treating user 2’s signal c′2 as known Gaussian interference
at Tx and encoding user 1’s signal c′1 using DPC, user 1 can achieve a rate as high as
if user 2’s signal was absent. User 2 treats user 1’s signal as Gaussian noise.
258 / 494
Achievability of the SISO BC Capacity Region
Proposition
With the appropriate cancellation/encoding ordering, Superposition Coding
with SIC and Dirty-Paper Coding are both optimal for achieving the SISO BC
capacity region.
Proposition
The SISO BC sum-rate capacity is achievable with dynamic TDMA (to the
strongest user), Superposition Coding with SIC (with the appropriate
cancellation ordering) and Dirty-Paper Coding (with the appropriate encoding
ordering).
259 / 494
Capacity Region of MIMO BC and its Achievability
261 / 494
SISO BC-MAC Duality
−1/2 −1/2 T
• Assuming a SISO BC over a deterministic channel h = Λ1 h1 , . . . , Λ K hK
2 2
with receiver noise powers σn,1 , . . . , σn,K , we express the SISO BC in the equivalent
system model with unit variance receiver noises and normalized channel gains
−1/2
h q = Λq hq /σn,q such that h = [h1 , . . . , hK ]T .
• The system model y = hc′ + n for SISO then writes equivalently as
262 / 494
SISO BC-MAC Duality
where CM AC (Es,1 , . . . , Es,K , h) is capacity region of the SISO MAC with the
−1/2
channel gains Λq hul,q replaced by the normalized channel gain hq and the
2
noise power σn = 1. 263 / 494
SISO BC-MAC Duality
• Two-user SISO BC capacity region characterized in the terms of the capacity region
of its dual MAC
264 / 494
MIMO BC-MAC Duality
where c′ul is the vector of transmitted signals from the K users, yul is the received
signal vector
at the nt receive antennas and ñul is a complex Gaussian noise
CN 0, Int .
265 / 494
MIMO BC-MAC Duality
Proposition
The capacity region of the MIMO BC with power constraint Es over a
deterministic channel H is equal to the union of the capacity region P of the dual
MIMO MAC with individual power constraints Es,q such that K q=1 Es,q = Es
[
CBC (Es , H) = CM AC Es,1 , . . . , Es,K , HH
{Es,q }∀q : K
P
q=1 Es,q =Es
P
[ (R1 , . . . ,RK ) : q∈S Rq ≤
= P H
log2 det Int + q∈S Hq Qul,q Hq , ∀S ⊆ K
{Qul,q ≥0}∀q ,
q=1 Tr{Qul,q }≤Es
PK
where CM AC Es,1 , . . . , Es,K , HH is teh MIMO MAC capacity region with the
channel matrix Hul replaced by HH and the noise power σn2 = 1.
266 / 494
MIMO BC-MAC Duality
Proposition
The sum-rate capacity of the MIMO BC is equal to the sum-rate capacity of
the sum power dual MIMO MAC
CBC (H, Es ) = CM AC HH , Es
K
X
= max log2 det Int + HH
q Qul,q Hq .
{ }
Qul,q ≥0
∀q
,
q=1
{ }
PK
q=1 Tr Q ul,q ≤E s
267 / 494
Bounds on Sum-Rate Capacity of MIMO BC
• Define
– ñ = min {nt , Knr } (nr,q = nr ∀q is assumed),
– λmax,q as the dominant eigenvalue of HH q Hq ,
• Achievable multiplexing gain of ñ
Proposition
• SISO
– Similarly to MAC, TDMA rate region is contained in the BC capacity region.
– The gap between the BC capacity region and the TDMA rate region increases
proportionally with the asymmetry between users normalized channel gains.
– TDMA achieves the sum-rate capacity of SISO BC.
• MIMO
– The maximum sum-rate CT DM A (H) is the largest single-link capacity among K users
CT DM A (H) = max CCSIT (Hq ) .
q=1,...,K
– Define λmax,q and λmax as the largest eigenvalue of HH H
q Hq and H H, λk Hq Hq
H
H
as the non-zero eigenvalues of Hq Hq and n = min {nt , nr }. Assume nr,q = nr ∀q.
Proposition
The maximum TDMA sum-rate, CT DM A (H), is lower bounded as
CT DM A (H) ≥ log2 1 + max ηq λmax,q ,
q=1,...,K
n
X ηq
CT DM A (H) ≥ CCSIT (Hq ) ≥ log2 1 + λk H q H H
q ,
k=1
n
Proposition
Intuition:
TDMA exploits at least one spatial dimension with the largest effective SNR among all
users.
– DPC exploits up to nt dimensions. Since the quality of each of those nt dimensions
cannot be larger than the single dimension used in the TDMA lower bound, DPC
cannot achieve a rate larger than nt times the TDMA capacity.
Proposition
For any nt , nr and K, at high SNR (Es → ∞, i.e. ηq → ∞ ∀q),
Perfect CSIT
P
• short-term power constraint q sq = Es : similar to deterministic channels
P
• long-term power constraint EH q sq (H) = Es :
– Power control policy that maps a channel realization to a set of transmit power.
– Ergodic capacity region is given by the union of all achievable rate regions over all
power control policies that satisfy the long-term power constraint.
– Sum-rate maximization: the sum-rate capacity can also be achieved by transmitting to
the strongest user in each fading state
Λ−1
q |hq |
2
q ⋆ = arg max |hq |2 = arg max 2
q=1,...,K q=1,...,K σn,q
and the power in each fading state can be optimized following the time domain
water-filling solution.
– Observe the similarity with SISO MAC. The user to be selected is slightly different.
• In MISO and MIMO BC, not sufficient to transmit to a single user at a time to
achieve the sum-rate capacity (similarly to SIMO/MIMO MAC).
Partial CSIT
• No channel ordering and no way know the interference to other users.
• TDMA is an appropriate strategy. Huge loss compared to perfect CSIT!
271 / 494
Outage Capacity and Probability in Slow Fading Channels
Proposition
The diversity gain of MU-MIMO precoding in MISO BC is not larger than the
diversity gain of transmit beamforming in a single link.
272 / 494
Multi-User MIMO - Scheduling and Precoding
(Downlink)
273 / 494
Reference Book
– Chapter 12
Section: 12.1,12.5,12.6,12.8
274 / 494
Introduction
• BC: K >> nt , MAC: K >> nr → All users cannot be scheduled at the same time.
– Which users to schedule?
– How to account for fairness?
• What if we do not have perfect channel knowledge at the transmitter to design the
precoders in MIMO BC?
275 / 494
System Model
yq = Λq−1/2 Hq c′ + nq
where
– yq ∈ nr,q
– Hq ∈ nr,q ×nt models the small scale time-varying fading process and Λ−1
q refers to
the large-scale fading accounting for path loss and
shadowing
– nq is a complex Gaussian noise CN 0, σn,q2 I
nr,q .
• Long term SNR of user q defined as ηq = Es Λ−1 2
q /σn,q .
• Generally speaking, c′ is written as the superposition of statistically independent
signals c′q
XK
c′ = c′q .
q=1
• Power constraint: Tr{Q} ≤ Es with Q = E c′ c′H .
276 / 494
System Model - Linear Precoding
• scheduled user set, denoted as K ⊂ K, is the set of users who are actually scheduled
(with a non-zero transmit power) by the transmitter at the time instant of interest.
• The transmitter serves users belonging to K with ne dataP streams and user q ∈ K is
served with nu,q data streams (nu,q ≤ ne ). Hence, ne = q∈K nu,q .
• Linear Precoding
X X
c′ = Pc = WS1/2 c = Pq c q = Wq S1/2
q cq
q∈K q∈K
where
– c is the symbol vector made of ne unit-energy independent symbols.
– P ∈ nt ×ne is the precoder subject to Tr{PPH } ≤ Es , made of two matrices: a
power control diagonal matrix denoted as S ∈ ne ×ne and a transmit beamforming
matrix W ∈ nt ×ne .
– Pq ∈ nt ×nu,q , Wq ∈ nt ×nu,q , Sq ∈ nu,q ×nu,q , and cq ∈ nu,q are user q’s
sub-matrices and sub-vector of P, W, S, and c, respectively.
• The received signal yq ∈ nr,q is shaped by Gq ∈ nu,q ×nr,q
and the filtered
received signal zq ∈ nu,q at user q writes as
z q = G q yq ,
X
= Λq−1/2 Gq Hq Wq S1/2
q cq + Λq−1/2 Gq Hq Wp S1/2
p c p + Gq nq .
p∈K, p6=q
277 / 494
Multi-User Diversity
• Provided that the BS is able to track the user channel fluctuations (based on
feedback), it can schedule transmissions to the users with favorable channel fading
conditions, i.e. near their peaks, to improve the total cell throughput.
• Recall that MU diversity was already identified as part of the sum-rate maximization
in SISO BC.
278 / 494
Multi-User Diversity Gain in SISO
• Assume that the fading distribution of the K users are independent and identically
(Λ−1
q = Λ−1 and channel gains hq are drawn from the same) Rayleigh distributed
2
and that users experience the same average SNR ηq = η (σn,q = σn2 ) ∀q:
yq = Λ−1/2 hq c′ + nq .
• Assume MU-SISO where one user is scheduled at a time in a TDMA manner: select
the user with the largest channel gain.
• Mathematically same as antenna selection diversity.
• Average SNR gain
– Average SNR after user selection ρ̄out
K
X 1
ρ̄out = E η max |hq |2 =η .
q=1,...,K
q=1
q
– SNR gain provided by MU diversity gm
K
ρ̄out X 1 K→∞
∼
gm = = = log(K).
η q=1
q
gm is of the order of log(K) and hence the gain of the strongest user grows as log(K)!
• Heavily relies on CSIT (partial or imperfect feedback impacts the performance) and
independent user fading distributions (correlated fading or LOS are not good for MU
diversity)
279 / 494
Multi-User Diversity Gain in SISO
• Sum-rate capacity
2
C̄T DM A = E {CT DM A } = E log2 1 + η max |hq | .
q=1,...,K
– low SNR
C̄T DM A ≈ E max |hq |2 η log2 (e) ≈ gm Cawgn .
q=1,...,K
Observations: capacity of the fading channel log(K) times larger than the AWGN
capacity.
– high SNR (Use Jensen’s inequality: Ex {F (x)} ≤ F (Ex {x}) if F concave)
C̄T DM A ≈ log2 (η) + E log2 max |hq |2 ,
q=1,...,K
≈ Cawgn + E log2 max |hq |2 ,
q=1,...,K
(a)
≤ Cawgn + log2 E max |hq |2 ,
q=1,...,K
• In MU-MIMO, the performance is function of the channel magnitude but also of the
spatial directions and properties of the channel matrices.
• MU diversity offers abundant spatial channel directions and allows to appropriately
choose users with good channel matrix properties or spatial separations.
• Opportunistic Beamforming: precode multiple streams along the unitary precoding
matrix W (orthogonal beams). For a large number of users, thanks to MU diversity,
each beam matches one user channel with a high probability and orthogonality of
beams prevents users from experiencing multi-user interference
K→∞
yq = Λq−1/2 hq WS1/2 c + nq = Λq−1/2 khq k s1/2
q cq + n q .
– The terminal only measures the effective channel, i.e. the channel precoded by each
beam, and reports the SNR (or CQI) for one or multiple beam(s).
– Works well only for very large K.
281 / 494
Multi-User Diversity
282 / 494
Resource Allocation, Fairness and Scheduling Criteria
where
– rate-maximization approach: wq = 1
γ
– proportional fair approach: wq = R̄q (R̄q is the long-term average rate of user q and
q
γq is the Quality of Service (QoS) of each user).
283 / 494
Practical Proportional Fair Scheduling
• The long-term average rate R̄q of user q is updated using an exponentially weighted
low-pass filter such that the estimate of R̄q at time k + 1, denoted as R̄q (k + 1), is
function of the long-term average rate R̄q (k) and of the current rate Rq (k) at
current time instant k as outlined by
(
(1 − 1/tc ) R̄q (k) + 1/tc Rq (k), q ∈ K⋆
R̄q (k + 1) =
(1 − 1/tc ) R̄q (k) , q∈/ K⋆
where tc is the scheduling time scale and K⋆ refers to the scheduled user set at time
k. The resources should thus be allocated at time instant k as
′⋆ ⋆ ⋆ X Rq (k)
c , G , K = arg ′ max γq .
c ,G,K⊂K
q∈K
R̄q (k)
• The scheduling time scale tc is a design parameter of the system that highly
influences the user fairness and the performance
– Very large tc : assuming all users experience identical fading statistics and have the
same QoS, the PF scheduler is equivalent to the rate-maximization scheduler, i.e. users
contributing to the highest sum-rate are selected.
– Small tc : assuming all users have the same QoS, the scheduler divides the available
resources equally among users (Round-Robin scheduling). No MU diversity is exploited.
284 / 494
Proportional Fair Scheduling
ε=0
TDMA sum−rate [bps/Hz]
2.2
1.8
tc=50
1.6
1.4
1.2
t =1.1
c
1
0.8
0 10 20 30 40 50
Number of Users K
285 / 494
User Grouping
• Given the presence of K users in the cell, the scheduler for MU-MIMO aims at
finding the best scheduled user set among all possible candidates within K.
requires to consider a large number of different sets and has a complexity that
quickly becomes cumbersome as K increases.
286 / 494
User Grouping
287 / 494
User Correlation
288 / 494
User Correlation
• Average value of u as a function of |t| and K, where the average value here refers to
the averaging over different sets of transmit correlation matrices Rt,q (|tq | = |t| ∀q,
phases of the correlation coefficients tq are randomly generated).
n =4
t
1
t=0
0.9 |t|=0.5, 360°
|t|=0.95, 360°
Average User Correlation u
0.8
|t|=0.95, 70°
0.7
0.6
0.5
0.4
0.3
0.2
4 6 8 10 12 14 16 18 20
Number of Users K
290 / 494
Achievable rate
with pq,l = wq,l sq,l (resp. gq,l ) the precoder (resp. combiner) attached to stream l
of user q, Il the inter-stream interference and Ic the intra-cell interference (also
called multi-user interference)
X −1 X −1
Il = Λq |gq,l Hq pq,m |2 = Λq |gq,l Hq wq,m |2 sq,m ,
m6=l m6=l
nu,p
X X X nX
u,p
Ic = Λ−1 2
q |gq,l Hq pp,m | = Λ−1 2
q |gq,l Hq wp,m | sp,m .
p∈K m=1 p∈K m=1
p6=q p6=q
2
Λ−1
q |hq wq | sq
• If nr = 1, the SINR of user q simply reads as ρq = −1 2 .
| | 2
P
p∈K Λq hq wp sp +σn,q
p6=q
291 / 494
Zero-Forcing Beamforming (ZFBF)
• Most popular MU-MIMO precoder. Assume single receive antenna per user.
• Channel Direction Information (CDI) of user q: h̄q = hq / khq k.
• Idea is to force the intra-cell interference Ic to zero: the precoder of a user q, wq , is
chosen such that hp wq = 0 ∀p ∈ K \ q. Only possible if ne ≤ nt !
• Define h iT
−1/2 T −1/2 T
H = Λi hi , . . . , Λ j hj = DH̄
i,j∈K
with
n o
−1/2 −1/2
D = diag Λi khi k , . . . , Λj khj k ,
i,j∈K
h iT
H̄ = h̄Ti , . . . , h̄Tj .
i,j∈K
Transmit precoder wq for user q ∈ K: wq = F(:, q)/kF(:, q)k = F̄(:, q)/kF̄(:, q)k
where F(:, q) is to be viewed as the column of F corresponding to user q.
292 / 494
Zero-Forcing Beamforming (ZFBF)
T
• Assuming that c = ci , . . . , cj i,j∈K
, the received signal of user q ∈ K is
yq = Λq−1/2 hq wq s1/2
q cq + n q = d q cq + n q ,
−1/2 1/2 −1/2 khq k 1/2
with dq = Λq h q wq s q = Λ q s .
kF̄(:,q)k q
Observations: MU-MIMO channel with ZFBF is split into ne parallel
(non-interfering) channels.
• The rate achievable by user q is given by
2
Rq = log2 1 + d2q /σn,q .
d2q is low if H is badly conditioned but would get larger if users’ CDI are orthogonal
or quasi-orthogonal.
– reminiscent of the loss caused by noise enhancement incurred by the linear ZF
• For large K, better conditioning of matrix H through the use of user grouping.
• By uniformly allocating the power across user streams sq = Es /ne and by choosing
ne = min {nt , K}, d2q /σn,q
2
= αq2 ηq /ne with αq2 = |hq wq |2 = khq k2 /kF̄(:, q)k2
min{nt ,K}
X ηq
CBF (H) = log2 1 + αq2 .
q=1
ñ
At high SNR with ηq = η, CBF (H) ≈ min {nt , K} log2 (ηq ). The multiplexing gain
min {nt , K} is achieved (same as with DPC). 293 / 494
Zero-Forcing Beamforming (ZFBF)
• Illustration of ZFBF precoding for a two-user scenario: (a) non-orthogonal user set,
(b) quasi-orthogonal user set.
294 / 494
Block Diagonalization (BD)
• Extension of ZFBF to multiple receive antennas and multiple streams per user.
• Constraints on the transmit filters targeting user q ∈ K
Λp−1/2 Hp Wq = 0, ∀p 6= q, p ∈ K
nr K̃q ×nt
• Denoting K̃q = K \ q of size K̃q = ♯K̃q , the interference space H̃q ∈ is
h iT
H̃q = . . . Λp−1/2 HTp . . . .
p∈K̃q
• BD filter design forces Wq tolie in the null space of H̃q : null space of H̃q to be
strictly larger than 0 → r H̃q < nt .
• An orthogonal basis of the null space of H̃q is obtained by taking its SVD
H
H̃q = Ũq Λ̃q Ṽq Ṽq′
where Ṽq′ refers to the eigenvectors of H̃q associated with the null singular values.
• Assuming the zero-interference constraint is possible for all users in K̃q and that
r Hq Ṽq′ = nu,q , Wq writes as a linear combination of columns of Ṽq′ as
Wq = Ṽq′ Aq
with some nu,q × nu,q unitary matrix Aq .
295 / 494
Block Diagonalization (BD)
• For a SISO channel, the intended signal at time instant k, h [0] ck , is affected by the
PL−1
ISI ik = l=1 h [l] ck−l (ignoring the noise)
• If h [l] ∀l known to Tx and given that the previous transmitted symbols ck−l are
known to Tx, ISI ik is known and the transmitter can make use of that knowledge.
– DFE or SIC prone to error propagation, but THP not affected as the previously
transmitted symbols ck−l are perfectly known to Tx.
297 / 494
THP
• If the Rx treats i as noise, achievable rate given by log2 1 + Es /(σi2 + σn2 ) .
298 / 494
THP
299 / 494
THP
300 / 494
THP
c̃ = Qc (αi) − αi
i.e. the transmitter finds the point in the equivalence class of c closest to αi and
transmits the quantization error between that point and αi.
αy = α (c̃ + n) + αi
• A suitable value for α is equal to the MMSE scaling factor Es /(Es + σn2 ). By doing
so, αy is a linear MMSE estimate of c̃ but shifted by αi.
301 / 494
n-dimensional THP and DPC
• System model
y = c̃ + i + n
• Choose
c̃ = Qc (αi) − αi
• With high dimensional coding, such precoding technique achieves the same capacity
as AWGN channel log2 1 + Es /σn2 , i.e. as in the absence of interference.
302 / 494
THP for MU-MIMO
• Transmit vector as
c′ = Pc̃ = WS1/2 c̃.
where
P −1/2 1/2
– Known interference at Tx: iq = p<q Λq hq wp sp c̃p (recall that c̃p , p < q, have
been previously computed).
– The interference from p > q is treated as an additional noise.
303 / 494
THP for MU-MIMO
• Choose the transmit beamformer W such that the interference due to p > q is
eliminated, i.e. zero-forced.
• Let H = RQ be the QR decomposition of H
– R is a ne × ne lower triangular
– Q is a ne × nt matrix with orthonormal rows.
yq = Λq−1/2 hq wq s1/2
q c̃q + iq + nq .
305 / 494
Sum-Rate Scaling Laws
Sum−rate [bps/Hz]
40 40
30 30
20 20
10 10
0 0
−5 0 5 10 15 20 25 30 35 40 −5 0 5 10 15 20 25 30 35 40
SNR [dB] SNR [dB]
Observations: ZFBF without user selection (ZFWF) performs poorly. ZFBF with
user selection (greedy-ZFWF) is a competitive strategy for MU-MIMO broadcast
channels, in terms of both performance and complexity.
Keep in mind the assumptions: perfect CSIT, the same average SNR for all users
and a max-rate scheduler (i.e. there is no fairness issue involved here).
307 / 494
Multi-User MIMO (Downlink) with Imperfect
CSIT
308 / 494
Reference Book
– Chapter 12
Section: 12.9
309 / 494
Additional References
310 / 494
Introduction
• In practice, perfect CSIT is hard to obtain in both FDD and TDD systems.
– TDD could make use of reciprocity
• Only partial CSIT is available at the transmitter.
• There are two major impairments that prevent from obtaining perfect CSIT:
• inaccurate CSI measurement and feedback (due to channel estimation errors and
limited feedback overhead)
• feedback delay (due to processing delay at the mobile and the BS and the frame
structure).
311 / 494
Quantized Feedback-Based Precoding
• ZFBF with uniform power allocation, single receive antenna per mobile and
quantized feedback ĥq
wq = F̂(:, q)/kF̂(:, q)k
where −1
F̂ = ĤH ĤĤH D−1
with h iT
Ĥ = ĥTi , . . . , ĥTj
i,j∈K
• Every user q quantizes its channel using a Bq -bits codebook Wq of codevectors vq,i ,
i = 1, . . . , np,q = 2Bq . the best codevector vq⋆ for user q is selected as
The quantized version of the channel direction writes as the nt × 1 row vector
ĥq = (vq⋆ )H .
• With imperfect CSIT, multi-user interference cannot be canceled out perfectly by the
ZFBF filter.
312 / 494
Sum-Rate Analysis
• Expected rate achieved by user q with perfect CSIT and uniform power allocation
(assume ne co-scheduled users)
ηq
R̄CSIT,q = EH log2 1 + |hq wZF,q |2
ne
• Expected rate achieved by user q with quantized CSIT and uniform power allocation
where ηq
n
|hq wq |2
ρq = ηq Pe
1+ ne p∈K,p6=q |hq wp |2
• The rate loss for user q incurred by the quantized feedback
Proposition
314 / 494
Precoding with Partial Transmit Channel Knowledge
DFT, B=4
25 DFT, B=3
DFT, B=2
20
15
10
0
0 5 10 15 20 25 30
SNR [dB]
315 / 494
Scalable Feedback
• Number of feedback bits necessary to maintain a rate loss of ∆R̄q ≤ log2 (b) bps/Hz
for user q
Proposition
In order to maintain a rate loss ∆R̄q between limited feedback ZFBF and
perfect CSIT-based ZFBF smaller than log2 (b) bps/Hz for user q, the number
of feedback bits Bq should scale according to
– i.i.d. Rayleigh fading channels
Bq ≈ (nt − 1) log2 (ηq ) − (nt − 1) log2 (b − 1).
– spatially correlated Rayleigh fading channels (with CDIT-based codebook)
316 / 494
Scalable Feedback
• Assuming ne = nt , ηq = η and rq = r ∀q
Deployment DL throughput UL overhead
i.i.d. nt log2 (η) nt (nt − 1) log2 (η)
Spatially Correlated nt log2 (η) nt (r − 1) log2 (η)
• Performance of ZFBF with perfect CSIT and ZFBF with channel statistics-based
codebook and scalable feedback without user selection (nt = 4, ne = 4)
Λ =diag{2,1,1,0}
t
20
Sum−rate [bps/Hz]
B=11
Perfect CSIT B=9
20 scalable feedback
B=7
B=6
10
B=4
B=1 B=2
0
0 5 10 15 20 25 30
SNR [dB]
317 / 494
Outdated Feedback-Based Precoding
318 / 494
Outdated Feedback-Based Precoding
• A long-term average transmit power (where averaging is also taken over the channel
realizations) of ρlt = 4/3ρ is consumed and twice as much power is spent on
coherence time 3 as in coherence time 1 and 2. We will refer to ρ as the SNR.
319 / 494
Outdated Feedback-Based Precoding
• The equivalent system model for user 1 at time instant t = 1, . . . , T can then be
written as
y1,t h1,1 h1,2 0 0 n1,t
√
y2,t = ρ √ c′t +
0 0 ct + ρ h2,1 h2,2 n2,t .
y3,t h3,1 g1,1 h3,1 g1,2 h3,1 h2,1 h3,1 h2,2 n3,t
| {z } | {z }
rank two rank one
• Interference elimination
y1,t √ h1,1 h1,2 n1,t
ỹt = = ρ ct + .
y3,t − h3,1 y2,t h3,1 g1,1 h3,1 g1,2 n3,t − h3,1 n2,t
• Similar for user 2. Hence, per user DoF of 2/3 and sum DoF of 4/3.
K K
• Extendable to K-user MISO BC: sum DoF of 1 +...+ 1
1+ 2
≈ ln K
.
K
320 / 494
Imperfect CSIT-Based Precoding
Rx1 Subband
…
Tx
Subband
Rx2
…
…
Subband
321 / 494
System Model
zj =gjH xj +ǫj2 ,
where ǫj1 and ǫj2 are unit power AWGN noise.
• hj and gj are the CSI in subband i of user 1 and user 2, respectively. The CSI are
i.i.d. across users and subbands.
• Imperfect CSIT: ĥj of user 1 and ĝj of user 2. Error vectors h̃j = hj − ĥj and
g̃j = gj − ĝj with the covariance matrix E[h̃j h̃H 2 H 2
j ]=σj1 I2 and E[g̃j g̃j ]=σj2 I2 .
2 ∼P −aj and σ 2 ∼P −bj . a and b are respectively interpreted as the quality of the
– σj1 j2 j j
CSIT of user 1 and user 2 in subband j, given as follows
2
logσj1 2
logσj2
aj = lim − , bj = lim − .
P →∞ logP P →∞ logP
– aj and bj vary within the range of [0,1]. aj =1 (resp. bj =1) is equivalent to perfect
CSIT and aj =0 (resp. bj =0) is equivalent to no CSIT.
• DoF per user and per channel use (assuming S channel uses)
Rk
dk , lim , k = 1, 2,
P →∞ S log P
322 / 494
System Model
• Note E[|hH ⊥ 2 H ⊥ 2 H ⊥ 2 H ⊥ ⊥H
j ĥj | ]=E[|(ĥj +h̃j ) ĥj | ]=E[|h̃j ĥj | ]=E[h̃j ĥj ĥj h̃j ]∼P −aj . and
E[|gjH ĝj⊥ |2 ]∼P −bj .
• The average
P CSIT quality ofPLuser 1 and user 2 are respectively expressed as
ae = L1 L j=1 aj and be = L
1
j=1 bj .
Definition
PL Problem: Find transmission strategies that maximize the DoF region in a
scenario such that ae =be .
– L = 1: aj = bj ∀j.
– L = 2: a1 +a2 =b1 +b2
a1 =b1 and a2 =b2 : two P1 on each subband
a1 6=b1 and a2 6=b2 : P2 where the transmitted signal in each subband is correlated to each other
323 / 494
ZFBF
y2 = h∗21 u0 + hH ⊥
2 ĝ2 u2 +ǫ21 , ∗
z2 = g21 u0 + g2H ĝ2⊥ u2 +ǫ22 ,
| {z } | {z } | {z } | {z }
P P P P0
325 / 494
3/2
S3
• If imperfect CSIT: a1 = b2 = β, b1 = a2 = α
y2 = h∗21 u0 + hH ⊥
2 ĝ2 u2 +ǫ21 , ∗
z2 = g21 u0 + g2H ĝ2⊥ u2 +ǫ22 ,
| {z } | {z } | {z } | {z }
P P P P 1−β
leading to a sum DoF of 1/2(β[rate ofu0 ] + 1[rate ofv1 ] + 1[rate ofu2 ]) = 1 + β/2.
326 / 494
P1
3/2
• Note ZFBF performs a space-only precoding while S3 performs a
space-frequency/time precoding
• P1 : Can we do better than ZFBF when aj =bj ∀j?
• Transmit the signal in each subband by superposing a common message with
ZFBF-precoded private messages. Focus on subband 1 for simplicity
where c1 is the common message broadcast to both users and u1 and v1 are symbols
intended for user 1 and user 2 respectively.
• Integration of broadcasting (ZFBF) and multicating/FDMA.
• Received signal at each user
where the private symbols u1 and v1 are drowned by the noise respectively at user 2
and user 1 due to partial ZFBF.
327 / 494
P1
• Decodability:
– Both users decode the common message first with rate (1−a1 )logP by treating the
private message as noise,
– Using SIC, each user can decode their private message with rate a1 logP only subject
to noise, after removing the common message,
328 / 494
P2
where
– Common messages u0 , c1 and c2 to be decoded by both users (intended for user 1 and
user 2 respectively or exclusively for user 1 or user 2 or for both users).
– Note that we do not precode common messages as it does not impact the DoF.
– u1 and u2 are symbols intended for user 1, while v1 and v2 are symbols intended for
user 2.
3/2
• Integrate ZFBF, S3 and FDMA/multicasting.
• Power and rate allocation
subband 1 Power Rate subband 2 Power Rate
c1 P −P β 1−β c2 P −P β 1−β
u1 P α /2 α u2 P β /2 β
u0 (P β − P α )/2 β−α u0 (P − P α )/2
β
β−α
v1 P β /2 β v2 P α /2 α
329 / 494
P2
• Decodability:
– c1 and c2 are respectively decoded first by treating all the other terms as noise.
– Afterwards, user 1 decodes u0 and u1 from y1 using SIC. With the knowledge of u0 ,
u2 can be recovered from y2 .
– Similarly, user 2 decodes u0 and v2 from z2 via SIC. v1 can be decoded from z1 by
eliminating u0 .
α+β
• Sum DoF of 1 + 2
– The DoF pair (1, α+β
2
and ( α+β
) 2
,1) are achieved if we consider the common messages
are intended for user 1 and user 2 respectively.
• When β=α, P2 degrades to 2 parallel P1 and no common message u0 is sent. 330 / 494
P2
• Received signal and decoding procedure of the optimal scheme for the P2
Received Signal
User 1 User 2
Subband 1
Subband 2
• The key point: the transmitter broadcasts u0 twice, i.e. subband 1 and 2.
– User 1 (resp. user 2) observes u0 with higher power than u1 (resp. v2 ) in subband 1
(resp. 2) and receives u0 with the same power level as u2 (resp. v1 ) in subband 2
(resp. 1).
– The common message u0 can be decoded by both users but in different subbands.
– Can be generalized to solve PL ,L≥3 problem by generating multiple streams of u0 and
sending each of them twice.
331 / 494
Weighted-Sum DoF Interpretation of P2 region
α
A 1 1
β-α
 1 0
1-β
A% 0 0
B 1 1 B̂ 0 1 A% 0 0
where
– Ã, B̃: no CSIT, each with channel use 1−β;
– Â (B̂): perfect CSIT of user 1 (2), with channel use β−α;
– Ā, B̄: perfect CSIT of both users, with channel use α.
332 / 494
Weighted-Sum DoF Interpretation of P2 region
d2
D Ā = D B̄ = D̄ : d1 ≤ 1, d2 ≤ 1. 0.6 (β −α) D̂ (β −α) D̂
0.4 (1, β +α
2 )
– Subchannel  and B̂ have an alternating
CSIT setting with two states: I1 I2 =P N 0.2 α D̄
3/2
and I1 I2 =N P (as in S3 ) (β −α) D̂ ( 1−β ) D̃
0
 B̂
(D +D )/2 = D̂ :d1 + d2 ≤ 1.5, 0 0.2 0.4 0.6 0.8 1
d1
d1 ≤ 1, d2 ≤ 1.
333 / 494
Weighted-Sum DoF Interpretation of P1 region
0.8
(1− α+β
2 ) D̃ (1− α+β
2 ) D̃
d2
0.6
0.4 (1, β +α
2 )
α+β
2 D̄
0.2
(1− α+β
2 ) D̃
0
0 0.2 0.4 0.6 0.8 1
d1
334 / 494
Mode switching among sub-optimal strategies
3/2
• P2 integrates FDMA/multicast, ZFBF, S3 . What about a simple switching
strategy?
– FDMA only: sum DoF dFΣ =1
– ZFBF only: sum DoF dZ
Σ =β+α
3/2 β
– S3 only: sum DoF dS
Σ =1+ 2
• The best strategy among the 3 sub-optimal strategies can achieve at least 80% of
the optimal sum DoF performance as
max(dF Z S opt
Σ , dΣ , dΣ ) ≥ 0.8 × dΣ , ∀β, α ∈ [0, 1].
1 1
ZFBF
0.8 0.8 ZFBF
0.6 3 Optimal 0.6
S32
α
α
3
0.4 0.4
S 32
0.2 3 0.2 FDMA 3
FDMA & S 3
3/2 S3
2
& S 32
0 0
0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1
β β
3/2
Figure: Unmatched case, switching among FDMA, ZFBF and S3 .
335 / 494
Mode switching among sub-optimal strategies
max(dF Z opt
Σ , dΣ ) ≥ 2/3 × dΣ , ∀β, α ∈ [0, 1].
1 1
α
α
336 / 494
Introduction to Multi-Cell MIMO
337 / 494
Reference Book
– Chapter 13
Section: 13.1, 13.2, 13.3
338 / 494
Introduction
• Current wireless networks primarily operate using a frequency reuse 1 (or close to 1),
i.e. all cells share the same frequency band
• Interference is not only made of intra-cell (i.e. multi-user interference), but also of
inter-cell (i.e. multi-cell) interference.
• Cell edge performance is primarily affected by the inter-cell interference.
339 / 494
Wideband/long-term SINR
90
80
70
60
CDF [%]
50
40
30
20
10 3D
2D
0
−10 −5 0 5 10 15 20 25
Downlink wideband SINR [dB] 340 / 494
Classical Inter-Cell Interference Mitigation
• Divide-and-conquer approach:
– fragmenting the network area into small zones independently controlled from each other
– making progressively use of advanced error correction coding, link adaptation, frequency
selective scheduling and lately single-user and multi-user MIMO in each of those zones.
341 / 494
Towards Multi-Cell MIMO: Coordination and Cooperation
• Jointly allocate resources across the whole network (and not for each cell
independently) and use the antennas of multiple cells to improve the received signal
quality at the mobile terminal and to reduce the co-channel interferences.
• Two categories:
– Coordination: No data sharing (user data is available at a single transmsitter) - CSI
sharing. Modelled by an Interference Channel and Interfering Broadcast/Multiple
Access Channel
– Cooperation: Data sharing (user data is available at multiple transmsitters) - CSI
sharing. Modelled by a Broadcast Channel (for Downlink) and Multiple Access Channel
(for Uplink)
342 / 494
Towards Multi-Cell MIMO: Coordination and Cooperation
343 / 494
Network Deployments
344 / 494
System Model - Interference Channel
where
– yq ∈ nr,q ,
– 2 I
nq is a complex Gaussian noise CN 0, σn,q nr,q ,
−1
– Λq,i refers to the path-loss and shadowing between transmitter i and user q,
– Hq,i ∈ nr,q ×nt,i models the MIMO fading channel between transmitter i and user q.
345 / 494
Linear Precoding
• scheduled user set of cell i, denoted as Ki , as the set of users who are actually
scheduled by BS i at the time instant of interest
where
– ci is the symbol vector made of ne,i unit-energy independent symbols
– Pi ∈ nt,i ×ne,i is the precoder made of two matrices, namely a power control
diagonal matrix denoted as Si ∈ ne,i ×ne,i and a transmit beamforming matrix
Wi ∈ nt,i ×ne,i .
– Pq,i ∈ nt,i ×nu,q , Wq,i ∈ nt,i ×nu,q , Sq,i ∈ nu,q ×nu,q , and cq,i ∈ nu,q are
user q’s sub-matrices and sub-vector of Pi , Wi, Si , and
ci , respectively.
– The input covariance matrix at cell i is Qi = E c′i c′H
i subject to the transmit power
constraint Tr{Qi } ≤ Es,i .
346 / 494
Linear Precoding
nr,q
• The received signal yq ∈ of user q ∈ Ki
−1/2
X −1/2
yq = Λq,i Hq,i Pq,i cq,i + Λq,i Hq,i Pp,i cp,i
p∈Ki , p6=q
| {z }
intra-cell (multi-user) interference
X X −1/2
+ Λq,j Hq,j Pl,j cl,j +nq .
j6=i l∈Kj
| {z }
inter-cell interference
347 / 494
Achievable Rate
• By treating all interference as noise, the maximum rate achievable by user q in cell i
with linear precoding is
nu,q
X
Rq,i = log2 (1 + ρq,l ) .
l=1
• The quantity ρq,l denotes the SINR experienced by stream l of user-q and writes as
S
ρq,l = .
Il + Ic + Io + kgq,l k2 σn,q
2
where S refers to the received signal power of the intended stream, Il the
inter-stream interference, Ic the intra-cell interference (i.e. interference from
co-scheduled users) and Io the inter-cell interference and they write as
S = Λ−1 2
q,i |gq,l Hq,i pq,i,l | ,
X −1
Il = Λq,i |gq,l Hq,i pq,i,m |2 ,
m6=l
nu,p
X X
Ic = Λ−1 2
q,i |gq,l Hq,i pp,i,m | ,
p∈Ki ,p6=q m=1
X
Io = Λ−1 2
q,j kgq,l Hq,j Pj k .
j6=i
348 / 494
Achievable Rate
Example
Given the precoders in all cells, what is the SINR of stream l of user-q in cell i?
• Noise plus interference: Il + Ic + Io + kgq,l k2 σn,q
2 H
= gq,l Rni gq,l where
X
Rni = Λ−1
q,i Hq,i pq,i,m (Hq,i pq,i,m )
H
m6=l
nu,p
X X
+ Λ−1
q,i Hq,i pp,i,m (Hq,i pp,i,m )
H
350 / 494
System Model - Broadcast and Multiple Access Channel
• If transmitters (resp. receivers) in different cells are allowed to cooperate and can
share any information through an ideal backhaul, the MIMO IC effectively becomes
the MIMO BC (resp. MIMO MAC).
– Giant MIMO BC in the downlink and MIMO MAC in the uplink.
• Focus on DL. By stacking up the transmit signal vectors, the received signal at user
q is
yq = H q c ′ + n
where
h ′ ′
iT h iT h iT
c′ = c1T , . . . , cnTc , y = y1T , . . . , yKT
T
, n = nT1 , . . . , nTKT ,
h i
−1/2 −1/2
Hq = Λq,1 Hq,1 . . . Λq,nc Hq,nc ,
• The transmit signal vector in the DL multi-cell cooperation writes as
KT
X
c′ = c′q
q=1
h ′ ′
iT
where c′q = cq,1
T T
, . . . , cq,n c
.
′
T = 0 ∀j 6= i where cell i is the serving cell of user q (i.e. q ∈ K ).
– In the MIMO IC, cq,j i
• Careful: power constraint per base Tr QjPn≤c Es,j ∀j and not sum-power
Pncstation
constraint across base stations j=1 Tr Qj ≤ j=1 Es,j ! 351 / 494
Network Architecture: Multi-Cell Measurement, Clustering
and Transmission
• All interfering links do not affect equally user q’s performance.
– Dominant interfering links with small path losses/shadowing contribute to a high
interference while other interfering links are almost invisible to user q.
– Only the CSI of the dominant interfering link should actually be measured and made
available to the transmitters (the CSI of other links may be ignored).
• The MC measurement set of user q ∈ Ki whose serving cell is i is defined as the set
of cells about which channel state/statistical information related to their link to the
MT is reported and is expressed based on long-term channel properties as
( )
Λ−1 Es,i
q,i
Mq = j −1 ≤δ
Λq,j Es,j
352 / 494
Multi-Cell Measurement, Clustering and Transmission
• The MC-requested user set of cell i is defined as the set of MC users that have cell i
in their MC measurement set, i.e.
Ri = { l| i ∈ Ml , ♯Ml > 1} .
Note that the MC-requested user set can also be viewed as the victim user set of cell
i as it is the set of users who could be impacted by cell i in the absence of multi-cell
cooperation/coordination.
• The MC clustering set of user q ∈ Ki on subcarrier/time instant k is defined as the
set of cells (BS) participating in the multi-cell coordination/cooperation.
• The MC transmission set Tk,q is a subset of the MC clustering set and is defined by
the BS or set of BSs actively transmitting data to MT q.
353 / 494
Distributed and Centralized Architecture
354 / 494
User-Centric and Network-Predefined Clustering
• user-centric clustering: each UE/MT has its owm clustering set. Clustering sets
dynamically selected and may overlap.
• network predefined clustering: cells are statically clustered and MTs are only served
by one cluster. Clusters do not overlap.
355 / 494
Capacity of the Interference Channel
356 / 494
Reference Book
– Chapter 13
Section: 13.4
357 / 494
SISO Interference Channel
• ηq,i = Λ−1 2
q,i Es,i /σn,q
– long-term SNR when user q is served by cell i
– long-term INR (interference to noise ratio) when q is a victim user of cell i
• Conditions: η̃2,1 << η̃1,1 and η̃1,2 << η̃2,2 (or simply by η̃c << η̃d in the symmetric
case)
• The interfering signal is treated as noise and encoding/decoding as in the absence of
interference is sufficient
– divide-and-conquer approach mentioned earlier (e.g., with frequency-reuse in cellular
systems)
359 / 494
Weak Interference Regime
• Conditions: η̃2,1 < η̃1,1 and η̃1,2 < η̃2,2 (or simply by η̃c < η̃d in the symmetric case)
• Capacity unknown in general but the best known achievable region has been
proposed by Han-Kobayashi.
• Capacity outer-bound lies within 1 bit of the capacity inner-bound achieved by the
Han-Kobayashi (HK) scheme
• Main idea behind Han-Kobayashi scheme:
– split each transmitter information into two parts, i.e., a common and a private message.
– A codebook shared between both transmitters is used to construct independently the
common messages at each transmitter.
– The private messages are constructed from independent codebooks.
– Each receiver jointly decodes the common messages (and therefore partially cancel off
part of the interference) by treating the private messages as interference, cancels the
common messages from the received signal and then decodes the intended private
message.
360 / 494
Weak Interference Regime
• Hence
η̃d (1 − x)
Rsym = Rp + Rc = log2 1 +
1 + η̃c (1 − x)
η̃c x 1 η̃d x + η̃c x
+ min log2 1 + , log2 1 +
1 + ηI 2 1 + ηI
• x chosen so that the interference level caused by the private message has the same
level as the other user’s noise level.
– interference caused by the private message has little impact on the other user’s
performance.
– does not prevent each user from experiencing a relatively large private message rate as
long as η̃d > η̃c .
– η̃c (1 − x) ≈ 1, i.e. 1 − x ≈ 1/η̃c and x ≈ η̃c − 1 /η̃c
• Assuming η̃d >> 1 and η̃c >> 1 and η̃d > η̃c
1 1
Rsym ≈ min log2 (η̃d ) + [log2 (η̃d ) − log2 (η̃c )] ,
2 2
max {log2 (η̃c ) , log2 (η̃d ) − log2 (η̃c )} .
362 / 494
Mixed Interference Regime
• Conditions: η̃2,1 ≥ η̃1,1 and η̃1,2 < η̃2,2 or η̃2,1 < η̃1,1 and η̃1,2 ≥ η̃2,2 .
• Not meaningful for the symmetric case
• The capacity is also unknown but the best known achievable region relies on the
Han-Kobayashi scheme.
363 / 494
Strong Interference Regime
• Conditions: η̃2,1 ≥ η̃1,1 and η̃1,2 ≥ η̃2,2 (or simply, η̃c ≥ η̃d in the symmetric case).
• The capacity region has been identified:
– The interfering signal can be decoded along with the desired signal, i.e. each user is
able to decode both messages.
– By decoding first the interfering signal, the rate of the desired signal is improved.
Unfortunately, the decodability of the interfering signal puts a constraint on the other
users’ rates, therefore resulting in a tradeoff between the interfering signal rate and the
desired signal rate.
– The two-user SISO IC capacity region is expressed as the intersection of the capacity
regions of the two SISO MAC formed by the two transmitters and each receiver q = 1, 2
Ri ≤ log2 (1 + η̃q,i ) , i = 1, 2
R1 + R2 ≤ log2 (1 + η̃q,1 + η̃q,2 ) .
• Given the strong interference regime η̃2,1 > η̃1,1 and η̃1,2 > η̃2,2 , the intersection
simply writes
Proposition
The capacity region CIC of the Gaussian two-user SISO IC with strong
interference is the set of all achievable rate pair (R1 , R2 ) such that
Ri ≤ log2 (1 + η̃i,i ) , i = 1, 2
R1 + R2 ≤ min {log2 (1 + η̃1,1 + η̃1,2 ) , log2 (1 + η̃2,2 + η̃2,1 )} .
364 / 494
Strong Interference Regime
• Symmetric
Corollary
The capacity region CIC of the symmetric Gaussian two-user SISO IC with
strong interference is the set of all achievable rate pair (R1 , R2 ) such that
Ri ≤ log2 (1 + η̃d ) , i = 1, 2
R1 + R2 ≤ log2 (1 + η̃d + η̃c ) .
• In the strong interference regime, the capacity region of the two-user SISO IC is a
pentagon (as in two-user SISO MAC).
• The symmetric rate (it is actually the symmetric capacity) simply writes as
1
Rsym = log2 (1 + η̃d + η̃c ) ,
2
1
≈ max {log2 (η̃d ) , log2 (η̃c )} ,
2
1
≈ log2 (η̃c ) .
2
365 / 494
Very Strong Interference Regime
• Can the capacity region, under some interference conditions, become a square only
determined by the inequalities Ri ≤ log2 (1 + η̃i,i ) , i = 1, 2?
– i.e. each transmitter can communicate with its receiver at a rate equal to the one
achievable without any interference
• Possible whenever
• The very strong interference conditions can be viewed from another angle that is
reminiscent of the SIC behavior in SISO BC.
• When user 1 decodes user 2’s signal in the very strong interference regime, it treats
its own signal as noise. Hence, for user 1 to be able to cancel correctly user 2’s
signal, the interfering channel between transmitter 2 and user 1 has to be strong
enough to support R2 , i.e.
2
!
Λ−1
1,2 |h1,2 | Es,2 η̃1,2
R2 ≤ log2 1 + 2 2 = log 2 1 + .
σn,1 + Λ−1 1,1 |h1,1 | Es,1
1 + η̃1,1
Given that user 2 wants to receive its message at a rate R2 = log2 1 + η̃2,2 , this
puts the constraints
η̃1,2
log2 1 + η̃2,2 ≤ log2 1 + ,
1 + η̃1,1
which equivalently writes as η̃1,2 ≥ η̃2,2 + η̃1,1 η̃2,2 . The other condition is obtained
similarly by looking at user 2’s requirement to decode user 1’s message correctly.
367 / 494
Degrees of Freedom - Multiplexing Gain
Definition
Proposition
Figure: Achievable multiplexing gain per user of the two-user Gaussian SISO IC (α = INR/SNR).
369 / 494
More than Two-User SISO Interference Channels
• The extension to more than two users is far from being clear.
– In the very weak interference regime, the optimality of treating the interference as
noise has been established for scenarios with more than two users.
– In the strong interference regime, the extension of the two-user strategy to more than
two users is not straightforward in general.
– In the weak and mixed interference regimes, the situation is even less clear.
• In the nc -user case (or nc -cell case), is the degree of freedom be of the order of
1/nc ?
• Fortunately not! In a nc -user interference channel where the intended and interfering
signals are of comparable strength (i.e. medium interference regime), it is possible
with Interference Alignment to achieve a multiplexing gain per user of 1/2 despite
the presence of nc interfering users!
– As the transmit power of each base station increases, every user will be able to
simultaneously achieve half of the capacity he could achieve in the absence of the
interference from other users.
370 / 494
More than Two-User SISO Interference Channels
where ηi = Es Λ−1 2
i,i /σn,i . The total achievable multiplexing gain at the network
level is defined as
nc
X Rj
lim = gs,sum .
Es −→∞
j=1
log 2 ηj
• Meaningful at asymptotically high SNR and INR, not necessarily at finite SNR and
INR!
371 / 494
More than Two-User SISO Interference Channels
– time-varying: channel coefficients vary from one channel use to the another
– beamforming over multiple symbol extensions of the time-varying channel.
– every user must be able to partition its observed signal space into two subspaces of
equal size: 1) one for the desired signals, 2) one for the waste basket for all the
interference terms, under the constraint that the vector spaces corresponding to the
interference must exactly align at every user receiver within the waste basket.
372 / 494
MIMO Interference Channels
• Use multiple antennas over static channels rather than time-varying channels
• Two-user MIMO IC
Proposition
The two-user MIMO IC with nt,1 , nt,2 antennas at the two transmitters and
nr,1 , nr,2 antennas at their respective receivers has a maximum multiplexing
gain
gs,sum = min {nt,1 + nt,2 , nr,1 + nr,2 , max {nt,1 , nr,2 } , max {nt,2 , nr,1 }} .
The two-user MIMO MAC with nr receive antennas and nt,1 , nt,2 transmit
antennas at the two transmitters has a total multiplexing gain
– If nt,1 = nt,2 = nr,1 = nr,2 = n, gs,sum = 2n with BC and MAC. Twice as much as
MIMO IC!
– The way antennas are distributed at both ends does not affect the multiplexing gain in
MIMO BC and MIMO MAC, contrary to the MIMO IC!
• Note: In multi-cell cooperation, no sum power constraint anymore but per BS power
constraint
374 / 494
Coordinated Scheduling and Power Control
375 / 494
Reference Book
– Chapter 13
Section: 13.5, 13.6
376 / 494
Multi-Cell Multi-User Diversity
• In the symmetric configuration, the users within a cell i are assumed to be located at
the same distance from the base station i, i.e. Λq∈Ki ,i = Λ ∀i, such that they
experience the same average SNR η.
Proposition
• Same scaling law for ub and lb: degradation created by inter-cell interference
becomes negligible when the number of users is large!
• With a rate maximization scheduler, the network is not interference limited as long
as K is large enough.
378 / 494
Multi-Cell Multi-User Diversity
• In the asymmetric configuration, the users are uniformly distributed in each cell such
that the path-loss is determined by the distance between the user and its serving cell.
Λ−1
q,i is a random variable i.i.d. across users and cells.
Proposition
• Same scaling law for ub and lb but the presence of unequal path-losses among users
enhances the multi-user diversity, therefore resulting in a larger growth rate in the
asymmetric case (log K) compared to the symmetric case (log log K).
379 / 494
Multi-Cell Multi-User Diversity
• Similar behavior was already observed in single-cell MU-MIMO where the intra-cell
interference is shown to have a negligible impact on the capacity when random
beamforming and opportunistic scheduling are performed under the assumption that
the number of users in the cell is large enough.
• Possible to achieve the optimal network capacity under a totally distributed network
architecture that does not require any exchange of CSI and coordination among cells.
Indeed, each cell can perform single-cell scheduling simply relying on the report of a
CQI ρq,lb .
380 / 494
Multi-Cell Resource Allocation
• Narrowband transmission
nc X
X
{S⋆ , W⋆ , G⋆ , K⋆ } = arg max wq Rq,i
S,W,G,K⊂K
i=1 q∈Ki
381 / 494
Coordinated Power Control
• Same scaling laws for lower- and upper-bounds on the network capacity
(a)symmetric configurations for large K and with a rate maximization approach.
– → The use of power control, even though optimal, does not further increase the
network capacity, and transmitting at full power is optimal in the asymptotic case of a
large number of users.
Proposition
Assuming a fading channel hq,i to be independent and identically Rayleigh
distributed across users, for a fixed number of cells nc and an asymptotically
large number of users per cell Ki = K, ∀i, the network capacity with optimal
power control and rate-maximization based scheduling scales as
Kր
C̄n ∼ nc log log K in a symmetric network configuration, and as
Kր
C̄n ∼ nc 2ǫ log K in an asymmetric network configuration.
382 / 494
High and Low SINR Regimes
• Imagine that the users to be scheduled K have been selected, and focus on optimal
power allocation.
nc
X
{S⋆ } = arg max wq Rq,i
S∈S
i=1,q∈Ki
• The original problem can be converted into the following user scheduling and power
control exhaustive search problem
nc
X
{S⋆ , K⋆ } = arg max
n
wq Rq,i .
S⊂Sb c ,K⊂K
i=1,q∈Ki
An exhaustive search is conducted over the sets Sbnc and K to find the optimal S⋆
and K⋆ .
383 / 494
Two-Cell Clusters
• Interestingly, in the specific two-cell network (nc = 2), binary power control is not
only optimal at low SINR but in the whole SINR range for a network relying on a
rate maximization policy
Proposition
In the two-cell case, the network sum-rate maximizing power allocation s⋆1 , s⋆2
is binary and always takes one of the following three power allocation
candidates: (Es,1 , es,2 ), (es,1 , Es,2 ) and (Es,1 , Es,2 ).
• Implications:
– The transmit power can be quantized to two values without loss of capacity.
– This makes the power allocation strategy particularly simple and of very low overhead.
– The decision cannot be taken based on local CSI only as the optimal decision requires
simultaneous CSI from both cells, therefore requiring some form of centralized
scheduler.
– Popular in LTE-A in the name of on-off power control or coordinated silencing. Robust
to CSI measurement and CSI feedback impairments once implemented at the subframe
level in OFDMA networks
• The binary allocation is not optimal anymore for a network-wide proportional
criterion for which the weights w1 and w2 are different, and for a more general
set-up containing more than two cells.
384 / 494
OFDMA Networks
• Assume SISO
• User assignments for all sub-carriers and all cells: K = {Ki }n c
i=1 where
Ki = {Kk,i }∀k .
• Power allocation: S = {Si }n c
i=1 where Si = {sk,i }∀k
385 / 494
Objective function
where
R(k),q,i = log2 (1 + ρk,q )
and 2
Λ−1
q,i h(k),q,i
sk,i
ρk,q = P
−1
2 ,
j6=i Λq,j h(k),q,j sk,j + σ 2
n,k,q
• Non-convex problem. The globally optimal solution may not be found but
near-optimal solutions can be obtained using iterative algorithms. The key idea relies
on an iterative optimization of scheduling and power allocation as discussed below.
386 / 494
Optimality Conditions
• For a predefined set of scheduled users, the optimal power allocation problem must
satisfy the Karush-Kuhn-Tucker (KKT) conditions
• Lagrangian of the optimization problem
nc T −1 nc T −1
!
X X X X
L (S, K, ν) = wq R(k),q,i + νi Es,i − sk,i
i=1 k=0,q∈Kk,i i=1 k=0
where ν = {νi }n c
i=1 is the set of Lagrange multipliers associated with the power
constraint in each cell.
• The solution should satisfy
∂L
=0
∂sk,i
and !
T
X −1
νi Es,i − sk,i = 0,
k=0
PT −1
under the constraints νi ≥ 0, sk,i ≥ 0 and k=0 sk,i ≤ Es,i , for i = 1, . . . , nc and
k = 0, . . . , T − 1.
387 / 494
Optimality Conditions
∂L
• ∂sk,i
= 0 leads to
∂R(k),q,i X ∂R(k),q′ ,m
wq + wq′ ,m = νi ,
∂sk,i ∂sk,i
m6=i
′
with q ∈ Kk,i and q ∈ Kk,m . Equivalently we can write
∂R(k),q,i
wq − Πk,i = νi
∂sk,i
P
where we define Πk,i = m6=i Πk,i,m with
∂R(k),q′ ,m ∂R(k),q′ ,m ∂Ik,q′ ,m
Πk,i,m = −wq′ ,m = −wq′ ,m
∂sk,i ∂Ik,q′ ,m ∂sk,i
| {z }
−πk,q ′ ,m
(a)
= wq′ ,m πk,q′ ,m Λ−1 h(k),q′ ,i 2
q ′ ,i
P
−1
2
where Ik,q′ ,m = l6=m Λq′ ,l h(k),q′ ,l sk,l is the total interference received by user
q ′ in cell m and πk,q′ ,m is defined as the non-negative quantity that represents the
marginal increase in rate of user q ′ in cell m per unit decrease in total interference
on subcarrier k.
388 / 494
Optimality Conditions
• Power allocation
P 2
1 wq j6=i Λ−1
q,j h(k),q,j
2
sk,j + σn,k,q
= sk,i + 2
log 2 (νi + Πk,i ) Λ−1 h(k),q,i
q,i
P
for i = 1, . . . , nc and k = 0, . . . , T − 1, where Πk,i = m6=i Πk,i,m and
!
1 1 ρ2k,q′
πk,q′ ,m = .
log 2 Λ−1
′
h(k),q′ ,m 2 sk,m 1 + ρk,q′
q ,m
389 / 494
Interference Pricing
• View πk,q′ ,m as a price charged to other cells for generating interference to user q ′
•
∂R(k),q,i
wq − Πk,i = νi
∂sk,i
is a necessary and sufficient optimality condition for the problem in which each cell i
specifies a power level sk,i on subcarrier k to maximize the following surplus function
390 / 494
Iterative Scheduler
• In the downlink, the inter-cell interference is only function of the power levels and is
independent of the user scheduling decisions.
• This suggests that the user scheduling and the power allocation can be carried out
separately.
• An iterative scheduler can be derived so that the best user to schedule are first found
assuming a fixed power allocation, then the best power allocation are computed for
the fixed scheduled users.
• Assuming a fixed power allocation, given the independence of the inter-cell
interference on the scheduled users,
391 / 494
Iterative Scheduler
• Initialization step: We first fix the maximum number of iteration Nmax and fix
n = 0. We initialize S(0) using e.g. a binary power control strategy, and compute
K(0) and Π(0) .
(n)
• Iteration-n: For each cell i = 1, . . . , nc , we update the power allocation Si based
(n−1) (n−1) (n−1)
on K ,Π and Sj for ∀j 6= i (i.e. assuming the transmit powers in the
other cells remain fixed) as follows
+
2 2
(n) 1 w q σI,k,q + σn,k,q (n−1)
sk,i = − 2 , q ∈ Kk,i
log 2 ν + Π(n−1)
i
−1
Λ h(k),q,i
q,i
k,i
where X 2 (n−1)
2
σI,k,q = Λ−1
q,j h(k),q,j sk,j
.
j6=i
P −1 (n)
The parameters νi are obtained from the power constraint Tk=0 sk,i ≤ Es,i . After
obtaining the power allocation S(n) , the user selection K(n) and finally Π(n) are
obtained. The procedure is repeated till convergence or till the number of iterations
reaches Nmax .
392 / 494
Modified Iterative Water-Filling
393 / 494
Feedback and Message Passing Requirements
• Centralized implementation:
– Each user q in cell i to report nc T channel measurements as the channel from user q
to any base station j over all subcarriers must be known.
– Each cell i forwards the CSI to a centralized controller.
– The central controller performs the modified iterative water-filling and informs each cell
about the scheduled user and the transmit power on each subcarrier.
• Distributed implementation:
– Cells exchange with each other messages and rely on the feedback information from
the users
– Each cell is assumed to be aware of local CSI, i.e. CSI that can be measured by its user
and reported, plus the messages exchanged between cells.
– From cell i perspective:
(n−1) 2
2 2
and Λ−1
the report to cell i from users q ∈ Kk,i of σI,k,q + σn,k,q q,i h(k),q,i
the reception by cell i from each cell m 6= i in the MC clustering set of user q of the tax
(n−1)
information Πk,i,m at iteration n − 1
(n−1)
the transfer of the tax information Πk,j,i at iteration n − 1 to cooperating cell j.
394 / 494
Coordinated Beamforming and Interference
Alignment
395 / 494
Reference Book
– Chapter 13
Section: 13.7
396 / 494
Coordinated Beamforming
where S refers to the received signal power of the intended stream, Il the
inter-stream interference, Ic the intra-cell interference (i.e. interference from
co-scheduled users) and Io the inter-cell interference
S = Λ−1 2
q,i |gq,l Hq,i wq,i,l | sq,i,l ,
X −1
Il = Λq,i |gq,l Hq,i wq,i,m |2 sq,i,m ,
m6=l
nu,p
X X
Ic = Λ−1 2
q,i |gq,l Hq,i wp,i,m | sp,i,m ,
p∈Ki ,p6=q m=1
X
2
1/2
Λ−1
Io = q,j
gq,l Hq,j Wj Sj
.
F
j6=i
397 / 494
Zero-Forcing Beamforming and Block Diagonalization
The zero forcing constraint forces Wq,i to lie in the null space of H̃q,i .
• Serving cell channel (between the user and the serving BS) but also the interfering
cells channels (between the user and the interfering BSs in the MC measurement
set) need to be known at the transmitter.
398 / 494
Interference Alignment
399 / 494
Conditions for Interference Alignment
• Assumptions:
– nc cells fully connected with nc users with nt transmit antennas at each BS (nt,i = nt
∀i) and nr ≤ nt (nr,i = nr ∀i) receive antennas at each MT.
– Each MT receives ne < nr data streams from its serving BS.
where C (A) is the column space of a matrix, i.e., the vector space spanned by the
column vectors of matrix A.
400 / 494
Conditions for Interference Alignment
ne < nr , C a1 a2 = C
• Given b1 b2 is satisfied if ∃ G such that
G a1 a2 = G b 1 b 2 = 0
• Interference is aligned before receive shaping in such a way that after receive
shaping, it is completely canceled out and the received signal yi of the scheduled
user in cell i lies in the d-dimensional signal space.
• The transmit filter Wi and the receive shaping Gl are obtained as solutions of the
set of nc (nc − 1) equations
−1/2
Λl,i Gl Hl,i Wi = 0, l 6= i, ∀i, l = 1, ..., nc .
401 / 494
Closed Form Solutions
• Assume
– nc = 3, n = nt = nr with n even for simplicity.
– The precoder Wi is of dimension n × n/2.
– The channel matrices are full rank.
• Show that there exist n/2 non-interfering paths between transmitter i and receiver i
for each i = 1, 2, 3, i.e. a total multiplexing gain of 3n/2.
• The interference can be zero-forced if the dimension of the interference space is
≤ n/2
– e.g. at receiver 1, r H1,3 W3 H1,2 W2 = n/2. Recall that
r (H1,2 W2 ) = n/2 and r (H1,3 W3 ) = n/2!
• The conditions for IA
C (H1,2 W2 ) = C (H1,3 W3 ) ,
C (H2,1 W1 ) = C (H2,3 W3 ) ,
C (H3,1 W1 ) = C (H3,2 W2 ) ,
• Given that channel matrices are invertible,
C (W2 ) = C H−1
3,2 H3,1 W1 ,
C (W3 ) = C H−1
2,3 H2,1 W1 .
C (W1 ) = C (TW1 ) ,
where T = H−1 −1 −1
3,1 H3,2 H1,2 H1,3 H2,3 H2,1 . 402 / 494
Closed Form Solutions
• Set W1 = eig (T) whereeig (T) refers to n/2 dominant eigenvectors of T. Hence
W1 = t1 . . . tn/2 .
• Stricter conditions
H2,1 W1 = H2,3 W3 ,
H3,1 W1 = H3,2 W2 ,
leding to W2 = H−1 −1
3,2 H3,1 W1 and W3 = H2,3 H2,1 W1 .
403 / 494
Closed Form Solutions
• Given the alignment of H1,2 W2 and H1,3 W3 , U(0) refers to the n/2 singular
H
vectors corresponding to zero singular values. Hence by selecting G1 = U(0) ,
we obtain
−1/2 1/2
X −1/2 1/2
y1 = Λ1,1 G1 H1,1 W1 S1 x1 + Λ1,j G1 H1,j Wj Sj xj + G1 n1 ,
j=2,3
−1/2 1/2
= Λ1,1 G1 H1,1 W1 S1 x1 + G1 n1 .
−1/2
User 1 perceives an equivalent channel given by Heq,1,1 = Λ1,1 G1 H1,1 W1 and is
not affected by multi-cell interference.
• The interference alignment presented so far creates an interference-free subspace but
does not attempt to maximize the desired signal strength within the desired signal
subspace.
– IA solution is not a function of the direct channels Hi,i
– sub-optimal at low and medium SNR
– Enhancement: precoding in a second stage (IA being the first stage) along the
eigenvectors of Heq,1,1 and applying water-filling based on its singular values.
404 / 494
IA Illustration
405 / 494
Iterative Solution
• Closed form solutions for IA have been found for specific settings only.
• In general, with nc > 3, nt 6= nr and ne,i streams for transmitter i, analytical
solutions to the IA problem are difficult to obtain.
• Iterative solution:
(n−1) (n)
– Assuming the transmit precoders Wi at iteration n − 1, the receiver shaping Gi
are first computed at iteration n.
(n) (n)
– The updated transmit precoders Wi can then be computed based on all Gi .
– The process iterates until convergence.
• The iterative algorithm alternates between the original and reciprocal networks.
– The reciprocal network consists in switching the roles of transmitter and receiver.
– We denote a variable in the reciprocal network with a bar on top.
– In the reciprocal network, the channel matrix writes as H̄j,i = HH i,j .
– IA conditions in the reciprocal network write as Ḡj H̄j,i W̄i = 0, ∀j 6= i.
– W̄i = GH i and Ḡi = Wi .
H
406 / 494
Iterative Solution
• Within each network only the receive filters are updated to minimize the total
inter-cell leakage interference.
– In the original network, the total inter-cell interference leakage at receiver i due to all
interfering transmitters is given by
n o
Io,i = Tr Gi Qi GH i
with X
Qi = Λ−1 H H
i,j Hi,j Wj Sj Wj Hi,j .
j6=i
• Design the receive shaping Gi such that it lies in the space spanned by the ne,i
eigenvectors corresponding to the ne,i smallest eigenvalues of Qi . Writing
Q i = U Q i ΛQ i U H
Qi with the entries of ΛQi ranked by increasing order of
magnitude, Gi = UH Qi (:, 1 : ne,i ).
407 / 494
Iterative Solution
(0)
• Initialization step: Start with arbitrary precoding matrices Wi with
(0) H (0)
Wi Wi = Ine,i .
• Iteration-n: Alternate between the original and the reciprocal networks:
1 In the original network, compute the interference covariance matrix at each receiver i
(n)
X (n−1) (n−1) H H
Qi = Λ−1
i,j Hi,j Wj S j Wj Hi,j ,
j6=i
and fix the receive shaping in the original network and the transmit beamformer in the
reciprocal network respectively as
(n) (n) (n) H
Gi = UH(n) (:, 1 : ne,i ) , W̄i = Gi .
Qi
2 In the reciprocal network, compute the interference covariance matrix at each receiver
j, X
(n) (n) (n) H H
Q̄j = Λ̄−1
j,i H̄j,i W̄i S̄i W̄i H̄j,i ,
i6=j
and fix the receive shaping in the reciprocal network and the transmit beamformer in
the original network respectively as
(n) (n) (n) H
Ḡj = UH(n) (:, 1 : ne,i ) , Wj = Ḡj .
Q̄j
For simplicity, uniform power allocation (Sj = S̄j = Es,j /ne,j Ine,j ) is often
assumed.
408 / 494
Iterative Solution
60
nc=1, 2x2, ne=2
55
n =1, 4x4, n =4
50 c e
n =2, 2x2, n =1
45 c e
nc=2, 2x2, ne=2
Sum−rate [bps/Hz]
40
n =2, 4x4, n =2
c e
35
n =3, 2x2, n =1
c e
30 nc=3, 4x4, ne=2
25
20
15
10
5
0
0 5 10 15 20 25 30
SNR [dB]
H1,2 v 2
w1 ~
y1 H1,3 v3 = H1, 4 v 4
H1,1 v1
~
y2 w2
w2
H1, 2 v 2
H1,3v3 = H1,4 v4
w3 ~
y3
~
y4
w4
Figure: The system model of the two-cell interfering MIMO-MAC when nt = 2, nr = 3, and
two cells with two users in each cell.
410 / 494
CSI Feedback and Message Exchange
40
Perfect feedback
Scaled B as in Theorem 3
35
Perfect feedback − 4log (2)
2
Number of feedback bits: 10
Achievable sum rate [bit/sec/Hz]
30
Number of feedback bits: 8
Number of feedback bits: 6
25
20
15
10
0
0 5 10 15 20 25 30
SNR [dB]
411 / 494
Other Beamformers
• Matched Beamforming
– important when it comes to Massive MIMO
412 / 494
Coordinated Scheduling, Beamforming and Power
Control
413 / 494
Reference Book
– Chapter 13
Section: 13.8
414 / 494
Coordinated Scheduling, Beamforming and Power Control
where
R(k),q,i = log2 (1 + ρk,q )
with 2
Λ−1
q,i h(k),q,i wk,q,i
sk,q,i
ρk,q = Pn P 2 ,
c
j=1 u∈Kk,j Λ−1
q,j h(k),q,j wk,u,j
sk,u,j + σ 2
n,k,q
(u,j)6=(q,i)
415 / 494
Optimality Conditions
where ν = {νi }n c
i=1 is the set of Lagrange multipliers associated with the power
constraint in each cell.
• The solution should satisfy
∂L
=0
∂sk,q,i
and
T
X −1 X
νi Es,i − sk,q,i = 0,
k=0 q∈Kk,i
PT −1 P
under the constraints νi ≥ 0, sk,q,i ≥ 0 and k=0 q∈Kk,i sk,q,i ≤ Es,i , for
i = 1, . . . , nc and k = 0, . . . , T − 1.
416 / 494
Iterative Scheduler
where
nc
X X
2
2
σI,k,q = Λ−1
q,j h(k),q,j wk,u,j
sk,u,j .
j=1 u∈Kk,j
(u,j)6=(q,i)
• Scheduler:
– For a fixed user schedule and transmit power per beam, optimize the beamforming
vectors.
– For fixed beamforming vectors and power allocation, the user scheduling is done per
beam by finding the user that maximizes wq R(k),q,i .
– For a fixed beamformers and user schedule, the power levels are updated.
• Recall that SINR of each user needs to be accurately computed by the BS at every
iteration!
– Inaccurate SINR prediction hampers the appropriate selection of the users, the
transmission ranks and the beamformers at every iteration of scheduler and ultimately
the whole link adaptation and the convergence of the scheduler
418 / 494
A General Framework of Coordination
with
T −1
(0) 1 X X (0)
Ui = wq R(k),q,i Pk,q,i
T
k=0 q∈K
(0)
k,i
419 / 494
A General Framework of Coordination
• Iteration-n: Each cell revisits its decision regarding the users to be scheduled and
their transmit precoders, based on the decisions taken by other cells in iteration
n − 1.
– The scheduling decisions in a given cell i are not only function of the utility metric of
users scheduled by that cell but also of the utility metric of victim users that have been
tentatively scheduled by other cells in iteration n − 1.
– Cell i allocates resources such that
n o
(n) (n) (n) (n) (n) (n−1)
S i , W i , Gi , K i = arg max Ui Ki , R i .
Ki ∈ Ki
(n−1)
Pk,i Ki , Ri
(n−1)
Ki , Ri : function of its served user set and its victim user set at iteration n − 1!
– Utility metric of cell i at iteration n
T −1
(n) 1 X X (n) (n−1)
Ui = wq R(k),q,i Pk,q,i , P (n−1)
T k=0 (n)
k,j∈Mq
q∈Kk,i
| {z }
Single-cell weighted sum-rate
(n−1)
− Π i Ri .
| {z }
Tax to be paid due to the interference created to vicitm users in adjacent cells
420 / 494
Massive MIMO
421 / 494
Reference Book
– Chapter 5,12,13
Section: 5.4, 12.2, 12.6, 12.8, 13.7
422 / 494
Introduction
423 / 494
Point-to-Point i.i.d. Channels
Proposition
In i.i.d. Rayleigh fading channels, the ergodic capacity with CDIT is achieved
under an equal power allocation scheme Q = Int /nt , i.e.,
( )
ρ
C̄CDIT = Īe = E log2 det Inr + Hw HH w ,
nt
or equivalently,
( n )
X ρ
C̄CDIT = Īe = E log2 1 + λk ,
nt
k=1
424 / 494
Point-to-Point i.i.d. Channels
Proposition
The ergodic capacity of i.i.d. Rayleigh fast fading channels with CDIT is given
by Z ∞
C̄CDIT = Īe = n log2 (1 + ρλ/nt )pλ (λ)dλ,
0
425 / 494
Point-to-Point i.i.d. Channels
0
0 5 10 15 20 25 30
Number of antennas N
• Observations: Asymptotically,
– C̄CDIT scales linearly with N for squared MIMO systems at any SNR,
– For MISO and SIMO systems, C̄CDIT /N decreases toward zero with increasing N
(and the decrease is faster for MISO than for SIMO).
– If the number of antennas is increased only at one side, the asymptotic rate of the
capacity growth is equal to zero. 427 / 494
Large Antenna Array Regime in Point-to-Point i.i.d.
Channels
• For nt = nr = N , we may take the limit for N → ∞ of C̄CDIT /N , and obtain
• Observations:
– limN →∞ C̄CDIT /N is a constant, which only depends on the SNR.
– The capacity scales with N = n at any SNR in the large antenna array regime!
428 / 494
Large Antenna Array Regime in Point-to-Point i.i.d.
Channels
• N = nr → ∞, with n = nt fixed:
– W/N = HH H/N converges to In as N → ∞.
– This implies that, for a fixed value of n, the n eigenvalues of W/N approach one, i.e.
the empirical distribution pλ′ (λ′ ) (where λ′ , λ/N ) approaches δ(λ′ − 1).
– Hence,
C̄CDIT N
lim = log2 1 + ρ .
N =nr →∞ n n
• N = nt → ∞, with n = nr fixed:
– The empirical distribution of the eigenvalues of W/N = HHH /N also converges
almost surely to δ(λ′ − 1),
– Hence
C̄CDIT
lim = log2 (1 + ρ),
N =nt →∞ n
which is equal to the capacity of a SISO AWGN channel!
• N = nt → ∞, n = nr → ∞, in a constant ratio N/n > 1:
– pλ′ (λ′ ) (where λ′ , λ/N ) can be computed
– Hence
C̄CDIT n N N
lim = log2 1 + ρ + ρ − ρβ + 1 − log2 (1 − β) − log2 (e) β
n→∞ n N n n
N
with β a function of n
and ρ.
429 / 494
Large Antenna Array Regime in Point-to-Point i.i.d.
Channels
• The capacity in the large antenna regime scales linearly with n at any SNR
– Recall that in the non-asymptotic case, the linear increase in n is observed only at high
SNR!
– The growth rate is only function of the SNR and the ratio N/n.
• The convergence is very fast, so that the large antenna array regime is reached
already for values of n as small as 3.
• CCDIT is Gaussian distributed, with a mean value given by in previous slide, and a
variance decreasing as 1/N in the first two scenarios. In the third scenario, the
variance dependence towards ρ and N/n is more complex.
• Under the large antenna array regime, the channel becomes much more deterministic
and the channel matrices better conditioned (as opposed to random),
– see e.g. the distribution of the eigenvalues of W/N in scenarios 1 and 2.
430 / 494
Large Antenna Array Regime in Multi-User Channels
• Recall
Proposition
For channels H1 , . . . , HK , SNR ηq , number of receive antennas nr , the gain of
DPC over TDMA is upper-bounded by the minimum between the number of
transmit antennas nt and the number of users K
CBC (H)
≤ min {nt , K} .
CT DM A (H)
• In the limit of large nt and with a fixed K and nr , the DPC gain over TDMA
equals K:
Proposition
For any fixed nr , K and effective SNR ηq ∀q, the gain of DPC over TDMA for
a large number of transmit antennas (nt → ∞) in fading channels that are
independent and identically Rayleigh distributed across antennas and
independent across users writes as
C̄BC nt ր
∼ K.
C̄T DM A
431 / 494
Intuition
• Fading channels are independent and identically Rayleigh distributed across antennas
and independent across users.
• Hence, by the law of large numbers, the nr K rows of H become mutually
orthogonal as nt becomes large,
1
lim Hl HH
p = Inr δlp , ∀l, p = 1, . . . , K,
nt →∞ nt
432 / 494
Sum-Rate Capacity of Massive MISO BC
• Assuming Λ−1 2
q /σq is the same for all users, the optimal power allocation boils down
to the uniform power allocation sul,q = Es /K ∀q.
433 / 494
Sum-Rate of Linear-Precoded Massive MISO BC
• At high and low SNR, linear beamforming (BF) techniques (based on ZFBF) can
achieve the same scaling rate as DPC.
Proposition
At high SNR and low SNR, DPC and BF have the same scaling rate
Es ր
CBC (H) ∼ CBF (H)
Es ց
CBC (H) ∼ CBF (H)
434 / 494
Sum-Rate of Linear-Precoded Massive MISO BC
– achievable sum-rate
K
X
C̄BF ≈ CBF ≈ log2 1 + nt Λ−1 2
q /σn,q sq .
q=1
– Assuming Λ−1 2
q /σq is the same for all users, the uniform power allocation sq = Es /K
∀q maximizes C̄BF .
– Strong similarity with the sum-rate capacity: Matched beamforming achieves the
sum-rate capacity in the very large transmit antenna regime if sq are chosen equal to
sul,q .
– Rate approximations are only valid in the large antenna regime for K/nt → 0. If
K, nt → ∞ with the ratio nt /K = α, CBF with matched beamforming exhibits an
error floor in the limit of large SNR.
435 / 494
Massive SIMO MAC
• Similar observations also hold in the multiple access channels (i.e. in the uplink) with
a large number of receive antennas. Assuming a SIMO MAC (with single antenna
transmitters) with large nr ,
K
X
C̄M AC ≈ CM AC ≈ log2 (1 + nr ηq ) .
q=1
• Under the large receive antenna regime, this sum-rate is achievable with a simple
receive matched filter.
436 / 494
Large Antenna Array Regime in Multi-User Channels
• The transmit (and also receive) beamforming gain approximates as nt for large nt .
As nt increases, the value of khq k2 , being a χ22nt distributed random variable,
concentrates indeed more and more around its mean.
• The SINR and the sum-rates become exclusively a function of Λq and not of the
fading (that is so useful to benefit from MU diversity)!
• For large nt , C̄BF /C̄T DM A = K!
Proposition
For any fixed nr , K and effective SNR ηq ∀q, the gain of DPC/BF over TDMA
for a large number of transmit antennas (nt → ∞) in fading channels that are
independent and identically Rayleigh distributed across antennas and
independent across users writes as
C̄BC/BF nt ր
∼ K.
C̄T DM A
– For large nt , C̄T DM A ≈ log2 1 + nt maxq=1,...,K {ηq } and
PK nt η q
C̄BF ≈ q=1 log2 1 + K (assuming uniform power allocation).
• Transmit/Receive beamforming and MU diversity are somehow not complementary.
A large nt benefits the array gain and multiplexing gain but restricts the MU
diversity gain. 437 / 494
Channel Hardening and Scheduling
• This behavior is called channel hardening and originates from the fact that, with
2
transmit and/or receive beamforming, the variance of rate σR decreases with nt and
nr as nt and nr tend to infinity, respectively.
• In general, any transmission scheme that exploits spatial diversity reduces the
multi-user diversity gain because of the channel hardening effect.
• Scheduling in Massive MIMO becomes much simpler!
438 / 494
Linear Precoding: Matched Beamforming
with
−1
F = HH HHH ,
−1
= H̄H H̄H̄H D−1 .
| {z }
F̄
440 / 494
Sum-Rate Evaluations
70
4 − MBF
4 − ZFBF
60
4 − IF
16 − MBF
50 16 − ZFBF
Sum−rate [bps/Hz]
16 − IF
64 − MBF
40 64 − ZFBF
64 − IF
30 4 − DPC
20
10
0
0 5 10 15 20 25 30 35 40
SNR [dB]
441 / 494
Sum-Rate Evaluations
• MBF on the other hand performs relatively poorly (except at low SNR)
– sum-rate performance fundamentally limited by intra-cell interference and his SINR is
limited at high SNR by the ratio α = nt /K.
– MBF requires a much larger number of antennas to reach the same performance as
ZFBF.
• In summary, for K >> nt and nt >> K, simple linear precoding schemes provide
very competitive alternatives to more complex (non-linear) strategies.
442 / 494
Inter-Cell Interference
• Inter-cell interference is naturally mitigated and the need for multi-cell coordination
or cooperation therefore vanishes as the number of transmit antennas increases!
• Same for Uplink
443 / 494
Practical issues
• More antennas packed in a limited volume, increased spatial correlation and antenna
coupling.
– But recall that spatial correlation/LoS can be beneficial to MU-MIMO.
• Accurate CSIT
– FDD?
– Reciprocity in TDD (if perfect calibration is performed), but the presence of pilot
contamination originating from the reuse of the same pilots by users in different cells
degrades uplink channel estimates and therefore limits the performance of Massive
MIMO
444 / 494
Real-World MIMO Wireless Networks
445 / 494
Reference Book
– Chapter 14
446 / 494
System Requirements
• peak rate
– highest theoretical throughput achievable with SU-MIMO spatial multiplexing but are
typically not achieved in practical deployments.
– e.g. 8x8 Spatial multiplexing with 8 streams transmission
447 / 494
Frame Structure
• Multiplexing/Access
– DL: OFDM
– UL: DFT-Spread OFDM (SC-FDM)
• Frame structure
– OFDMA/SC-FDMA create a
time-frequency grid composed of
time-frequency resources
– A resource block (RB) is formed by 12
consecutive REs in the frequency
domain for a duration of 7 OFDM
symbols in the time domain.
– A subframe is formed of 14
consecutive OFDM/SC-FDM symbols.
– Scheduling and data transmission is
performed at the RB-level with the
minimum scheduling unit consisting of
two RBs within one subframe.
– First 3 symbols used to carry control
information.
448 / 494
Key Downlink Technologies
450 / 494
Antenna Deployments
451 / 494
Reference Signals
453 / 494
Channel State Information (CSI) feedback
• Closed-Loop (Spatial Multiplexing and MU-MIMO) rely on RI, CQI and PMI
– If Spatial Multiplexing, the actual precoder is the same as the one selected by the user
(PMI)
– If MU-MIMO based on CRS, the actual precoder is the same as the one selected by the
user (PMI)
– If MU-MIMO based on DM-RS, the actual precoder (e.g. ZFBF) is computed based on
the one selected by the user (PMI).
454 / 494
Network Deployments
455 / 494
Network Deployment Scenarios
456 / 494
Macro-Pico Heterogeneous Deployment
• Cell association
– Connect a terminal (for both DL and UL links) to the cell with the strongest received
DL power.
– In HetNet, owing to the transmit power difference between the high power and low
power node, it is preferable to connect to the cell with the lowest path-loss in the UL
and connect to the cell with the strongest received DL power in the DL.
– The UL coverage area therefore becomes larger than the DL coverage area, leading to
different cell associations in the DL and UL and particularly complexifying the system.
• Cell loading
– Balance the load among macro- and picocells in highly loaded cells in order to
maximize the resource reuse between cells.
457 / 494
Macro-Pico Heterogeneous Deployment
• To allow flexible load balancing and keep the same cell association for DL and UL,
LTE-A supports cell range expansion (CRE)
– The range of the low-power node is controlled by a cell association bias.
– Expansion so that a UE may be associated with a cell which does not provide the
strongest received signal power.
– Unfortunately, range expansion brings inter-cell interference whose severity increases as
the cell association bias increases.
458 / 494
Macro-Femto Heterogeneous Deployment
Aggressors
Alt 1. ICIC via UE relaying
Home eNB
Home UE
Macro eNB Macro UE Victim UE
Dead-zone
DSL gateway
• When the probability of being located in the vicinity of HeNB and the deployment
ratio of HeNB are both low, static ICIC is sufficient.
– silencing resources by e.g. ON-OFF, blank subframes
• When the deployment ratio of HeNB is high and the outage probability becomes
noticeable, dynamic ICIC is useful.
459 / 494
Beyond LTE-A: Massive Multi-Cell and Massive
Multi-Antenna Networks
460 / 494
System-Level Performance Evaluations
461 / 494
Reference Book
– Chapter 15
462 / 494
System Level Assumptions
463 / 494
Single-User MIMO
Antenna deployment (with 6RB) Antenna configuration (for ULA)
3 4
ULA 2x2 (4,15)
[bits/s/Hz/cell]
2.75 DP 3.5
37% 4x2 (4,15)
[bits/s/Hz/cell]
40% 48%
3 4x4 (4,15)
2.5 43% 50% 65%
58% 2x2 (0.5,8)
2.5
77% 4x2 (0.5,8)
2.25 2 4x4 (0.5,8)
2 1.5
8x2 (0.5,15) 4x2 (0.5,15) 4x2 (4,15) 1 RB 3 RB 6 RB 50 RB
0.15 0.2
cell edge throughput
DP
[bits/s/Hz/user]
0.15 4x2 (4,15)
0.1
4x4 (4,15)
0.1
2x2 (0.5,8)
0.05 4x2 (0.5,8)
0.05
4x4 (0.5,8)
0 0
8x2 (0.5,15) 4x2 (0.5,15) 4x2 (4,15) 1 RB 3 RB 6 RB 50 RB
Observations: Observations:
• 4 × 2 ≥ 2 × 2, 4 × 4 >> 4 × 2 (symmetric
• 8 × 2 > 4 × 2 (larger transmit array gain conf. better).
and SM gain).
• gain of large nr higher for less correlated
• DP > ULA for cell average if nt large (SM scenarios and large subband size.
gain & large array gain on each pol).
• ULA>DP for cell edge (large array gain
due to spatial correlation).
• Large AS < small AS (large array gain >
large SM gain). 464 / 494
Single-User MIMO
Statistics of the transmission rank
°
Channel estimation errors (ULA,3RB)
3 RB, ULA, 4λ, 15 AS
1
2x2
4x2 7%
0.5 3.5
(4,15) (0.5,8)
[bits/s/Hz/cell]
4x4 6%
3
0
1 2 3 4 4% 4%
2.5
3 RB, ULA, 0.5λ, 8°AS 7% 6%
1
probability
2x2 2
2x2 4x2 4x4 2x2 4x2 4x4
0.5 4x2
4x4
0.2
[bits/s/Hz/user]
6 RB, 0.5λ, 15°AS 0.15 non−ideal CSI−RS, non−ideal DM−RS
1
probability
4x2 ULA
0.5 8x2 ULA 0.1
8x2 DP
0
1 2 3 4 0.05
2x2 4x2 4x4 2x2 4x2 4x4
number of transmitted streams
Observations: Observations:
• loss of 6 to 7% in the cell average and cell
• higher ranks as nt , nr , spacing and AS edge throughputs.
increase.
• higher ranks for DP than ULA.
• rank-1 transmission most encountered.
• symmetric antenna set-up: full rank
transmission negligible.
465 / 494
Single-User MIMO
[bits/s/Hz/cell]
[bits/s/Hz/cell]
3 unquantized PMI
3 11.5%
14% 13%
2.5 16% 11% 6%
31% 6.5%
42% 39% 30%
2 2.5
1.5
2x2 4x2 4x4 2x2 4x2 4x4 2
4x2 ULA (0.5,15) 4x2 DP (0.5,15) 8x2 DP (0.5,15)
0.2
cell edge throughput
0.12
0.15
[bits/s/Hz/user]
1 RB
3 RB 0.1
0.1
6 RB
0.05 50 RB
0.08 13.5%
0
2x2 4x2 4x4 2x2 4x2 4x4 0.06
4x2 ULA (0.5,15) 4x2 DP (0.5,15) 8x2 DP (0.5,15)
Observations: Observations:
• as the subband size increases, the • loss incurred by codebook quantization
performance decreases (due to channel larger in DP compared to ULA deployments
frequency selectivity) and in 8 × 2 compared to 4 × 2.
• less pronounced in spatially correlated
environment (0.5, 8).
466 / 494
Multi-User MIMO
Antenna deployment Antenna configuration
cell average throughput
[bits/s/Hz/cell]
3.5 DP 5 8x2 (0.5,15)
8x4 (0.5,15)
3 4
2.5 3
2 2
8x2 (0.5,15) 4x2 (0.5,15) 4x2 (4,15) DP ULA
0.15 0.2
cell edge throughput
[bits/s/Hz/user]
DP
0.1 0.15
0.05 0.1
0 0.05
8x2 (0.5,15) 4x2 (0.5,15) 4x2 (4,15) DP ULA
Observations: Observations:
• 8x4 provides significant gain over 8x2
• ULA > DP for cell average for any nt
– 8Tx ZFBF is far from nulling out MU
• ULA > DP at the cell edge interference
• 0.5λ > 4λ – more pronounced in DP
467 / 494
Multi-User MIMO
MU-MIMO dimensioning without overhead CSI measurement and feedback
non−id. CSI/DM−RS
non−id. CSI/DM−RS
non−id. CSI/DM−RS
cell average throughput
4.5
max−2 streams 3 30%
[bits/s/Hz/cell]
unq. CDI/CQI
unq. CDI/CQI
unq. CDI/CQI
2.5
non−id. CSI
3.5
id. DM−RS
q. CDI/CQI
3 2
2 1
4x2, q. CDI 4x2, unq. CDI 8x2, q. CDI 8x2, unq. CDI
0.125
max−2 streams
[bits/s/Hz/user]
[bits/s/Hz/user]
0.125 max−4 streams
16% 23% 42%
0.1 0.1
9%
0.075
0.05 0.075
4x2, q. CDI 4x2, unq. CDI 8x2, q. CDI 8x2, unq. CDI
Observations: Observations:
• ranking of losses in increasing order of
• 4 × 2: 4 streams > 2 streams with accurate
severity: CSI-RS < q. CQI < DM-RS <<
feedback, 2 streams > 4 streams with
q. CDI (assuming 6RB subband size and
quantized feedback
feedback delay)
• 8 × 2: 4 streams > 2 streams if accurate
and quantized feedback
468 / 494
Multi-User MIMO
Dynamic switching based on quantized Dynamic switching based on unquantized
feedback feedback
cell average throughput
[bits/s/Hz/cell]
3.5 MU − rank−1 report MU − rank−1 report
[bits/s/Hz/cell]
2 2
4x2 DP (4) 4x2 DP (0.5) 4x2 ULA 8x2 DP 8x2 ULA 4x2 DP (0.5,15) 4x2 ULA (0.5,15) 8x2 DP (0.5,15)
0.15 0.15
cell edge throughput
0.1 0.1
11%
0.075 8% 0.075
0.05 0.05
4x2 DP (4) 4x2 DP (0.5) 4x2 ULA 8x2 DP 8x2 ULA 4x2 DP (0.5,15) 4x2 ULA (0.5,15) 8x2 DP (0.5,15)
Observations: Observations:
• MU-MIMO and SU/MU-MIMO dynamic • dynamic SU/MU-MIMO ≥ MU-MIMO
switching bring negligible gain over SU in based on rank-1 report
4 × 2 DP (4,15)
• MU-MIMO and SU/MU-MIMO dynamic
switching bring only 5% gain over SU in
4 × 2 DP (0.5,15)
• MU with rank-1 report > MU w/
469 / 494
Multi-User MIMO
Multi-User Diversity Multi-User Diversity
user average throughput cell average throughput
2.8 50
2.6 40
MU − rank−1 report
relative gain
2.4 30 SU/MU − SU report
2.2 SU − SU report 20
MU − rank−1 report
2 10
SU/MU − SU report
1.8 0
2 4 6 8 10 12 14 2 4 6 8 10 12 14
1.25 5
SU − SU report
1 0
MU − rank−1 report
0.75 SU/MU − SU report
−5
0.5
−10 MU − rank−1 report vs. SU − SU report
0.25 SU/MU − SU report vs. SU − SU report
0 −15
2 4 6 8 10 12 14 2 4 6 8 10 12 14
Average number of users per cell Average number of users per cell
Observations:
• SU-MIMO > MU-MIMO for small K (less than 4) while MU-MIMO > SU-MIMO for large
K.
• MU-MIMO relies more heavily on multi-user diversity than SU-MIMO
• SU/MU-MIMO dynamic switching outperforms both SU-MIMO and MU-MIMO for any
number of users.
470 / 494
Multi-Cell MIMO
9 9
400 400
11 10 5 4 11 10 5 4
200 12 6 200 12 6
y [m]
y [m]
0 2 1 0 2
2 1
3 3
−20014 13 20 19 −20014 13 20 19
15 21 15 21
−400 −400
17 16 17 16
18 18
−600 −600
−500 −400 −300 −200 −100 0 100 200 300 400 500 −500 −400 −300 −200 −100 0 100 200 300 400 500
x [m] x [m]
600
8 7
9
400
11 10 5 4
200 12 6
y [m]
0 2
2 1
3
−20014 13 20 19
15 21
−400
17 16
18
−600
−500 −400 −300 −200 −100 0 100 200 300 400 500
x [m] 471 / 494
Multi-Cell MIMO
400 400 9
9
11 10 5
5 4 11 10 5
5 4
200 12 6 200 12 6
y [m]
y [m]
0
2 1 0
2 1
3 3
−20014 13 20 19 −20014 13 20 19
15 21 15 21
−400 −400
17 16 17 16
18
−600 −600
−500 −400 −300 −200 −100 0 100 200 300 400 500 −500 −400 −300 −200 −100 0 100 200 300 400 500
x [m] x [m]
600
8 7
400 9
11 10 5
5 4
200 12 6
y [m]
0
2 1
3
−20014 13 20 19
15 21
−400
17 16
18
−600
−500 −400 −300 −200 −100 0 100 200 300 400 500
x [m] 472 / 494
Multi-Cell MIMO: Intra-site vs. Inter-site Clustering
• Percentage of users whose CoMP (MC) measurement set size is 1 to 6 for inter-site
and intra-site (10dB triggering threshold).
CoMP measurement set size 1 2 3 4 5 6
Inter-site CoMP 53% 23% 18% 3% 2% 1%
Intra-site CoMP 75% 19% 6% 0% 0% 0%
Observations:
473 / 494
Multi-Cell MIMO: User-centric vs. Network Predefined
Clustering
• Percentage of users whose CoMP (MC) measurement set size is 1 to 6 for
user-centric and network predefined cooperating (or clustering) set (10dB triggering
threshold).
CoMP measurement set size 1 2 3 4 5 6
user-centric cooperating set 53% 23% 18% 3% 2% 1%
inter-site network predef. cooperating set 80% 14% 6% 0% 0% 0%
• Percentage of users whose CoMP (MC) measurement set size is 1 to 6 for two
different network predefined cooperating (or clustering) sets (10dB triggering
threshold).
CoMP measurement set size 1 2 3
intra-site network predefined cooperating set 75% 19% 6%
inter-site network predefined cooperating set 80% 14% 6%
Observations:
70 70
percentage of users
percentage of users
60 60
50 50
40 40
30 30
20 20
10 10
0 0
0 5 10 15 20 25 30 0 5 10 15 20 25 30
triggering threshold [dB] triggering threshold
Observations:
• With a 10dB triggering threshold
Deployment Absolute overhead Overhead increase
intra-site K ∗ B ∗ (0.75 ∗ B + 0.19 ∗ 2 + 0.06 ∗ 3) = 1.31KB 31%
inter-site K ∗ B ∗ (0.53 ∗ B + 0.23 ∗ 2 + 0.24 ∗ 3) = 1.71KB 71%
• User centric clustering requires higher overhead than network predefined clustering.
• With network predefined clustering, inter-site requires less overhead than intra-site. 475 / 494
Coordinated Scheduling and Beamforming
• Assumptions:
– Coordinated SU-MIMO in homogeneous network: one user scheduled at a time in each
cell on a given time/frequency resource
– Network level iterative coordinated scheduling and beamforming based on interference
pricing, Signal-to-Leakage-and-Noise-Ratio (SLNR) filter design and user-centric
clustering
476 / 494
Coordinated Scheduling and Beamforming
Iterative CSCB SU-MIMO vs. SU-MIMO Iterative CSCB SU-MIMO vs. SU-MIMO
nt × nr = 4 × 2 ULA (4,15) and (0.5,8) nt × nr = 4 × 4 ULA (4,15)
cell average throughput
[bits/s/Hz/cell]
2.6 CSCB SU 16 iterations CSCB SU
3.8
2.9%
2.4 9.5% 2%
3.6 10.5%
2.2 3.4
4.5%
2 3.2
(4,15) 1RB (0.5,8) 1RB (4,15) 3RB (0.5,8) 3RB 1 3 6
0.11
cell edge throughput
0.14
[bits/s/Hz/user]
CSCB SU
0.09 31% 0.13
26.5% 0.12
42% 31.2% 18.3%
0.07 19% 0.11
6.6%
0.1
0.05 0.09
(4,15) 1RB (0.5,8) 1RB (4,15) 3RB (0.5,8) 3RB 1 3 6
subband size [RB]
Observations:
• Gain of about 30% coordination at the cell edge for a CSI overhead increase of 71%
• Big loss as the CSIT accuracy decreases
– ... and this assumed unquantized CSI, user receiver implementation assumed known at
the Tx, perfect CSI measurement, no delay
477 / 494
Coordinated Scheduling and Beamforming
[bits/s/Hz/cell]
0.6 CSCB SU 2.6 8 iterations
probability
16 iterations
0.4 2.4
0.2 2.2
0 2
1 2 (4,15) 1RB (0.5,8) 1RB (4,15) 3RB (0.5,8) 3RB
°
3 RB, 4x2 ULA, 0.5λ, 8 AS
0.11
[bits/s/Hz/user]
6%
0.6
probability
CSCB SU 0.09
11%
0.4
0.07 13%
0.2
0 0.05
1 2 (4,15) 1RB (0.5,8) 1RB (4,15) 3RB (0.5,8) 3RB
number of transmitted streams
Observations: Observations:
478 / 494
Coordinated Scheduling and Beamforming
0.1
cell edge throughput
[bits/s/Hz/user]
0.09
0.08
32% 24% 43%
0.07
0.06
0.05
(4,15) 3RB (0.5,8) 3RB
Observations:
• Most of the potential gain lost due to inaccurate LA.
• Inaccurate CQI prediction hampers the appropriate selection of the users, the transmission
ranks and the beamformers at every iteration of scheduler and ultimately the whole link
adaptation and the convergence of the scheduler
479 / 494
Coordinated Scheduling and Beamforming
CS vs. CS/CB
[bits/s/Hz/cell]
2.6 CSCB
2.4
2.2
2
(4,15) 1RB (0.5,8) 1RB (4,15) 3RB (0.5,8) 3RB
0.1
cell edge throughput
[bits/s/Hz/user]
0.09
0.08
0.07
0.06
(4,15) 1RB (0.5,8) 1RB (4,15) 3RB (0.5,8) 3RB
Observations:
480 / 494
Robust Joint Scheduling and Rank Coordination
Improve cell edge user experience by Rank coordination: Each cell edge UE
enabling robust multi-streams transmission recommends the interfering cells to use a
to cell edge users transmission rank that is the most beneficial
to its performance
481 / 494
Robust Joint Scheduling and Rank Coordination
[bits/s/Hz/cell]
3.7 dyn. RR SU (B)
4.5%
stat. RR SU
3.6
3.5
0.14
[bits/s/Hz/user]
0.13
dyn. RR SU (B) 4%
6.5%
0.12 stat. RR SU
0.11 25.5%
0.1
Observations:
• Better performance gain with a significantly lower feedback overhead and scheduler
complexity
• About 20% gain at the cell edge with only 2-bit additional feedback compared to
uncoordinated SU-MIMO
482 / 494
Heterogeneous Networks
483 / 494
Heterogeneous Networks: Femto
Macro MT (×) inside and outside the apartment complex, femto MT (o), femto BS (∗).
484 / 494
Heterogeneous Networks: Femto
Observations:
485 / 494
Heterogeneous Networks: Femto
0.6 0.6
DR increases
0.4 0.4
Observations:
486 / 494
Heterogeneous Networks: Femto
100
Macro cell +
80 24 Femto cells
60 Macro cell +
12 Femto cells
40
20
Macro cell only
0
0% 5% 10% 20% 30%
deployment ratio (DR) [%]
Observations:
• The cell area throughput is boosted and benefits from the increase in the cell-splitting gain
of femto cells.
• As the DR (i.e., the number of femtocells) is doubled, the area throughput is however not
doubled due to the inter-cell interference increase.
487 / 494
Heterogeneous Networks: Femto
3
[bits/s/Hz/cell]
2.5
2
1.5
0% 10% 20% 30%
15
[bits/s/Hz/user] probability [%]
PF−TDMA
10 PF−FDMA
5
0
0% 10% 20% 30%
cell edge throughput
0.12
0.08
0.04
0
0% 10% 20% 30%
deployment ratio (DR) [%]
Observations:
• PF-FDMA helps the macro user performance (Outage still large but decreased).
• Exploit frequency selective scheduling in interference-limited environments.
488 / 494
Static Binary ON/OFF Power Control in Heterogeneous
Networks
Common Resource Silencing for non reliable backhaul with PF-FDMA
• Femto resource
utilization factor α
• Common silencing
resource is allocated
exclusively for macro
MT protection.
• No femto MT can be
allocated on that
common resource.
489 / 494
Static Binary ON/OFF Power Control in Heterogeneous
Networks
Femtocell throughput Macro and femto throughput
1 3
DR=10%, α=0.5
Observations:
2.6 2.6
[bits/s/Hz/cell]
[bits/s/Hz/cell]
macro cell av.
w/ dynamic PC
2.4 6.1% 2.5
14.9% w/ static PC, α=0.9
2.2 2.4 w/ static PC, α=0.5
2 2.3
0% 10% 20% 10% 20%
[bits/s/Hz/user] [bits/s/Hz/user]
w/o dynamic PC w/ dynamic PC
w/ dynamic PC 0.09 w/ static PC, α=0.9
5
0.08 w/ static PC, α=0.5
0 0.07
0% 10% 20% 10% 20%
cell edge throughput
0.12 3
w dynamic PC
0.08 69% w/ static PC, α=0.9
1281% 2
0.04 w/ static PC, α=0.5
0 1
0% 10% 20% 10% 20%
deployment ratio (DR) [%] deployment ratio (DR) [%]
Observations:
491 / 494
Heterogeneous Networks - DAS
• Potential gain of single-cell and multi-cell MIMO in theory but benefits may vanish
in practical scenarios
– Sensitivity to CSI measurement: channel estimation errors particularly large for cell
edge users
– CSI feedback inaccuracy: Limited feedback, Subband feedback with strong frequency
selectivity within subband, Particularly problematic in dual-polarized antenna
deployments
– Latency of the feedback and the backhaul
– Feedback and message exchange overhead: Target cell edge users
– Inaccurate link adaptation: due to feedback inaccuracy, BS does not know the receiver
at the mobile terminal, Traffic model, Fast variation of the inter-cell interference
– Scheduler convergence and complexity
– Many other issues left: time/frequency synchronization, antenna calibration, ...
• Sensitivity different depending on SU-MIMO, MU-MIMO, Muti-Cell MIMO
493 / 494
Some Conclusions
• Gap between theory and practice gets much bigger as we move from single-cell to
(cooperative/coordinated) multi-cell designs
• Account for impairments
494 / 494