Physical Layer Deep Learning of Encodings For The MIMO Fading Channel
Physical Layer Deep Learning of Encodings For The MIMO Fading Channel
Abstract— We introduce a novel physical layer scheme for back from the receiver to the transmitter). Space-Time Block
Multiple Input Multiple Output (MIMO) communications Codes (STBC) are regularly used to increase robustness and
based on unsupervised deep learning using an autoencoder. extend coverage in open-loop systems [1], and in dense
This method extends prior work on the joint optimization
of physical layer representation and encoding and decoding multi-user environments, closed-loop MIMO with precoding
processes as a single end-to-end task by expanding the trans- techniques based on CSI are used to improve single and
mitter and receiver to the multi-antenna case. We introduce a multi-user throughput [2], [3]. While current day MIMO
domain appropriate wireless channel impairment model (the systems provide a substantial benefit, they are quite complex
multi-input multi-output Rayleigh fading channel), into the and not known to be optimal, leading to the obvious question:
autoencoder optimization problem in order to directly learn
a system which optimizes for it. This approach demonstrates ”can we do better?” Recent results in physical layer learning
significant potential for learning schemes which achieve and for the Single Input Single Output (SISO) channel [11]
exceed performance of current day methods which are widely have shown that autoencoders [7] can readily match the
used in existing wireless MIMO systems. We discuss how the performance of near-optimal existing baseline modulation
scheme can be easily adapted for open-loop and closed-loop and coding schemes by using an autoencoder to jointly learn
operation in spatial multiplexing modes as well as spatial
diversity modes. Each of these modes is learned and realized the system.
using the same simple and compact approach.
I. I NTRODUCTION
Multiple Input Multiple Output (MIMO) wireless systems
are widely used today in 4G cellular and wireless local area
network systems to increase throughput and coverage by
exploiting the multipath characteristics of the channel. By
encoding information across multiple antenna elements using Fig. 1. SISO channel autoencoder system
spatial-multiplexing or spatial-diversity schemes throughput
or range can be improved in various channel conditions. The idea of the channel autoencoder [8], [11] applies deep
Current schemes rely on rigid analytically obtained encod- unsupervised learning with a reconstruction loss function to
ing/beamforming and decoding techniques for these tasks jointly optimize encoding, decoding, and signal representa-
and are in general not known to be optimal. Moreover, spatial tion over some impaired communications channel. A brief
multiplexing techniques mostly rely on Channel State Infor- visualization of the channel autoencoder system is shown in
mation (CSI) estimation, quantization and feedback which Figure 1. This approach is appealing as it finds solutions
further complicates the ability of these schemes to perform for modulation and forward error correction which rival
optimally. In contrast, we introduce a scheme which can today’s best designs for small code word sizes over existing
combine many of these tasks into a single end-to-end learned channel impairment models, and offers a method to learn
estimation, feedback, encoding, and decoding process which solutions over channel impairments for which no optimal
can be jointly optimized to maximize throughput and min- solution is known [10] or a compact analytical representa-
imize bit error rate (BER) for specific channel conditions. tion may be difficult to express and optimize considering
We believe this joint system optimization process has the real world effects. Another benefit of such systems is that
potential to provide significant gains in comparison with the computational complexity of the learned encoder and
current day systems which are optimized in a more modular decoder modules can in many cases be lower compared to the
and disjoint fashion. existing methods, leading to potential power savings when
such systems are deployed.
II. BACKGROUND
Traditional MIMO communication schemes are divided III. T ECHNICAL A PPROACH
into two categories which are either open-loop (without CSI In the SISO channel autoencoder, one symbol s of k bits
at the transmitter), or closed-loop systems (with CSI fed is encoded into n sequential time samples to be transmitted
77
• mul: Complex matrix multiplication of x with H ML autoencoder implementation (Figure 4). The symbols
• norm: Normalize average power transmitted at the antenna level are then optimized during
• awg: Additive white gaussian noise N (0, σ) the training. Note that the throughput is the same for both
• dec: Learned Decoder: r → ŝ Alamouti coding and ML implementation since both take two
In terms of these basic operations, we can express the full time slots to send two symbols. In both cases it is assumed
network f as follows for the open loop MIMO encoding that no CSI is available at the transmitter (i.e. open-loop
case: MIMO).
78
the sort of constellation derived throughout the optimization As a result, transmit symbol encodings are learned learned
process, and constraints such as equal power per antenna during training which leverage CSI effectively to increase
could additionally be included as constraints if desired. throughput under specific channel conditions.
In Figure 7 we provide the performance curve comparing
BER from the learned encoding scheme compared against
the conventional system. For the baseline, average BER
results over all antennas are provided. Here, the ML approach
provides very promising results, with better performance
compared to baseline method for all SNR levels.
Full CSI
−1 Baseline
10
10−3
10−4
0 10 20 30
Signal to Noise Ratio (dB)
79
We will further investigate what these constellations would [3] W. Yu, W. Rhee, S. Boyd, and J. M. Cioffi, “Iterative
look like for all symbols over a constant MIMO channel water-filling for gaussian vector multiple access chan-
matrix since it can be difficult to interpret all ”precodings” nels,” IEEE Transactions on Information Theory, vol.
merged together as shown here. 50, no. 1, pp. 145–151, 2004.
Our next step will be to improve this simulation by [4] D. Kingma and J. Ba, “Adam: A method for stochastic
introducing channel estimation and feedback error to re- optimization,” arXiv preprint arXiv:1412.6980, 2014.
flect practical implementation challenges. Moreover, since [5] M. Abadi and et al., TensorFlow: Large-scale machine
feedback of CSI (e.g., from a handset device to a base learning on heterogeneous systems, Software avail-
station) for closed-loop schemes requires protocol overhead able from tensorflow.org, 2015. [Online]. Available:
in real world systems, we will consider the case where CSI https://www.tensorflow.org/.
is constrained to be compact discrete valued information [6] F. Chollet, Keras, https : / / github . com /
encoded into compact p-bit fields. fchollet/keras, 2015.
[7] I. Goodfellow, Y. Bengio, and A. Courville, Deep
V. C ONCLUSION
learning. MIT press, 2016.
The architecture and initial results shown here provide [8] T. J. O’Shea, K. Karra, and T. C. Clancy, “Learn-
an exciting new approach to physical layer design and ing to communicate: Channel auto-encoders, domain
optimization for MIMO wireless communications schemes specific regularizers, and attention,” in 2016 IEEE
using an autoencoder. International Symposium on Signal Processing and
This is a significant departure from current day systems Information Technology (ISSPIT), 2016, pp. 223–228.
which still need to be understood and made to operate DOI : 10.1109/ISSPIT.2016.7886039.
efficiently with real world physical constraints. Preliminary [9] S. Cammerer, T. Gruber, J. Hoydis, and S. t. Brink,
results show that performance can potentially be quite com- “Scaling deep learning-based decoding of polar codes
petitive with existing schemes, especially at high SNRs. via partitioning,” arXiv preprint arXiv:1702.06901,
Further work is needed to compare these schemes to 2017.
existing baselines with error correction coding considered [10] N. Farsad and A. J. Goldsmith, “Detection algorithms
as well. Since channel autoencoders tend to learn encod- for communication systems using deep learning,”
ings equivalent to traditional joint modulation and coding CoRR, vol. abs/1705.08044, 2017. [Online]. Available:
schemes in terms of capacity, this would be a more fair http://arxiv.org/abs/1705.08044.
comparison, but would require non-matching modulation [11] T. J. O’Shea and J. Hoydis, “An introduction to
rates (with matching un-coded bit information rates). We deep learning for the physical layer,” CoRR, vol.
believe that this approach represents an exciting vein of abs/1702.00832, 2017. [Online]. Available: http://
future research considering that this approach has shown arxiv.org/abs/1702.00832.
promise in learning efficient encodings without any prior
knowledge about how to encode for such a channel, and
given that such a scheme could be learned for potentially
very different channel conditions which depart from the
MIMO Rayleigh model used here.
Lastly, these kinds of models are potentially quite at-
tractive due to the efficient computational realization which
leverages concurrent NN structure and minimal iteration,
as compared to the iterative algorithms used for detection
and error correction in many systems today. Significant
promise has been shown in the area of pure error correction
decoding in this area [9], and we believe that optimizing the
joint decoding process with MIMO symbol estimation and
decoding will in the future hold increasingly more promising
results than through isolated modular optimization of either
independently.
R EFERENCES
[1] S. M. Alamouti, “A simple transmit diversity tech-
nique for wireless communications,” IEEE Journal on
Selected Areas in Communications, vol. 16, no. 8,
pp. 1451–1458, 1998.
[2] E. Telatar, “Capacity of multi-antenna gaussian chan-
nels,” European Transactions on Telecommunications,
vol. 10, no. 6, pp. 585–595, 1999.
80