Digital Predistortion of 5G Multiuser MIMO Transmitters Using Low-Dimensional Feature-Based Model Generation
Digital Predistortion of 5G Multiuser MIMO Transmitters Using Low-Dimensional Feature-Based Model Generation
Abstract— In this article, we present a novel digital predistor- behavior of power amplifiers (PAs) in MIMO arrays can lead
tion (DPD) system which can be updated quickly and efficiently to significant distortion that deteriorates the signal quality.
in response to the dynamic reconfiguration of multiuser multiple- Digital predistortion (DPD) has been widely recognized as
input multiple-output (MIMO) transmitters. By identifying the
shared properties of different power amplifiers (PAs) with two one of the most effective linearization techniques in modern
feature extraction stages, nonlinear behaviors of the PAs are communication systems [2], [3]. Compared with the conven-
encoded into low-dimensional features. Using the extracted fea- tional single-input single-output (SISO) transmitters (TXs),
tures as input, a novel DPD generator is employed to synthesize the MIMO TXs in 5G exhibit more complex nonlinear char-
DPD model coefficients directly. In this way, when the power acteristics [4]. Recently, many DPD techniques have been
levels or beam angles change, the DPD model can be updated
fast and accurately without capturing the output data or applying proposed to mitigate distortions generated by MIMO TXs.
linear system identification algorithms. To capture slow-varying Various DPD models have been proposed to suppress the
dynamics of the PAs, the features can also be calibrated efficiently nonlinear distortions and crosstalk induced by PAs in fully
in the background. Computational complexity and operational digital (FD) MIMO TXs [5]–[7]. In hybrid beamforming (HB)
latency can thus be reduced significantly. Simulation and exper- TXs, the linearization target may be set to the main beam
imental results demonstrate that the proposed DPD approach
achieves excellent linearization performance with low complexity, via over-the-air (OTA) test, rather than the output of each
making itself a promising linearization solution for 5G massive PA. The conventional SISO DPD models can thus still be
MIMO transmitters. adopted [8], [9]. To update the DPD model coefficients,
Index Terms— 5G, digital predistortion (DPD), feature extrac- different data acquisition strategies have been proposed. The
tion, model adaptation, multiuser, multiple-input multiple-output desired target signal for DPD can be captured using embedded
(MIMO), power amplifier (PA). antennas [10], [11], dedicated coupling structures [12] or
using OTA antennas [13], [14]. To avoid interruption of data
transmission, Wang et al. [15], [16] proposed the OTA-based
I. I NTRODUCTION
indirect identification technique. By monitoring system behav-
Fig. 2. Update of system configuration and DPD model when the supported
UE moves in conventional systems.
separate couplers and data acquisition receivers [15]. It is thus
desirable to characterize the entire array with a single data
As shown in Fig. 2, when the supported UE moves, channel acquisition receiver. One way to achieve it is to capture each
state information (CSI) of UEs is updated. The system needs PA one by one in a time-division manner. Another method is
to change the power and phase in the corresponding subarray to capture the combined feedback signals and then separate the
to steer the beam to the new location of the UE. Owing to components of each PA. In both cases, multiple measurements
the changed system configuration, characteristics of the PAs, are required. As the input signals will change during this
and the subarray as the whole, change as well. In this case, process, the captured signals cannot be directly combined to
new feedback information is needed to update DPD model form the target signal, i.e., far-field signal at UE. Therefore,
coefficients. it is necessary to build a forward model for each PA in the
In MU scenarios, since each user may have different power array and estimate the target signal by feeding a common input
and angle settings, the number of possible system configura- signal to the forward models.
tions can be very large, making it impossible to pre-compute Generally, we assume the PAs share the same model struc-
the DPD coefficients for all conditions in advance. This sug- ture but have independent coefficients. If the PA model is
gests that the DPD coefficients must be updated fast enough to linear in its parameters, it can be expressed in matrix format
track the varying operating condition. The new situation is very as follows:
different from the conventional systems where DPD adaptation
algorithms only need to cope with long-term variation, e.g., ynt = Uλnt (6)
temperature or device aging.
In the new application scenarios, the existing DPD adapta- where ynt is the nth PA output under tth power level consisting
tion schemes face two critical issues: latency and complexity. of N S data samples
The latency of the conventional adaptation schemes may not
be acceptable, because they need to capture enough data and ynt = [ ỹnt (N S ), ỹnt (N S − 1), . . .]T (7)
execute complex model extraction algorithms before they can
update the DPD models. The system may fail to meet the λnt is a vector including all PA model coefficients of nth PA
linearity requirement until the DPD update is finished. The operating under tth power level
issue will be worse with a large number of PAs in MIMO
arrays. Furthermore, because of the differences between PAs λnt = [λnt (1), λnt (2), . . . , λnt (PA )]T (8)
in the array, each PA needs to be characterized separately [27].
The latency from data acquisition and model extraction will
and U is the regression matrix containing all PA model basis
increase rapidly with the array size. The other serious con-
functions constructed with the PA input signal. For example,
cern is the computational complexity in model identification.
if the memory polynomials (MP) model [28] is used, the basis
To avoid interruption of data transmission, we need to estimate
functions have the form of |u(n − m)|k u(n − m), where k is
the PA model coefficients for each PA in the array [15], [16],
the nonlinearity order and m is the delay number, and U can
which can have high complexity when the array size is large.
be constructed as (9), shown at the bottom of the page.
Since the moving UEs can change the PA operating condi-
III. C HARACTERIZATION OF MU MIMO TX W ITH PA
tion, a separate model is needed for each operating condition.
F EATURE E XTRACTION
As a result, a large number of PA models are needed to cover
Considering the large number of PAs in 5G MIMO trans- all potential conditions, leading to very high-computational
mitters, it is impractical to capture the output of PAs using complexity to operate the system.
⎡ ⎤
u(N S ) u(N S − 1) ··· |u(N S )|u(N S ) ···
⎢ u(N S − 1) u(N S − 2) ··· |u(N S − 1)|u(N S − 1) ···⎥
U=⎣ ⎦. (9)
.. .. .. .. ..
. . . . .
1512 IEEE TRANSACTIONS ON MICROWAVE THEORY AND TECHNIQUES, VOL. 70, NO. 3, MARCH 2022
Fig. 7. Generation of DPD coefficients for one subarray with DPD generator
under lth power level.
Fig. 6. Extraction of general power-induced information B. signal of the vth subarray in the space can be represented by
Nv
zv = Uv λntn e j (n−1)(β+θv )
PAs under T different power levels can be rearranged as n=1
⎡ ⎤
Nv
ω11 ω21 ··· ωN 1 = Uv Aωnt n e j (n−1)(β+θv )
⎢ ω12 ω22 ··· ωN 2 ⎥
⎢ ⎥ n=1
W = ⎢ . .. .. .. ⎥ (14)
⎣ .. . . . ⎦
Nv
where each column is the coefficients of one PA under T From (17), the PA features of each PA in the subarray
different power levels. Perform SVD on matrix W are combined to model the spatial combination of transmitted
signals. The combined features are referred to as array fea-
W = UW W VW H . (15) tures. As shown in Fig. 7, the PA features are added together
using beam direction information as weighting factors and then
By keeping the PC most significant principal components, normalized. Therefore, the subarray to be linearized can be
that is the first PC columns of UW , we obtain modeled in a similar way as a single PA, using normalized
array features ω instead of PA features ω. From a behavioral
T modeling point of view, the subarray can simply be viewed as
B = B1 T , B2 T , . . . , BTT = UW (:, 1 : PC ). (16)
a special PA with a different set of parameters. Thus, to derive
the relationship between the array model and the desired
If the most representative cases are covered in offline
DPD coefficients, it is sufficient to examine such relationship
training, the transformation matrices can also be generalized
on individual PAs in the array, so we derive and train the
to cover untested cases by interpolation.
DPD generator by collecting the PA features and the DPD
coefficients for the corresponding PAs.
IV. D IRECT S YNTHESIS OF DPD M ODEL C OEFFICIENTS
B. DPD Generator
In the conventional DPD, the model coefficients are usually
To derive the model structure of DPD generator, we first
extracted using linear system identification algorithms, such
consider a post-inverse model where a DPD model is cas-
as LS in (5), where a group of input–output data are gathered
caded after the PA. After model extraction, the post-inverse
and matrix operations are conducted. As mentioned earlier,
coefficients can be directly copied to the pre-inverse in DPD.
this type of model extraction is computationally complex and
Representing the PA using the extracted PA features, the output
time consuming, which is not suitable for MIMO TXs. In this
of post-inverse DPD model is
section, we present a novel DPD generator to find the value
of the model coefficients directly without using LS. The DPD Yc = f DPD (y) = f DPD (UAω) (18)
generator is a model built between PA features ω and the
corresponding DPD model coefficients c. where Y includes DPD basis functions built by PA output
signal y and f DPD represents the DPD model.
When PA characteristics (ω) vary, to ensure the cascaded
system is linear, all DPD coefficients (c) need to change
A. Array Model and DPD Formulation
accordingly. Thus, every DPD coefficient is a function of PA
Before deriving the DPD generator, let us check the array features ω. Without loss of generality, we consider i th DPD
model equation. Consider the case of HB TXs, where one DPD coefficient, c(i ), in the following derivation.
is assigned to each subarray and the DPD linearizes the signals Due to the use of SVD technique in feature extraction,
of the subarray combined in space. According to (3), if we the importance of different PA features decays exponentially,
model the PA behavior with the PA features, the transmitted and the first feature ω(1) is far more important than the rest
1514 IEEE TRANSACTIONS ON MICROWAVE THEORY AND TECHNIQUES, VOL. 70, NO. 3, MARCH 2022
TABLE I
C OMPLEXITY C OMPARISON OF DPD M ODEL A DAPTATION C ONSIDERING
C OMPLEX M ULTIPLICATION O PERATIONS
TABLE II
S IMULATED P ERFORMANCE OF O FFLINE E XTRACTION
TABLE III
S IMULATED L INEARIZATION P ERFORMANCE OF P ROPOSED DPD M ETHOD
ON 4 × 8 A RRAY U NDER F IVE P OWER L EVELS S UPPORTING F OUR UE S
B. Linearization Performance
The linearization performance of the proposed DPD method
was then simulated. Core PA model coefficients were updated
with the OTA-based indirect PA identification method [16]. Furthermore, only the coupling from the adjacent RF chains
Data blocks from 68 directions were used to update the core at each side was considered. The coupling factor was set as
PA models and calculate the PA features. It is noted that 0.5, which means that half of the signals from the adjacent RF
PA outputs were not used in DPD adaptation but they were chain was coupled. From the simulation results, we can see
estimated to verify the accuracy of the forward modeling with that, even with a high coupling factor, the mutual coupling has
OTA data acquisition. The worst case NMSE of PA output little impact on the reconstructed PA outputs. With the core
reconstruction is −43.6 dB. PA models, the worst case NMSE of the 32 reconstructed PA
After that, DPD model coefficients were obtained directly outputs is −43.7 dB. The linearization performance is listed
by DPD generator. The linearization performance of the pro- in Table IV, demonstrating that the main beam signals are
posed DPD method for all the four UEs is listed in Table III. effectively linearized with the proposed DPD method under
From the results, the proposed DPD method successfully mutual coupling effect. This is because the received signal at
linearized all UEs. The worst case NMSE and ACPR of four the main beam is still a linear combination of the PA outputs
UEs are −45.9 dB and −51.6 dBc, respectively. Figs. 11 and if only linear mutual coupling occurs. For nonlinear coupling
12 show the spectrum, AM-AM and AM-PM results of signals where the coupling affects the PA nonlinearity, the situation
received by the 1st UE under 2nd power level and 3rd UE is more complex. We will investigate that in our future work.
under fifth power level without and with the proposed DPD.
VI. E XPERIMENTAL R ESULTS
C. Effect of Mutual Coupling In this section, we validate the efficacy of the proposed
Effect of mutual coupling between antenna elements was method on two different MIMO array platforms. In both cases,
considered in the MATLAB simulation. For simplicity, only each PA element has three discrete power levels.
linear coupling between the RF chains was considered in the
simulation. In other words, the output of each RF chain with A. Linearization of Hybrid Beamforming Transmitter
coupling was generated by adding the output of the PA in its 1) Experimental Setup: The test bench for 4 × 4 URA
own RF chain with the outputs from the adjacent RF chains. was set up as shown in Fig. 13. The device used in the
WANG et al.: DIGITAL PREDISTORTION OF 5G MULTIUSER MIMO TRANSMITTERS 1517
TABLE V
E XPERIMENTAL L INEARIZATION P ERFORMANCE OF THE P ROPOSED
DPD U SING OTA D ATA A CQUISITION FOR HB TX W ITH M ODEL
E XTRACTED U NDER −6.0 D B M
[29] Y. Li, X. Wang, and A. Zhu, “Complexity-reduced model adaptation Zhiqiang Yu (Member, IEEE) received the B.S.
for digital predistortion of RF power amplifiers with pretraining-based degree from the Nanjing University of Science and
feature extraction,” IEEE Trans. Microw. Theory Techn., vol. 69, no. 3, Technology, Nanjing, China, in 2002, and the Ph.D.
pp. 1780–1790, Mar. 2021. degree from Southeast University, Nanjing, in 2013.
[30] J. Chani-Cahuana, P. N. Landin, C. Fager, and T. Eriksson, “Iterative From 2002 to 2007, he was a Research Staff in
learning control for RF power amplifier linearization,” IEEE Trans. airborne radar transmitter with the Nanjing Institute
Microw. Theory Techn., vol. 64, no. 9, pp. 2778–2789, Sep. 2016. of Electronics, China Electronics Technology Group
[31] D. R. Morgan, Z. Ma, J. Kim, M. G. Zierdt, and J. Pastalan, “A gener- Corporation, Nanjing. He is currently a Lecturer
alized memory polynomial model for digital predistortion of RF power with the School of Information Science and Engi-
amplifiers,” IEEE Trans. Signal Process., vol. 54, no. 10, pp. 3852–3860, neering, Southeast University. His current research
Oct. 2006. interests include microwave and millimeter-wave
transceiver systems, beamforming networks, and phased arrays for mobile
communication.