Resource Allocation For RIS Assisted CoMP-NOMA Networks Using Reinforcement Learning Final Year Design Project
Resource Allocation For RIS Assisted CoMP-NOMA Networks Using Reinforcement Learning Final Year Design Project
net/publication/385446024
CITATIONS READS
0 146
2 authors:
All content following this page was uploaded by Muhammad Ahmed Mohsin on 01 November 2024.
By
Muhammad Umer (CMS – 345834)
Muhammad Ahmed Mohsin (CMS – 333060)
submitted by Muhammad Ahmed Mohsin and Muhammad Umer have been found
satisfactory for the requirements of the degree.
i
Abstract
This thesis delves into the forefront of wireless communication by exploring the syn-
ergistic integration of three transformative technologies: Simultaneously Transmit-
ting and Reflecting Reconfigurable Intelligent Surfaces (STAR-RIS), Coordinated
Multi-Point transmission (CoMP), and Non-Orthogonal Multiple Access (NOMA).
Driven by the ever-increasing demand for higher data rates, improved spectral ef-
ficiency, and expanded coverage in the evolving landscape of 6G development, this
research investigates the potential of these technologies to revolutionize future wire-
less networks.
The thesis analyzes the performance gains achievable through strategic deploy-
ment of STAR-RIS, focusing on mitigating inter-cell interference, enhancing signal
strength, and extending coverage to cell-edge users. Resource sharing strategies for
STAR-RIS elements are explored, optimizing both transmission and reflection func-
tionalities. Analytical frameworks are developed to quantify the benefits of STAR-
RIS assisted CoMP-NOMA networks under realistic channel conditions, deriving
key performance metrics such as ergodic rates and outage probabilities. Addition-
ally, the research delves into energy-efficient design approaches for CoMP-NOMA
networks incorporating RIS, proposing novel RIS configurations and optimization
algorithms to achieve a balance between performance and energy consumption. Fur-
thermore, the application of Deep Reinforcement Learning (DRL) techniques for
intelligent and adaptive optimization in aerial RIS-assisted CoMP-NOMA networks
is explored, aiming to maximize network sum rate while meeting user quality of ser-
vice requirements. Through a comprehensive investigation of these technologies and
their synergistic potential, this thesis contributes valuable insights into the future of
wireless communication, paving the way for the development of more efficient, reli-
able, and sustainable networks capable of meeting the demands of our increasingly
connected world.
ii
Acknowledgements
We extend our sincere gratitude to Almighty Allah for granting us the strength,
guidance, and perseverance throughout the journey of completing this project. We
would like to express our heartfelt appreciation to our parents for their unwaver-
ing support, encouragement, and belief in our abilities. Their love, sacrifices, and
continuous encouragement have been the cornerstone of our success.
We are deeply grateful to our friends for their camaraderie, motivation, and support,
which made this journey memorable and enjoyable. We owe a debt of gratitude to
our supervisor, Dr. Syed Ali Hassan, for his invaluable guidance, support, and exper-
tise. His insightful feedback, encouragement, and patience have been instrumental
in shaping this thesis.
Special thanks are due to Dr. Haejoon Jung and Dr. Huma Ghafoor for their
assistance, valuable insights, and encouragement throughout this project. Their
expertise, encouragement, and willingness to help have been truly appreciated. We
are indebted to all those who have contributed in any way, no matter how small,
to the completion of this project. Thank you for your support, encouragement, and
belief in us.
iii
Publication List
The main contributions of this research are either published or accepted or are
presently submitted for acceptance in journals and conferences as mentioned in the
following list:
Journal Articles
[1] M. Umer, M. A. Mohsin, M. Gidlund, H. Jung, and S. A. Hassan, “Analysis
of STAR-RIS Assisted Downlink CoMP-NOMA Multi-Cell Networks under
Nakagami-m Fading,” IEEE Communications Letters, 2024.
Conference Papers
[1] M. Umer, M. A. Mohsin, S. A. Hassan, H. Jung, and H. Pervaiz, “Performance
Analysis of STAR-RIS Enhanced CoMP-NOMA Multi-Cell Networks,” in 2023
IEEE Globecom Workshops (GC Wkshps), pp. 2000–2005, IEEE, 2023.
[2] M. Umer, M. A. Mohsin, S. A. Hassan, and H. Jung, “On the Energy Efficiency
and Passive Beamforming Design of RIS Assisted CoMP-NOMA Networks,”
Submitted to IEEE GLOBECOM 2024
iv
Table of Contents
List of Tables x
1 Introduction 1
1.1 Project Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.3 Objectives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.4 Methodology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.5 Project Outcomes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.6 Organization of the Report . . . . . . . . . . . . . . . . . . . . . . . . 3
2 Background 4
2.1 Preliminaries . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
2.1.1 Challenges in Modern Wireless Communication . . . . . . . . 4
2.1.2 Emerging Technologies . . . . . . . . . . . . . . . . . . . . . . 5
2.2 Related Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
2.2.1 Non-Orthogonal Multiple Access (NOMA) . . . . . . . . . . . 6
2.2.2 Coordinated Multi-Point (CoMP) . . . . . . . . . . . . . . . . 7
2.2.3 Reconfigurable Intelligent Surfaces (RIS) . . . . . . . . . . . . 7
2.2.4 UAV-assisted Wireless Networks . . . . . . . . . . . . . . . . . 8
2.3 Related Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
2.3.1 STAR-RIS Enhanced CoMP-NOMA Networks . . . . . . . . . 8
2.3.2 DRL for Aerial RIS in CoMP-NOMA Networks . . . . . . . . 9
2.3.3 Energy Efficiency . . . . . . . . . . . . . . . . . . . . . . . . . 10
2.4 Research Gaps and Opportunities . . . . . . . . . . . . . . . . . . . . 10
2.4.1 Performance Challenges in CoMP-NOMA Networks . . . . . . 10
2.4.2 Open Research Areas in STAR-RIS Technology . . . . . . . . 11
2.4.3 Statistical Gaps . . . . . . . . . . . . . . . . . . . . . . . . . . 12
2.4.4 DRL in Aerial RIS Networks . . . . . . . . . . . . . . . . . . . 12
v
Table of Contents
vi
Table of Contents
6 Conclusion 56
6.1 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56
6.1.1 Performance Analysis . . . . . . . . . . . . . . . . . . . . . . . 56
6.1.2 Optimization Strategies . . . . . . . . . . . . . . . . . . . . . 57
6.1.3 Deep Reinforcement Learning . . . . . . . . . . . . . . . . . . 57
6.2 Limitations and Future Work . . . . . . . . . . . . . . . . . . . . . . 57
References 62
vii
List of Figures
viii
List of Figures
5.5 Top view of the UAV trajectory obtained by the MO-PPO algorithm
sampled every 25 time slots and averaged over 10 evaluation episodes 54
ix
List of Tables
x
Chapter 1
Introduction
This chapter provides an overview of the research project, outlining its motivation,
objectives, methodology, and expected outcomes. It also details the organization of
the report for clarity and navigation.
1
Chapter 1. Introduction
shifts, and NOMA power control for maximizing network sum rate and meeting
user quality of service requirements.
1.2 Motivation
The ever-increasing demand for higher data rates, improved spectral efficiency, and
expanded coverage in wireless communication systems has pushed the boundaries
of existing technologies. Traditional approaches are struggling to keep pace with
these demands, necessitating the exploration of novel solutions. The limitations of
conventional RIS designs, the challenges of interference management in dense het-
erogeneous networks, and the need for more efficient spectrum utilization motivate
the investigation of STAR-RIS, CoMP, and NOMA as potential enablers for next-
generation wireless networks. By combining these technologies, we aim to achieve
significant performance improvements and pave the way for a more connected and
sustainable future.
1.3 Objectives
The primary objectives of this research project are:
• To analyze and quantify the performance gains achievable through the inte-
gration of STAR-RIS in CoMP-NOMA networks, particularly in terms of data
rates, coverage, and spectral efficiency.
1.4 Methodology
The research methodology will involve a combination of theoretical analysis, simu-
lation modeling, and algorithm development. Analytical tools will be employed to
derive performance metrics and gain insights into the behavior of STAR-RIS assisted
CoMP-NOMA networks under various channel conditions. Simulation models will
2
Chapter 1. Introduction
3
Chapter 2
Background
2.1 Preliminaries
The relentless expansion of wireless communication necessitates continuous inno-
vation to address escalating demands for data rates, spectral efficiency, and cover-
age. This thesis delves into the exploration of cutting-edge technologies, namely
Non-Orthogonal Multiple Access (NOMA), Coordinated Multi-Point (CoMP), and
Reconfigurable Intelligent Surfaces (RIS), with a particular focus on Simultaneously
Transmitting and Reflecting RIS (STAR-RIS), to tackle the limitations of conven-
tional wireless systems.
This introductory section establishes the foundation for understanding these tech-
nologies and their potential impact. It begins by outlining the key challenges faced
by modern wireless communication systems and subsequently introduces the afore-
mentioned solutions, highlighting their operating principles and potential benefits.
• Data Rate Demands: The need for faster data transfer speeds to support
bandwidth-hungry applications is ever-increasing.
4
Chapter 2. Background
NOMA departs from the traditional orthogonal multiple access (OMA) paradigm by
leveraging power-domain multiplexing to serve multiple users on the same resource
block. By allocating higher power levels to users experiencing weaker channel con-
ditions, NOMA ensures successful decoding at the receiver while simultaneously
improving spectral efficiency and user fairness. This technique offers significant ad-
vantages over OMA, including increased system capacity and enhanced service for
cell-edge users.
5
Chapter 2. Background
The subsequent sections of this thesis will delve deeper into these technologies,
exploring their theoretical foundations, practical implementations, and potential
applications in the context of future wireless communication systems.
Further exploring NOMA, the survey in [2] offers a detailed analysis of its progress in
5G systems, focusing primarily on power-domain NOMA employing superposition
coding at the transmitter and successive interference cancellation at the receiver.
The paper delves into various aspects of NOMA, including capacity analysis, power
6
Chapter 2. Background
allocation strategies, user fairness considerations, and integration with other wireless
technologies such as cooperative communications, MIMO, beamforming, and net-
work coding. Implementation challenges and potential avenues for future research
are also discussed, providing a holistic understanding of NOMA and its potential
impact on future wireless networks.
Further exploring RIS technology, the work in [6] provides a comprehensive overview
of its potential for revolutionizing wireless communication networks. The paper
delves into the applications, advantages, hardware architecture, and signal model of
RIS, highlighting its ability to intelligently manipulate the wireless propagation en-
vironment using passive reflecting elements. The authors also discuss the challenges
7
Chapter 2. Background
Furthermore, the work in [8] proposes a novel framework for designing trajectories
of multiple UAVs by predicting users’ mobility information. This framework aims
to maximize the instantaneous sum transmit rate while satisfying user rate require-
ments. Integrating such trajectory design and user mobility prediction techniques
with RIS, CoMP, and NOMA could pave the way for highly efficient and adaptable
wireless networks capable of dynamically responding to user demands and environ-
mental changes.
8
Chapter 2. Background
based (SSECB) design that leverages a large number of RIS elements to concur-
rently eliminate inter-cell interference and boost desired signals. Simulation results
demonstrate the effectiveness of SSECB in outperforming conventional designs and
achieving superior performance in CoMP-NOMA networks.
Further exploring the potential of STAR-RIS in NOMA systems, the study in [10] in-
vestigates its application in enhancing coverage quality and spectral efficiency. The
authors analyze the secrecy performance of a STAR-RIS aided downlink NOMA
system employing the energy splitting protocol. Analytical expressions for secrecy
outage probability (SOP) are derived, and asymptotic performance analysis is con-
ducted to gain insights into system behavior. The results demonstrate the superior
secrecy performance of STAR-RIS-NOMA compared to conventional OMA systems.
Further exploring the use of RIS in UAV-assisted communication, the study in [12]
investigates the joint optimization of UAV placement and RIS phase-shift to max-
imize data transfer rates while minimizing UAV energy consumption. DRL algo-
rithms, specifically DDQN and DDPG, are employed to address the optimization
challenges. Numerical results demonstrate the effectiveness of these algorithms in
improving the energy efficiency of RIS-assisted UAV systems compared to bench-
mark solutions.
The work in [13] introduces an energy harvesting (EH) scheme for UAV-RIS sys-
tems operating in communication-disabled areas. The proposed EH-RIS scheme
utilizes SWIPT technology to simultaneously transport information and harvest
energy through a split passive reflecting array. A robust DRL-based algorithm is
developed for efficient resource allocation in dynamic environments with pedestrian
mobility and rapid channel changes. Simulation results showcase the effectiveness
and efficiency of the proposed EH-RIS system, surpassing existing solutions and
9
Chapter 2. Background
Expanding on the concept of EE, the work in [15] explores secure energy efficiency
maximization in an RIS-assisted uplink wireless communication system involving a
UAV acting as a mobile relay. The authors propose an algorithm for joint opti-
mization of UAV trajectory, RIS phase shift, user association, and transmit power
to maximize system efficiency. Simulation results demonstrate significant gains in
secure energy efficiency compared to traditional schemes without RIS integration.
10
Chapter 2. Background
Channel Modeling: Accurate channel models that capture the unique characteristics
of STAR-RIS, including its simultaneous transmitting and reflecting capabilities,
are crucial for system design and performance evaluation. These models should
account for mutual coupling between elements, the interaction between transmitted
and reflected signals, and the impact of different RIS configurations on channel
characteristics.
11
Chapter 2. Background
12
Chapter 2. Background
13
Chapter 2. Background
14
Chapter 2. Background
• Privacy Concerns: The dynamic nature of aerial RIS networks raises privacy
concerns. Studies should explore privacy-preserving mechanisms that protect
user information while maintaining energy efficiency and network performance.
2.5 Summary
This chapter has laid the foundation for exploring the potential of cutting-edge
technologies in addressing the ever-growing demands of modern wireless communi-
cation. The escalating need for higher data rates, improved spectral efficiency, and
expanded coverage has exposed the limitations of traditional approaches, necessi-
tating the investigation of novel solutions.
15
Chapter 2. Background
16
Chapter 3
In parallel, the increasing deployment of small, low-power base stations within cel-
lular networks has led to challenges related to cross-tier interference and increased
energy consumption. Coordinated Multi-Point (CoMP) techniques offer a solution
by enabling base stations to coordinate transmissions through high-speed fronthaul
links, thereby mitigating interference and enhancing overall network performance.
Integrating CoMP with Non-Orthogonal Multiple Access (NOMA) further improves
spectral efficiency by allowing multiple users to share the same resource block.
This chapter explores the synergistic potential of combining STAR-RIS with CoMP-
NOMA networks to address the challenges of future wireless communication systems.
By leveraging the unique capabilities of each technology, we aim to pave the path
towards achieving the ambitious goals of 6G networks, providing enhanced coverage,
improved spectral efficiency, and a more sustainable and connected future.
17
Chapter 3. Synergy of STAR-RIS, CoMP, and NOMA
Performance Analysis
Uf
STA
R-R
IS
Cell
edge
BS1 U1,c
NOMA Pair
of BS1
U2,c
Con
nLoS links boun verage
dary
of BS
Interfering links
2 BS2
We define the index set I = {1, 2} for the two BSs, whereas C = {1, 2, . . . , C}
is defined for the cell-center users, and F = {1, 2, . . . , F } for the cell-edge users,
respectively. Additionally, let U = C ∪ F, denote the set of all system users. For
the sake of simplicity, we assume C = 2, where each center user Uci is served by its
corresponding BSi , and F = 1, with C and F representing the cardinality of C and
F, respectively.
The BSs employ power-domain NOMA techniques to communicate with the users.
Specifically, ∀i ∈ I, c ∈ C, and f ∈ F, BSi forms the NOMA pair (Uci , Uf ), where
Uci is the cell-center user of BSi . Consequently, the Uf is part of two NOMA
pairs, each cluster served by a different BS. To mitigate the strong ICI experienced
by Uf , CoMP is adopted between the two BSs. In addition, it is assumed that
the BSs are connected to a central processing unit (CPU) via high-speed fronthaul
links, facilitating seamless information sharing and coordinated transmissions among
them.
In this work, perfect CSI is assumed to be available at the BSs. While this is a
18
Chapter 3. Synergy of STAR-RIS, CoMP, and NOMA
ρo
r
hi,u = vi,u , (3.1)
P L(di,u )
where vi,u is a complex Gaussian random variable that follows a Rayleigh distribution
with zero mean and unit variance, ρo is the reference path-loss at a distance of 1
α
m, P L(di,u ) is the large scale path-loss, modeled as P L(di,u ) = di,u i→u , where
di,u is the distance and αi→u is the path-loss exponent between the BSi and Uu ,
respectively.
On the contrary, the link between the STAR-RIS, hereafter represented by R, and
BSi is assumed to exhibit a dominant line-of-sight (LoS) path [18]. Therefore, these
links are subject to the Rician fading, where their channel coefficients are expressed
as: s
ρo κi,R 1
r r
hi,R = vˆi,R + vi,R , (3.2)
P L(di,R ) κi,R + 1 κi,R + 1
where di,R is the distance between the BSi and R, κi,R represents the Rician factor,
vˆi,R represents the deterministic LoS components, and vi,R denotes the complex
Gaussian random variables, each following a Rayleigh distribution with zero mean
and unit variance, thus representing the NLoS components. The links between R
and Uu , ∀u ∈ U, are also modeled similarly.
t t t
p
Θr = β r diag(ejθ1 , ejθ2 , . . . , ejθK ), (3.3)
jθ1r jθ2r r
p jθK
Θt = β t diag(e ,e ,...,e ), (3.4)
19
Chapter 3. Synergy of STAR-RIS, CoMP, and NOMA
where β t , β r ∈ [0, 1] and θkt , θkr ∈ [0, 2π), ∀k ∈ K ≜ {1, 2, . . . , K}. The phase shifts
for transmission and reflection (i.e., θkt and θkr ) can generally be chosen indepen-
dently of each other [20]. However, the amplitude adjustments for transmission and
reflection are coupled by the law of conservation of energy. Assuming the STAR-RIS
does not impose any power loss, the relation between the amplitude coefficients (i.e.,
β t and β r ) is expressed as β t + β r = 1. To reduce the signaling overhead between
the STAR-RIS and the BSs, all elements are adjusted to have the same transmission
and reflection coefficients.
where ζi,c and ζi,f are the power allocation (PA) factors assigned by BSi to users
Uci sand Uf , respectively. It is important to note that Uci experiences stronger
channel conditions compared to Uf , making it the dominant NOMA user in the pair
(Uci , Uf ) formed by BSi . Following the principle of NOMA, Uci should be capable
of detecting and decoding the message intended for Uf . This principle also implies
that ζi,c < 0.5, or 0.5 < ζi,f < 1 [22, 23].
For brevity, we only define the rate achieved by Uc1 from the set of cell-center users
C, as the same steps could be extended to define the rate of Uci , ∀i ∈ I and c ∈ C.
The received signal at Uc1 can be written as:
20
Chapter 3. Synergy of STAR-RIS, CoMP, and NOMA
signal of Uf (i.e., xf ) and then removes it from yc1 to decode its own message (i.e.,
x1,c ). Based on this approach, the signal-to-interference-and-noise ratio (SINR) and
the corresponding achievable rate at Uc1 for decoding the message of Uf can be
expressed as:
2
ζ1,f P1 H1,c
γ1,c→f = 2 2 , (3.7)
ζc1 P1 H1,c + P2 h2,c′ + σ2
R1,c→f = log2 1 + γ1,c→f , (3.8)
On the contrary, Uf , belonging to two NOMA pairs, receives its signal through the
broadcasts from each BSi , ∀i ∈ I. Thus, the received signal at Uf can be expressed
as:
yf = H1,f x1 + H2,f x2 + N0 , (3.11)
where H1,f and H2,f represent the combined channels from BS1 to Uf and from
BS2 to Uf , and can be expressed as H1,f = h1,f + hH R,f Θt h1,R and H2,f = h2,f +
H
hR,f Θt h2,R , respectively. Given that non-coherent JT-CoMP is taken into consid-
eration, the SINR and the corresponding achievable rate at Uf can be expressed
as [24, 25]:
2 2
ζ1,f P1 H1,f + ζ2,f P2 H2,f
γf = 2 2 , (3.12)
ζc1 P1 H1,f + ζ2,c P2 H2,f + σ2
Rf = log2 1 + γf . (3.13)
21
Chapter 3. Synergy of STAR-RIS, CoMP, and NOMA
Pi,c = Pr (γi,c→f < γthf ) + Pr (γi,c→f > γthf , γc < γthc ), (3.14)
where γthf and γthc represent the outage thresholds for Uf and Ui,c , respectively.
Similarly, in regard to Uf , an outage occurs when it fails to decode xf , and the
corresponding outage probability is expressed as:
Analytical Analysis
(2)
µ2Z µZ −µ2Z
i,u i,u i,u
where kZi,u = (2) and θZi,u = µZi,u
are the shape and scale parameters of
µZ −µ2Z
i,u i,u
√
2K βΩiu ΩiR ΩRu Γm ( 12 , 12 , 21 )
the Gamma distribution, with µZi,u = √
miR mRu miu
+βK 2 ΩiR ΩRu +Ωiu and
√ 3/2 √
(2) 4β 3/2 K 3 Ωiu (ΩiR ΩRu )3/2 Γm ( 21 , 32 , 32 ) 2 4KΩiu βΩiR ΩRu Γm ( 32 , 12 , 12 )
µZi,u = √
miu (miR mRu )3/2
+6βK Ω Ω Ω
iR iu Ru + √ 3/2 +
miR mRu miu
β 2 K 4 Ω2iR (miR +1)(mRu +1)Ω2Ru (m +1)Ω2
miR mRu
+ iumiu iu as the first and second moments of Zi,u , respec-
tively, and Γm (a, b, c) = Γ(miuΓ(m + a)Γ(miR + b)Γ(mRu + c)
iu )Γ(miR )Γ(mRu )
.
√ P
Proof. For brevity, let the combined channel be Gi,u = βn K k=1 |hR,u ||hi,R |. By
applying CLT, and noting that it is a scaled double-Nakagami random variable (RV),
22
Chapter 3. Synergy of STAR-RIS, CoMP, and NOMA
(2)
where µGi,u and µGi,u are the first and second moments of Gi,u , respectively, with
the p-th moment of Gi,u given by [27]
√
(K βn )p (ΩiR ΩRu )p/2 Γ mRu + p2 Γ miR + p2
(p)
µGi,u = , (3.17)
(mRu miR )p/2 Γ (miR ) Γ (mRu )
and as |hi,u | ∼ N akagami(mi,u , Ωi,u ), the p-th moments are known to be given by
(p) Γ(miu + p2 )Ωiu p/2
µ|hi,u | = Γ(miu )m p/2 . Since |hi,u | and Gi,u are independent, the p-th moment of
iu
|Hi,u | can be obtained via the moments of its summands, i.e., |hi,u | and Gi,u , by
applying the binomial theorem. Hence, the p-th moment of |Hi,u | is given by
p
(p)
X p (q) (p−q)
µ|Hi,u | = µ|hi,u | µGi,u . (3.18)
q=0
q
(2) (2)
µZi,u = µ|hi,u | + 2µ|hi,u | µGi,u + µGi,u , (3.19)
(2) (4) (3) (2) (2) (3) (4)
µZi,u = µ|hi,u | + 4µ|hi,u | µGi,u + 6µ|hi,u | µGi,u + 4µ|hi,u | µGi,u + µGi,u . (3.20)
The final expression of moments can thus be obtained through the means of substi-
tution in (3.19) and (3.20).
23
Chapter 3. Synergy of STAR-RIS, CoMP, and NOMA
(2)
µ2 (a,b) µ (a,b)
−µ2 (a,b)
B B B
i,u i,u i,u
where kB(a,b) = (2) and θB(a,b) = µ
, with µB(a,b) = aµZi,u + bΩi,u
i,u µ (a,b)
−µ2 (a,b) i,u B
(a,b) i,u
B B i,u
i,u i,u
(2) (2)
and µ (a,b) = a2 µZi,u + 2abµZi,u Ωi,u + b2 Ω2i,u (1 + m1i,u ) as the first and second moments
Bi,u
(a,b)
of Bi,u , respectively.
ρζi,f Zi,c
Proof. The expression in (3.7) can be rewritten as γi,c→f = Wi,c,f
, where Wi,c,f =
(ρζ , ρ) (ρζ , ρ)
Bi,c i,c+ 1, with Zi,c and Bi,c i,c
both being Gamma RVs based on the statistics
derived earlier. Then, the distribution of Wi,c,f can also be approximated by an
equivalent Gamma RV, i.e., Wi,c,f ∼ Γ kWi,c,f , θWi,c,f . As Zi,c and Wi,c,f are two
independent Gamma RVs, the ratio of two Gamma RVs is known to follow a Beta
ρζ Z
prime distribution, i.e., Wi,fi,c,fi,c ∼ β ′ kZi,c , kWi,c,f , 1, ρζi,f θZi,c /θWi,c,f , corresponding
24
Chapter 3. Synergy of STAR-RIS, CoMP, and NOMA
Corollary 1. As γi,c→f and γi,c are closely related, differing only by weighting con-
stants, the PDF of γi,c is given by
αi,c νi,c
xθWi,c xθWi,c
θWi,c ρζi,c θZi,c ρζi,c θZi,c
+1
fγi,c (x) = , x > 0, (3.23)
ρζi,c θZi,c B(kZi,c , kWi,c )
(2)
µ2W µW −µ2W
i,c i,c i,c
where νi,c = −(kZi,c +kWi,c ), kWi,c = (2) , and θWi,c = µWi,c
, with µWi,c =
µW −µ2W
i,c i,c
(2) (2)
µB(0, ρ) + 1 and µWi,c = µ (0, ρ) + 2µB(0, ρ) + 1 as the first and second moment of
i,c Bi,c i,c
(0, ρ)
Wi,c = Bi,c + 1, respectively.
µ2V
f
where B (· , ·) is the Euler Beta function, νf = −(kVf + kWf ), kVf = (2) , θVf =
µV −µ2V
f f
(2) (2)
µV −µ2V µ2W µW −µ2W
f f f f f
µVf
, kWf = (2) , and θWf = µ Wf
, with µVf = ρ(ζi,f µZi,f + ζi′ ,f µZi′ ,f ),
µW −µ2W
f f
(2) (2) (2)
µVf = ρ2 (ζi,f
2
µZi,f + 2ζi,f ζi′ ,f µZi,f µZi′ ,f + ζi2′ ,f µZi′ ,f ), µWf = ρ(ζi,c µZi,f + ζi′ ,c µZi′ ,f ) + 1,
(2) 2 2
and µWf = 2µZi,f (ρ2 ζi,c ζi′ ,c µZi′ ,f + ρζi′ ,c + ρζi,c ) + ρ2 ζi2′ ,c µ2Zi′ ,f + ρ2 ζi,c µZi,f + 1.
V
Proof. The expression in (3.12) can be rewritten as γf = Wff , where Vf = ρζi,f Zi,f +
ρζi′ ,f Zi′ ,f and Wf = ρζi,c Zi,f + ρζi′ ,c Zi′ ,f + 1, with Zi,f and Zi′ ,f both being Gamma
RVs. The rest of the proof is similar to that of Figure 3.
In Figure 3.2, we compare the analytical and simulated PDFs and CDFs of SINRs
of network users. The close alignment between the analytical approximations and
Monte Carlo (MC) simulations affirms the accuracy of the derived expressions. Ad-
ditionally, we conduct the Kolmogorov-Smirnov (KS) goodness-of-fit test, however,
the details are omitted due to space constraints.
25
Chapter 3. Synergy of STAR-RIS, CoMP, and NOMA
Figure 3.2: The PDFs and CDFs of the SINRs at the center and edge user, with K = 34
elements, mi,u = mi′ ,u = 1, and mi,R = mR,u = 2, ∀i ∈ I, i′ ∈ I \ {i}, ∀u ∈ U.
where fγf (x) is the PDF of γf in (3.24). Noting that the PDF of γf is a Beta prime
distribution, the ER of the edge user is derived in the following theorem.
Theorem 1. In the proposed system model, the ER of the edge user is given by
1 θWf 0, 1 − kWf , 1
Rf = G3,2
3,3 , (3.26)
ln(2)Λf θVf 0, 0, kVf
a1 , . . . , ap
where Gm,n
p,q z is the Meijer G-function, Λf = B(kVf , kWf ) Γ(κf ) and
b1 , . . . , b q
κf = kVf + kWf .
From [28, Eq. (11)] and [28, Eq. (10)], the logarithmic and power functions can be
26
Chapter 3. Synergy of STAR-RIS, CoMP, and NOMA
1, 1 .
expressed in terms of a Meijer G-function, i.e., log2 (1 + z) = G1,2
2,2 z ln(2)
1, 0
1 − ν .
and (1 + z)ν = G1,1
1,1 z Γ(ν), respectively. Furthermore, using the analyt-
0
ical continuation of the Meijer G-function, the integral in (3.27) can be rewritten
as
Z ∞
θWf 1,2 1, 1
Rf = G2,2 x
ln(2)θVf B(kVf , kWf )Γ(κf ) 0
1, 0
xθWf −kWf
× G1,1
1,1 dx. (3.28)
θVf kVf − 1
Finally, using the integral representation of the Meijer G-function, we obtain (3.26).
To gain further insight, we express the high-SNR approximation for the ER of the
edge user as
1 3,2 0, 1 − kṼf , 1
R∞
f ≈ G3,3 θf˜ , (3.29)
ln(2)Λf˜ 0, 0, kVf
θṼ
where Λf˜ = B(kVf , kṼf ) Γ(κf˜), κf˜ = kVf + kṼf , and θf˜ = θVf . The approximate
f
parameters, denoted as kṼf and θṼf , can be computed using the first and second mo-
(2) 2 (2)
ment, that is, µṼf = ρ(ζi,c µZi,f +ζi′ ,c µZi′ ,f ) and µṼ = ρ2 (ζi,c µZi,f +2ζi,c ζi′ ,c µZi,f µZi′ ,f +
f
(2)
ζi2′ ,c µZi′ ,f ), respectively.
where fγi,c (x) is the PDF of γi,c in (3.23). The ER of the center users can then be
derived as follows.
27
Chapter 3. Synergy of STAR-RIS, CoMP, and NOMA
Similar insights can be derived for the ER of the center users as of the edge user,
however, the details are omitted due to space constraints.
Γ(kVf + kWf )
Pf = Bψ (kV , kWf ), (3.32)
Γ(kVf )Γ(kWf ) f f
λth θW
where ψf = θV +λfth θfW , and Bz (· , ·) is the incomplete Beta function. As the thresh-
f f f
old (λthf ) tends towards infinity, the incomplete Beta function in (3.32) converges
to the Euler Beta function, i.e., Bψf (kVf , kWf ) → B(kVf , kWf ).
Similarly, with regards to center users, the OP is defined as the probability that
the instantaneous SINR for decoding the user’s own message or the message of the
edge user falls below a certain threshold. Mathematically, it can be expressed as
Pi,c ≈ Pr (γi,c→f < γthf ) + Pr (γi,c→f > γthf , γc < γthc ), where γthc = 2Rthc − 1 is the
target SINR with Rthc being the target rate for center users, and the approximate
symbol is due to the fact that the detection sequence is not of fully independent
(1)
events. The first term in the sum expression, denoted here onwards as Pi,c , takes
on the same form as that of OP for the edge user in (3.32), except for parameters,
i.e., kVf → kZi,c , kWf → kWi,c,f , θVf → ρζi,f θZi,c , and θWf → θWi,c,f . Furthermore,
(2)
let Pi,c = Pr (γi,c→f > γthf , γc < γthc ), then, the second term in the sum expression
becomes
(2)
Pi,c = Iψi,c→f (kWi,c,f , kZi,c ) Iψi,c (kZi,c , kWi,c ), (3.33)
ρζi,f θZ λth θW
where ψi,c→f = ρζi,f θZ +λthi,cθW , ψi,c = ρζi,c θZ c+λthi,c θW , and Iz (· , ·) is the reg-
i,c f i,c,f i,c c i,c
ularized incomplete Beta function. The OP for the center user is then given by
(1) (2)
Pi,c ≈ Pi,c + Pi,c . Further improvement in approximation can be made by mak-
ing use of the fact that outage performance cannot be better than that of the
28
Chapter 3. Synergy of STAR-RIS, CoMP, and NOMA
Numerical Results
In the three-dimensional Cartesian coordinate system, the locations of BS1 and BS2 ,
each with a coverage radius of 60 m, are set to (-50m, 0m, 25m) and (50m, 0m, 25m)
respectively. The STAR-RIS is strategically placed at the intersection of the two
cells, near Uf , specifically, at the coordinates (0m, 25m, 5m). Additionally, the
cellular users U1,c , U2,c , and Uf are positioned at (-40m, 18m, 1m), (30m, 22m, 1m),
and (0m, 35m, 1m), respectively. Some specific parameters used for simulation are
outlined in Table 4.1.
29
Chapter 3. Synergy of STAR-RIS, CoMP, and NOMA
14
K = 120 Elements
K = 40 Elements
12 Without RIS
Non-CoMP, without RIS
0
-25 -20 -15 -10 -5
Transmit power per BS, Pt (dBm)
Figure 3.3: The PDFs and CDFs of the SINRs at the center and edge user, with K = 34
elements, mi,u = mi′ ,u = 1, and mi,R = mR,u = 2, ∀i ∈ I, i′ ∈ I \ {i}, ∀u ∈ U.
100
10!1
Outage probability
Figure 3.4: Outage probability of network users versus Pt for equal amplitude coefficients
(β t = β r ), and element assignments (K1R = K2R ), when K > 0.
30
Chapter 3. Synergy of STAR-RIS, CoMP, and NOMA
tently high outage probabilities across all power levels. Moreover, we attribute the
convergence of outage probability of Uf towards the asymptotes to the problem of
saturation inherent in NOMA.
0.8/0.2 9.5
9.54
9.56
9.54
1
9.51
9.5
0.7/0.3
9.453
9
9.49
9.4
9.45
0.6/0.4
9.4
7
46
9.
67
0.5/0.5 9.438
9.4
0.4/0.6
9.412
0.3/0.7 9.35
9.39
0.2/0.8 9.36
9.3
0.1/0.9 9.32
9.26
0/1
0/72 9/63 18/54 27/45 36/36 45/27 54/18 63/9 72/0
Element splitting ratio K1A / K2A
Figure 3.5: Ergodic rate for varying RIS element assignments (K1R , K2R ) and amplitude
adjustments (βt , βr ), with Pt = −10 dBm.
31
Chapter 4
The convergence of CoMP and NOMA techniques has emerged as a promising so-
lution to enhance both coverage and capacity within cellular networks. Recent
research has further explored the integration of Reconfigurable Intelligent Surfaces
(RIS) into CoMP-NOMA systems, demonstrating the potential for significant per-
formance gains. By strategically deploying RIS, particularly at the cell edge, signal
quality for edge users can be dramatically improved, leading to enhanced network
coverage and capacity. However, despite these promising advancements, a crucial
question remains: how do varying RIS configurations, the number of cooperating
base stations (BSs), and the number of RIS elements influence the overall energy
efficiency of the network? This chapter delves into this critical aspect, investigating
the energy efficiency and passive beamforming (PBF) design within RIS-assisted
CoMP-NOMA networks. We propose and analyze two distinct RIS configurations:
Enhancement-only PBF (EO) and Enhancement & Cancellation PBF (EC). The EO
configuration focuses on optimizing RIS phases to solely enhance the desired signal
quality for edge users, while the EC configuration aims to optimize RIS phases for
both signal enhancement and interference suppression.
32
Chapter 4. Energy Efficient Design for CoMP-NOMA Networks
Towards Generalizability
We define two user classes based on their location: the cell-center users and the
edge users. The cell-center users reside within the disk of their associated cell, while
the edge users fall outside it. Let C (i) ≜ {1, 2, . . . , Ci } represent the set of indices
for cell-center users associated with BSi , and F ≜ {1, 2, . . . , F } represent the set of
indices for the shared edge users cooperatively served by multiple BSs. We denote
cell-center users as Uic (superscript i indicates the serving BS) and edge users as Uef
(superscript e represents the edge class), with c ∈ C (i) and f ∈ F. Furthermore,
U = i∈I C (i) ∪ F represents the index set for all users in the network. For ease of
S
exposition and without loss of generality, we consider a single edge user and a single
cell-center user per cell in this work, i.e., Ci = F = 1, ∀i ∈ I.
For coordinated operation, the BSs are assumed to be interconnected via a high-
∗
Due to processing complexity, latency of SIC at the receivers, and the practical limitations
resulting in SIC error propagation, two-user NOMA pairs are considered in this work. Moreover,
two-user configurations are of practical interest and have been standardized in 3GPP Release
15 [29].
33
Chapter 4. Energy Efficient Design for CoMP-NOMA Networks
In contrast to the Rayleigh fading experienced on direct links, the channels between
BSs and RIS are modeled as Rician fading channels due to the presence of a dominant
line-of-sight (LoS) component. The channel between BSi and Ri is denoted hi,Ri and
can be expressed as
r r !
ρo κ 1
r
hi,Ri = gLoS + gNLoS , (4.2)
P L(di,Ri ) 1 + κ i,Ri 1 + κ i,Ri
LoS
where κ is the Rician factor, gi,Ri
∈ CK×1 is the LoS channel vector given by
h iT
LoS j(k−1)π sin(ωi ) j(K−1)π sin(ωi )
gi,Ri
= 1, . . . , e , . . . , e ,
For the sake of simplicity, we assume perfect channel state information (CSI) at
the BSs. While achieving perfect CSI in practice can be challenging, recent ad-
vancements in channel estimation techniques for RIS-assisted wireless networks have
demonstrated the potential for accurate CSI acquisition with a reasonable over-
34
Chapter 4. Energy Efficient Design for CoMP-NOMA Networks
X p X q X
yfe = e
Hj,f ζj Pj se + e
Hj,f (1 − ζj )Pj scj + e
Hm,f xi +no , (4.3)
j∈J j∈J m∈I\J
| {z } | {z } | {z }
CoMP gain intra-cluster interference inter-cell interference
e
where Hj,f = hej,f + hTRj ,f Θj hj,Rj and Hm,f
e
= hem,f + hTRi ,f Θi hm,Ri are the effec-
tive channels between BSj and Uef and between BSi and Uef , respectively, and
no ∼ CN (0, σ 2 ) is the additive white Gaussian noise. To minimize synchroniza-
tion overhead, we employ non-coherent JT-CoMP, where the edge user Uef combines
signals from cooperative BSs without CSI exchange [24]. Therefore, the signal-to-
†
While practical RIS implementations are subject to phase quantization errors, this work pri-
oritizes establishing a proof-of-concept for the benefits of RIS-assisted CoMP-NOMA in terms of
energy efficiency and performance.
35
Chapter 4. Energy Efficient Design for CoMP-NOMA Networks
e 2
P
ζj Pj |Hj,f
j∈J |
γfe = P e 2 2
, (4.4)
j∈J (1 − ζj )Pj |Hj,f | + Ye + σ
e
|2 represents the inter-cell interference term.
P
where Yf = m∈I\J Pm |Hm,f
p X p
yci = hii,c (1 − ζi )Pi sci + hij,c ζj Pj se
j∈J
X q X
+ hij,c (1 − ζj )Pj scj + him,c xm + no . (4.5)
j∈J ,j̸=i m∈I\J
Based on the SIC principle, the SINR at Uic for decoding the signal intended for Uef
is given by
i 2
P
i j∈J ζj Pj |hj,c |
γc→f = P i 2 2
, (4.6)
j∈J (1 − ζj )Pj |hj,c | + Yi + σ
(1 − ζi )Pi |hii,c |2
γci = P i 2 2
. (4.7)
j∈J ,j̸=i (1 − ζj )Pj |hj,c | + Yi + σ
It is worth noting that due to their placement at the cell edge, the impact of RIS
on the channels experienced by Uic is negligible. Thus, the SINR expressions for Uic
only consider the direct links between the BSs and the users. Finally, the achievable
rates for Uef and Uic can be calculated as
and
An outage event occurs for cell-center users if Uic fails to decode se or is capable of
decoding se but fails to decode sci . The corresponding outage probability can be
e i
defined as Pic = 1 − P(γc→f
i
> γˆf , γci > γˆc ), where γˆf = 2Rth − 1 and γˆc = 2Rth − 1
represent the target SINR thresholds for Uef and Uic , respectively, corresponding to
their target rates Reth and Rith . For edge user Uef , an outage occurs if it fails to
36
Chapter 4. Energy Efficient Design for CoMP-NOMA Networks
Reoutf
P
i
P
c∈C (i) Routc f ∈F
X X
ηEE = 1 + 1 , (4.11)
i∈I
P + PQ
λ i
P
j∈J λ j
+ PQ + PR
where Rioutc = (1−Pic )Ric and Reoutf = (1−Pef )Ref represent the effective outage rate
for Uic and Uef , respectively. Moreover, PQ represents the static power consumption
of a cell, PR = KPele denotes the total power consumption of Ri , where Pele is
the power consumption of k-th element, and λ ∈ (0, 1] signifies the power amplifier
efficiency.
where Φ = {Θ1 , Θ2 , . . . , ΘI } represents the set of phase shift matrices for all RIS.
For cooperative BSs, the phase shifts of Ri in constraint (4.12b) are optimized to
e 2
maximize the effective channel gain |Hj,f | . From [33], the optimal phase shift for
each element of Rj is given by
(k) (k)
θkj = arg(hej,f ) − arg(hRj ,f · hj,Rj ), (4.13)
37
Chapter 4. Energy Efficient Design for CoMP-NOMA Networks
(k) (k)
where arg(·) denotes the argument function, and hRj ,f and hj,Rj represent the k-th
elements of the channel vectors hRj ,f and hj,Rj , respectively.
It should be noted that incorporating cell-center users into the optimization problem,
or increasing the number of users per cell, significantly increases the complexity of
the beamforming design. To maintain tractability, this work focuses on a single edge
user and a single cell-center user per cell.
Numerical Results
The path loss exponents are set to αR = 2.7, αi = 3, αe = 3.5, and αici = 4
for RIS, BS, edge user, and inter-cell interference links, respectively. The network
operates at a carrier frequency of fc = 2.4 GHz, and the noise power is defined as
σ 2 = −174 + 10 log10 (B), with a bandwidth B = 10 MHz. Table 4.1 summarizes
the remaining simulation parameters.
38
Chapter 4. Energy Efficient Design for CoMP-NOMA Networks
17
EC-based Phase Shifts
Energy e/ciency (bps/Hz/Joule)
16.4
16.2
16
15.8
1 2 3 4 5 6
Number of cooperative BSs
Figure 4.2: Energy efficiency vs. number of cooperative BSs J at Pt = 0 dBm and under
various RIS configurations with K = 70 elements.
include: no RIS, RIS with random phase shifts, EO-based RIS, and EC-based RIS.
We observe that for the EO and EC configurations, the energy efficiency initially
increases with a growing number of cooperating BSs, reaching a peak at J = 4.
Beyond this point, the efficiency experiences a decline due to the saturation of both
the achievable rate and the outage probability, leading to diminishing returns. It
should be noted that the EC configuration consistently outperforms other scenarios
∀J, except when J = I. In this particular case where all BSs are cooperative, inter-
ference cancellation becomes redundant, leading to equivalent performance between
the EO and EC configurations.
39
Chapter 4. Energy Efficient Design for CoMP-NOMA Networks
56
52
50
8 9 10 11 12 13 14
Transmit power per BS, Pt (dBm)
Figure 4.3: Outage sum rate vs. transmit power Pt with J = 4 cooperative BSs and
K = 70 elements.
1.8
Energy e/ciency (bps/Hz/Joule) 1.4 30
1.6 2.6
4.
1
Rate threshold Rth (bps/Hz)
5.
1.4 25
6
7.
8
10
13
.9
1.2 16 .6
.4 20
19
1 .1
21
.8
0.8 15
24.4
0.6 26.8
10
0.4 29.2
31.6 5
0.2
0
-10 -5 0 5 10 15
Transmit power per BS, Pt (dBm)
Figure 4.4: Energy efficiency contour plot for varying transmit power Pt and rate threshold
Rth with J = 4 cooperative BSs and K = 70 elements.
40
Chapter 4. Energy Efficient Design for CoMP-NOMA Networks
17
16
15
14
13
18 54 90 126 162
Number of RIS elements K
Figure 4.5: Energy efficiency vs. number of RIS elements K at Pt = 0 dBm and J = 4
cooperative BSs.
28.6
28.4
28.2
28
27.8
27.6
27.4
Figure 4.6: Outage sum rate vs. CO/EO split-ratio for different cooperative BSs J with
Pt = 0 dBm with K = 72 elements.
41
Chapter 4. Energy Efficient Design for CoMP-NOMA Networks
42
Chapter 5
While the previous chapter explored the potential of static RIS configurations in
CoMP-NOMA networks, this chapter delves into the dynamic adaptation of these
networks through the integration of Unmanned Aerial Vehicles (UAVs) and Aerial
RIS (ARIS). This combination of technologies presents a unique opportunity to
further enhance network performance by dynamically adjusting the propagation en-
vironment, enabling flexible coverage, and optimizing resource allocation. Existing
research has demonstrated the potential of incorporating UAVs within RIS, CoMP,
and NOMA frameworks, showcasing notable improvements in various performance
metrics as evidenced in [34, 35].
However, many of these studies rely on static RIS deployments, limiting the adapt-
ability and flexibility of the network. Although some works have investigated ARIS-
assisted CoMP-NOMA networks and optimized UAV trajectory and RIS phase shifts
for sum rate maximization [36], the optimization methods employed, such as double-
layer alternating optimization, often face scalability issues due to their complexity
and potential convergence challenges.
43
Chapter 5. Deep Reinforcement Learning for Intelligent NOMA Networks
Towards Optimization
Aerial
RIS
NOMA pairs
For coordinated operation, the BSs are assumed to be interconnected via a high-
speed backhaul network to a central processing unit (CPU). Moreover, to improve
the signal quality for edge users, an ARIS, denoted as R, is deployed at a fixed
altitude H over area A to create reflection links between the BSs and the users,
and is equipped with K passive elements. For tractability, we discretize the entire
system operation into time slots of equal length τ , where each time slot is indexed by
t ∈ T ≜ {1, 2, . . . , T }, such that T is the total flight time of the UAV. Furthermore,
we assume the presence of O obstacles in the network, denoted as O ≜ {1, 2, . . . , O},
where each obstacle Oo , o ∈ O has its own forbidden zone represented as a circular
disk of radius dmin , centered at the obstacle’s location.
44
Chapter 5. Deep Reinforcement Learning for Intelligent NOMA Networks
Before proceeding with the channel and signal model, we define the positions of
the various entities in the network. Specifically, ∀i ∈ I, u ∈ U, and o ∈ O, the
positions of BSi , Uu , and Oo are represented by pi = (xi , yi , HB ), pu = (xu , yu , 0),
and po = (xo , yo , HO ), respectively, where HB and HO are the heights of the BSs
and obstacles, respectively. Moreover, the position of R at time slot t is denoted
as pR [t] = (xR [t], yR [t], HR ). In this work, we assume that the users are stationary,
and the UAV is capable of adjusting its horizontal position in the xy-plane, while
maintaining a fixed altitude HR .
Contrary to the direct links, the reflection links between BSi and R are modeled
as Rician fading channels, denoted as hi,R [t], due to the presence of a dominant
line-of-sight (LoS) component. At time slot t, the channel hi,R [t] is given by
r r !
ρo κ 1
r
hi,R [t] = gLoS [t] + gNLoS [t] , (5.2)
P L(di,R [t]) 1 + κ i,R 1 + κ i,R
where κ is the Rician factor representing the ratio of the power of the LoS component
to the power of the scattered components, and di,R [t] = pi − pR [t] is the distance
LoS
between BSi and R. Moreover, the deterministic LoS represented, i.e., gi,R [t] ∈
K×1
C , is given by
h iT
LoS j(k−1)π sin(ωi ) j(K−1)π sin(ωi )
gi,R = 1, . . . , e ,...,e ,
45
Chapter 5. Deep Reinforcement Learning for Intelligent NOMA Networks
For the RIS configuration, we assume that the phase shift of the k-th element can
be set independently of other elements and that both the UAV trajectory and the
phase response are controlled by the CPU. Furthermore, the phase shift (PS) matrix
at time slot t is expressed as
jθ1 [t] jθ2 [t] jθK [t]
Θ[t] = diag a1 e , a2 e , . . . , ak e , (5.3)
where ak ∈ (0, 1] is the amplitude coefficient and θk [t] ∈ [−pi, pi) is the phase shift
of the k-th element. In this work, we assume an ideal RIS with perfect phase control
and all reflection elements having a unit amplitude, i.e., ak = 1, ∀k. Furthermore,
we assume the availability of perfect channel state information (CSI) at the CPU.
While this is a challenging assumption in practice, recent advancements in chan-
nel estimation techniques for RIS-assisted wireless networks have demonstrated the
feasibility of achieving accurate CSI with reasonable overhead [16, 30, 32].
where i′ ∈ I \ {i}, no [t] ∼ CN (0, σ 2 ) is the additive white Gaussian noise (AWGN),
and Hi,f [t] = hTR,f [t]Θ[t]hi,R [t] represents the effective channels between BSi and Uf
through R, respectively. To minimize synchronization overhead, we employ non-
coherent JT-CoMP, thus, the signal-to-interference-plus-noise ratio (SINR) is given
by
2 2
λi Hi,f [t] + λi′ Hi′ ,f [t]
γf [t] = 2 2 , (5.5)
(1 − λi ) Hi,f [t] + (1 − λi′ ) Hi′ ,f [t] + ρ1
46
Chapter 5. Deep Reinforcement Learning for Intelligent NOMA Networks
yci [t] = Hi,ci [t]xi [t] + hi′ ,ci [t]xi′ [t] + no [t], (5.6)
where Hi,ci [t] = hi,ci [t] + hTR,ci [t]Θ[t]hi,R [t] represents the effective channels between
BSi and Uci through R, respectively. Also, the term hi′ ,ci [t]xi′ [t] represents the ICI
caused by the transmission of BSi′ at Uci . Based on the SIC principle, Uci first
decodes xi,f [t] and then cancels it from yci [t] to decode xi,ci [t]. The SINR at Uci for
decoding xi,f [t] is given by
2
λi Hi,ci [t]
γci →f [t] = 2 2 , (5.7)
1
(1 − λi ) Hi,ci [t] + hi′ ,ci [t] + ρ
2
(1 − λi ) Hi,ci [t]
γci [t] = 2 . (5.8)
1
hi′ ,ci [t] + ρ
Finally, the achievable sum rate of the network at time slot t can be expressed as
X X
Rsum [t] = Rci [t] + Rf [t]. (5.9)
i∈I f ∈F
where Rci [t] = log2 (1 + γci [t]) and Rf [t] = log2 (1 + γf [t]) are the achievable rates of
Uci and Uf , respectively.
47
Chapter 5. Deep Reinforcement Learning for Intelligent NOMA Networks
where constraint (5.10b) restricts the UAV trajectory to lie within A, and con-
straint (5.10c) enforces a minimum safety distance between the UAV and any obsta-
cles present, thus guaranteeing the UAV’s safety. Constraint (5.10d) limits the phase
shifts applied by the RIS elements. To meet the quality of service (QoS) require-
ments, constraints (5.10e) and (5.10f) impose minimum rate thresholds, denoted by
Rmin
ci and Rmin
f , for Uci and Uf , respectively. Lastly, constraint (5.10g) defines the
permissible range for power allocation factors, ensuring successful SIC. The opti-
mization problem in (5.10) is non-convex due to the coupled variables {P, Θ, Λ}.
To address this, we propose a DRL-based solution in the next section.
1. State Space S: The environment state at time slot t consists of the UAV’s cur-
rent position pR [t], the distance from the UAV to the center of obstacles dR [t] =
{ pR [t] − po , ∀o ∈ O}, the power allocation factors Λ, and the achievable rates
48
Chapter 5. Deep Reinforcement Learning for Intelligent NOMA Networks
R[t] = {Rci [t], Rf [t], ∀i, f }. Thus, the state space can be expressed as
2. Action Space A: The action space of the formulated MDP consists of the UAV’s
movement in the horizontal xy-plane, the phase shifts of the RIS elements, and the
power allocation factors. Specifically, the action space at time slot t contains the
manuevering actions aR [t] ∈ {(−1, 0), (1, 0), (0, −1), (0, 1), (0, 0)}, representing left,
right, down, up, and hover, respectively, the phase shifts aΦ [t] = {ϕk [t], ∀k}, and
the power allocation factors aΛ = {λi , ∀i}. Thus, the action space can be expressed
as
at = {aR [t], aΦ [t], aΛ } ∈ RdimA . (5.12)
3. Reward Function R: The reward function plays a crucial role in shaping the
learning behavior of the RL agent. Our design encourages maximizing the sum rate
while ensuring UAV safety and meeting QoS requirements by penalizing constraint
violations. The reward function is defined as
P !
u∈U ζu [t]
R(st , at ) = Rsum [t] 1 − − ξR [t]Kviol , (5.13)
|U|
where Kviol is the penalty factor for constraint violation, and ζu [t] = I{Ru [t] ≤ Rumin }
is the indicator function for the QoS constraints, i.e., ζu [t] = 1 if QoS constraints
are violated, and 0 otherwise. Similarly, ξR [t] = I{xR [t], yR [t] ∈
/ A ∧ pR [t] − po <
dmin , ∀o ∈ O} is the indicator function for UAV’s safety constraints.
49
Chapter 5. Deep Reinforcement Learning for Intelligent NOMA Networks
T̂ −1
X
Ât = γ k rt+k + γ T̂ V (st+T̂ ) − V (st ), (5.14)
k=0
To generate the stochastic policy πθd (at |st ) for the discrete actions, the correspond-
ing actor network outputs |aR | logits, which are then passed through a softmax
function to obtain a probability distribution over the available discrete actions. Con-
versely, the continuous actor network generates the continuous actions aΦ and aΛ by
sampling from Gaussian distributions parameterized by the mean and standard de-
50
Chapter 5. Deep Reinforcement Learning for Intelligent NOMA Networks
viation outputs of the network, as dictated by the stochastic policy πθc (at |st ). Both
πθd (at |st ) and πθc (at |st ) are optimized independently using their respective clipped
surrogate objective functions. For the discrete actions, the objective function is
given by h i
LCLIP
d (θd ) = Êt min(rt
d
(θd )Ât , ℵ(rt
d
, θd , ϵ)Ât , (5.15)
where ℵ(rtd , θd , ϵ) = clip(rtd (θd ), 1 − ϵ, 1 + ϵ), rtd (θd ) = πθd (at |st )/πθold
d
(at |st ) is the
importance sampling ratio, and ϵ is the clipping parameter. The objective function
for the continuous actions can be expressed in a similar manner but is left out for
brevity.
It is important to note that while both policies collaborate within the environment,
their optimization objectives remain decoupled, i.e., πθd (at |st ) and πθc (at |st ) are
treated as independent distributions during policy optimization, rather than a joint
distribution encompassing both action spaces. The MO-PPO algorithm is summa-
rized in Algorithm 1.
51
Chapter 5. Deep Reinforcement Learning for Intelligent NOMA Networks
Numerical Results
52
Chapter 5. Deep Reinforcement Learning for Intelligent NOMA Networks
35
MO-PPO, CoMP-NOMA Optimal PS
30 MO-PPO, CoMP-OMA Optimal PS
convergence
20
15
10
0
100 200 300 400 500 600 700
Episodes
Figure 5.2: Average cumulative reward vs. number of training episodes with Pt = 20 dBm
and K = 120 elements.
15 NOMA gain
RIS gain
gain
C oM P
10
5
0 2 4 6 8 10 12 14 16 18 20
Transmit power per BS, Pt (dBm)
Figure 5.3: Sum rate vs. transmit power for different algorithms and configurations with
K = 120 elements.
53
Chapter 5. Deep Reinforcement Learning for Intelligent NOMA Networks
22
Exhaustive Search
20 MO-PPO, Optimal PS
16
14
12
10
18 54 90 126 162
Number of RIS elements K
Figure 5.4: Impact of the number of RIS elements on the achievable sum rate with Pt = 10
dBm.
45
Obstacles Start
30
BS1
15 dmin
R
Uc 1
y (m)
-15
Uf
-30
BS2 Uc 2 Hover
-45
-45 -30 -15 0 15 30 45
x (m)
Figure 5.5: Top view of the UAV trajectory obtained by the MO-PPO algorithm sampled
every 25 time slots and averaged over 10 evaluation episodes
54
Chapter 5. Deep Reinforcement Learning for Intelligent NOMA Networks
55
Chapter 6
Conclusion
6.1 Summary
This thesis has investigated the potential of integrating STAR-RIS, CoMP, and
NOMA technologies to address the challenges of future wireless communication
systems. The research explored various aspects of these technologies, including
performance analysis, optimization strategies, and practical implementation consid-
erations.
56
Chapter 6. Conclusion
57
Chapter 6. Conclusion
vironments.
58
References
[7] L. Zhang, H. Zhao, S. Hou, Z. Zhao, H. Xu, X. Wu, Q. Wu, and R. Zhang,
“A survey on 5g millimeter wave communications for uav-assisted wireless net-
works,” IEEE Access, vol. 7, pp. 117460–117504, 2019.
[8] X. Liu, Y. Liu, Y. Chen, and L. Hanzo, “Trajectory design and power control
for multi-uav assisted wireless networks: A machine learning approach,” IEEE
Transactions on Vehicular Technology, vol. 68, no. 8, pp. 7957–7969, 2019.
[9] T. Hou, J. Wang, Y. Liu, X. Sun, A. Li, and B. Ai, “A joint design for STAR-
RIS enhanced NOMA-CoMP networks: A simultaneous-signal-enhancement-
59
References
[10] S. Xu, C. Liu, H. Wang, M. Qian, and J. Li, “On secrecy performance analysis
of multi-antenna star-ris-assisted downlink noma systems,” EURASIP Journal
on Advances in Signal Processing, vol. 2022, no. 1, p. 118, 2022.
[11] J. Chen, K. Zhai, Z. Wang, Y. Liu, J. Jia, and X. Wang, “Comp and ris-
assisted multicast transmission in a multi-uav communication system,” IEEE
Transactions on Communications, 2024.
[12] H. Mei, K. Yang, Q. Liu, and K. Wang, “3d-trajectory and phase-shift design for
ris-assisted uav systems using deep reinforcement learning,” IEEE Transactions
on Vehicular Technology, vol. 71, no. 3, pp. 3020–3029, 2022.
[13] H. Peng and L.-C. Wang, “Energy harvesting reconfigurable intelligent surface
for uav based on robust deep reinforcement learning,” IEEE Transactions on
Wireless Communications, vol. 22, no. 10, pp. 6826–6838, 2023.
[18] Y. Guo, Z. Qin, Y. Liu, and N. Al-Dhahir, “Intelligent reflecting surface aided
multiple access over fading channels,” IEEE Transactions on Communications,
vol. 69, no. 3, pp. 2015–2027, 2020.
60
References
[26] X. Yue, J. Xie, Y. Liu, Z. Han, R. Liu, and Z. Ding, “Simultaneously transmit-
ting and reflecting reconfigurable intelligent surface assisted noma networks,”
IEEE Transactions on Wireless Communications, vol. 22, no. 1, pp. 189–204,
2023.
61
References
[34] J. Zhao, L. Yu, K. Cai, Y. Zhu, and Z. Han, “RIS-aided ground-aerial NOMA
communications: A distributionally robust DRL approach,” IEEE J. Sel. Areas
Commun., vol. 40, no. 4, pp. 1287–1301, 2022.
[36] S. Lv, X. Xu, S. Han, and P. Zhang, “UAV-RIS assisted coordinated multipoint
finite blocklength transmission for MTC networks,” IEEE Internet Things J.,
2023.
[37] D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” arXiv
preprint arXiv:1412.6980, 2014.
62