Enhancing Heterogeneous Network Performance: Advanced Content Popularity Prediction and Efficient Caching
Abstract
:1. Introduction
- Establishing an efficient caching strategy that minimizes communication latency and maximizes cache hit rates by addressing the challenges of dynamic content popularity and resource allocation in heterogeneous networks.
- Introducing an Long Short Term Memory(LSTM)-based content-popularity-prediction model for a caching placement strategy, using low-dimensional denoising encoders to capture content-request information and LSTM modules to analyze temporal features. This is coupled with a fully connected network for feature fusion, thereby enhancing prediction accuracy and effectively addressing the diversity of user requests in heterogeneous network environments.
- Representing the heterogeneous-edge-content-delivery problem as a Markov decision process (MDP) and providing a collaborative caching solution based on multi-agent actor–critic networks. This approach utilizes multi-head attention technology to account for the variability in a diverse network environment.
- Conducting a series of tests to validate the effectiveness of the proposed solution in improving network performance and reducing resource inefficiency, thus demonstrating significant improvements in content-popularity-prediction capability, cache hit rate, and system latency reduction compared to existing solutions and other reinforcement-learning methods.
2. Related Works
3. System Model
3.1. Network Model
3.2. Communication Model
3.3. Content-Caching Model
3.3.1. Content Placement Phase
3.3.2. Content Delivery Phase
3.4. Problem Description
4. LSTM-Based Content Placement Strategy
4.1. Content Popularity Prediction
4.2. Content-Popularity-Prediction Methods in Varying Environments
Algorithm 1 Long short-term memory–content popularity prediction (LSTM–CPP) algorithm. |
Require: H: Historical content request data, : Set of existing contents : Historical data observation-window length Ensure: : Predicted popularity for each content 1: Initialize the stacked auto-encoder (SAE) 2: for each content f in do 3: Extract the low-dimensional representation using an SAE trained on H 4: 5: end for 6: Initialize the LSTM network 7: for each representation do 8: Process with LSTM to capture the time-dimension features 9: 10: end for 11: Initialize the feature-fusion module 12: for each time slot t do 13: Fuse multi-dimensional features and to form 14: 15: Predict content popularity using a fully connected (FC) network on 16: 17: end for 18: for new content in each time slot do 19: Calculate similarity between and each content in 20: 21: Estimate the popularity of based on weighted similarity 22: 23: end for 24: return |
5. Collaborative Caching Strategies Based on Multi-Intelligent Actor–Critic Networks
5.1. Multi-Intelligent Actor–Critic Framework
- State Space:We defined the state space , where contained a composition of the local and global information for each agent. The local information indicated the state of the cached list, and the global information focused on the content requests of the MTs, including historical and current data.
- Action Space:The action space was denoted as , where and denoted the process where the caching decision is the agent’s decision on which content is to be stored or removed. The decision is based on the predicted content popularity and the current cache state. Moreover, denoted the bandwidth-allocation decision, which involves how the agent allocates the available bandwidth resources among the MTs it serves. Each element indicates the proportion of bandwidth allocated to the corresponding MTs or tasks.
- Reward Mechanism:We designed the reward function to evaluate the effect of an agent’s action according to the definition of the system model. The reward mechanism includes aspects such as reducing network latency, increasing cache hit rate, and improving the efficiency of bandwidth allocation. For behaviors that are effective in reducing network latency and improving cache hit rate, the agent will be positively rewarded. At the same time, we also considered the optimization of the bandwidth-allocation efficiency.For behaviors that reduce network latency, the agent should receive positive rewards:An increase in cache hit rate should also be rewarded, especially if it matches the popular content predicted by the LSTM–CPP algorithm:Efficient bandwidth allocation should be equally rewarded:
- Observation Mechanism:The observation mechanism enables each agent to obtain the state of the local cache in its service area, as well as a sense of the overall network state. In addition, the agents can obtain the state information of neighboring SBSs through the established communication links.
- Actor Networks:An actor network is a parameterized policy network that defines the probability of choosing a particular action given an observation. Each agent observes the local state of its service region, and it then makes a caching decision based on the current observation of its local state. For agent i, the actor network can be represented by the function , which outputs the probability distribution of taking action under a given observation as follows:Here, denotes the parameters of the actor network.
- Critic Network:The determination to cache actions is made by the actor network implemented in each agent, and it relies on local knowledge. After the action selection in the actor network, the critic network selects actions based on the states of all the BSs and actions , which guides the update of the actor network by evaluating the expected returns of the strategies through a value function. In addition, we introduced an attention mechanism to more accurately model the influence of other agents, thus making the evaluation process more accurate and efficient.
5.2. Critic Networks with Attention Mechanisms
5.2.1. Optimization of the Embedding Layer
5.2.2. Attention Mechanism Refinement
5.2.3. Integration of the Output Layer
5.3. Multi-Intelligence Actor–Critic Collaborative Caching Policy Algorithm
5.3.1. Strategy Evaluation and Iterative Optimization
5.3.2. Utilization of Dominance Functions
5.3.3. Update Mechanisms for the Critic and Actor Networks
5.4. Algorithm Complexity Analysis
Algorithm 2 Multi-intelligence actor–critic collaborative caching policy (MACP) algorithm. |
Input: Set of SBSs, each as an intelligent agent Initial set of states S Predicted content popularity as determined via the LSTM model Observation mechanism O Set of parameters for the actor networks Set of parameters for the critic networks Output: Updated set of parameters for the actor networks Updated set of parameters for the critic networks
|
6. Simulation Analysis
6.1. Simulation Parameter Configuration
6.2. Content-Popularity-Prediction Method
- LSTM: The content-popularity-prediction model of an LSTM uses a K-mean clustering algorithm to group different contents [38]. Moreover, the content popularity of each group was predicted using a long short-term memory network.
- DNN-LSTM: An end-to-end network from [39]. A two-layer fully connected network with [20, 10] neurons, as well as two-layer long and short-term memory network with [24, 24] neurons.
- CNN-LSTM: This method is different from DNN-LSTM. This model uses a one-dimensional convolutional neural network instead of a fully connected network [40].
6.3. Efficient Caching Strategy
- Actor–Critic (AC): This method employs a distinct memory structure that explicitly represents the policy, and this is true independent of the value function. Such a setup allows for a more nuanced policy development that is not directly tied to the value estimations [41].
- Deep Q Networks (DQN): DQN differs not only in its network structure, but also in its approach to feature handling. This method is grounded in local features and does not incorporate the influences from neighboring nodes’ environments. This limitation could impact the algorithm’s effectiveness in more interconnected or dynamic settings [42].
7. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Reiss-Mirzaei, M.; Ghobaei-Arani, M.; Esmaeili, L. A review on the edge caching mechanisms in the mobile edge computing: A social-aware perspective. Internet Things 2023, 22, 100690. [Google Scholar] [CrossRef]
- Somesula, M.K.; Rout, R.R.; Somayajulu, D. Cooperative cache update using multi-agent recurrent deep reinforcement learning for mobile edge networks. Comput. Netw. 2022, 209, 108876. [Google Scholar] [CrossRef]
- Wang, Q.; Chen, S.; Wu, M. Incentive-Aware Blockchain-Assisted Intelligent Edge Caching and Computation Offloading for IoT. Engineering 2023, in press. [Google Scholar] [CrossRef]
- Zyrianoff, I.; Gigli, L.; Montori, F.; Sciullo, L.; Kamienski, C.; Felice, M.D. Cache-it: A distributed architecture for proactive edge caching in heterogeneous iot scenarios. Ad Hoc Netw. 2024, 156, 103413. [Google Scholar] [CrossRef]
- Liu, W.-X.; Zhang, J.; Liang, Z.-W.; Peng, L.-X.; Cai, J. Content popularity prediction and caching for icn: A deep learning approach with sdn. IEEE Access 2017, 6, 5075–5089. [Google Scholar] [CrossRef]
- Zhou, H.; Jiang, K.; He, S.; Min, G.; Wu, J. Distributed deep multi-agent reinforcement learning for cooperative edge caching in internet-of-vehicles. IEEE Trans. Wirel. Commun. 2023, 22, 9595–9609. [Google Scholar] [CrossRef]
- Lee, D.D.; Pham, P.; Largman, Y.; Ng, A. Advances in neural information processing systems 22. In Proceedings of the 26th Annual Conference on Neural Information Processing Systems 2012, NIPS 2012, Lake Tahoe, NV, USA, 3–6 December 2009. [Google Scholar]
- Sultan, M.T.; Sayed, H.E. QoE-aware analysis and management of multimedia services in 5 g and beyond heterogeneous networks. IEEE Access 2023, 11, 77679–77688. [Google Scholar] [CrossRef]
- Salim, M.M.; Elsayed, H.A.; Elaziz, M.A.; Fouda, M.M.; Abdalzaher, M.S. An optimal balanced energy harvesting algorithm for maximizing two-way relaying d2d communication data rate. IEEE Access 2022, 10, 114178–114191. [Google Scholar] [CrossRef]
- Fang, S.; Tang, R.; Guo, X. An adaptive adjusting density scheme of small cell base stations in heterogeneous cell networks. In Proceedings of the 5th International Conference on Communication and Information Processing, Chongqing, China, 15–17 November 2019; pp. 221–225. [Google Scholar]
- Abdalzaher, M.S.; Moustafa, S.S.; Hafiez, H.A.; Ahmed, W.F. An optimized learning model augment analyst decisions for seismic source discrimination. IEEE Trans. Geosci. Remote Sens. 2022, 60, 1–12. [Google Scholar] [CrossRef]
- Choi, Y.; Lim, Y. Deep reinforcement learning-based edge caching in heterogeneous networks. J. Inf. Process. Syst. 2022, 18, 803. [Google Scholar]
- Li, Y.; Ma, H.; Wang, L.; Mao, S.; Wang, G. Optimized content caching and user association for edge computing in densely deployed heterogeneous networks. IEEE Trans. Mob. Comput. 2020, 21, 2130–2142. [Google Scholar] [CrossRef]
- Tang, J.; Tang, H.; Zhang, X.; Cumanan, K.; Chen, G.; Wong, K.-K.; Chambers, J.A. Energy minimization in d2d-assisted cache-enabled internet of things: A deep reinforcement learning approach. IEEE Trans. Ind. Inform. 2019, 16, 5412–5423. [Google Scholar] [CrossRef]
- Tang, J.; Tang, H.; Zhao, N.; Cumanan, K.; Zhang, S.; Zhou, Y. A reinforcement learning approach for d2d-assisted cache-enabled hetnets. In Proceedings of the 2019 IEEE Global Communications Conference (GLOBECOM), Waikoloa, HI, USA, 9–13 December 2019; pp. 1–6. [Google Scholar]
- Sadeghi, A.; Wang, G.; Giannakis, G.B. Deep reinforcement learning for adaptive caching in hierarchical content delivery networks. IEEE Trans. Cogn. Commun. Netw. 2019, 5, 1024–1033. [Google Scholar] [CrossRef]
- Hou, J.; Xia, H.; Lu, H.; Nayak, A. A graph neural network approach for caching performance optimization in ndn networks. IEEE Access 2022, 10, 112657–112668. [Google Scholar] [CrossRef]
- Hassine, N.B.; Milocco, R.; Minet, P. ARMA based popularity prediction for caching in content delivery networks. In Proceedings of the 2017 Wireless Days, Porto, Portugal, 29–31 March 2017; pp. 113–120. [Google Scholar]
- Ale, L.; Zhang, N.; Wu, H.; Chen, D.; Han, T. Online proactive caching in mobile edge computing using bidirectional deep recurrent neural network. IEEE Internet Things J. 2019, 6, 5520–5530. [Google Scholar] [CrossRef]
- Jiang, Y.; Feng, H.; Zheng, F.-C.; Niyato, D.; You, X. Deep learning-based edge caching in fog radio access networks. IEEE Trans. Wirel. Commun. 2020, 19, 8442–8454. [Google Scholar] [CrossRef]
- Lin, Z.; Sun, X.; Ji, Y. Landslide displacement prediction model using time series analysis method and modified lstm model. Electronics 2022, 11, 1519. [Google Scholar] [CrossRef]
- Mannepalli, K.; Singh, S.P.; Kolli, C.S.; Raj, S.; Bojja, G.R.; Rajakumar, B.; Binu, D. Popularity prediction model with context, time and user sentiment information: An optimization assisted deep learning technique. Int. J. Uncertain. Fuzziness-Knowl.-Based Syst. 2023, 31, 283–302. [Google Scholar] [CrossRef]
- Hu, Z.; Fang, C.; Wang, Z.; Tseng, S.-M.; Dong, M. Many-objective optimization based-content popularity prediction for cache-assisted cloud-edge-end collaborative iot networks. IEEE Internet Things J. 2023, 11, 1190–1200. [Google Scholar] [CrossRef]
- Shekhar, S.; Singh, A.; Gupta, A.K. A deep neural network (dnn) approach for recommendation systems. In Advances in Computational Intelligence and Communication Technology: Proceedings of CICT 2021; Springer: Singapore, 2022; pp. 385–396. [Google Scholar]
- Zhang, R.; Yu, F.R.; Liu, J.; Huang, T.; Liu, Y. Deep reinforcement learning (drl)-based device-to-device (d2d) caching with blockchain and mobile edge computing. IEEE Trans. Wirel. Commun. 2020, 19, 6469–6485. [Google Scholar] [CrossRef]
- Zhong, C.; Gursoy, M.C.; Velipasalar, S. Deep multi-agent reinforcement learning based cooperative edge caching in wireless networks. In Proceedings of the ICC 2019-2019 IEEE International Conference on Communications (ICC), Shanghai, China, 20–24 May 2019; pp. 1–6. [Google Scholar]
- Yan, H.; Xu, X.; Bilal, M.; Xia, X.; Dou, W.; Wang, H. Customer centric service caching for intelligent cyber-physical transportation systems with cloud-edge computing leveraging digital twins. IEEE Trans. Consum. Electron. 2023. [Google Scholar] [CrossRef]
- Lu, Y.; Zhang, P.; Duan, Y.; Guizani, M.; Wang, J.; Li, S. Dynamic scheduling of iov edge cloud service functions under nfv: A multi-agent reinforcement learning approach. IEEE Trans. Veh. Technol. 2023. [Google Scholar] [CrossRef]
- Li, D.; Zhang, H.; Ding, H.; Li, T.; Liang, D.; Yuan, D. User preference learning-based proactive edge caching for d2d-assisted wireless networks. IEEE Internet Things J. 2023, 10, 11922–11937. [Google Scholar] [CrossRef]
- Garetto, M.; Leonardi, E.; Traverso, S. Efficient analysis of caching strategies under dynamic content popularity. In Proceedings of the 2015 IEEE Conference on Computer Communications (INFOCOM), Hong Kong, China, 26 April–1 May 2015; pp. 2263–2271. [Google Scholar]
- Li, D.; Han, Y.; Wang, C.; Shi, G.; Wang, X.; Li, X.; Leung, V.C. Deep reinforcement learning for cooperative edge caching in future mobile networks. In Proceedings of the 2019 IEEE Wireless Communications and Networking Conference (WCNC), Marrakesh, Morocco, 15–18 April 2019; pp. 1–6. [Google Scholar]
- Wang, C.; Liang, C.; Yu, F.R.; Chen, Q.; Tang, L. Computation offloading and resource allocation in wireless cellular networks with mobile edge computing. IEEE Trans. Wirel. Commun. 2017, 16, 4924–4938. [Google Scholar] [CrossRef]
- Al-Shehari, T.; Alsowail, R.A. An insider data leakage detection using one-hot encoding, synthetic minority oversampling and machine learning techniques. Entropy 2021, 23, 1258. [Google Scholar] [CrossRef]
- Haarnoja, T.; Zhou, A.; Abbeel, P.; Levine, S. Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor. In International Conference on Machine Learning; PMLR: London, UK, 2018; pp. 1861–1870. [Google Scholar]
- Wang, X.; Wang, C.; Li, X.; Leung, V.C.; Taleb, T. Federated deep reinforcement learning for internet of things with decentralized cooperative edge caching. IEEE Internet Things J. 2020, 7, 9441–9455. [Google Scholar] [CrossRef]
- ElSawy, H.; Sultan-Salem, A.; Alouini, M.-S.; Win, M.Z. Modeling and analysis of cellular networks using stochastic geometry: A tutorial. IEEE Commun. Surv. Tutor. 2016, 19, 167–203. [Google Scholar] [CrossRef]
- Li, D.; Zhang, H.; Yuan, D.; Zhang, M. Learning-based hierarchical edge caching for cloud-aided heterogeneous networks. IEEE Trans. Wirel. Commun. 2023, 22, 1648–1663. [Google Scholar] [CrossRef]
- Xu, H.; Sun, Y.; Gao, J.; Guo, J. Intelligent edge content caching: A deep recurrent reinforcement learning method. Peer-to-Peer Netw. Appl. 2022, 15, 2619–2632. [Google Scholar] [CrossRef]
- Zheng, H.; Lin, F.; Feng, X.; Chen, Y. A hybrid deep learning model with attention-based conv-lstm networks for short-term traffic flow prediction. IEEE Trans. Intell. Transp. Syst. 2021, 22, 6910–6920. [Google Scholar] [CrossRef]
- Zhang, L.; Cai, Y.; Huang, H.; Li, A.; Yang, L.; Zhou, C. A cnn-lstm model for soil organic carbon content prediction with long time series of modis-based phenological variables. Remote Sens. 2022, 14, 4441. [Google Scholar] [CrossRef]
- Peters, J.; Schaal, S. Natural actor-critic. Neurocomputing 2008, 71, 1180–1190. [Google Scholar] [CrossRef]
- Li, R.; Zhao, Y.; Wang, C.; Wang, X.; Leung, V.C.; Li, X.; Taleb, T. Edge caching replacement optimization for d2d wireless networks via weighted distributed dqn. In Proceedings of the 2020 IEEE Wireless Communications and Networking Conference (WCNC), Seoul, Republic of Korea, 25–28 May 2020; pp. 1–6. [Google Scholar]
Symbol | Description |
---|---|
Set of MBS and SBSs | |
Set of all MTs within the coverage area of SBS m | |
t | Time slot t |
Data transmission rate | |
Channel gain between SBS m and MT n in time slot t | |
Transmit power | |
Background-noise power | |
Bandwidth of SBSs | |
Bandwidth percentage allocated by SBS m to corresponding MT n | |
Set of contents stored in the content server | |
Size of each content | |
and | Cache capacity of servers equipped in SBSs and MBS |
H | Historical request data of contents |
Length of historical-data observation window | |
Categories of content popularity for different service ranges | |
Interest category of each base station | |
Request probability of content f | |
Popularity ranking of content f in interest category k | |
Reflects the skewness of the Zipf distribution | |
Content-caching decision of BS m in time slot t | |
D | Delay function |
Cache hit rate in time slot t | |
Number of requests for content f in time slot t | |
Number of requests for all content within the coverage area of base station m | |
Weight matrix | |
Bias vector | |
Input gate of the LSTM network | |
Forget gate of the LSTM network | |
Output gate of the LSTM network | |
Cell state of the LSTM network | |
Hidden-layer output variable of the LSTM network | |
Similarity between new content f and existing content | |
State space | |
Action space | |
Reward function | |
O | Observation mechanism |
Update rule typically involves the advantage function | |
Key vector | |
Value vector | |
Critic network of each BS estimates the action-value function through attention mechanism | |
MLP layer | |
Decay factor | |
Balances between the maximum entrance and reward | |
Network-update parameter | |
Learning rate |
Symbol | Setting | Parameter |
---|---|---|
4 | Number of base stations | |
40 | Number of mobile terminals | |
10 MHz | Size of bandwidth | |
1 w | Transmission power of SBS | |
10−14 | Power of background noise | |
10 ms | Delay of content delivery from neighboring SBS to SBS | |
10 ms | Delay of content delivery from MBS to SBS | |
100 Mb/s | Average data rate of the core network | |
0.003 | SBS density | |
10 Mb | Capacity of SBS-equipped server cache | |
20 Mb | Capacity of MBS-equipped server cache | |
1000 | Number of contents | |
1 Mb | Size of each content | |
7 | Step size of LSTM |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Sun, Z.; Chen, G. Enhancing Heterogeneous Network Performance: Advanced Content Popularity Prediction and Efficient Caching. Electronics 2024, 13, 794. https://doi.org/10.3390/electronics13040794
Sun Z, Chen G. Enhancing Heterogeneous Network Performance: Advanced Content Popularity Prediction and Efficient Caching. Electronics. 2024; 13(4):794. https://doi.org/10.3390/electronics13040794
Chicago/Turabian StyleSun, Zhiyao, and Guifen Chen. 2024. "Enhancing Heterogeneous Network Performance: Advanced Content Popularity Prediction and Efficient Caching" Electronics 13, no. 4: 794. https://doi.org/10.3390/electronics13040794