Machine_Learning_With_Computer_Networks_Techniques_Datasets_and_Models

The document discusses the intersection of machine learning (ML) and computer networks, highlighting the importance of understanding techniques, datasets, and models for effective research in this area. It serves as a primer for researchers, outlining key concepts, open problems, and tools available for applying ML to networking challenges. The paper emphasizes the need for quality training data and the potential of ML to enhance network performance amidst growing complexity in network architectures.

Uploaded by

marichuy.salgado

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views

Machine_Learning_With_Computer_Networks_Techniques_Datasets_and_Models

Uploaded by

marichuy.salgado

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 48

Received 25 February 2024, accepted 25 March 2024, date of publication 3 April 2024, date of current version 23 April 2024.

Digital Object Identifier 10.1109/ACCESS.2024.3384460

Machine Learning With Computer Networks:

Techniques, Datasets, and Models
HAITHAM AFIFI 1 , (Member, IEEE), SABRINA POCHABA 2 , ANDREAS BOLTRES 3 ,
DOMINIC LANIEWSKI 4 , (Graduate Student Member, IEEE), JANEK HABERER 5 ,
LEONARD PAELEKE 6,7 , REZA POORZARE 8 , (Member, IEEE), DANIEL STOLPMANN 9,

NIKOLAS WEHNER 10 , ADRIAN REDDER 11 , ERIC SAMIKWA 12 ,

AND MICHAEL SEUFERT 13 , (Senior Member, IEEE)
1 Accenture, 61476 Kronberg im Taunus, Germany
2 Salzburg Research Forschungsgesellschaft m.b.H., 5020 Salzburg, Austria
3 Autonomous Learning Robots Laboratory, Karlsruhe Institute of Technology (KIT), 76131 Karlsruhe, Germany
4 Institute of Computer Science, Osnabrück University, 49076 Osnabrück, Germany
5 Distributed Systems Group, Kiel University, 24118 Kiel, Germany
6 Digital Engineering Faculty, University of Potsdam, 14482 Potsdam, Germany
7 Digital Health & Machine Learning, Hasso Plattner Institute, 14482 Potsdam, Germany
8 Wirtschaft Center of Applied Research, Data-Centric Software Systems (DSS) Research Group, Institute of Applied Research, Hochschule Karlsruhe Technik,

76133 Karlsruhe, Germany

9 Institute of Communication Networks, Hamburg University of Technology, 21073 Hamburg, Germany
10 Chair of Communication Networks, University of Würzburg, 97074 Würzburg, Germany
11 Universität Paderborn, 33098 Paderborn, Germany
12 Institute of Computer Science, University of Bern, 3012 Bern, Switzerland
13 Chair of Networked Embedded Systems and Communication Systems, University of Augsburg, 86159 Augsburg, Germany

Corresponding author: Michael Seufert ([email protected])

This work was supported by German Research Foundation [Deutsche Forschungsgemeinschaft (DFG)] under Grant SE 3163/3-1,
project number: 500105691 (UserNet). This work was also supported by the Federal Ministry of Education and Research of Germany
under Grant 16KISK011 (Open6GHub) as well as by the Federal Ministry for Economic Affairs and Climate Action of Germany
under Grant 68GX21002 (Marispace-X).

ABSTRACT Machine learning has found many applications in network contexts. These include solving
optimisation problems and managing network operations. Conversely, networks are essential for facilitating
machine learning training and inference, whether performed centrally or in a distributed fashion. To conduct
rigorous research in this area, researchers must have a comprehensive understanding of fundamental
techniques, specific frameworks, and access to relevant datasets. Additionally, access to training data can
serve as a benchmark or a springboard for further investigation. All these techniques are summarized in
this article; serving as a primer paper and hopefully providing an efficient start for anybody doing research
regarding machine learning for networks or using networks for machine learning.

INDEX TERMS Computer networking, datasets, machine learning, metrics, tools.

I. INTRODUCTION steadily growing. Consequently, conventional algorithmic

In recent years, the ever-growing interconnection of busi- and heuristic-based approaches for network management
nesses and people and their increased reliance on networked tasks are starting to fall behind the expected levels of perfor-
services has prompted computer network architectures to mance, as they fail to deliver timely and nuanced decisions
continually grow in size and complexity. Moreover, with in the face of the complex environment they are operating in.
the increased efficiency and convenience of network-based Meanwhile, Machine Learning (ML) has shown remarkable
services and businesses, the expectations of enterprises results in various problem domains such as discovering new
and people with respect to network performance indicators antibiotic drugs [1], generating high-fidelity images from
such as latency, throughput, reliability and resilience are arbitrary text prompts [2] and even finding new mathematical
conjectures [3]. Such successes usually become very visible
The associate editor coordinating the review of this manuscript and even beyond the research community, and thus ML has soared
approving it for publication was Kaigui Bian . in popularity in the past few years. Generally, Artificial

2024 The Authors. This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.
VOLUME 12, 2024 For more information, see https://creativecommons.org/licenses/by-nc-nd/4.0/ 54673
H. Afifi et al.: Machine Learning With Computer Networks: Techniques, Datasets, and Models

Intelligence (AI) refers to machines or systems that can are beginning to take on the aforementioned challenges of
perform tasks typically requiring human intelligence, such as the networking community on ML, and combining ML and
learning, reasoning, problem-solving, perception, language networking in research seems more attractive than ever.
understanding, and decision-making. ML is a subfield of Furthermore, computer network infrastructures have been
AI that concentrates on developing algorithms and statistical used recently to improve the performance of existing ML
models. These models enable computers to perform tasks approaches, e.g. by distributing the training process or the
without explicit programming. In other words, it involves data collection to improve resource utilization or training
using statistical techniques to enable machines to learn from speed.
data and improve their performance over time. ML models ML is a very active and rapidly expanding research
repeatedly show their potential for delivering high-quality field that includes an abundance of learning techniques,
output (e.g. classifications/decisions, regression values and model types, tools and frameworks, practices, and application
generated artifacts) in highly complex environments with possibilities. Although we focus here on ML models, some
non-trivial decision boundaries. Generally, for that sort of applications require considering the whole running system,
environment, the proposed ML model greatly reduces the i.e., AI system, to properly evaluate and understand the
compute resources needed to generate an adequate response output, instead of focusing solely on the ML models [4]. This
and/or generates outputs that are much ‘‘better’’ than what paper is intended as a primer/practical guide for researchers
existing models could deliver. That being said, for more who are keen on quickly applying ML to problems in com-
complex problem domains, most ML approaches require puter networking and/or leveraging networking techniques
substantial amounts of compute resources and training data. to improve the performance of their ML systems but feel
Since most ML models aim at generalizing from specific overwhelmed by the possibilities the intersection of ML and
records of data, the quality of these data samples is essential computer networking provides. The key points of the paper
to the overall model performance. This often means that large are the following:
amounts of data records are required to depict a sufficiently • It first introduces the most relevant concepts and model
representative portion of the problem’s data domain. Also, architectures of ML and then puts them into the context
more sophisticated models can quickly explode in terms of of the different networking problem domains and the
parameter/compute operation count and thus often require latest advancements therein,
specialized training hardware (i.e. memory and compute). • It exposes the currently open problems within computer
Nevertheless, the continuous improvement of used hardware networking and introduces a selection of different tools,
as well as the increased attention towards training data data sets, and approaches that have been popular among
acquisition, preparation and generation has paved the way for the research community and might serve as a starting
ML to enter into more and more application domains. point for future work,
Computer networking is a highly complex problem domain • It covers several techniques for utilizing networks to
with a plethora of tasks and problems that, to this day, improve ML efficiency, such as reducing resource
are solved predominantly through hand-crafted, algorithmic, requirements via Split Learning (SL) and distributed
or heuristic methods. These methods have to respect a wide training via Federated Learning (FL) or incorporating
range of topologies, network types and scopes, configura- the right inductive biases into ML models to improve
tions, hardware and protocol stacks, traffic patterns, and other their ability to generalize from limited data,
sources of variation. Furthermore, there are many different • It discusses challenges related to networks for ML, such
ways to assess network performance, and in many cases, as resource constraints, security concerns, and the lack
minimum performance guarantees and security policies add of understanding of how ML models make decisions
special constraints to the optimization problem. Additionally, (and how techniques such as Explainable Artificial
contemporary networks use specialized hardware to deliver Intelligence (XAI) may help in gaining understanding),
optimized performance, e.g. for forwarding packets at line • It comprehensively provides pointers for further study
speed. Oftentimes, this hardware does not easily allow ML on related surveys and research.
models to replace existing functionality, e.g. because certain The organization of the paper is visualized in Figure 1,
types of computations are not supported or because the stor- and the remainder is organized as follows: Section II explains
age is not available for more complex ML models. Finally, the basic concepts and categories of ML and relates common
while network administrators and networking researchers do networking problems to them. Section III introduces the ML
monitor their networks in action, the amount of useful ML subfield of deep learning, which has been responsible for
training data in networking – data that is not noisy nor most of the recent ML breakthroughs, elaborating on the
incomplete, publicly available, and diverse enough to cover most common model architectures and how and why they
large parts of the problem’s underlying data domain – is are suited to specific tasks within computer networking.
only a fraction of what other problem domains have at their Thereafter, Section IV sheds light on the variety of accessible
disposal. As a consequence, optimizing network performance data sets, tools, and frameworks that ease the development
has so far been largely beyond the reach of ML research. and training of ML-powered networking systems. Section V
However, given the increased visibility of ML, researchers discusses explainability in Artificial Intelligence (XAI),