0% found this document useful (0 votes)

7 views

Project Report - Performance of Various ML Algorithms

Uploaded by

guptakomal12122001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

Project Report - Performance of Various ML Algorithms

Uploaded by

guptakomal12122001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 46

1

Performance of Various ML Algorithms for

detection of DDoS Attack

REPORT on B. Tech Project

(CS 4272)

Jaykishan Padia (2020CSB032)

Amrita Kesh (2020CSB036)
Komal Gupta (2020CSB087)

under the guidance of

Prof. Sipra Das Bit

DEPARTMENT OF COMPUTER SCIENCE AND TECHNOLOGY,

IIEST, SHIBPUR
HOWRAH – 711103

2023-2024
2

Declaration

I hereby declare that this thesis is the record of bona fide research work
carried out by us under the supervision of Dr. Sipra Das Bit, Professor,
Department of Computer Science and Technology, Bengal Engineering and
Science University, Shibpur. I further declare that this thesis has not
previously formed the basis for the award of any degree, diploma, associate
ship, fellowship or other similar title of recognition.

Jaykishan Padia
Enrollment No. 2020CSB032

Amrita Kesh
Enrollment No. 2020CSB036

Komal Gupta
Enrollment No. 2020CSB087
3

This page is intentionally left blank

Acknowledgements

I would like to thank my supervisor Prof. Sipra Das Bit for her guidance,
support and advice at all stages of our work. I would also like to thank the
head Prof. Apurba Sarkar and all other professors of the Department of
Computer Science and Technology
for their valuable suggestions.
5

Contents

1. Introduction………………………………………………………………………………10
2. Related Works………………………………………………..………………………….11
2.1. Literature Review…………………………………………………………………11
2.2. Motivation………………………………………………………………………….12
2.3. Objective…………………………………………………………………………..13
2.4. Relevance…………………………………………………………….………….. 13
3. System Model……………………………………………………………………………14
3.1. Architecture……………………………………………………………………….14
3.2. Proposed Model…………………………………………………………………..15
3.2.1. Processing Model………………………………………………………….15
3.2.2. Tools for Analysis…………………………………………………………..15
3.2.3. Input and Output……………….……………….……………….…..……..15
3.3. Various DDoS Attacks considered……………………………………...………16
3.3.1. Udp Flood…………………………………………………………….…….16
3.3.2. Udp Lag Attack…………………………………………………………….16
3.3.3. NetBIOS Amplification…………………………………………………….16
3.3.4 Syn Flood…………………………………………………………………..17
3.3.5 LDAP Reflection……………………………………………….…………..17
3.3.6 MSSQL ……………………………………………………….……………17
3.4. Algorithms Used………………………………………………………….……….18
3.4.1 Logistic regression…………………………………………………………18
3.4.2 Random Forest……………………………………………………………..18
3.4.3 Naive-Bayes……………………………………………………………..…19
3.5. Metrics Used…………………………………………………..…………………..19
3.5.1 Accuracy…………………………………………………..….…………….19
3.5.2 Precision…………………………………………………..….…………….20
3.5.3 Recall…….……………………………………………..………….………. 20
3.5.4 F1 Score.….……………………………………………..………...……….20
6

4. Analysis on Performance of Classification Algorithms………………………………21

4.1. UDP Flood...……………….……………….……………….…………….………22
4.1.1. Random Forest Classifier………….……………….…………………….23
4.1.2. Logistic Regression………….……………….…………………………...24
4.1.3. Naive Bayes………….……………….……………………………………25

4.2. UDP Flood……………….……………….……………….………………………26

4.2.1. Random Forest Classifier………….……………….…………………….27
4.2.2. Logistic Regression………….……………….…………………………...28
4.2.3. Naive Bayes………….……………….……………………………………29

4.3. Syn Flood…………………………………………………………………….……30

4.3.1. Random Forest Classifier………….……………….…………………….31
4.3.2. Logistic Regression………….……………….………………………….. 32
4.3.3. Naive Bayes………….……………….………………………………….. 33

4.4. LDAP Reflection………………………………………………………………….34

4.4.1. Random Forest Classifier………….……………….……………………35
4.4.2. Logistic Regression………….……………….…………………………...36
4.4.3. Naive Bayes………….……………….……………………………………37

4.5. MSSQL………….……………………….……..…………………….……………38
4.5.1. Random Forest Classifier………………………….…….……………….39
4.5.2. Logistic Regression…………………….……………..….……………….40
4.5.3. Naive Bayes …………………….……………….………….………….…41

5. Performance Evaluation………….………………….………………………….………42
6. Conclusion……………….……………….……………….………………..……………43
6.1. Future Scope…………………….……………….……………….………………44
6.1.1. Real-Time Data Ingestion…………………….……………….………….44
6.1.2. Stream Processing Frameworks…………………………...……………44
6.1.3. User Interface and Visualization…………………….………....………..44
6.1.4. Trigger-Based Alerting…………………….……………….……………..44
References………….……………………….……………………….…………………………45
7

List of figures

Figure 1: DDoS attack taxonomy………………………………….………21

Figure 2: Relative importance of different parameters for a UDP attack
……………………………………………………………….……22
Figure 3: Bar graph for comparing Precision, Recall and F1 Score for
Random Forest Classifier……………………………………….23
Figure 4: Bar graph for comparing Precision, Recall and F1 Score for
Logistic Regression………………………………………………24
Figure 5: Bar graph for comparing Precision, Recall and F1 Score for
Naive Bayes………………………………………………………25
Figure 6: Relative importance of different features for a Netbios attack
……………………………………………………………………..26
Figure 7: Bar graph for comparing Precision, Recall and F1 Score for
Random Forest Classifier……………………………………….….27
Figure 8: Bar graph for comparing Precision, Recall and F1 Score for
Logistic Regression………………………………………………28
Figure 9: Bar graph for comparing Precision, Recall and F1 Score for
Naive Bayes………………………………………………………29
Figure 10:Relative importance of different parameters for a Syn Flood
attack……………………………………………………………...30
Figure 11: Bar graph for comparing Precision, Recall and F1 Score for
Random Forest Classifier………………………………...……31
Figure 12: Bar graph for comparing Precision, Recall and F1 Score for
Logistic Regression…………………………………………….32
Figure 13: Bar graph for comparing Precision, Recall and F1 Score for
Naive Bayes……………………………………………………..33
8

Figure 14: Relative importance of different parameters for a Ldap

attack……………………………………………….…………….34
Figure 15: Bar graph for comparing Precision, Recall and F1 Score for
Random Forest Classifier……………………………………...35
Figure 16: Bar graph for comparing Precision, Recall and F1 Score for
Logistic Regression…………………….………………………36
Figure 17: Bar graph for comparing Precision, Recall and F1 Score for
Naive Bayes……………………………………………………..37
Figure 18: Relative importance of different parameters for a Syn Flood
attack…………………………………………………………..…38
Figure 19: Bar graph for comparing Precision, Recall and F1 Score for
Random Forest Classifier…………………………………..….39
Figure 20: Bar graph for comparing Precision, Recall and F1 Score for
Logistic Regression……………………………………….……40
Figure 21: Bar graph for comparing Precision, Recall and F1 Score for
Naive Bayes……………………………………………………..41
Figure 22: Performance evaluation Table………………………………...42
9

Abstract
Distributed Denial of Service (DDoS) attack is a menace to network security that aims at
exhausting the target network with malicious traffic. This research explores the effectiveness of
different machine learning (ML) classification algorithms in detection of Distributed Denial of
Service (DDoS) attacks. We compare the performance of logistic regression, random forest and
Naive-Bayes algorithms in identifying DDoS attacks. Our work primarily focuses on
preprocessing datasets, training machine learning models, and making predictions based on the
trained models. The study evaluates the algorithms based on various metrics such as accuracy,
precision, recall, and F1-score.
10

Chapter-1
Introduction
Network security is one of the most important challenges that we face today. Among the many
threats, Distributed Denial of Service (DDoS) attack is a very powerful technique to attack
internet resources.

A DDoS attack is a malicious attempt to disrupt the normal traffic of a targeted server, service,
or network by overwhelming the target or its infrastructure with a flood of internet traffic.

DDoS attacks are carried out with networks of Internet-connected machines.

These networks consist of computers and other devices (such as IoT devices) which have been
infected with malware, allowing them to be controlled remotely by an attacker. These
individual devices are referred to as bots (or zombies), and a group of bots is called a botnet.

ML techniques are good as they do not have any prior known data distribution, but defining the
best feature-set is one of the main concerns for them.
11

Chapter-2
Related Works

2.1 Literature Review

Distributed Denial of Service (DDoS) attacks pose significant challenges to network security and
availability, prompting researchers to develop sophisticated detection mechanisms to mitigate their
impact. In recent years, various approaches have been proposed to detect and mitigate DDoS attacks,
leveraging advanced machine learning and statistical techniques. This literature review examines two
notable studies in the field of DDoS attack detection, highlighting their methodologies, contributions,
and findings.

Jin and Yeung (2004) proposed a covariance analysis model for DDoS attack detection, as presented in
[1]. The study focused on analyzing the covariance structure of network traffic features to identify
patterns indicative of DDoS attacks. By capturing correlations and relationships between different
traffic attributes, such as packet sizes, transmission rates, and protocol types, the model aimed to
distinguish between normal and attack traffic effectively. Experimental results demonstrated the
effectiveness of the covariance analysis model in detecting various forms of DDoS attacks, showcasing
its potential for enhancing network security.

In [2], Subbulakshmi et al. (2011) introduced an approach for DDoS attack detection using enhanced
support vector machines (SVMs) with real-time generated datasets. The study leveraged SVMs, a
popular machine learning algorithm known for its ability to classify complex and high-dimensional
data, to identify DDoS attack patterns from network traffic data. To address the challenges of limited
and imbalanced training data, the researchers proposed a method for generating synthetic datasets in
real-time, enabling the SVM model to adapt and learn from dynamic network environments.
Experimental evaluations demonstrated the effectiveness of the proposed approach in accurately
detecting DDoS attacks while minimizing false positives.
12

In a similar vein, Singh and De (2015) presented an approach for DDoS attack detection using
classifiers, as discussed in [6]. The study explored the use of various classification algorithms to
distinguish between normal and attack traffic based on features extracted from network packets. By
training classifiers on labeled datasets and evaluating their performance, the researchers aimed to
identify the most effective algorithm for detecting DDoS attacks in real-world scenarios.

Both studies underscore the importance of leveraging advanced machine learning and statistical
techniques for DDoS attack detection. While Jin and Yeung (2004) focused on covariance analysis to
identify attack patterns, Subbulakshmi et al. (2011) explored the use of SVMs with real-time generated
datasets for adaptive detection. These contributions highlight the diverse strategies and methodologies
employed by researchers to combat the evolving threat landscape of DDoS attacks, paving the way for
more robust and adaptive detection mechanisms in the future.

Prior research has investigated the use of statistical methods[1][3] and machine learning
methods[2][6][11]. Previous works have demonstrated the effectiveness of ML-based approaches in
detecting anomalies and identifying malicious traffic patterns. However, there is a need for further
research to evaluate the performance of different ML algorithms under DDoS attacks.

2.2 Motivation

The increasing prevalence and sophistication of Distributed Denial of Service (DDoS) attacks pose a
significant threat to modern network infrastructures, including emerging IoT environments. With the
proliferation of interconnected devices the susceptibility of networks to DDoS attacks has heightened,
necessitating robust detection and mitigation strategies. This project aims to address this critical need
by evaluating the performance of different classification algorithms in detecting and mitigating various
types of DDoS attacks, thereby enhancing the security and resilience of networks.
13

2.3 Objective

The primary objective of this project is to compare the effectiveness of logistic regression, random
forest, and naive Bayes classification algorithms in detecting and mitigating DDoS attacks targeting
IoT devices. Specifically, the project aims to assess the accuracy, precision, and F1 score of each
algorithm in distinguishing between normal network traffic and different types of DDoS attacks,
including UDP flood, SYN flood, LDAP reflection, and others. By achieving these objectives, the
project seeks to provide valuable insights into the strengths and limitations of different machine
learning techniques for combating DDoS threats in IoT environments, ultimately contributing to the
development of more adaptive and resilient cybersecurity solutions.

2.4 Relevance

While this project may not be directly based on 5G IoT devices, its findings hold significant relevance
in the context of 5G IoT environments. With the rapid deployment of 5G technology and the
proliferation of interconnected IoT devices, the threat landscape for DDoS attacks has expanded
exponentially. Therefore, understanding the performance of classification algorithms in detecting and
mitigating DDoS attacks is crucial for ensuring the security and resilience of 5G IoT networks. By
benchmarking logistic regression, random forest, and naive Bayes algorithms against various types of
DDoS attacks, this work provides valuable insights into their effectiveness and applicability in 5G IoT
environments. The findings of this project can inform the development of tailored cybersecurity
solutions and strategies to safeguard 5G IoT networks.
14

Chapter-3
System Model

3.1 Architecture

The architecture of our proposed system will include our computer/server system connected to the
network.

Our computer/server system will do the task of network data collection and DDoS attack
detection.

The data consisting of 80 traffic features will be extracted using CICFlowMeter[9]. Afterwards,
this data will be passed through our trained ML models which will detect whether or not our
network is experiencing a DDoS attack.

The detection of network will be done when:

● There is a suspicious amount of traffic originating from a single IP address or IP range.

● A flood of traffic from users who share a single behavioral profile, such as device type or
geolocation, or web browser version.
● An immediate surge in requests to a single page or endpoint.
● Odd traffic patterns of sudden spikes which appear unnatural

The monitoring of network traffic will be done continuously and in suspicious cases, we will use
the ML models to detect whether there is a DDoS attack on the system.
15

3.2 Proposed Model

We propose to evaluate the performance of three ML algorithms (logistic regression, random

forest and Naive Bayes) regarding detection of DDoS attacks. These algorithms are chosen for
their simplicity, scalability, and interpretability. To train the models, we will use the
CIC-DDoS2019 dataset. [10]

3.2.1 Processing Model

Before training the models, we will preprocess the data to ensure its suitability for ML
algorithms. This preprocessing phase involves tasks such as data cleaning, feature selection,
normalization, and handling of missing values. We used CICFlow Meter to analyse the
network traffic data [12]. By preparing the data in a structured and standardized format, we
aim to enhance the robustness and effectiveness of the ML models.

3.2.2 Tools for analyzing

Libraries utilized for data preprocessing, model training, and performance evaluation include
popular Python libraries such as pandas, scikit-learn, and matplotlib for data manipulation,
machine learning, and visualization, respectively. Running Location - The machine learning
algorithms are deployed on a centralized server or cloud platform, where they analyze
incoming data streams.

3.2.3 Input and Output

The input to the algorithms consists of features extracted from network traffic, system logs,
and device telemetry data. These features include packet headers, traffic volume, packet
length, protocol type, and other relevant attributes. The output of the algorithms is a binary
classification indicating whether a DDoS attack is detected or not. This output provides
actionable insights for network administrators and security personnel to respond to potential
security threats in real-time.
16

3.3 Various DDOS attacks considered

DDoS attacks can take various forms, including volumetric, protocol, and application layer
attacks. Volumetric attacks flood the network with a high volume of traffic, consuming
bandwidth and resources. Protocol attacks target vulnerabilities in network protocols,
exploiting weaknesses in packet handling and processing. Application layer attacks focus on
specific services or applications, aiming to exhaust server resources or disrupt communication
channels.

` 3.3.1 UDP FLOOD

UDP flood attacks involve sending a massive volume of UDP (User Datagram Protocol)
packets to a target. These packets may be sent from a single source (single-flow UDP flood) or
from multiple distributed sources (distributed UDP flood). he goal is to consume the target's
network bandwidth, exhaust its computational resources (such as CPU and memory), or
overwhelm its network infrastructure (such as routers and switches).

3.3.2 UDP Lag Attack

UDP Lag attacks, the flood of UDP packets can introduce significant network latency or lag,
causing delays in packet delivery and increasing round-trip times for network communication.
While the primary goal is still to overwhelm the target's resources, UDP Lag attacks may
emphasize the disruption of real-time applications that are sensitive to latency, such as online
gaming, VoIP (Voice over IP), and streaming media.

3.3.3 NetBIOS Amplification

NetBIOS services can be leveraged in reflection and amplification attacks. Attackers send
spoofed NetBIOS queries to vulnerable servers, which then respond with larger NetBIOS
responses to the victim's IP address. This amplification of response traffic can consume the
victim's bandwidth and exhaust its network resources, resulting in a denial of service.
17

3.3.4 SYN FLOOD (Synchronize)

SYN flood attacks exploit the three-way handshake process of the TCP protocol. Attackers
flood the target server with a large number of SYN requests, but they do not complete the
handshake by sending the final ACK packet. This results in the target server maintaining
half-open connections, eventually exhausting its resources and preventing legitimate users from
establishing connections.

3.3.5 LDAP Reflection

LDAP servers can be exploited in reflection attacks similar to DNS and NTP reflection attacks.
Attackers send spoofed LDAP queries to vulnerable servers, which then respond with larger
LDAP responses to the victim's IP address. This amplification of response traffic can
overwhelm the victim's network infrastructure, leading to service disruption.

3.3.6 MSSQL

Attackers can exploit vulnerabilities in Microsoft SQL Server (MSSQL) to launch DDoS
attacks. By sending specially crafted SQL queries or exploiting known vulnerabilities in
MSSQL services, attackers can cause the target server to become unresponsive or crash,
resulting in denial of service for legitimate users.
18

3.4 Algorithms Used

3.4.1 Logistic regression

Logistic regression is a linear model suitable for binary classification tasks and it forms a S
shaped curve (sigmoid). In machine Learning, we use sigmoid to map predictions to
probabilities.

[8]

3.4.2 Random Forest

Random forest is an ensemble learning method that combines multiple decision trees to
improve accuracy and robustness. Random forests are a combination of tree predictors such
that each tree depends on the values of a random vector sampled independently and with the
same distribution for all trees in the forest. The generalisation error for forests converges as to
a limit as the number of trees in the forest becomes large. The generalisation error of a forest
of tree classifiers depends on the strength of the individual trees in the forest and the
correlation between them. Using a random selection of features to split each node yields error
rates that compare favourably to Adaboost, but are more robust with respect to noise. [4]
19

3.4.3 Naive-Bayes

Naive Bayes is simple, scalable and can handle high dimensional data. Naïve Bayes is part of
a family of generative learning algorithms, meaning that it seeks to model the distribution of
inputs of a given class or category. Unlike discriminative classifiers, like logistic regression, it
does not learn which features are most important to differentiate between classes.[13]

[5]

3.5 Metrics Used

3.5.1 Accuracy

The proportion of correctly classified instances by the machine learning algorithms.

[7]
20

3.5.2 Precision

The ratio of true positive predictions to the total number of positive predictions, indicating
the accuracy of positive predictions.

[7]

3.5.3 Recall

The ratio of true positive predictions to the total number of actual positive instances,
measuring the algorithm's ability to identify all positive instances.

[7]

3.5.4 F1 Score

The harmonic mean of precision and recall, providing a balanced measure of the algorithm's
performance.

[7]
21

Chapter-4
Analysis on Performance of classification

Figure 1:DDoS Attack Taxonomy

4.1 UDP FLOOD

This was plotted using the Random Forest Regression algorithm. This is useful for
gaining insights regarding the important features for a particular type of DDoS attack.

Figure 2:relative importance of different parameters for a UDP attack

4.1.1. Random Forest Classifier

● Accuracy: 0.9935479152176735
● Precision: 0.993548146621589
● Recall: 0.999994834821675
● F1 Score: 0.9967610671217557

Figure 3: Bar graph for comparing Precision, Recall and F1 Score

for Random Forest Classifier

4.1.2. Logistic Regression

● Accuracy: 0.9928200112329862

● Precision: 0.9928203940645784

● Recall: 0.9999994562970184

● F1 Score: 0.9963969940233413

Figure 4: Bar graph for comparing Precision, Recall and F1 Score for Logistic Regression
25

4.1.3. Naive Bayes

● Accuracy: 0.9930048885891267

● Precision: 0.993015693902095

● Recall: 0.9999874948314236

● F1 Score: 0.996489400204582

Figure 5: Bar graph for comparing Precision, Recall and F1 Score

4.2 NetBIOS AMPLIFICATION

This was plotted using the Random Forest Regression algorithm. This is useful for gaining
insights regarding the important features for a particular type of DDoS attack.

Figure 6: Relative importance of different features for a NetBios Attack

4.2.1. Random Forest Classifier

● Accuracy: 0.999976844466083

● Precision: 0.9999771367717423

● Recall: 0.9999996991612616

● F1 Score: 0.9999884178392351

Figure 7: Bar graph for comparing Precision, Recall and F1 Score

4.2.2. Logistic Regression

● Accuracy: 0.9583733645982876

● Precision: 0.9942776481173973

● Recall: 0.9635509138501749

● F1 Score: 0.9786739129643197

Figure 8: Bar graph for comparing Precision, Recall and F1 Score

4.2.3. Naive Bayes

● Accuracy: 0.9985535309332372

● Precision: 0.9997380283471395

● Recall: 0.9988146953706935

● F1 Score:0.9992761485686252

Figure 9: Bar graph for comparing Precision, Recall and F1 Score

4.3 SYN FLOOD

This was plotted using the Random Forest Regression algorithm. This is useful for gaining
insights regarding the important features for a particular type of DDoS attack.

Figure 10: Relative importance of different features for a Syn Flood attack
31

4.3.1. Random Forest Classifier

● Accuracy: 0.9990102700580686

● Precision: 0.9990887261190069

● Recall: 0.9999135449134174

● F1 Score: 0.9995009653498113

Figure 11: Bar graph for comparing Precision, Recall and F1 Score
32

4.3.2. Logistic Regression

● Accuracy: 0.9583733645982876

● Precision: 0.9942776481173973

● Recall: 0.9635509138501749

● F1 Score: 0.9786739129643197

Figure 12: Bar graph for comparing Precision, Recall and F1 Score
33

4.3.3. Naive Bayes

● Accuracy: 0.9583731375999545

● Precision: 0.9942775356532425

● Recall: 0.9635506854189249

● F1 Score: 0.9786729914953117

Figure 13: Bar graph for comparing Precision, Recall and F1 Score
34

4.4 LDAP REFLECTION

This was plotted using the Random Forest Regression algorithm. This is useful for gaining insights
regarding the important features for a particular type of DDoS attack.

Figure 14: Relative importance of different features for a Ldap attack

4.4.1. Random Forest Classifier

● Accuracy: 0.992175791429551

● Precision: 0.9915325900231428

● Recall: 0.9998872310662302

● F1 Score: 0.9956923853533081

Figure 15: Bar graph for comparing Precision, Recall and F1 Score
36

4.4.2. Logistic Regression

● Accuracy: 0.9057088409526002

● Precision: 0.9056206168720637

● Recall: 0.9999484484874196

● F1 Score: 0.9504498653147526

Figure 16: Bar graph for comparing Precision, Recall and F1 Score
37

4.4.3. Naive Bayes

● Accuracy: 0.9049857172134531

● Precision: 0.9049888969837413

● Recall: 0.9999162287920568

● F1 Score: 0.9500873011742544

Figure 17: Bar graph for comparing Precision, Recall and F1 Score
38

4.5 MSSQL EXPLOIT

This was plotted using the Random Forest Regression algorithm. This is useful for gaining insights
regarding the important features for a particular type of DDoS attack.

Figure 18: Relative importance of different features for a Mssql attack

4.5.1. Random Forest Classifier

● Accuracy: 0.9981453535427659

● Precision: 0.9981470721747966

● Recall: 0.9999974824101391

● F1 Score: 0.9990714204930858

Figure 19: Bar graph for comparing Precision, Recall and F1 Score
40

4.5.2. Logistic Regression

● Accuracy: 0.9979361518624781

● Precision: 0.9979752096892731

● Recall: 0.9999602580457676

● F1 Score: 0.9989667477453353

Figure 20: Bar graph for comparing Precision, Recall and F1 Score
41

4.5.3. Naive Bayes

● Accuracy: 0.9971901307761687

● Precision: 0.997849051501561

● Recall: 0.99933787386659

● F1 Score: 0.9985929077555391

Figure 21: Bar graph for comparing Precision, Recall and F1 Score
42

Chapter-5
PERFORMANCE EVALUATION
Based on our evaluation results, we determined the preferred ML algorithm for each type of DDoS
attack. The figure below illustrates the preferred algorithm for different DDoS attacks:

DDoS Attack ML Algorithm

UDP Flood Logistic Regression

MSSQL Random Forest Classification

SYN Random Forest Classification

LDAP Random Forest Classification

NetBIOS Random Forest Classification

Figure 22: Preferred Algorithm for different DDoS attacks

The observed variance in the performance of machine learning (ML) algorithms across different types of
Distributed Denial of Service (DDoS) attacks underscores the complex nature of cyber threats and the
need for tailored detection approaches. In our evaluation, Random Forest Classification demonstrated
superior performance for LDAP, MSSQL, NetBIOS, and SYN attacks, while Logistic Regression
exhibited better performance for UDP Flood attacks.

The contrasting performance of Random Forest and Logistic Regression highlights the importance of
selecting the most appropriate ML algorithm based on the characteristics of the targeted attacks. By
leveraging the strengths of different algorithms for different attack types, organizations can enhance
their overall defense mechanisms against evolving cyber threats.

Additionally, our findings underscore the need for ongoing research and development in DDoS attack
detection to address the dynamic nature of cyber threats and ensure robust protection for network
environments.
43

Chapter-6
Conclusion
In conclusion, this project delves into an extensive exploration of the performance of logistic regression,
random forest, and Naive Bayes algorithms in the detection of Distributed Denial of Service (DDoS)
attacks. Through comprehensive analysis and experimentation, the study sheds light on the efficacy of
machine learning (ML)-based methodologies in fortifying cybersecurity measures.

The findings underscore the significance of employing ML approaches for bolstering security protocols,
particularly in the realm of DDoS attack detection. Among the algorithms investigated, random forest
emerges as the frontrunner, exhibiting remarkable accuracy and robustness in identifying and mitigating
DDoS threats. Its ability to handle complex data patterns and maintain high performance levels under
diverse conditions positions it as a formidable tool in the arsenal against cyber-attacks.

Nevertheless, it is imperative to acknowledge the nuanced strengths and limitations inherent in each
algorithm. While random forest excels in various aspects, logistic regression and Naive Bayes
algorithms demonstrate notable potential in real-time DDoS detection. Their relatively simpler structures
and computational efficiency make them viable options, particularly in scenarios where resource
constraints or latency considerations are paramount.

Looking ahead, the trajectory of research in this domain should prioritize continual refinement of
existing algorithms and exploration of alternative ML techniques. Fine-tuning the parameters and
architectures of logistic regression, random forest, and Naive Bayes models could yield further
enhancements in detection accuracy and response efficacy. Additionally, investigating novel ML
methodologies, such as deep learning or ensemble techniques, holds promise for extending the frontier
of cybersecurity defense mechanisms.
44

In essence, this project serves as a stepping stone towards a more comprehensive understanding of
ML-driven cybersecurity paradigms. By leveraging the insights garnered herein, stakeholders can
proactively fortify their systems against evolving cyber threats, thereby fostering a safer and more
resilient digital ecosystem.

6.1. Future Scope

While our current implementation provides a foundation for detecting DDoS attacks using
machine learning algorithms, there are several enhancements and additional features that can be
explored to make the system more effective, scalable, and real-time. Here are some future scope
considerations.

6.1.1. Real-Time Data Ingestion

Implement mechanisms to ingest live network traffic and system logs in real-time,
allowing for continuous monitoring and analysis of incoming data streams.

6.1.2. Stream Processing Frameworks

Explore the use of stream processing frameworks such as Apache Kafka or Apache Flink
to handle high-volume, real-time data streams and enable parallelized processing and
analysis.

6.1.3. User Interface and Visualization

Develop a user-friendly interface and visualization tools to facilitate monitoring, analysis,

and reporting of DDoS attack detection results, enabling security analysts to gain insights
and take appropriate actions effectively.

6.1.4. Trigger-Based Alerting

Implement a trigger mechanism that monitors the output of the detection system and
alerts security personnel in real-time when ongoing DDoS attacks are detected, providing
timely notifications for proactive response and mitigation efforts.
45

REFERENCES
[1] S. Jin and D. S. Yeung, “A covariance analysis model for ddos attack detection,” in 2004 IEEE International
Conference on Communications, vol. 4, pp. 1882–1886 Vol.4, 2004.

[2] T. Subbulakshmi, K. BalaKrishnan, S. M. Shalinie, D. AnandKumar, V.GanapathiSubramanian, and K.

Kannathal, “Detection of ddos attacks using enhanced support vector machines with real time generated dataset,”
in Third International Conference on Advanced Computing, pp. 17–22, 2011.

[3] Freedman, D. A. (2008). Logistic Regression: Why We Cannot Do What We Think We Can Do, and What We
Can Do About It. Journal of the American Statistical Association, 95(450), 1-4.

[4] L. Breiman, “Random forests,” Machine learning, vol. 45, no. 1, pp. 5–

32, 2001.

[5] Domingos, P., & Pazzani, M. (1997). The Optimality of Naive Bayes. Proceedings of the 13th International
Conference on Machine Learning, 118-126. - Naive Bayes

[6] K. J. Singh and T. De, “An approach of ddos attack detection using classifiers,” Emerging Research in
Computing, Information, Communication and Applications, 2015.

[7] Powers, D. M. (2011). Evaluation: From Precision, Recall and F-Measure to ROC, Informedness, Markedness
& Correlation. Journal of Machine Learning Technologies, 2(1), 37-63..

[8]Joseph Berkson, (1944) Logistic Regression

[9] CICFlowMeter, 2021 https://github.com/ahlashkari/CICFlowMeter.

[10] CIC-DDoS2019 https://www.unb.ca/cic/datasets/ddos-2019.html

[11]Developing Realistic Distributed Denial of Service (DDoS) Attack Dataset and Taxonomy
https://ieeexplore.ieee.org/document/8888419

[12]B. H. Ali, N. Sulaiman, S. A. R. Al-Haddad, R. Atan and S. L. M. Hassan, "DDoS Detection Using Active
and Idle Features of Revised CICFlowMeter and Statistical Approaches," 2022 4th International Conference on
Advanced Science and Engineering (ICOASE), Zakho, Iraq, 2022, pp. 148-153, doi:
10.1109/ICOASE56293.2022.10075591. keywords: {Sensitivity;Databases;Scalability;Bidirectional
control;Denial-of-service attack;Feature extraction;Entropy;Sequential probability ratio test;Shannon
Entropy;Confusion Matrix;CICFlowMeter;DDoS},

[13] https://www.ibm.com/topics/naive-bayes
46

oscp+
100% (1)
oscp+
6 pages
01 JWT Authentication in Spring Boot 3 With Spring Security 6
No ratings yet
01 JWT Authentication in Spring Boot 3 With Spring Security 6
38 pages
Godrej Placement-Paper - Free Online Questions and Answers
No ratings yet
Godrej Placement-Paper - Free Online Questions and Answers
6 pages
DDoS(research_paper) (3)
No ratings yet
DDoS(research_paper) (3)
5 pages
DDoS Detection A Network Guradian A Threat Stopper
No ratings yet
DDoS Detection A Network Guradian A Threat Stopper
10 pages
Major Project Research
No ratings yet
Major Project Research
6 pages
20BIT0127
No ratings yet
20BIT0127
32 pages
SSRN Id4515135
No ratings yet
SSRN Id4515135
10 pages
applsci-13-03183
No ratings yet
applsci-13-03183
27 pages
New Detect 2
No ratings yet
New Detect 2
23 pages
journal.pone.0312425
No ratings yet
journal.pone.0312425
29 pages
9 PresentationTemplate JU PMSCS-3e7ac0
No ratings yet
9 PresentationTemplate JU PMSCS-3e7ac0
15 pages
DDoS Detection A Network Guradian A Threat Stopper AI
No ratings yet
DDoS Detection A Network Guradian A Threat Stopper AI
10 pages
4paprre
No ratings yet
4paprre
6 pages
1180-Article_Text-10722-1-10-20241005_241111_180753[1]
No ratings yet
1180-Article_Text-10722-1-10-20241005_241111_180753[1]
18 pages
Screenshot 2024-03-06 at 10.49.16 AM
No ratings yet
Screenshot 2024-03-06 at 10.49.16 AM
5 pages
Synopsis Batch Mitm
No ratings yet
Synopsis Batch Mitm
12 pages
IOP with vivek
No ratings yet
IOP with vivek
13 pages
Ajresd Template Cnaiadd24
No ratings yet
Ajresd Template Cnaiadd24
8 pages
capestone team 69
No ratings yet
capestone team 69
18 pages
1-s2.0-S0045790624002052-main[1]
No ratings yet
1-s2.0-S0045790624002052-main[1]
19 pages
IJSATE032503
No ratings yet
IJSATE032503
7 pages
Algorithms 17 00099 v2
No ratings yet
Algorithms 17 00099 v2
21 pages
Ddos Attacks + Description Data Set
No ratings yet
Ddos Attacks + Description Data Set
27 pages
Enhancing Cybersecurity: Machine Learning Approaches For Predicting Ddos Attack
No ratings yet
Enhancing Cybersecurity: Machine Learning Approaches For Predicting Ddos Attack
7 pages
13-Sarah+Zghair+Arrak
No ratings yet
13-Sarah+Zghair+Arrak
15 pages
DDos Attack Prediction - DL
No ratings yet
DDos Attack Prediction - DL
5 pages
NCFTEAS - 2024 Paper 16
No ratings yet
NCFTEAS - 2024 Paper 16
8 pages
Implementation_of_QOS_in_SDN_and_Distributed_Networks_for_mitigation_of_DDOS_based_attacks_using_Mach
No ratings yet
Implementation_of_QOS_in_SDN_and_Distributed_Networks_for_mitigation_of_DDOS_based_attacks_using_Mach
6 pages
Deep Learning Approaches For Detecting Ddos Attacks: A Systematic Review
No ratings yet
Deep Learning Approaches For Detecting Ddos Attacks: A Systematic Review
37 pages
Performance Comparison of Machine Learning and Deep Learning Models in DDoS Attack Detection _ SpringerLink
No ratings yet
Performance Comparison of Machine Learning and Deep Learning Models in DDoS Attack Detection _ SpringerLink
10 pages
Deep Learning Approach To DDoS Attack With Imbalanced Data at The Application Layer
No ratings yet
Deep Learning Approach To DDoS Attack With Imbalanced Data at The Application Layer
8 pages
Machine Learning Approaches For Combating Distributed Denial of Service Attacks in Modern Networking Environments
No ratings yet
Machine Learning Approaches For Combating Distributed Denial of Service Attacks in Modern Networking Environments
29 pages
Mmep 10.04 04
No ratings yet
Mmep 10.04 04
10 pages
Development Machine Learning Techniques
No ratings yet
Development Machine Learning Techniques
11 pages
Predict and Prevent DDOS Attacks Using Machine Lea
No ratings yet
Predict and Prevent DDOS Attacks Using Machine Lea
13 pages
Feature Selection For Ddos Detection Using Classification Machine Learning Techniques
No ratings yet
Feature Selection For Ddos Detection Using Classification Machine Learning Techniques
9 pages
Semi-supervised Machine Learning Approach for DDoS Detection (2)
No ratings yet
Semi-supervised Machine Learning Approach for DDoS Detection (2)
11 pages
DDos Attack Prediction - ML
No ratings yet
DDos Attack Prediction - ML
5 pages
An Ensemble-Based Approach For Effective Distributed Denial of Service Attack Detection in Software Defined Networking
No ratings yet
An Ensemble-Based Approach For Effective Distributed Denial of Service Attack Detection in Software Defined Networking
8 pages
AI-Driven_DDoS_Mitigation_at_the_Edge_Leveraging_Machine_Learning_for_Real-Time_Threat_Detection_and_Response
No ratings yet
AI-Driven_DDoS_Mitigation_at_the_Edge_Leveraging_Machine_Learning_for_Real-Time_Threat_Detection_and_Response
7 pages
Machine_Learning_Algorithms_for_DoS_and_DDoS_Cyberattacks_Detection_in_Real-Time_Environment
No ratings yet
Machine_Learning_Algorithms_for_DoS_and_DDoS_Cyberattacks_Detection_in_Real-Time_Environment
2 pages
AIP - Aip 202203 0005
No ratings yet
AIP - Aip 202203 0005
13 pages
major review 1.2
No ratings yet
major review 1.2
20 pages
Sada
No ratings yet
Sada
11 pages
DDoS Attack Identification and Defense Using SDN Based On Machine Learning Method
No ratings yet
DDoS Attack Identification and Defense Using SDN Based On Machine Learning Method
5 pages
DDoS Detection Using Hybrid Deep Neural Network Approaches
No ratings yet
DDoS Detection Using Hybrid Deep Neural Network Approaches
8 pages
DDos and big data-1
No ratings yet
DDos and big data-1
18 pages
Efficient Detection of DDoS Attacks Using A Hybrid Deep Learning Model With Improved Feature Selection
No ratings yet
Efficient Detection of DDoS Attacks Using A Hybrid Deep Learning Model With Improved Feature Selection
22 pages
Elevating Cybersecurity Using AI and Deep Learning for Intrusion Detection Reinforcement
No ratings yet
Elevating Cybersecurity Using AI and Deep Learning for Intrusion Detection Reinforcement
27 pages
Computational Intelligent Techniques To Detect DDOS Attacks: A Survey
No ratings yet
Computational Intelligent Techniques To Detect DDOS Attacks: A Survey
18 pages
Computational Intelligent Techniques To Detect DDOS Attacks: A Survey
No ratings yet
Computational Intelligent Techniques To Detect DDOS Attacks: A Survey
18 pages
DDOS Attack Classifier Using Machine Learning
No ratings yet
DDOS Attack Classifier Using Machine Learning
6 pages
3 Ai4ddos
No ratings yet
3 Ai4ddos
7 pages
Project
No ratings yet
Project
20 pages
RTL-DL: A Hybrid Deep Learning Framework For Ddos Attack Detection in A Big Data Environment
No ratings yet
RTL-DL: A Hybrid Deep Learning Framework For Ddos Attack Detection in A Big Data Environment
16 pages
Detection_of_Distributed_Denial_of_Service_Attacks
No ratings yet
Detection_of_Distributed_Denial_of_Service_Attacks
13 pages
IJISAE 14 Shankar+G 10 2164
No ratings yet
IJISAE 14 Shankar+G 10 2164
17 pages
Symmetry 14 01095
No ratings yet
Symmetry 14 01095
15 pages
Group 9 - Real-time DDoS Detection using Machine Learning
No ratings yet
Group 9 - Real-time DDoS Detection using Machine Learning
11 pages
DDOS in NMIMS Temp
No ratings yet
DDOS in NMIMS Temp
23 pages
s41598-024-67984-w
No ratings yet
s41598-024-67984-w
18 pages
Securing ChatGPT: Best Practices for Protecting Sensitive Data in AI Language Models
From Everand
Securing ChatGPT: Best Practices for Protecting Sensitive Data in AI Language Models
Matthew C. Smith
No ratings yet
5g Architecture
No ratings yet
5g Architecture
26 pages
Phillip's Curve Dornbusch Startz
No ratings yet
Phillip's Curve Dornbusch Startz
6 pages
atII Bks Lec 2021 34 35
No ratings yet
atII Bks Lec 2021 34 35
13 pages
Final - Support Vector Machine - Class - Modifie
No ratings yet
Final - Support Vector Machine - Class - Modifie
69 pages
00-Installations - Rev G
No ratings yet
00-Installations - Rev G
69 pages
Nso303 (Cisco Nso Administration and Devops) 3.0: Objetivo
No ratings yet
Nso303 (Cisco Nso Administration and Devops) 3.0: Objetivo
2 pages
Mini Project Report-Template
No ratings yet
Mini Project Report-Template
12 pages
Weld 334 M
No ratings yet
Weld 334 M
94 pages
Ideapad 100S 14 Spec
No ratings yet
Ideapad 100S 14 Spec
1 page
NIOS 6.0.0 CLIGuide
No ratings yet
NIOS 6.0.0 CLIGuide
114 pages
spc58nh Disp
No ratings yet
spc58nh Disp
7 pages
Introduction To Devops - PostQuiz - Attempt Review
No ratings yet
Introduction To Devops - PostQuiz - Attempt Review
3 pages
Sketch and Guess Student Guide Challenge
No ratings yet
Sketch and Guess Student Guide Challenge
10 pages
Arun - Data Engineer - Resume
No ratings yet
Arun - Data Engineer - Resume
2 pages
Towel Workout
No ratings yet
Towel Workout
4 pages
FBAcc Hack
No ratings yet
FBAcc Hack
2 pages
Send SMS Messages Using Delphi and SMS Messaging Server
No ratings yet
Send SMS Messages Using Delphi and SMS Messaging Server
5 pages
OTP Generation Using SHA
No ratings yet
OTP Generation Using SHA
2 pages
Hfe Pioneer Remote Control PT Num List
No ratings yet
Hfe Pioneer Remote Control PT Num List
20 pages
Radha Bai Gound
No ratings yet
Radha Bai Gound
3 pages
CloudDefense - White Paper
No ratings yet
CloudDefense - White Paper
42 pages
On GSM Based Smart Energy Meter
No ratings yet
On GSM Based Smart Energy Meter
17 pages
Delmia Robotics Manual
No ratings yet
Delmia Robotics Manual
14 pages
DATA VOLT - Branding Questionnaire Fileld by Clinet
No ratings yet
DATA VOLT - Branding Questionnaire Fileld by Clinet
4 pages
Prefix To Infix Conversion: Data Structure
No ratings yet
Prefix To Infix Conversion: Data Structure
5 pages
Numerical Methods
No ratings yet
Numerical Methods
72 pages
Recent Trends
No ratings yet
Recent Trends
28 pages
What Is An Operating System
No ratings yet
What Is An Operating System
2 pages
Car Booking Using Java
No ratings yet
Car Booking Using Java
21 pages
Artemii Kropachev Opinion On Certification
No ratings yet
Artemii Kropachev Opinion On Certification
2 pages
Tips For Creating An Instagram Account Without A Mobile Number
No ratings yet
Tips For Creating An Instagram Account Without A Mobile Number
4 pages