0% found this document useful (0 votes)

0 views

Module -IV

The document discusses the use of the NSL-KDD dataset for training machine learning models to detect network intrusions, categorizing attacks into four main types: DOS, R2L, U2R, and probing. It highlights the advantages of the NSL-KDD dataset over the original KDD dataset, such as the absence of redundant records and improved evaluation consistency. Additionally, it explains the concept of confusion matrices for evaluating classification models, detailing performance metrics and their importance in machine learning.

Uploaded by

teddy haile

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

0 views

Module -IV

Uploaded by

teddy haile

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

Ethiopian Defence University, College of

Engineering

CT-6713: Machine Learning in

Cybersecurity

Capt. Mehari K (Ph.D) Ethiopian University, Engineering College 1

Intrusion Detection Using NLS-KDD dataset
• Software to detect network intrusions protects a computer
network from unauthorized users including perhaps
insiders
• The intrusion detector learning task is to build a predictive
model (i.e. a classifier) capable of distinguishing between
bad connections, called intrusions or attacks, and good
normal connections
• A connection is a sequence of TCP packets starting and
ending at some well defined times

Capt. Mehari K (Ph.D) Ethiopian University, Engineering College 2 2

Contd…
• Between which data flows to and from a source IP address
to a target IP address under some well defined protocol
• Each connection is labeled as either normal, or as an
attack, with exactly one specific attack type. Each
connection record consists of about 100 bytes

Capt. Mehari K (Ph.D) Ethiopian University, Engineering College 3 3

Attacks fall into four main categories:
• DOS: denial-of-service, e.g. synchflood;
• R2L: unauthorized access from a remote machine, e.g.
guessing password;
• U2R: unauthorized access to local superuser (root)
privileges, e.g., various ''buffer overflow'' attacks;
• probing: surveillance and other probing, e.g., port
scanning.

Capt. Mehari K (Ph.D) Ethiopian University, Engineering College 4 4

Contd…
• It is important to note that the test data is not from the
same probability distribution as the training data
• It includes specific attack types not in the training data
• This makes the task more realistic
• Some intrusion experts believe that most novel attacks are
variants of known attacks
• The "signature" of known attacks can be sufficient to catch
novel variants
• The datasets contain a total of 24 training attack types,
with an additional 14 types in the test data only

Capt. Mehari K (Ph.D) Ethiopian University, Engineering College 5 5

Training attacks
• back dos • perl u2r
• buffer_overflow u2r • phf r2l
• ftp_write r2l • pod dos
• guess_passwd r2l • portsweep probe
• imap r2l • rootkit u2r
• ipsweep probe • satan probe
• land dos • smurf dos
• loadmodule u2r • spy r2l
• multihop r2l • teardrop dos
• neptune dos • warezclient r2l
• nmap probe • warezmaster r2l
Capt. Mehari K (Ph.D) Ethiopian University, Engineering College 6 6
NSL-KDD dataset description
• NSL-KDD is a data set suggested to solve some of the inherent
problems of the KDD'99 data set
• The NSL-KDD data set has the following advantages over the
original KDD data set:
• It does not include redundant records in the train set, so the
classifiers will not be biased towards more frequent records
• There is no duplicate records in the proposed test sets;
• Therefore, the performance of the learners are not biased by the
methods which have better detection rates on the frequent records

Capt. Mehari K (Ph.D) Ethiopian University, Engineering College 7 7

Contd…
• The number of selected records from each difficulty level group is
inversely proportional to the percentage of records in the original
KDD data set
• As a result, the classification rates of distinct machine learning
methods vary in a wider range, which makes it more efficient to have
an accurate evaluation of different learning techniques
• The number of records in the train and test sets are reasonable,
which makes it affordable to run the experiments on the complete set
without the need to randomly select a small portion
• Consequently, evaluation results of different research works will be
consistent and comparable

Capt. Mehari K (Ph.D) Ethiopian University, Engineering College 8 8

Demonstration of Random forest Classifier results

Capt. Mehari K (Ph.D) Ethiopian University, Engineering College 9 9

Demonstration of Random forest Classifier results

Capt. Mehari K (Ph.D) Ethiopian University, Engineering College 10 10

The classifier Tree

Capt. Mehari K (Ph.D) Ethiopian University, Engineering College 11 11

Confusion Matrix
• A Confusion matrix is an N x N
matrix used for evaluating the
performance of a classification
model
• Where N is the total number of
target classes
• The matrix compares the
actual target values with those
predicted by the machine
learning model
• A confusion matrix is used for
evaluating the performance of
a machine learning model

Capt. Mehari K (Ph.D) Ethiopian University, Engineering College 12 12

Understanding Confusion Matrix
• The following 4 are the basic terminology which will help us in
determining the metrics we are looking for.
• True Positives (TP): when the actual value is Positive and predicted is
also Positive.
• True negatives (TN): when the actual value is Negative and
prediction is also Negative.
• False positives (FP): When the actual is negative but prediction is
Positive. Also known as the Type 1 error
• False negatives (FN): When the actual is Positive but the prediction is
Negative. Also known as the Type 2 error

Capt. Mehari K (Ph.D) Ethiopian University, Engineering College 13 13

Confusion Matrix
• A confusion matrix, as the name suggests, is a matrix of
numbers that tell us where a model gets confused
• It is a class-wise distribution of the predictive performance of a
classification model
• That is, the confusion matrix is an organized way of mapping
the predictions to the original classes to which the data belong
• This also implies that confusion matrices can only be used when
the output distribution is known, i.e., in supervised
learning frameworks
• The confusion matrix not only allows the calculation of the
accuracy of a classifier, be it the global or the class-wise
accuracy
• But also helps compute other important metrics that developers
often use to evaluate their models
Capt. Mehari K (Ph.D) Ethiopian University, Engineering College 14 14
Confusion Matrix
• A confusion matrix computed for the same test set of a
dataset
• But using different classifiers, can also help compare their
relative strengths and weaknesses
• Draw an inference about how they can be combined
(ensemble learning) to obtain the optimal performance.

Capt. Mehari K (Ph.D) Ethiopian University, Engineering College 15 15

Confusion Matrix for binary classes

Capt. Mehari K (Ph.D) Ethiopian University, Engineering College 16 16

The Most Common performance metrics in classification

Capt. Mehari K (Ph.D) Ethiopian University, Engineering College 17 17

Contd…

Capt. Mehari K (Ph.D) Ethiopian University, Engineering College 18 18

Contd…

Capt. Mehari K (Ph.D) Ethiopian University, Engineering College 19 19

Confusion Matrix for Multiple Classes
• The concept of the multi-class confusion
matrix is similar to the binary-class
matrix
• The columns represent the Actual or
expected class distribution, and the rows
represent the predicted or output
distribution by the classifier.
• Let us elaborate on the features of the
multi-class confusion matrix with an
example
• Suppose we have the test set (consisting
of 191 total samples) of a dataset with
the following distribution:

Capt. Mehari K (Ph.D) Ethiopian University, Engineering College 20 20

Contd…

Capt. Mehari K (Ph.D) Ethiopian University, Engineering College 21 21

Confusion Matrix of multiple class
• Assignment : Explain how to use and interpret results using
Confusion matrix.

• Maximum marks : 10%

• Use numerical example
• Using this concept, we can calculate the class-wise accuracy, precision,
recall, and f1-scores and put the results in a table
• Submit in PPTx format to the address of : [email protected]
• Due date : 5/12/2024 till mid-night
• Penalty: Deduction of marks submitting after due date!

Capt. Mehari K (Ph.D) Ethiopian University, Engineering College 22 22

Module III
No ratings yet
Module III
25 pages
Lec 0
No ratings yet
Lec 0
24 pages
FML - KNN
No ratings yet
FML - KNN
64 pages
ECE 521: Microprocessor System
No ratings yet
ECE 521: Microprocessor System
12 pages
Deep Learning Based Attack Detection For Cyber-Physical System Cyber-Security A Survey
No ratings yet
Deep Learning Based Attack Detection For Cyber-Physical System Cyber-Security A Survey
14 pages
Lecture 5
No ratings yet
Lecture 5
12 pages
A Detailed Analysis of The KDD CUP 99
No ratings yet
A Detailed Analysis of The KDD CUP 99
25 pages
Lecture 0 CSE322
No ratings yet
Lecture 0 CSE322
46 pages
Lecture 1
100% (1)
Lecture 1
81 pages
Machine Learning (Se204A) Lab Manual
No ratings yet
Machine Learning (Se204A) Lab Manual
27 pages
1 Introduction
No ratings yet
1 Introduction
58 pages
21cs743 Model Question Paper Solution
No ratings yet
21cs743 Model Question Paper Solution
33 pages
Chapter 0_General Overview (1)
No ratings yet
Chapter 0_General Overview (1)
12 pages
AI_Lecture 3
No ratings yet
AI_Lecture 3
50 pages
Test Suite Generation With Memetic Algorithms: Gordon Fraser Andrea Arcuri Phil Mcminn
No ratings yet
Test Suite Generation With Memetic Algorithms: Gordon Fraser Andrea Arcuri Phil Mcminn
8 pages
Notes
No ratings yet
Notes
125 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
24 pages
Module II-2
No ratings yet
Module II-2
41 pages
Data Mining - Lab 2
No ratings yet
Data Mining - Lab 2
5 pages
Machine - Learning - Unit - 1
No ratings yet
Machine - Learning - Unit - 1
70 pages
ML-chap-2
No ratings yet
ML-chap-2
60 pages
SEN Question Paper Solution P.A Test 2
No ratings yet
SEN Question Paper Solution P.A Test 2
11 pages
OOP - Object Oriented Paradigm
No ratings yet
OOP - Object Oriented Paradigm
32 pages
Software Testing Seminar: Mooly Sagiv Tel Aviv University 640-6706 Sunday 16-18 Monday 10-12 Schrieber 317
100% (2)
Software Testing Seminar: Mooly Sagiv Tel Aviv University 640-6706 Sunday 16-18 Monday 10-12 Schrieber 317
39 pages
Week 1:: Data Structure and Algorithm
No ratings yet
Week 1:: Data Structure and Algorithm
66 pages
DLD EXP 1 and 2
No ratings yet
DLD EXP 1 and 2
20 pages
Machine Learning & Some Industry Applications
No ratings yet
Machine Learning & Some Industry Applications
43 pages
Module 3 Intro 1ef1ea17a8ab2a794dc68a0a1e2efe59
No ratings yet
Module 3 Intro 1ef1ea17a8ab2a794dc68a0a1e2efe59
46 pages
ROHTAK Pre Ph.D. Computer Science Engg
No ratings yet
ROHTAK Pre Ph.D. Computer Science Engg
28 pages
Semester Project Description and Instructions
No ratings yet
Semester Project Description and Instructions
3 pages
METRIC+: A Metamorphic Relation Identification Technique Based on Input plus Output Domains
No ratings yet
METRIC+: A Metamorphic Relation Identification Technique Based on Input plus Output Domains
22 pages
University Institute of Engineering Department of Computer Science and Engg
No ratings yet
University Institute of Engineering Department of Computer Science and Engg
27 pages
3_answers
No ratings yet
3_answers
19 pages
Introduction_Complexity
No ratings yet
Introduction_Complexity
94 pages
Introduction
No ratings yet
Introduction
93 pages
Previous Lecture
No ratings yet
Previous Lecture
43 pages
Analysis of Search Metrics: Jyotisman Das 17CS10017
No ratings yet
Analysis of Search Metrics: Jyotisman Das 17CS10017
22 pages
Cse141 Fall21 Syllabus
No ratings yet
Cse141 Fall21 Syllabus
6 pages
P-2.1.2 Cross Validation and Regularization
No ratings yet
P-2.1.2 Cross Validation and Regularization
37 pages
Machine Learning Crashcourse
No ratings yet
Machine Learning Crashcourse
233 pages
ML Sit1305
No ratings yet
ML Sit1305
127 pages
3 Intro Algo
No ratings yet
3 Intro Algo
11 pages
Unit I DSA 24th July
No ratings yet
Unit I DSA 24th July
149 pages
Lecture 01 - Introduction To AML-Jan24
No ratings yet
Lecture 01 - Introduction To AML-Jan24
66 pages
prompt_engineering_BAET
No ratings yet
prompt_engineering_BAET
28 pages
2024 MTH058 Lecture07 FederatedLearning
No ratings yet
2024 MTH058 Lecture07 FederatedLearning
25 pages
Classification
No ratings yet
Classification
53 pages
On Speeding Up Language Model Evaluation
No ratings yet
On Speeding Up Language Model Evaluation
19 pages
Ansh Toc File
No ratings yet
Ansh Toc File
53 pages
Da Session 1
No ratings yet
Da Session 1
50 pages
Data Splitting and Bias Variance Tradeoff
No ratings yet
Data Splitting and Bias Variance Tradeoff
14 pages
Comp422 534 2020 Lecture1 Introduction
No ratings yet
Comp422 534 2020 Lecture1 Introduction
49 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
151 pages
5 LVQ
No ratings yet
5 LVQ
30 pages
Make Up Assignment - Data Science
No ratings yet
Make Up Assignment - Data Science
4 pages
Zero Lecture Cse68d
No ratings yet
Zero Lecture Cse68d
22 pages
Sec 1630
No ratings yet
Sec 1630
145 pages
C Programming Notes
No ratings yet
C Programming Notes
363 pages
Blockchain Foundation Courseware - English
From Everand
Blockchain Foundation Courseware - English
Eppo Luppes
No ratings yet
Computer Science, Career and Job
From Everand
Computer Science, Career and Job
Ramkrishna Ghosh
No ratings yet
ch 11
No ratings yet
ch 11
29 pages
CH4 WEB Lecture2
No ratings yet
CH4 WEB Lecture2
52 pages
ch5
No ratings yet
ch5
20 pages
CH2 WEB Lecture
No ratings yet
CH2 WEB Lecture
56 pages
Ch2
No ratings yet
Ch2
52 pages
ch 12
No ratings yet
ch 12
26 pages
Ch3
No ratings yet
Ch3
18 pages
Ch4
No ratings yet
Ch4
23 pages
Chapter 6- Wireless Threats-class
No ratings yet
Chapter 6- Wireless Threats-class
31 pages
G Assignment
No ratings yet
G Assignment
12 pages
Multipath Propagation
No ratings yet
Multipath Propagation
2 pages
Fixed Cooperation Strategies
No ratings yet
Fixed Cooperation Strategies
1 page
Chapter 1-Introduction to Wireless Networks - class
No ratings yet
Chapter 1-Introduction to Wireless Networks - class
72 pages
Chapter 8 Mobile AdHoc Networks , Protocols and Security Class
No ratings yet
Chapter 8 Mobile AdHoc Networks , Protocols and Security Class
46 pages
Chapter 4
No ratings yet
Chapter 4
43 pages
Chapter 4- Cooperative Secrecy Techniques for the Physical Layer
No ratings yet
Chapter 4- Cooperative Secrecy Techniques for the Physical Layer
50 pages
Data Structure & Algorithm
No ratings yet
Data Structure & Algorithm
46 pages
Cybersecurity Curriculum_Final (1)
No ratings yet
Cybersecurity Curriculum_Final (1)
121 pages
Common and Cooperative Diversity
No ratings yet
Common and Cooperative Diversity
2 pages
The First Question (0 Points) : CSE 373, Spring 2012 Midterm Solutions
No ratings yet
The First Question (0 Points) : CSE 373, Spring 2012 Midterm Solutions
8 pages
Chapter 5
No ratings yet
Chapter 5
47 pages
Big-O Notation Analysis of Algorithms
No ratings yet
Big-O Notation Analysis of Algorithms
2 pages
SFCC Developer Certification Cartes - Quizlet
No ratings yet
SFCC Developer Certification Cartes - Quizlet
60 pages
DIA2 GROUP 6 WebDD PART 1
No ratings yet
DIA2 GROUP 6 WebDD PART 1
25 pages
Course Outline: Dire Dawa University Dire Dawa Institute of Technology School of Electrical & Computer Engineering
No ratings yet
Course Outline: Dire Dawa University Dire Dawa Institute of Technology School of Electrical & Computer Engineering
4 pages
Product Specifications: Wireless Data Logging System RTR-500 Series
No ratings yet
Product Specifications: Wireless Data Logging System RTR-500 Series
1 page
初级研究论文
100% (2)
初级研究论文
9 pages
Netwrix Auditor For Office 365 Quick Start Guide
No ratings yet
Netwrix Auditor For Office 365 Quick Start Guide
25 pages
2 NodeJS
No ratings yet
2 NodeJS
118 pages
Internship Presentation On Autocad Software at Cadd Centre, Marathalli
No ratings yet
Internship Presentation On Autocad Software at Cadd Centre, Marathalli
20 pages
Internet of Behaviors (IoB) 1st Edition R. Dhaya (Editor) all chapter instant download
100% (5)
Internet of Behaviors (IoB) 1st Edition R. Dhaya (Editor) all chapter instant download
40 pages
UsbFix Report
No ratings yet
UsbFix Report
97 pages
NexStarCommunicationProtocolV1.2 2 PDF
No ratings yet
NexStarCommunicationProtocolV1.2 2 PDF
8 pages
Zurich CV Presentation
No ratings yet
Zurich CV Presentation
16 pages
Codeigniter3 Studentstutorial
No ratings yet
Codeigniter3 Studentstutorial
123 pages
Specifications Robots Painting-Robots kj264 E25 en 01 2021
No ratings yet
Specifications Robots Painting-Robots kj264 E25 en 01 2021
6 pages
Station Worksheets
No ratings yet
Station Worksheets
10 pages
Africa Digital Forensics CTF Writeup Week 4
No ratings yet
Africa Digital Forensics CTF Writeup Week 4
10 pages
VersaWorks - Profiling Spanish
No ratings yet
VersaWorks - Profiling Spanish
17 pages
Unit3 Database Management System
No ratings yet
Unit3 Database Management System
20 pages
Xtream Iptv by BTFII HH
No ratings yet
Xtream Iptv by BTFII HH
1 page
P.E.S. College of Engineering, Mandya - 571 401
No ratings yet
P.E.S. College of Engineering, Mandya - 571 401
3 pages
Oracle Database 12c R2 - Official ADMIN1
No ratings yet
Oracle Database 12c R2 - Official ADMIN1
4 pages
An Approach For De-Noising and Contrast Enhancement of Retinal Fundusimage Using CLAHE
No ratings yet
An Approach For De-Noising and Contrast Enhancement of Retinal Fundusimage Using CLAHE
12 pages
Service Management Guiding Principles: Figure 1. The Service Value System
No ratings yet
Service Management Guiding Principles: Figure 1. The Service Value System
2 pages
REEPcurriculum Work150
No ratings yet
REEPcurriculum Work150
9 pages
Java : Bus Ticket Booking
75% (8)
Java : Bus Ticket Booking
37 pages
Precision 3540 Spec Sheet
No ratings yet
Precision 3540 Spec Sheet
5 pages
CATALOG Flipbook
No ratings yet
CATALOG Flipbook
124 pages
UNIT 1 CGAR MCQs
No ratings yet
UNIT 1 CGAR MCQs
8 pages
OM D E M5 Mark III Visual Guide en LR
No ratings yet
OM D E M5 Mark III Visual Guide en LR
84 pages
AI Soccer References v2
No ratings yet
AI Soccer References v2
2 pages