0% found this document useful (0 votes)

134 views11 pages

Machine Learning in Traffic Classification of SDN - Final Project Report

This document summarizes a student project that used machine learning models to classify network traffic flows in a software-defined networking (SDN) environment. The student implemented a virtual network using VirtualBox and Open vSwitch. Traffic was generated between hosts using D-ITG and classified using logistic regression (supervised) and K-means clustering (unsupervised). Logistic regression performed better at classifying DNS, Telnet, ping and voice traffic flows based on flow statistics collected by the Ryu SDN controller. The project provided a proof of concept for machine learning-based traffic classification but had limitations that could be addressed in future work.

Uploaded by

faizan jutt

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

134 views11 pages

Machine Learning in Traffic Classification of SDN - Final Project Report

Uploaded by

faizan jutt

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Machine Learning in Traffic Classification of

SDN Based Networks

ECE 1508 Network Softwarization – Technologies and Enablers:
Final Project Report

Ahmed Khan (998325272)

April 15th, 2019

Abstract

Traffic Classification has become an important research area especially with the advancements

of Machine Learning and Software Defined Networking. In this project, two machine learning models –

Logistic Regression (supervised) and K-Means Clustering (unsupervised) were used to classify DNS,

Telnet, Ping, and Voice traffic flows simulated by the Distributed Internet Traffic Generator (D-ITG) tool.

Each host in the network was connected through an overlay network to an Open vSwitch (OVS). The OVS

was connected to a Ryu controller which collected basic flow statistics between hosts. These statistics

were then parsed by a Python traffic classification script which periodically outputted the learned traffic

labels of each flow. Logistic Regression was found to work much better than K-Means Clustering. Further

improvements in the project could be to add functionality to detect unique traffic flows between the

same pair of source and destination.

Introduction

The classification of traffic flows in today’s IP networks has become an important research area with the

adoption of Machine Learning (ML) techniques and Software Defined Networking (SDN) principles.

Traditional methodologies including identifying traffic based on port number and payload inspection are

not effective due to the dynamic and encrypted nature of current traffic. This project will attempt to

utilize Supervised and Unsupervised ML algorithms to classify flows by their required bandwidth,

required QoS, and their application based on various flow level details as features.

As mentioned earlier, traffic classification using Machine Learning is a growing trend in the network

analytics domain. One application of these techniques is in cybersecurity. Large datasets produced from

enormous Internet traffic flows are difficult to process and analyze, even for talented experts in the field

using sophisticated tools. In addition, ML classification and clustering of flows can help identify network

hotspots and potential bottlenecks. When using bandwidth and QoS of flows as classifiers, we can use

Traffic Engineering (TE) to adjust flow paths and add virtual resources to the network infrastructure.

Finally, identification of applications or web-based protocols is important for forecasting future trends

and ensuring the network can meet the demand. Network classification is therefore of great interest to

ISPs, governments, and enterprise alike.

Implementation
A simple network topology was created using VirtualBox as shown in Figure 1. It consisted of 5 Virtual

Machines: 1 Controller, 1 Layer 2 Switch, and 3 Hosts. The nodes in the network were modeled as VMs

so that network traffic would experience some delay. Another option was to use Mininet but it wasn’t

chosen due to the network simulation taking place within a single VM. An overlay network was deployed
so that the traffic generated by the hosts went through the Switch VM instead of using the VirtualBox

internal switching mechanism to communicate. The hosts in the network used OVS with 2 interfaces.

The first interface was internal and the second connected towards the Switch VM OVS using VXLAN

tunnelling. The Switch OVS had VXLAN interfaces towards the hosts and connected to the Controller VM

directly (i.e. using underlay IP). The Ryu controller was deployed on the Controller VM.

Figure 1: VirtualBox Network Simulation

After the network is successfully deployed and configured, the controller should be aware of packets

flowing through the switch between any of the hosts. The sample Python script, simple_monitor_13.py,

was modified to display the following flow information each second:

• time: the UTC clock value at the time of flow information

• datapath: the switch ID to identify the switch in Ryu

• in-port: the port receiving the incoming traffic

• eth_src: the source MAC address of the flow

• eth_dst: the destination MAC address of the flow

• out-port: the port sending the outgoing traffic

• total_packets: total packets of the flow so far

• total_bytes: total Bytes of the flow so far

This data was used as input data to the traffic_classifier.py script. The script used the input data to

create a Flow object with the following attributes:

• time_start: the UTC clock value at the time the flow is first detected

• datapath: the switch ID to identify the switch in Ryu

• in-port: the port receiving the incoming traffic

• eth_src: the source MAC address of the flow

• eth_dst: the destination MAC address of the flow

• out-port: the port sending the outgoing traffic

• forward_packets: the total number of packets seen in the forward direction (src -> dst)

• forward_bytes: the total number of Bytes seen in the forward direction (src -> dst)

• forward_delta_packets: the number of packets seen since the last forward flow detection

• forward_delta_bytes: the number of Bytes seen since the last forward flow detection

• forward_inst_pps: the instantaneous packets per second in the forward direction (src->dst)

• forward_avg_pps: the average packets per second in the forward direction (src->dst)

• forward_inst_bps: the instantaneous Bytes per second in the forward direction (src->dst)

• forward_avg_bps: the average Bytes per second in the forward direction (src->dst)

• forward_status: the status (active/inactive) of the forward flow

• forward_last_time: the UTC clock value of the last time forward flow was detected
• reverse_packets: the total number of packets seen in the reverse direction (dst -> src)

• reverse _bytes: the total number of Bytes seen in the reverse direction (dst -> src)

• reverse _delta_packets: the number of packets seen since the last reverse flow detection

• reverse _delta_bytes: the number of Bytes seen since the last reverse flow detection

• reverse _inst_pps: the instantaneous packets per second in the reverse direction (dst->src)

• reverse _avg_pps: the average packets per second in the reverse direction (dst->src)

• reverse _inst_bps: the instantaneous Bytes per second in the reverse direction (dst->src)

• reverse _avg_bps: the average Bytes per second in the reverse direction (dst->src)

• reverse _status: the status (active/inactive) of the reverse flow

• reverse _last_time: the UTC clock value of the last time reverse flow was detected

The traffic_classifier.py script can do the following tasks:

1) Collect Training Data – collect training data for a specified traffic type. The traffic must be

flowing between two hosts before the script is run.

2) Classify using Supervised Machine Learning – classify traffic type of flow between hosts using

Logistic Regression

3) Classify using Unsupervised Machine Learning – classify traffic type of flow between hosts using

K-Means Clustering.

Simulating Traffic Flows

The Distributed Internet Traffic Generator (D-ITG) application was used to generate the traffic flow data

used for training the Machine Learning models. D-ITG is described as ‘a platform capable to produce

IPv4 and IPv6 traffic by accurately replicating the workload of current Internet applications. D-ITG can

generate traffic following stochastic models for packet size (PS) and inter departure time (IDT) that

mimic application-level protocol behavior. D-ITG is able to replicate statistical properties of traffic of
different well-known applications (e.g. Telnet, VoIP – G.711, G.723, G.729, Voice Activity Detection,

Compressed RTP – DNS, network games’ [1]. For the purposes of this proof of concept traffic

classification, the following traffic types were used: Ping, Telnet, DNS, Voice (G.711). The choice of traffic

classes was due to limitations in simulation tools and issues faced in using D-ITG as discussed further in

the Limitations and Future Work section.

D-ITG describes the traffic used as follows:

• Telnet - Generates traffic with Telnet characteristics. It works with TCP transport layer protocol.

Different settings will be ignored.

• DNS - Generates traffic with DNS characteristics. It works with both UDP and TCP transport layer

protocols.

• VoIP (voice) - Generate traffic with VoIP characteristics. It only works with UDP transport layer

protocol. Different settings will be ignored. The emulation of G.711 codec is used.

For Ping traffic, a simple ‘ping’ command was run with the overlay IP of the destination host.

The process to collect training data for the models is as follows: First simulate the flow of the specific

traffic between a certain pair of hosts using D-ITG or other tools. Second, start the traffic_classifier.py

script with the appropriate options for training the traffic type. The script starts the Ryu controller and

simple_monitor_AK.py (the modified version of simple_monitor_13.py). The data generated from

simple monitor script is collected and transformed to update the attributes of a Flow object. The

attributes of the Flow object are then periodically outputted to a CSV file. After the CSV files for each

traffic type are generated, they are combined in a complete Pandas Dataframe object used for the

model training and testing.

Supervised Learning - Logistic Regression

The first Machine Learning algorithm used was a supervised Logistic Regression model. It is used to

predict categorical target variables. Mostly, the outcome is a binary value but in the case of multiple

targets, it picks the target with the highest probability of occurring. In our program, Logistic Regression

performed exceptionally well with an accuracy of over 99%. It is clear to see from the decision

boundaries, Figure 2, formed with the first two Principle Components why the accuracy is so high.

Figure 2: Decision Boundaries for Logistic Regression

A Confusion Matrix can help pinpoint where the model is failing to accurately decide the target. It is a

matrix with predicted labels on the y-axis and true labels on the x-axis. If a model is tending to predict a

certain traffic class as another class more often, then it will be clearly evident in the Confusion Matrix.

However, we see, in Figure 3, that for Logistic Regression, there is almost no failure.
Figure 3: Confusion Matrix for Logistic Regression

Unsupervised Learning – K-Means Clustering

K-Means Clustering is an unsupervised Machine Learning model that groups data points in multi-

dimensional space together to form clusters and label each data point in the cluster the same. The initial

desired number of clusters are chosen – in our case of four traffic types, four clusters were used as an

input parameter to the model. In Figure 4 below, we can attempt to visualize where each centroid of the

four clusters are. Each square represents a dimension. The darker the color, the higher the value in that

dimension. This shows where in the 12-dimensional vector space, the cluster centroids lie.

Figure 4: Visualization of Cluster Centroids

To visualize model performance, we again use Principle Component Analysis to reduce the number of

dimensions from 12 to 2 and then plot in two-dimensional space. We can clearly understand why model

performance for this algorithm was only around 30%. It is not very good at clustering non-circularly

groups of data.

Figure 5: Visualizing K-Means Clustering Performance

Finally, we confirm our understanding by looking at the confusion matrix and seeing a very disbalanced

matrix.

Figure 6: Confusion Matrix of K-Means Clustering

Limitations and Future Work

Although the proof of concept design of this problem works well, it is limited in the following cases:

1) It does not classify multiple flows between the same source and destination nodes. For example,

if we start a ping command from host1 to host2, then add voice traffic from host1 to host2, it

will assume the ping and voice are 1 unique flow and classify as such.

2) If a flow is started and stopped, the algorithm will not delete the old flow but instead update it

with the latest flow statistics. This is a problem because the flow classifier considers average

packet size and average number of Bytes. If the flow is stopped for a significant amount of time,

these two features will be reduced, and the resulting label of traffic will be inaccurate.

3) The simulation tool (D-ITG) was unable to generate flows for game play or other video traffic.

With a lot of internet traffic being video nowadays, this would have been helpful for real world

classification situations.

4) We see that unsupervised learning using K-Means clustering performs very poorly. Perhaps this

model needs to be further tuned using Hyperparameter tuning. We can also consider other

models such as DBSCAN (Density Based Scanning) or CNN (Convoluted Neural Networks) for

future work on this project.

In addition to the above, the following items will also help improve the quality of the project.

• Create GUI to visualize flow classifications on different SDN controllers

• Use visual analytics to point out ‘hot’ areas of high bandwidth and QoS traffic

• Connect ML application to actual SAVI testbed controllers and visualize real traffic flows
References

[1] Distributed Internet Traffic Generator

http://traffic.comics.unina.it/software/ITG/manual/D-ITG-2.8.1-manual.pdf

[2] Classifying Network Traffic Flows with Deep-Learning

https://www.eleceng.adelaide.edu.au/students/wiki/projects/index.php/Projects:2017s1-
101_Classifying_Network_Traffic_Flows_with_Deep-Learning#Padding_style

[3] QoS -aware Traffic Classification Architecture Using Machine Learning and Deep Packet Inspection
https://reader.elsevier.com/reader/sd/pii/S1877050918307129?token=481A2C61587FE2C6743C320DD
C7612381E99A6BAD5300B53B6C87D475380BC37D4A801FA23FA6B514383FD67EA186865

[4] A Survey of Traffic Classification in Software Defined Networks

https://hoticn.com/files/hoticnPapers/032-paper%20101.pdf

[5] Identification and Selection of Flow Features for Accurate Traffic Classification in SDN
https://ieeexplore.ieee.org/document/7371715?ALU=LU1046369

[6] Unsupervised Learning with Python

https://towardsdatascience.com/unsupervised-learning-with-python-173c51dc7f03

Major Project Sunny - Docxhiii
0% (1)
Major Project Sunny - Docxhiii
52 pages
Project Report Final
No ratings yet
Project Report Final
40 pages
Crime Analysis and Prediction Using Data
No ratings yet
Crime Analysis and Prediction Using Data
7 pages
Anomaly Detection Report
No ratings yet
Anomaly Detection Report
33 pages
Driver Drowsiness Detection System
No ratings yet
Driver Drowsiness Detection System
68 pages
Cryptographic Solutions For Cyber-Physical System Security
No ratings yet
Cryptographic Solutions For Cyber-Physical System Security
179 pages
A Survey and Analysis of Intrusion Detection Models Based On Information Security and Object Technology-Cloud Intrusion Dataset
No ratings yet
A Survey and Analysis of Intrusion Detection Models Based On Information Security and Object Technology-Cloud Intrusion Dataset
8 pages
Ravi Internship Report
No ratings yet
Ravi Internship Report
39 pages
Intrusion Detection System in Software Defined Networks Using Machine Learning Approach
No ratings yet
Intrusion Detection System in Software Defined Networks Using Machine Learning Approach
8 pages
CRIME ANALYSIS AND PREDICTION USING DATA MINING TECHNIQUES (1) (AutoRecovered) (Repaired)
No ratings yet
CRIME ANALYSIS AND PREDICTION USING DATA MINING TECHNIQUES (1) (AutoRecovered) (Repaired)
49 pages
Smart Parking
No ratings yet
Smart Parking
110 pages
LLM Soar
No ratings yet
LLM Soar
27 pages
A Review of Intrusion Detection System
100% (1)
A Review of Intrusion Detection System
3 pages
PUMMP: Phishing URL Detection Using Machine Learning With Monomorphic and Polymorphic Treatment of Features
No ratings yet
PUMMP: Phishing URL Detection Using Machine Learning With Monomorphic and Polymorphic Treatment of Features
20 pages
Netwrokiing Project 1 Semester Esoft Metro Campus
100% (1)
Netwrokiing Project 1 Semester Esoft Metro Campus
41 pages
Thesis Philippine History Tower Defense Android Game
No ratings yet
Thesis Philippine History Tower Defense Android Game
112 pages
Final Report
No ratings yet
Final Report
51 pages
Crime Type and Occurrence Prediction Using Machine Learning
No ratings yet
Crime Type and Occurrence Prediction Using Machine Learning
28 pages
Einal - Report On Predictive Modeling of Global Terrorist Attacks Using Machine Learning PDF
No ratings yet
Einal - Report On Predictive Modeling of Global Terrorist Attacks Using Machine Learning PDF
69 pages
Classification of Malware Attacks Using Machine Learning in Decision Tree
No ratings yet
Classification of Malware Attacks Using Machine Learning in Decision Tree
16 pages
Project-Human Emotion Detection
No ratings yet
Project-Human Emotion Detection
28 pages
Disease Prediction Using ML
100% (1)
Disease Prediction Using ML
43 pages
Malicious Url Detection Based On Machine Learning
No ratings yet
Malicious Url Detection Based On Machine Learning
52 pages
C1a - Anomaly Detection
No ratings yet
C1a - Anomaly Detection
12 pages
Militant and Weapon Detection Final Report
No ratings yet
Militant and Weapon Detection Final Report
63 pages
b3 Plant Leaf Disease Detection
No ratings yet
b3 Plant Leaf Disease Detection
62 pages
A Main Project ON: Intrusion Detection System
No ratings yet
A Main Project ON: Intrusion Detection System
24 pages
Identification of Edible and Non-Edible Mushroom Through Convolution Neural Network
No ratings yet
Identification of Edible and Non-Edible Mushroom Through Convolution Neural Network
10 pages
M.E (FT) 2021 Regulation-Cse Syllabus
No ratings yet
M.E (FT) 2021 Regulation-Cse Syllabus
88 pages
Survey of Machine Learning in Phishing Detection Research
No ratings yet
Survey of Machine Learning in Phishing Detection Research
21 pages
Accident Detection System A Deep Learning Approach To Detect Accidents
No ratings yet
Accident Detection System A Deep Learning Approach To Detect Accidents
4 pages
66
No ratings yet
66
82 pages
Yahya Thesis - Draft
100% (1)
Yahya Thesis - Draft
58 pages
Face Recognition Using CNN
No ratings yet
Face Recognition Using CNN
17 pages
Object Detection Using Yolo
No ratings yet
Object Detection Using Yolo
42 pages
Classification of Fruits and Detection of Disease Using CNN: Bachelor of Engineering IN Information Technology
No ratings yet
Classification of Fruits and Detection of Disease Using CNN: Bachelor of Engineering IN Information Technology
65 pages
A Human-Detection Method Based On YOLOv5 and Trans
No ratings yet
A Human-Detection Method Based On YOLOv5 and Trans
12 pages
IOT Based Fire Detection System - Formatted Paper
No ratings yet
IOT Based Fire Detection System - Formatted Paper
7 pages
Find Where To Park in Real Time Using Opencv: This Problem Can Be Solved Using Deep Learning and Opencv
No ratings yet
Find Where To Park in Real Time Using Opencv: This Problem Can Be Solved Using Deep Learning and Opencv
6 pages
Crime Analysis and Prediction Using Machine Learning
No ratings yet
Crime Analysis and Prediction Using Machine Learning
24 pages
Plant Disease Identification and Crop Management - K - MITRA Report
100% (1)
Plant Disease Identification and Crop Management - K - MITRA Report
59 pages
Detection of Phishing WebsitesUsing Random Forest and XGBOOST
No ratings yet
Detection of Phishing WebsitesUsing Random Forest and XGBOOST
14 pages
"Accident Detection and Alert System": Visvesvaraya Technological University "Jnana Sangama" Belagavi-590018
No ratings yet
"Accident Detection and Alert System": Visvesvaraya Technological University "Jnana Sangama" Belagavi-590018
23 pages
Deep Audio Classification
No ratings yet
Deep Audio Classification
10 pages
Digital Media Marketing Using Trend Analysis On Social Media Seminar Presentation
100% (1)
Digital Media Marketing Using Trend Analysis On Social Media Seminar Presentation
16 pages
Python Programming-Grade 9
No ratings yet
Python Programming-Grade 9
53 pages
Plant Disease Detection Robot Using Raspberry Pi
No ratings yet
Plant Disease Detection Robot Using Raspberry Pi
10 pages
Stroke Prediction Project Report
No ratings yet
Stroke Prediction Project Report
7 pages
Project Report
100% (1)
Project Report
60 pages
Classification of Lung Sounds Using CNN
No ratings yet
Classification of Lung Sounds Using CNN
10 pages
Dpb20043 Chapter 2
No ratings yet
Dpb20043 Chapter 2
140 pages
Minor Project Alcohal Detection
No ratings yet
Minor Project Alcohal Detection
14 pages
Blockchain Based Certificate Validation
No ratings yet
Blockchain Based Certificate Validation
7 pages
400-007 Cisco Exam Valid Questions
No ratings yet
400-007 Cisco Exam Valid Questions
20 pages
Fruit Disease Detection Using Color, Texture Analysis: A Project Report
No ratings yet
Fruit Disease Detection Using Color, Texture Analysis: A Project Report
10 pages
Nimbalkar Sandesh Seminar PPT Final
No ratings yet
Nimbalkar Sandesh Seminar PPT Final
20 pages
Drug Recommender System Using Machine Learning For Sentiment Analysis
No ratings yet
Drug Recommender System Using Machine Learning For Sentiment Analysis
4 pages
30 Hrs Deep Learning CV Images Video
No ratings yet
30 Hrs Deep Learning CV Images Video
6 pages
HPE Networking Comware Switch 48-Port 1GBaseT 4XG 2QSFP+ 5901AF Data sheet-PSN1013968503HREN
No ratings yet
HPE Networking Comware Switch 48-Port 1GBaseT 4XG 2QSFP+ 5901AF Data sheet-PSN1013968503HREN
4 pages
Weather Prediction Using CPT+ Algorithm: Proposed Scheme
No ratings yet
Weather Prediction Using CPT+ Algorithm: Proposed Scheme
12 pages
Strokeprediction DRAFTArticle
No ratings yet
Strokeprediction DRAFTArticle
6 pages
Abstract On The Artificial Intelegence
No ratings yet
Abstract On The Artificial Intelegence
15 pages
Leaf Disease Detection
No ratings yet
Leaf Disease Detection
8 pages
Exam JN0-211: IT Certification Guaranteed, The Easy Way!
No ratings yet
Exam JN0-211: IT Certification Guaranteed, The Easy Way!
24 pages
SDN Book PDF
No ratings yet
SDN Book PDF
183 pages
CS5460 ChadMaughan Assignment4
No ratings yet
CS5460 ChadMaughan Assignment4
3 pages
Alcatel-Lucent Omniswitch 6860: Stackable Lan Switches For Mobility, Iot and Network Analytics
No ratings yet
Alcatel-Lucent Omniswitch 6860: Stackable Lan Switches For Mobility, Iot and Network Analytics
17 pages
5 Acc
No ratings yet
5 Acc
14 pages
Cloud-Based Network Management
No ratings yet
Cloud-Based Network Management
19 pages
Analysis of The Use of Network Automation in Modern Network Infrastructure Management - Jaringan Komputer-Muhamad Rido 2411600063
No ratings yet
Analysis of The Use of Network Automation in Modern Network Infrastructure Management - Jaringan Komputer-Muhamad Rido 2411600063
7 pages
First Level Report
No ratings yet
First Level Report
38 pages
Contrail DPDK1
No ratings yet
Contrail DPDK1
196 pages
Final Assessment Inb23804 Muhammad Naim Bin Zulmi 52223122576
No ratings yet
Final Assessment Inb23804 Muhammad Naim Bin Zulmi 52223122576
22 pages
Rice Leaf Color Chart Using Low-Cost Visible Spectro Sensor - IJITEE - Volume-8 - Issue-7C2 - May - 2019
No ratings yet
Rice Leaf Color Chart Using Low-Cost Visible Spectro Sensor - IJITEE - Volume-8 - Issue-7C2 - May - 2019
73 pages
Ddos Detection and Mitigation in SDN Using Onos Controller: Dr. Shashank Srivastava
No ratings yet
Ddos Detection and Mitigation in SDN Using Onos Controller: Dr. Shashank Srivastava
57 pages
CCNA Training CCNAv7 (2020) - New Questions Part 5
No ratings yet
CCNA Training CCNAv7 (2020) - New Questions Part 5
29 pages
CV Muhammad Tahir
No ratings yet
CV Muhammad Tahir
5 pages
Electronics: An Adaptable Train-to-Ground Communication Architecture Based On The 5G Technological Enabler SDN
No ratings yet
Electronics: An Adaptable Train-to-Ground Communication Architecture Based On The 5G Technological Enabler SDN
12 pages
Chapter 2
No ratings yet
Chapter 2
32 pages
Unit I - TCS 552
No ratings yet
Unit I - TCS 552
35 pages
11th International Conference On Networks, Mobile Communication (NMCO 2025)
No ratings yet
11th International Conference On Networks, Mobile Communication (NMCO 2025)
2 pages
Phycom S 24 01996
No ratings yet
Phycom S 24 01996
25 pages
Allied Telesis Autonomous Management Framework (AMF) : Automate and Simplify Network Management
No ratings yet
Allied Telesis Autonomous Management Framework (AMF) : Automate and Simplify Network Management
8 pages
Course Manual - NGN
No ratings yet
Course Manual - NGN
12 pages
Network Node Manager I: Data Sheet
No ratings yet
Network Node Manager I: Data Sheet
10 pages
Question Bank - CN (Module 4,5,6)
No ratings yet
Question Bank - CN (Module 4,5,6)
3 pages
DRL-Driven Digital Twin Function Virtualization For Adaptive Service Response in 6G Networks
No ratings yet
DRL-Driven Digital Twin Function Virtualization For Adaptive Service Response in 6G Networks
5 pages
116AI01
No ratings yet
116AI01
2 pages
IT490 Assignment 1 MohammedAlomar 438011410
No ratings yet
IT490 Assignment 1 MohammedAlomar 438011410
3 pages
The Today and Future of WSN, AI, and IoT: A Compass and Torchbearer for the Technocrats
From Everand
The Today and Future of WSN, AI, and IoT: A Compass and Torchbearer for the Technocrats
Dr.Chandrakant
No ratings yet

Machine Learning in Traffic Classification of SDN - Final Project Report

Uploaded by

Machine Learning in Traffic Classification of SDN - Final Project Report

Uploaded by

Machine Learning in Traffic Classification of

SDN Based Networks

Ahmed Khan (998325272)

April 15th, 2019

same pair of source and destination.

ISPs, governments, and enterprise alike.

Figure 1: VirtualBox Network Simulation

was modified to display the following flow information each second:

• time: the UTC clock value at the time of flow information

• datapath: the switch ID to identify the switch in Ryu

• eth_src: the source MAC address of the flow

• eth_dst: the destination MAC address of the flow

• out-port: the port sending the outgoing traffic

• total_packets: total packets of the flow so far

• total_bytes: total Bytes of the flow so far

create a Flow object with the following attributes:

• datapath: the switch ID to identify the switch in Ryu

• in-port: the port receiving the incoming traffic

• eth_src: the source MAC address of the flow

• eth_dst: the destination MAC address of the flow

• out-port: the port sending the outgoing traffic

• forward_status: the status (active/inactive) of the forward flow

• reverse _status: the status (active/inactive) of the reverse flow

The traffic_classifier.py script can do the following tasks:

flowing between two hosts before the script is run.

Simulating Traffic Flows

the Limitations and Future Work section.

D-ITG describes the traffic used as follows:

Different settings will be ignored.

simple_monitor_AK.py (the modified version of simple_monitor_13.py). The data generated from

model training and testing.

Figure 2: Decision Boundaries for Logistic Regression

Unsupervised Learning – K-Means Clustering

Figure 4: Visualization of Cluster Centroids

Figure 5: Visualizing K-Means Clustering Performance

Figure 6: Confusion Matrix of K-Means Clustering

future work on this project.

• Create GUI to visualize flow classifications on different SDN controllers

[1] Distributed Internet Traffic Generator

[2] Classifying Network Traffic Flows with Deep-Learning

[4] A Survey of Traffic Classification in Software Defined Networks

[6] Unsupervised Learning with Python

You might also like