Enhanced Network Anomaly Detection Based On Deep Neural Networks

This document discusses using deep learning approaches like convolutional neural networks, autoencoders, and recurrent neural networks for anomaly-based intrusion detection systems. It compares models based on these deep learning techniques to conventional machine learning classification methods. The deep learning models were trained on a standard dataset and evaluated using standard classification metrics to analyze their performance for intrusion detection.

Uploaded by

Yishak Tadele

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

101 views16 pages

Enhanced Network Anomaly Detection Based On Deep Neural Networks

Uploaded by

Yishak Tadele

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

SPECIAL SECTION ON CYBER-THREATS AND COUNTERMEASURES

IN THE HEALTHCARE SECTOR

Received June 3, 2018, accepted July 16, 2018, date of publication August 17, 2018, date of current version September 21, 2018.
Digital Object Identifier 10.1109/ACCESS.2018.2863036

Enhanced Network Anomaly Detection

Based on Deep Neural Networks
SHERAZ NASEER1,2 , YASIR SALEEM1 , SHEHZAD KHALID3 , MUHAMMAD KHAWAR BASHIR1,4 ,
JIHUN HAN5 , MUHAMMAD MUNWAR IQBAL 6 , AND KIJUN HAN5
1 Department of Computer Science & Engineering, University of Engineering and Technology, Lahore 54890, Pakistan
2 Department of Informatics and Systems, University of Management and Technology, Lahore 10033, Pakistan
3 Department of Computer Engineering, Bahria University, Islamabad 44000, Pakistan
4 Department of Statistics and Computer Science, University of Veterinary and Animal Sciences, Lahore 54000, Pakistan
5 School of Computer Science and Engineering, Kyungpook National University, Daegu 37224, South Korea
6 Department of Computer Science, University of Engineering and Technology, Taxila 47080, Pakistan

Corresponding author: Kijun Han ([email protected])

ABSTRACT Due to the monumental growth of Internet applications in the last decade, the need for security
of information network has increased manifolds. As a primary defense of network infrastructure, an intrusion
detection system is expected to adapt to dynamically changing threat landscape. Many supervised and
unsupervised techniques have been devised by researchers from the discipline of machine learning and
data mining to achieve reliable detection of anomalies. Deep learning is an area of machine learning which
applies neuron-like structure for learning tasks. Deep learning has profoundly changed the way we approach
learning tasks by delivering monumental progress in different disciplines like speech processing, computer
vision, and natural language processing to name a few. It is only relevant that this new technology must be
investigated for information security applications. The aim of this paper is to investigate the suitability of deep
learning approaches for anomaly-based intrusion detection system. For this research, we developed anomaly
detection models based on different deep neural network structures, including convolutional neural networks,
autoencoders, and recurrent neural networks. These deep models were trained on NSLKDD training data set
and evaluated on both test data sets provided by NSLKDD, namely NSLKDDTest+ and NSLKDDTest21.
All experiments in this paper are performed by authors on a GPU-based test bed. Conventional machine
learning-based intrusion detection models were implemented using well-known classification techniques,
including extreme learning machine, nearest neighbor, decision-tree, random-forest, support vector machine,
naive-bays, and quadratic discriminant analysis. Both deep and conventional machine learning models were
evaluated using well-known classification metrics, including receiver operating characteristics, area under
curve, precision-recall curve, mean average precision and accuracy of classification. Experimental results of
deep IDS models showed promising results for real-world application in anomaly detection systems.

INDEX TERMS Deep learning, convolutional neural networks, autoencoders, LSTM, k_NN, decision_tree,
intrusion detection, convnets, information security.

I. INTRODUCTION patterns and intrusions. This idea pioneered a new breed

Network intrusion detection refers to the problem of monitor- of intrusion detection systems which were based on learn-
ing and differentiating such network flows and activities from ing algorithms rather than always-updating signatures of
the normal expected behavior of network which can adversely intrusions. Over the last three decades, machine learning
impact the security of information systems. The search of reli- techniques were applied as a conventional approach for devel-
able solutions by Governments and organizations to protect oping network anomaly detection models. These approaches
their information assets from unauthorized disclosures and employ supervised, unsupervised and semi-supervised learn-
illegal accesses has brought intrusion detection and preven- ing algorithms to propose solutions for anomaly detection
tion at the forefront of information security landscape. problem.
Denning [1] proposed the idea of developing intru- Anomaly detection is modeled as a classification prob-
sion detection system by employing Artificial Intelligence lem in supervised learning. Supervised learning uses labeled
techniques on security events to identify abnormal usage data to train anomaly detection models. The goal of this

2169-3536
2018 IEEE. Translations and content mining are permitted for academic research only.
VOLUME 6, 2018 Personal use is also permitted, but republication/redistribution requires IEEE permission. 48231
See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
S. Naseer et al.: Enhanced Network Anomaly Detection Based on DNNs

type of training is to classify the test data as anomalous or Precision-Recall Curve, mean average precision (mAP) and
normal on the basis of feature vectors. Unsupervised learn- accuracy of classification.
ing, on the other hand, uses unlabeled or untagged data The primary contribution of this work is filling above-
to perform the task learning. One of the popular unsuper- mentioned research gaps by designing and implementing
vised learning technique is clustering [2], which searches anomaly detection models based on state of the art Deep Neu-
for similarities among instances of the dataset to build clus- ral Networks and their evaluation using standardized classi-
ters. Instances sharing related characteristics are assumed to fication quality metrics. The first gap is filled by developing
be alike and placed in the same cluster. Semi-Supervised anomaly detection models using Deep CNN, LSTM and mul-
Learning (SSL) is a combination of supervised and unsu- tiple types of Autoencoders. To the best of our knowledge,
pervised learning. The SSL approach utilizes both labeled the DNN structures (DCNN, Contractive, and Convolutional
and unlabeled data [3] for learning. SSL methods learn Autoencoders) investigated in this study have not been ana-
feature-label associations from labeled data and assign the lyzed for anomaly detection. In addition, comparisons of deep
labels to unlabeled instances having similar features that learning based anomaly detection models are provided with
of a labeled instance on the basis of learned feature-label well-known classification schemes including SVM, K-NN,
associations. Decision-Tree, Random-Forest, QDA and Extreme Learning
Deep Learning is an area of Machine Learning which machine. To fill second research gap, we opted to train all
applies neuron like mathematical structures [4] for learn- models on training dataset without ever exposing test dataset
ing tasks. Neural Networks have been around for many to the model during training and then tested/evaluated the
decades [5] and have been gaining and losing the favor of models on testing datasets. This approach provided a fair esti-
research community. The latest rise of this technology is mate of model capabilities by using unseen data instances at
attributed to Alexnet [6], a Deep Neural Network, which won evaluation time. To bridge the third research gap, Deep learn-
the ImageNet classification challenge. Alexnet achieved top- ing based anomaly detection models were evaluated amongst
1 and top-5 error rates of 37.5 % and 17.0% on ImageNet themselves and with conventional machine learning models
Dataset [7] which were considerably better than the previous by using unseen test data and employing standard classifica-
state-of-the-art mechanisms. Since then, Deep Neural Net- tion quality metrics including RoC Curve, Area under RoC,
works (DNNs) have attracted the attention of research com- Precision-Recall Curve, mean average precision (mAP) and
munity once again and multiple DNN structures including accuracy of classification.
Convolutional neural networks (CNNs) [8], Recurrent Neural All experiments in this study are performed by authors on
networks (LSTM) [9], Deep belief nets (DBNs) and differ- NSLKDD dataset provided by [14] using a GPU-powered
ent types of Autoencoders including Sparse, Denoising [10], test-bed. NSLKDD is derived from KDDCUP99 [15] which
Convolutional [11], Contractive [12] and variational Autoen- was generated in 1999 from the DARPA98 network traf-
coders have been proposed. These DNN structures have been fic. Tavallaee et al. [14] discovered some inherent flaws
successfully applied to devise state of the art solutions in in original KDDCUP99 dataset which had adverse impacts
multiple disciplines. on the performance of IDS models trained and evaluated
Application of Deep Neural Networks for the solution of on the Dataset. A statistically enhanced version of dataset
Information security problems is a relatively new area of called NSLKDD was proposed by [14] to counter discov-
research. We observed three research gaps during literature ered statistical problems. Some advantages of NSLKDD
review of anomaly detection problem. The first gap was lack over KDDCUP99 dataset include removal of redundant
of investigation of well-known deep learning approaches for records from training dataset for reducing complexity
anomaly detection. Although isolated studies were available and bias towards frequent records and the introduction
as described in II, no comprehensive research work was of non-duplicate records in testing datasets for unbiased
available to fill this gap. The second research gap was the use evaluation.
of training datasets for both training and testing of models NSLKDD Dataset is available in four partitions. Two
using cross-validation mechanisms. Most of the recent works partitions namely NSLKDDTrain20p and NSLKDDTrain+
followed this approach and reported very high detection rates, serve as training Dataset for model learning and provide
e.g., Kim et al. [13] used a four-layer DNN with 100 units 25,192 and 125,973 training records respectively. Remain-
for intrusion detection on the KDD99 dataset and reported ing two partitions called NSLKDDTest+ and NSLKD-
99% accuracy. We believe that this approach does not provide DTest21 are available for performance evaluation of trained
a reliable solution of anomaly detection problem, as, given models on unseen data and provide 22,543 and 11,850 data
sufficient training, models can be over-fitted to achieve such instances respectively. Additionally, NSLKDDTest21 con-
high rates. The 3rd gap turned out to be lack of compari- tains records for attack types not available in other NSLKDD
son/evaluation of deep learning models amongst themselves train and test Datasets. These attack types include pro-
and with conventional machine learning based models using cesstable, mscan, snmpguess, snmpgetattack, saint, apache2,
standardized classification quality metrics which was a natu- httptunnel, back and mailbomb. All models in our study
ral consequence of previous two gaps. Standardized classifi- were trained on NSLKDD training datasets (NSLKD-
cation quality metrics include RoC Curve, Area under RoC, DTrain20p and NSLKDDTrain+) and tested on NSLKDD