Android Malware Detection With Different IP Coding Methods

Uploaded by

Thoughts

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views9 pages

Android Malware Detection With Different IP Coding Methods

Uploaded by

Thoughts

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

NETWORK SECURITY AND CRYPTOGRAPHY

(NSC)

Lab Report

Android Malware Detection with IP Coding Methods

Submitted By:
Eman Tariq
20-CP-12
Faria Raghib
20-CP-56

Submitted to:
Dr.Asim Raheel

Dated:
23/05/2024
Android Malware Detection with IP Coding Methods

Introduction:

The growing influence of telecommunication networks and the metaphor of the internet have
revolutionized the way organizations carry out their activities. Indeed, the spectacular evolution of
technology, digitalization, cloud/fog/edge computing, quantum computing, and the deployment of
an exorbitant number of connected objects have given rise to unprecedented cybercriminal
activities. Cybercriminals, individuals or groups with malicious intent, exploit vulnerabilities in
digital systems for financial gain, espionage, disruption, and political motivations.The current
landscape of cyber threats is dynamic and multifaceted, with cybercriminals continuously adapting
their tactics and techniques to exploit emerging vulnerabilities and circumvent traditional security
measures. One of the most prevalent forms of cybercrime is the proliferation of malware, malicious
software designed to infiltrate computer systems and compromise their integrity or steal sensitive
information. Malware comes in various forms, including viruses, worms, trojans, ransomware,
spyware, and adware, each posing unique threats to individuals, businesses, and governments
worldwide.

Ransomware attacks, in particular, have become increasingly prevalent and damaging in recent
years. These attacks involve cybercriminals encrypting victims' data and demanding ransom
payments in exchange for decryption keys. High-profile ransomware incidents have targeted
critical infrastructure, healthcare systems, financial institutions, and government agencies, causing
widespread disruption, financial losses, and reputational damage. The evolution of ransomware-
as-a-service (RaaS) platforms has democratized cybercrime, enabling less technically skilled
individuals to carry out sophisticated attacks with minimal effort.Supply chain attacks represent
another significant cyber threat, where cybercriminals exploit vulnerabilities in third-party vendors
and service providers to gain unauthorized access to their customers' networks. By compromising
trusted entities within the supply chain, cybercriminals can infiltrate target organizations, exfiltrate
sensitive data, and deploy malware payloads, often with devastating consequences.

The proliferation of cloud computing and edge computing technologies has expanded the attack
surface for cybercriminals, presenting new challenges for cybersecurity professionals.
Misconfigured cloud instances, insecure APIs, and data breaches resulting from unauthorized
access to cloud storage repositories are just a few examples of the security risks associated with
cloud computing environments. Similarly, the rapid adoption of IoT devices has introduced new
vulnerabilities into digital ecosystems, with many IoT devices lacking adequate security controls
and protocols.In response to these evolving cyber threats, organizations must prioritize
cybersecurity and implement robust security measures to protect their digital assets and
infrastructure. This includes regular security assessments, employee training programs, incident
response plans, and the adoption of advanced security technologies such as endpoint detection and
response (EDR), network segmentation, and threat intelligence platforms.

Furthermore, collaboration and information sharing among industry stakeholders, government

agencies, and cybersecurity researchers are essential for detecting and mitigating cyber threats
effectively. By working together to identify emerging threats, share threat intelligence, and
develop proactive cybersecurity strategies, we can collectively enhance our resilience to cyber
attacks and safeguard the integrity and security of digital systems worldwide.

Dataset:

The dataset utilized in this study was sourced from the University of New Brunswick’s Canadian
Institute for Cybersecurity website . The CICAndMal2017 dataset, created by Lashkari et al.,
includes over 10,854 samples (comprising 4,354 malware and 6,500 benign samples) collected
from various sources. Through dynamic analysis conducted on real devices, 426 malware and
5,065 benign samples were obtained. The benign software was gathered from the most popular
free applications available on the Google Play market in 2015, 2016, and 2017. The malware
samples are categorized into four types: adware, ransomware, scareware, and SMS malware, with
each sample labeled accordingly. Figure 2 illustrates the distribution of examples by attack type in
the CICAndMal2017 dataset.

The breakdown of malicious applications is as follows:

Adware Malicious Applications: Includes 104 applications from families such as Ewind,
Dowgin, Gooligan, Feiwo, Shuanet, Kemoge, Youmi, Koodous, Mobidash, and Selfmite.

Ransomware Malicious Applications: Comprises 101 applications from families like Charger,
Pletor, Jisut, PornDroid, Koler, RansomBO, LockerPin, Svpeng, Simplocker, and WannaLocker.
Scareware Malicious Applications: Contains 102 applications from families including
AndroidDefender, FakeApp.AL, AndroidSpy.277, FakeAV, AV, FakeJobOffer, FakeTaoBao,
Penetho, and FakeApp.

SMS Malware Applications: Consists of 99 applications from families such as Bean Bot, Ji Fake,
Bilge, Mazarbot, FakeInst, Nandrobox, FakeMart, Plankton, FakeNotify, and SMS Sniffer.

Benign Applications: Includes 1,700 benign applications sourced from the Google Play market
in 2015-2016.

The CICAndMal2017 dataset encompasses 84 features along with a label (Attack, Normal). This
study aims to analyze the impact of IP Addresses. To achieve this, 375,564 data points from the
adware category and 410,548 data points from the benign category in the CICAndMal2017 dataset
were combined, resulting in a comprehensive dataset of 786,112 data points.

Implementation:

In this implementation, we performed a comprehensive preprocessing and classification of the

CICAndMal2017 dataset using a Random Forest model. Initially, data from multiple CSV files
located within specific directories for adware and benign applications were loaded and
concatenated into single DataFrames using a custom function. Labels were added to differentiate
between adware (1) and benign (0) samples. The combined dataset was then subjected to
preprocessing steps, including the conversion of 'Timestamp' columns to integer format and the
splitting of IP addresses into four separate integer features, followed by the removal of the original
IP address columns.To ensure the data's integrity, the 'Flow ID' column was dropped, and all
remaining features were converted to numeric types with missing values filled with zero. Negative
and infinite values were handled by replacing them with zero and clipping extreme values,
respectively. Feature selection was performed using the SelectKBest method with the chi-squared
test, narrowing down to the top 50 features.

The preprocessed dataset was then split into training and testing sets. A Random Forest classifier
was trained on the training set and evaluated on the testing set. The model's performance was
measured using accuracy and classification report metrics, demonstrating the efficacy of the
preprocessing and feature selection steps. This comprehensive approach ensured robust handling
of the dataset as shown in table 1, preparing it for effective machine learning model training and
evaluation.

Table:

Feature Feature Feature

Flow ID Fwd IAT Min Avg Bwd Segment Size
Source IP Bwd IAT Total Fwd Header Length
Source Port Bwd IAT Mean Fwd Avg Bytes/Bulk
Destination IP Bwd IAT Std Fwd Avg Packets/Bulk
Destination Port Bwd IAT Max Fwd Avg Bulk Rate
Protocol Bwd IAT Min Bwd Avg Bytes/Bulk
Timestamp Fwd PSH Flags Bwd Avg Packets/Bulk
Flow Duration Bwd PSH Flags Bwd Avg Bulk Rate
Total Fwd Packets Fwd URG Flags Subflow Fwd Packets
Total Backward Packets Bwd URG Flags Subflow Fwd Bytes
Total Length of Fwd Packets Fwd Header Length Subflow Bwd Packets
Total Length of Bwd Packets Bwd Header Length Subflow Bwd Bytes
Fwd Packet Length Max Fwd Packets/s Init_Win_bytes_forward
Fwd Packet Length Min Bwd Packets/s Init_Win_bytes_backward
Fwd Packet Length Mean Min Packet Length act_data_pkt_fwd
Fwd Packet Length Std Max Packet Length min_seg_size_forward
Bwd Packet Length Max Packet Length Mean Active Mean
Bwd Packet Length Min Packet Length Std Active Std
Bwd Packet Length Mean Packet Length Variance Active Max
Bwd Packet Length Std FIN Flag Count Active Min
Flow Bytes/s SYN Flag Count Idle Mean
Flow Packets/s RST Flag Count Idle Std
Flow IAT Mean PSH Flag Count Idle Max
Flow IAT Std ACK Flag Count Idle Min
Flow IAT Max URG Flag Count
Flow IAT Min CWE Flag Count
Fwd IAT Total ECE Flag Count
Fwd IAT Mean Down/Up Ratio
Fwd IAT Std Average Packet Size
Fwd IAT Max Avg Fwd Segment Size
Table 1 CICAndMal2017 Dataset Feature

Results:

After preprocessing and classifying the CICAndMal2017 dataset using a Random Forest model,
the results were evaluated in terms of accuracy and other classification metrics. The dataset,
consisting of features from both adware and benign applications, was split into training and testing
sets. The Random Forest classifier achieved some accuracy, reflecting the model's capability to
distinguish between adware and benign samples effectively. The classification report provided
detailed metrics such as precision, recall, and F1-score for each class (adware and benign),
indicating the robustness and reliability of the model in identifying different types of
applications.These results underscore the preprocessing steps, including the handling of IP
addresses, timestamps, and feature selection. The Random Forest model demonstrated strong
performance, suggesting that the feature engineering and selection processes significantly
contributed to the model's accuracy and overall classification success. The exact numerical results,
such as the specific accuracy score and detailed classification metrics, were obtained from the
classification report, confirming the model's suitability for this binary classification task.

Figure 1 Features Finalized

Figure 2 Accuray

The above Figure 2 shows the accuracy score we achieved using the random forest classifier
model.Furthermore in order to evaluate its metrics and parameters we use the following confusin
matrix plots and graphical representation to showcase how well our model has performed on the
given dataset.

Confusion Matrix:

Figure 3 Confusion Matrix Plot

The confusion matrix shows how well the model performs on the given dataset and how its value
should be normalized.

Graphical Plots:

Figure 4 F1 Score,Precision and Recall Parameters

The above figure 4 shows graphical plot representation shows the F1-score,Precision and Recall
parameters.
Reference:

1. Bayazit, E. C., Sahingoz, O. K., & Dogan, B. (2021, June). Neural network based Android malware
detection with different IP coding methods. In 2021 3rd International Congress on Human-Computer
Interaction, Optimization and Robotic Applications (HORA) (pp. 1-6). IEEE.
2. Noorbehbahani, F., & Saberi, M. (2020, October). Ransomware detection with semi-supervised
learning. In 2020 10th International Conference on Computer and Knowledge Engineering
(ICCKE) (pp. 024-029). IEEE.
3. Chen, R., Li, Y., & Fang, W. (2019, July). Android malware identification based on traffic analysis.
In International conference on artificial intelligence and security (pp. 293-303). Cham: Springer
International Publishing.
4. Bayazit, E. C., Sahingoz, O. K., & Dogan, B. (2022, June). A deep learning based android malware
detection system with static analysis. In 2022 International Congress on Human-Computer Interaction,
Optimization and Robotic Applications (HORA) (pp. 1-6). IEEE.
5. Arslan, R. S. (2021, October). Identify type of android malware with machine learning based ensemble
model. In 2021 5th international symposium on multidisciplinary studies and innovative technologies
(ISMSIT) (pp. 628-632). IEEE.

A Comprehensive Survey On Deep Learning Based Malware Detectiontechniques
No ratings yet
A Comprehensive Survey On Deep Learning Based Malware Detectiontechniques
36 pages
Threat Hunting Via Network Traffic Analysis!
No ratings yet
Threat Hunting Via Network Traffic Analysis!
61 pages
Final Report2 1
No ratings yet
Final Report2 1
83 pages
Chapter One 1.1 Background of The Study
No ratings yet
Chapter One 1.1 Background of The Study
40 pages
IDS and IPS with Snort 3: Get up and running with Snort 3 and discover effective solutions to your security issues
From Everand
IDS and IPS with Snort 3: Get up and running with Snort 3 and discover effective solutions to your security issues
Ashley Thomas
No ratings yet
HRM 206 MCQ
100% (1)
HRM 206 MCQ
47 pages
Cyber Malware
No ratings yet
Cyber Malware
310 pages
Mohak RR
No ratings yet
Mohak RR
57 pages
Thesis
No ratings yet
Thesis
76 pages
Phase 1 Report Group ID CSE19-G58 Malware Detection Using ML
No ratings yet
Phase 1 Report Group ID CSE19-G58 Malware Detection Using ML
30 pages
Mushkan Report
No ratings yet
Mushkan Report
67 pages
Masters Thesis
100% (1)
Masters Thesis
93 pages
23 Jan 7th
No ratings yet
23 Jan 7th
31 pages
1.1 Project Description: Robust Malware Detection
No ratings yet
1.1 Project Description: Robust Malware Detection
36 pages
Jury Instructions For Civil Rights Claims Under Section 1983
No ratings yet
Jury Instructions For Civil Rights Claims Under Section 1983
232 pages
Written Request For Mortgage
100% (8)
Written Request For Mortgage
5 pages
A Malicious Code Detection Method Based On Stacked Depthwise Separable Convolutions and Attention Mechanism
No ratings yet
A Malicious Code Detection Method Based On Stacked Depthwise Separable Convolutions and Attention Mechanism
27 pages
Analyzing and Comparing The Effectiveness of Malware Detection - A Study of Machine Learning Approaches - ScienceDirect
No ratings yet
Analyzing and Comparing The Effectiveness of Malware Detection - A Study of Machine Learning Approaches - ScienceDirect
39 pages
Cyber Security Forensic Presentation
No ratings yet
Cyber Security Forensic Presentation
36 pages
Symmetry 15 00677 v3
No ratings yet
Symmetry 15 00677 v3
24 pages
A Survey of The Recent Trends in Deep Le
No ratings yet
A Survey of The Recent Trends in Deep Le
30 pages
Malware Classification Based On Multilayer Perception and
No ratings yet
Malware Classification Based On Multilayer Perception and
22 pages
The Chain Rule
100% (1)
The Chain Rule
40 pages
3.cyber Threat Landscape-Andy Choy
No ratings yet
3.cyber Threat Landscape-Andy Choy
19 pages
1 s2.0 S2405844023107821 Main
No ratings yet
1 s2.0 S2405844023107821 Main
19 pages
Cisco Introduction To Cyber Security Chap-2
100% (1)
Cisco Introduction To Cyber Security Chap-2
9 pages
Malware Detection Using Machine Learning and Deep Learning
No ratings yet
Malware Detection Using Machine Learning and Deep Learning
10 pages
Digital Forensics and Incident Response - Second Edition: Incident response techniques and procedures to respond to modern cyber threats, 2nd Edition
From Everand
Digital Forensics and Incident Response - Second Edition: Incident response techniques and procedures to respond to modern cyber threats, 2nd Edition
Gerard Johansen
No ratings yet
Ransomware Attack Detection Using Supervised Machine Learning Classifiers
No ratings yet
Ransomware Attack Detection Using Supervised Machine Learning Classifiers
44 pages
Unit Ii Ais
No ratings yet
Unit Ii Ais
26 pages
AI-driven Data Analytics For Cyber Threat Intelligence and Anomaly Detection-2108
No ratings yet
AI-driven Data Analytics For Cyber Threat Intelligence and Anomaly Detection-2108
14 pages
Document 14
No ratings yet
Document 14
18 pages
Review On
No ratings yet
Review On
9 pages
Radon Transform Based Malware Classification in Cyb 2024 Results in Control
No ratings yet
Radon Transform Based Malware Classification in Cyb 2024 Results in Control
14 pages
Ransomware Detection & Identification Using AI: by Leon Wiskie
No ratings yet
Ransomware Detection & Identification Using AI: by Leon Wiskie
45 pages
Ijett V73i1p132
No ratings yet
Ijett V73i1p132
15 pages
Synergy Project Malware Detection
No ratings yet
Synergy Project Malware Detection
12 pages
Dynamic Malware Detection in Wireless Networks Using Deep Learning
No ratings yet
Dynamic Malware Detection in Wireless Networks Using Deep Learning
16 pages
14 ArticleText 51 1 10 20200331
No ratings yet
14 ArticleText 51 1 10 20200331
10 pages
p6 Digital Forensics For Malware Classification An Approach For
No ratings yet
p6 Digital Forensics For Malware Classification An Approach For
12 pages
Malware KA Webinar Slides
No ratings yet
Malware KA Webinar Slides
40 pages
Electronics 11 03665 v2
No ratings yet
Electronics 11 03665 v2
20 pages
20-CP-93 NSC Lab 1
No ratings yet
20-CP-93 NSC Lab 1
5 pages
The State-of-the-Art in AI-Based Malware Detection Techniques: A Review
No ratings yet
The State-of-the-Art in AI-Based Malware Detection Techniques: A Review
18 pages
Lightweight and Robust Malware Detection Using Dictionaries of API Calls
No ratings yet
Lightweight and Robust Malware Detection Using Dictionaries of API Calls
12 pages
15709-Article Text-55876-2-10-20220114
No ratings yet
15709-Article Text-55876-2-10-20220114
26 pages
Information Security Project
No ratings yet
Information Security Project
7 pages
Ransomware Attack Detection Based On Pertinent System Calls Using Machine Learning Techniques
No ratings yet
Ransomware Attack Detection Based On Pertinent System Calls Using Machine Learning Techniques
23 pages
An Analysis of Internet of Things IoT Malwares and Detection Based On Static and Dynamic Techniques
No ratings yet
An Analysis of Internet of Things IoT Malwares and Detection Based On Static and Dynamic Techniques
6 pages
Ransomware Attack Detection Based On Pertinent System Calls Using Machine Learning Techniques
No ratings yet
Ransomware Attack Detection Based On Pertinent System Calls Using Machine Learning Techniques
23 pages
IEEE Conference LaTeX Template PDF
No ratings yet
IEEE Conference LaTeX Template PDF
7 pages
Android Malware Classification Using LSTM Model: Revue D'intelligence Artificielle
No ratings yet
Android Malware Classification Using LSTM Model: Revue D'intelligence Artificielle
7 pages
Malware Detection and Prevention Using Machine Learning - 25!03!23!16!20 - 14
No ratings yet
Malware Detection and Prevention Using Machine Learning - 25!03!23!16!20 - 14
6 pages
A Comprehensive Survey On Identification of Malware Types and Malware Classification Using Machine Learning Techniques
No ratings yet
A Comprehensive Survey On Identification of Malware Types and Malware Classification Using Machine Learning Techniques
8 pages
Ransomware Detection Using Network Traffic Analysis and Generative Adversarial Networks
No ratings yet
Ransomware Detection Using Network Traffic Analysis and Generative Adversarial Networks
8 pages
Obfuscated Malware Detection Using Artificial Neural Network ANN
No ratings yet
Obfuscated Malware Detection Using Artificial Neural Network ANN
5 pages
Malware Detection and Classification Based On Graph Convolutional Networks and Function Call Graphs
No ratings yet
Malware Detection and Classification Based On Graph Convolutional Networks and Function Call Graphs
11 pages
Derivatives of Polynomials and Exponential Functions
No ratings yet
Derivatives of Polynomials and Exponential Functions
54 pages
Judy S Detection and Classification of Malware For
No ratings yet
Judy S Detection and Classification of Malware For
6 pages
Comprehensive Review On CNN-based Malware Detection With Hybrid Optimization Algorithm
No ratings yet
Comprehensive Review On CNN-based Malware Detection With Hybrid Optimization Algorithm
13 pages
Simex Script
0% (1)
Simex Script
3 pages
Project - Software Development
No ratings yet
Project - Software Development
3 pages
Anomaly Detection Using Machine Learning
No ratings yet
Anomaly Detection Using Machine Learning
4 pages
Winston Churchill Great Speach
No ratings yet
Winston Churchill Great Speach
1 page
Mini Project
No ratings yet
Mini Project
11 pages
Analysis of Cyber Security Threats Using
No ratings yet
Analysis of Cyber Security Threats Using
5 pages
Malcode Detection
No ratings yet
Malcode Detection
5 pages
Online Human Trafficking: Media and Literacy
No ratings yet
Online Human Trafficking: Media and Literacy
8 pages
Gulliver and The Little People
No ratings yet
Gulliver and The Little People
10 pages
Indian Contracts Act, 1872
No ratings yet
Indian Contracts Act, 1872
5 pages
Cag 5
No ratings yet
Cag 5
45 pages
Summary-Mrs. Shehla Zia Vs WAPDA: Syed Ijlal Haider ERP 13309 Course: Legal and Regulatory Environment For Business
No ratings yet
Summary-Mrs. Shehla Zia Vs WAPDA: Syed Ijlal Haider ERP 13309 Course: Legal and Regulatory Environment For Business
1 page
Misinterpreted Monsters - Aurora Krec
No ratings yet
Misinterpreted Monsters - Aurora Krec
8 pages
Board Notes - MVA - Edited
No ratings yet
Board Notes - MVA - Edited
28 pages
Drobot Complaint 2015-02-09 PDF
No ratings yet
Drobot Complaint 2015-02-09 PDF
67 pages
Guntupalli Srinivas Rao vs. The State of Telangana 2021000771820242-551583
No ratings yet
Guntupalli Srinivas Rao vs. The State of Telangana 2021000771820242-551583
4 pages
It 000144628237 2024 10
No ratings yet
It 000144628237 2024 10
1 page
Board of Trustees Vs Velasco, G R No 170436, February 2, 2011 Kinds
No ratings yet
Board of Trustees Vs Velasco, G R No 170436, February 2, 2011 Kinds
2 pages
Jojo Rabbit Final Script Removed
No ratings yet
Jojo Rabbit Final Script Removed
98 pages
The Turk As A Threat and Europe's "Other"
100% (1)
The Turk As A Threat and Europe's "Other"
12 pages
001
No ratings yet
001
1 page
SSS Expanded Maternity Leave Checklist
No ratings yet
SSS Expanded Maternity Leave Checklist
2 pages
Wa0021.
No ratings yet
Wa0021.
16 pages
Woodbury Man Charged With Identity Theft
No ratings yet
Woodbury Man Charged With Identity Theft
10 pages
SCL 182 Cocklin Apr13
No ratings yet
SCL 182 Cocklin Apr13
16 pages
Allison Gonzalez Rubio - Rhetorical Analysis Essay Draft
No ratings yet
Allison Gonzalez Rubio - Rhetorical Analysis Essay Draft
4 pages
CL Sample Test
No ratings yet
CL Sample Test
3 pages
2I - Summons Illustration by Mononita Kundu Das
No ratings yet
2I - Summons Illustration by Mononita Kundu Das
3 pages
Peoria County Jail Booking Sheet For Sept. 12, 2016
No ratings yet
Peoria County Jail Booking Sheet For Sept. 12, 2016
7 pages
1908 Velasco - v. - Masa20210424 12 11bjt5j
No ratings yet
1908 Velasco - v. - Masa20210424 12 11bjt5j
5 pages
OS Lab03
No ratings yet
OS Lab03
12 pages
DSD Lab 5
No ratings yet
DSD Lab 5
5 pages
Behaviour Tracking Chart With Possible Function 1
No ratings yet
Behaviour Tracking Chart With Possible Function 1
1 page
HSG 3rd List
No ratings yet
HSG 3rd List
2 pages