0% found this document useful (0 votes)

45 views6 pages

Ensemble Learning for Malware Detection

Uploaded by

Saumya Verma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

45 views6 pages

Ensemble Learning for Malware Detection

Uploaded by

Saumya Verma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Malware Detection Using Ensemble Learning And

File Monitoring
Tilak Vignesh Sowhith Reddy Sonit Kumar
Department of CSE Department of CSE Department of CSE
PES University PES University PES University
Bangalore, India Bangalore, India Bangalore, India
[email protected] [email protected] [email protected]

Akshat Chourey Chandrashekhar Pomu Chavan

Department of CSE Department of CSE
PES University PES University
Bangalore, India Bangalore, India
[email protected] [email protected]

Abstract—In essence, malware refers to harmful monitoring is a method in place to tackle the problem posed
programs that cybercriminals use to infiltrate a specific by all malware detecting ML models. Which is malicious files
machine or an organisation’s complete network. It takes bypassing the system as the ML model could not detect it. It’s
advantage of flaws in legitimate software (such a browser basically just keeping track of a file’s activities once it enters
or plugin for an online application) that can be hijacked. into a system to make sure it does not exploit the system.
ML is widely used to mitigate this problem which is an
excellent solution but the problem with this is that it’s II. LITERATURE REVIEW
possible for ML to falsely detect some files causing system The execution hardware-driven malware detection
exploits. This paper aims to provide a method to detect technique was an attempt made by the authors H. Sayadi et al
malware using ensemble learning and further monitor [2]. The HPC data was extracted using the Perf software,
files based on a probability value assigned to it by the
which is Linux-compatible. The result variable is the
model.
application’s class (like malware vs benign), whereas the
Keywords— Malware Detection, Ensemble Learning, Machine HPCs pulled at intervals of 10 ms from the running
Learning, File Monitoring, ML Classifiers programmes serve as the input factors for the classifiers. There
were 44 performance counters when they began. The features
are then given scores based on their importance and relevance
I. INTRODUCTION to the goal variable using the feature scoring technique. Using
a feature reduction technique, the 16 computer hardware
Malware intends to exploit systems. As technology is counters that were most closely linked to malware detection
advancing, malware is becoming more difficult to detect and were found and ranked.
reject. With the rise of modern cyber security, machine The majority of machine learning (ML) classifiers perform
learning is now being used to detect these advanced malware. well before feature reduction (16 HPCs), usually providing
Malware detection is vital at the time of entry to prevent a accuracy rates of above 80 percent. However, after using
cyber-attack. What about malwares that are unknown/have not ensemble learning, these accuracy values increased to 88
been detected? What about malwares that bypasses the percent. The efficiency of using ensemble methods to improve
security systems? Our paper works towards solving these 2 the performance of ML classifiers with fewer HPCs as
problems efficiently by building a model using ensemble opposed to extracting 16 or 8 hardware performance counters,
learning to predict whether the file is a malware or not and which would place a substantial implementation cost burden
monitoring files which enter into the system by assigning each on the systems in regard to resource use and energy
file a “likelihood” value. consumption.
To address a specific computational intelligence problem, They understood that using enormous HPCs to run ML
many models, such as classifiers or experts, are strategically algorithms would make it difficult and time-consuming.
developed and merged in a process known as ensemble Additionally, the classifier’s accuracy would decline if
learning [1]. Ensemble learning is mainly used to improve a irrelevant characteristics were added.
model’s performance (classification, prediction, utility In order to improve resistance to specific evasion
estimation, and so on) or decrease the likelihood of selecting techniques, the ensemble detector proposed by author M.
an inferior model unintentionally. The reason to use ensemble Ficco [3] makes use of the advantages of the main analysis
learning in specific is because it has a high accuracy. This is algorithms published in the literature. In order to enhance the
very important in the case of malware detection. A malware unpredictability of the analysis process in general and the
which has escaped detection is unacceptable. Another reason detection method in particular, the research provides a variety
is that ensemble learning is more stable and less noisy. File of strategies for combining general and specialised detectors.

Authorized licensed use limited to: VIT University. Downloaded on January 17,2024 at 05:43:09 UTC from IEEE Xplore. Restrictions apply.
The suggested methods further assist to increase detection malware with an accuracy rate of 0.998 and a false positive
rates when unidentified malware families are present and they rate of 0.002, according to experimental tests. The adoption of
provide better detection performance when re-training the ML-based methodologies to replace traditional signature-
detector on a regular basis is not necessary to keep up with based techniques was emphasised. These ML models make a
malware evolution. The performance of the specialised fortune off of the fast rising prevalence of undetected
ensemble detector, which includes the four best specialised malware, which has been a problem for commercial antivirus
low detectors, is compared to that of the alpha count software. The accuracy, adaptation, tweaking, and
ensemble, which consists of two generic and two specialised dependability of suggested machine learning models improve
low detectors. The results show that the alpha-count- as additional training is carried out using more malware
ensemble detector performs better than specialised ensemble training samples. Compared to signature-based detection, uses
detectors, particularly in terms of sensitivity and accuracy. more processing resources and has a more complicated model.
Additionally, the number of false positives has significantly
decreased, though it is still marginally higher than the III. PROPOSED METHODOLOGY
specialised ensemble detector.
Author H. Rathore et al [4] presented his work on malware A. Dataset
detection using (1) a variety of machine learning methods and The dataset [7] utilised the BIG 2015 model that was
(2) Deep learning models, According to the authors’ findings, proposed and made available by Microsoft on the Kaggle
Random forests beat deep neural networks with high opcode platform. The dataset is 0.5 terabytes in size and contains
frequencies. The deep auto encoder was overkill for the 10868 training samples and 10873 test samples. There are 9
dataset, even with feature reduction, and simple features like families of malware in the dataset namely,
variance cutoff outperformed others. Along with the suggested
approach, this needs to go over additional problems and
• Ramnit: It steals user credentials.
particular difficulties that are specific to the field, as well as
unanswered research topics, restrictions, and future directions. • Lollipop: It’s an adware and can also monitor user traffic.
It provides highly accurate training models. The authors used • Kelihosver1 and Kelihosver3: These are Trojans that take
threshold variance and random forest to attain a maximum full control of a system and their propagation is via
accuracy of 99.78 percent. The model will not work against email.
malware that has not been previously detected or used to train • Vundo: Install other malicious content and show pop up
the model. You can also discard non-malware files. ads.
• Simda: Steal user passwords and create a backdoor.
The ensemble learning technique (SMASH) developed by • Traceur: The attacker shows fake ads using this and gains
Y. Dai et al. [5] fundamentally combines software and money out of it.
hardware features, extracts the API call sequence, hardware • Obfuscator.ACY: These are obfuscated malwares.
performance counters, and memory dump from malware as • Gatak: Trojan that infects systems via malicious code.
detector features, and produces various feature vector types.
Therefore, in this instance, hardware features balance out the B. Malware Detection
susceptibility of software feature evasion while software This section discusses how to train the model using
features make up for a lack of hardware feature detection ensemble learning to classify a file into any of the 9 types of
precision. Using an existing neural network with good malwares mentioned above.
detection performance, each feature was assigned to a
particular detector for malware classification, and all detection 1) Combined Workflow:
results were added together to determine the maliciousness of • A File is downloaded. A python library called
the tested sample in accordance with the approach. Firstly, the ”watchdogs” is used to detect this file download.
technique combines low-level hardware characteristics like • The File is pre-processed. It’s decompiled to get the
resistance to evasion of the memory dump grayscale and ASM data and also the byte matrix.
hardware performance counters with software properties like • The ASM data is further converted into opcode
API call sequences with high detection precision. Secondly, frequencies which are stored in a csv. The byte matrix is
they tried to improve each feature based on the original stored in a csv as well.
research. They tried to select a more advanced classifier model • The csvs are combined.
to improve the detection precision of a single feature. Finally, • The combined csv is passed to the model which we
they came to the approach of using an ensemble learning trained earlier.
algorithm composed of multiple classification algorithms for • The model outputs 9 values between 0-1 depicting the
detection of the malware. This approach won’t work for the probability of the file begin one of the 9 malware or not.
types of malwares that were difficult to detect which are • We select the maximum value from the 9 and make a
highly threatening. decision based on it.
• If the value is less that 0.4 we conclude it isn’t any of the
The authors Amer et al [6] have provided an ensemble 9 types of malware.
learning based detection technique. The file header is mined • If the value is in between 0.4 to 0.7 we say it might be a
for the fewest possible significant attributes that can be used to malware and monitor the file.
train the model. Evaluations show that ensemble models • If the value is greater than 0.7 we discard the file.
outperform individual categorization models by a small
margin. The model they proposed could predict unknown

Authorized licensed use limited to: VIT University. Downloaded on January 17,2024 at 05:43:09 UTC from IEEE Xplore. Restrictions apply.
Fig. 3. Opcode frequency csv

As seen in figure 3, assembly is preprocessed and converted

into a frequency csv
3).byte Preprocess: The byte files are converted into images
of size 32*32. These are further converted into csvs and
finally cnn is applied to get the final features.
Figure 4 shows the image for a specific byte file.

Fig. 1. Combined ML and file monitoring

The combined workflow gives an understanding of how the

model and file monitoring system interact with each other and
the practical use of this interaction is highlighted as shown in
figure 1.
The malware detection has 2 components namely Fig. 4. Flowchart Depicting byte preprocess figure
2).ASM Preprocess: The .asm data is first converted to .txt As seen in figure 5 each byte file is converted into a png which looks
files and unwanted data is removed. Once this is done, the like this
opcode frequencies of each opcode are calculated from the
text files and stored in a csv. Normalization is done and svm is
applied to get the final features.
ASM data is essentially assembly level code of an application.
The data gives away a lot of important information which can
be used to make certain decisions based on its characteristics.
Figure 2 explains the flow of the ASM pre-process

Fig. 5. Byte file converted to image

The png is further converted into a csv as shown in figure 6.

The png is first converted into a matrix. The matrix is then
normalized and this matrix is flattened and stored in a csv.
This is done for multiple files providing multiple entries in a
csv. This csv is later on used to obtain a hybrid dataset on
which the final model would be applied to obtain the results.
Figure 6 shows the byte file represented as a csv.
Fig. 2. Flowchart Depicting ASM preprocess

Fig. 6. csv representing byte file

Authorized licensed use limited to: VIT University. Downloaded on January 17,2024 at 05:43:09 UTC from IEEE Xplore. Restrictions apply.
4) Final Model: Figure 7 shows how the final pre-processed data
is converted into a hybrid dataset and finally ANN is applied on it to
get the result.

Fig. 9. Confusion matrix

Fig. 7. Flowchart Depicting Final Model Fig. 10. A comparison of the performance of different ML models for the
BIG2015 dataset
C. File Monitoring
The model has now been trained using the implementation As seen in figure 10 bar chart with the accuracy’s obtained
explained above and it can now be used to classify files. We from [8] the author Hemalata J et al, The proposed model
monitor the file based on its CPU usage and activities. We use poses the highest accuracy with an accuracy of 97.63 percent
a library called ”psutil” in python to do so. In case the compared to the rest of the models.
program detects a file exceeding its utilization, the process is Performance of the proposed methodology compared to other
killed and the program is discarded. deep learning models is shown in the figure 11
IV. RESULT
A. Malware Detection
The model classifies a file as one of the 9 types of malwares
with an accuracy of 97.6 percent with a test loss of 9 percent
as shown in figure 8.

Fig. 8. Model performance

Figure 9 shows us the confusion matrix depicting the

performance of the model. The 9*9 Confusion matrix shows
us how many files has the model correctly classified in the
right class. Fig. 11. A comparison of the performance of different deep Learning
In figure 9; models for the BIG2015 dataset
The first row and column represent Gatak
The second row and column represent Khelios ver1 As seen in the figure 11 with the accuracy’s obtained from [8]
The third row and column represent khelois ver3 the author Hemalata J et al, The proposed model poses the
The fourth row and column represent Lollipop highest accuracy with an accuracy of 97.63 percent compared
The fifth row and column represent Obfuscator.ACY to the rest of the deep learning models.
The sixth row and column represent Ramnit The bar charts show that the model performs better than
The seventh row and column represent Simda conventional ML methods as well as deep learning methods
The eighth row and column represent Tracur • The ninth row for malware classification against the BIG2015 dataset.
and column represent Vundo B. File Monitoring
The file monitoring application monitors a file which have a
slight probability to be a malware as detected by the model.
• When File is Safe

Authorized licensed use limited to: VIT University. Downloaded on January 17,2024 at 05:43:09 UTC from IEEE Xplore. Restrictions apply.
conventional ML methods and also other deep learning
methods. The increased accuracy was due to the use of
multiple models and further improvement in accuracy for
malware detection will be made by combining different model
results as proposed in this paper.
The file monitoring part was crucial in the whole workflow to
make the system foolproof. Even if there were cases where the
Fig. 12. A screenshot of the brave executable passed to the application model might not be able to flag a file as a malware, the file
marked as safe. monitor took care of this. Hence the combination of the file
monitor with the model reduces system exploits and increases
The example taken here in figure 12 of brave.exe As seen the security in a system substantially.
probability of it belonging to one of the malware class is 0.25
hence the file is marked as safe. REFERENCES
When File is sent for monitoring: In figure 13 we see that the [1] https://en.wikipedia.org/wiki/Ensemblelearning
file might be a malware as it has a probability greater than 0.4, [2] H. Sayadi, N. Patel, S. M. P.D., A. Sasan, S. Rafatirad and H.
Hence it’s PIDs are monitored in order to track its activity. In Homayoun, ”Ensemble Learning for Effective Run-Time Hardware-
the above diagram, we can see the PIDs stored in an array and Based Malware
constantly tracked. Detection: A Comprehensive Analysis and Classification,” 2018 55th
ACM/ESDA/IEEE Design Automation Conference (DAC), 2018, pp.
16, doi: 10.1109/DAC.2018.8465828.
[3] M. Ficco, ”Malware Analysis By Combining Multiple Detectors and
Observation Windows,” in IEEE Transactions on Computers, doi:
10.1109/TC.2021.3082002.
[4] H. Rathore and S. K. Sahay, ”Towards Robust Android Malware
Detection Models using Adversarial Learning,” 2021 IEEE
International Conference on Pervasive Computing and
Communications Workshops and other Affiliated Events (PerCom
Workshops),2021,pp.424-425,doi:
10.1109/PerComWorkshops51409.2021.9430980.
Fig. 13. A screenshot of EasyBCD executable passed to the application [5] Y. Dai, H. Li, Y. Qian, R. Yang and M. Zheng, ”SMASH: A Malware
marked for monitoring Detection Method Based on Multi-Feature Ensemble Learning,” in
IEEE Access, vol. 7, pp. 112588-112597, 2019, doi:
10.1109/ACCESS.2019.2934012.
• When File is a malware
[6] Amer, Eslam and Zelinka, Ivan. (2019). An Ensemble-Based Malware
Detection Model Using Minimum Feature Set. MENDEL. 25. 1-10.
10.13164/mendel.2019.2.001.
[7] https://www.kaggle.com/competitions/malware- lassification/overview
[8] Hemalatha J, Roseline SA, Geetha S, Kadry S, Damaseviˇ cius R. Anˇ
Efficient DenseNet-Based Deep Learning Model for Malware
Detection. Entropy (Basel). 2021 Mar 15;23(3):344. doi:
10.3390/e23030344. PMID: 33804035; PMCID: PMC7998822
[9] https://www.eicar.org/download-anti-malware-testfile/
[10] Chandrashekhar Pomu Chavan and Pallapa Venkataram Designing a
Routing Protocol for Ubiquitous Networks using ECA Scheme in Fifth
Fig. 14. A screenshot of eicar text file passed to the application flagged as International Conference on Advances in Computing and Information
malware and deleted Technology during 25-26, 2015 at Chennai, India
[11] Chandrashekhar Pomu Chavan. Intelligent dynamic routing decisions
in ubiquitous network. In IEEE 2022 7th International Conference for
In figure 14 we see that the file is a malware as it has a Convergence in Technology (I2CT), Pune, Maharashtra, India., 7-9
probability of 0.895, Hence the file is deleted. The file is April 2022.
sourced from the [9-17] eicar website where we can obtain [12] Chandrashekhar Pomu Chavan, Srinivas Talabattula. Design and
anti malware test files. Development of Novel Routing Protocol for Ubiquitous Network. In
IEEE 2022 7th International Conference for Convergence in
V. CONCLUSION Technology (I2CT), Pune, Maharashtra, India., 7-9 April 2022
[13] Aratrika Ray, Akhil Khubchandan, Siddhartha Shenoy, Canute Rollin
The paper explained an approach to detect malware using a Cardoza, and Chandrashekhar Pomu Chavan. Smart emergency
type of machine learning called ensemble learning. reporting system for animals. In IEEE 2022 7th International
Throughout the course of this paper we proposed an Conference for Convergence in Technology (I2CT),Pune,
architecture to build a model that classifies a file into 1 of the Maharashtra, India., 7-9 April 2022 .
9 types of malware mentioned above. We used ensemble [14] Chandrashekhar Pomu Chavan and Pallapa Venkataram.
Design and Implementation of Event-based Multicast AODV Routing
learning to do so. The paper also took into account the Protocol for Ubiquitous Network. Elsevier Journal, Volume-
malware not detected by the model and provides a method to 14(25900056):100129,2022.DOI:
tackle this. The undetected malware were constantly https://doi.org/10.1016/j.array.2022.100129
monitored in a system and were discarded once malicious [15] Chandrashekhar Pomu Chavan and Pallapa Venkataram Feasible QOS
activity is detected. Routing in Ubiquitous Network Springer Journal of Wireless Personal
Communications, 2022 (In print)
[16]A. S. Alva, A. S. Dinesh and C. P. Chavan, “IoT for Enabling
As seen above, the proposed ML model worked better for Smart Environment System,” 2022 International Conference on Smart
malware detection and classification compared to

Authorized licensed use limited to: VIT University. Downloaded on January 17,2024 at 05:43:09 UTC from IEEE Xplore. Restrictions apply.
Generation Computing, Communication and Networking(SMART
GENCON),Bangalore,India,2022,pp.1-6,doi:
10.1109/SMARTGENCON56628.2022.10083922.
[17] Vignesh L, Nishanth J C, Hari Prasad H R, Jayanth Kumar A and
Chandrashekhar Pomu Chavan. Smart Farm Android Application
Using IoT and Machine Learning. In IEEE 2023 8th International
Conference for Convergence in Technology (I2CT), Pune,
Maharashtra, India., 7-9 April 2023

Authorized licensed use limited to: VIT University. Downloaded on January 17,2024 at 05:43:09 UTC from IEEE Xplore. Restrictions apply.

Supervised Malware Detection Model
No ratings yet
Supervised Malware Detection Model
21 pages
Malware
No ratings yet
Malware
10 pages
Malware Detection with Ensemble Learning
No ratings yet
Malware Detection with Ensemble Learning
70 pages
Ensemble Model
No ratings yet
Ensemble Model
6 pages
Malware Detection Research Paper Updated Soheb6
No ratings yet
Malware Detection Research Paper Updated Soheb6
6 pages
Amutenda r206668v Technical Paper
No ratings yet
Amutenda r206668v Technical Paper
5 pages
Malware Detection Using Machine Learning and Deep Learning
No ratings yet
Malware Detection Using Machine Learning and Deep Learning
10 pages
Synopsis 1
No ratings yet
Synopsis 1
7 pages
A - Multi-Strategy - Adversarial - Attack - Method - For - Deep - Learning - Based - Malware - Detectors
No ratings yet
A - Multi-Strategy - Adversarial - Attack - Method - For - Deep - Learning - Based - Malware - Detectors
5 pages
IEEE Conference Template 1
No ratings yet
IEEE Conference Template 1
4 pages
Detection of Obfuscated Malware Using EnsembleLearning Techniques
No ratings yet
Detection of Obfuscated Malware Using EnsembleLearning Techniques
8 pages
Malware Detection Using Machine Learning
No ratings yet
Malware Detection Using Machine Learning
4 pages
Machine Learning for Malware Detection
No ratings yet
Machine Learning for Malware Detection
11 pages
Malware - Detection - Research - Paper - Updated Soheb6
No ratings yet
Malware - Detection - Research - Paper - Updated Soheb6
8 pages
A Multi-View Feature Fusion Approach For Effective Malware Classification Using Deep Learning
No ratings yet
A Multi-View Feature Fusion Approach For Effective Malware Classification Using Deep Learning
15 pages
Deep Neural Network for Malware Detection
No ratings yet
Deep Neural Network for Malware Detection
10 pages
Malware Detection for Researchers
No ratings yet
Malware Detection for Researchers
11 pages
Preprints202407 1214 v1
No ratings yet
Preprints202407 1214 v1
20 pages
Malware Detection Using Machine Leaning
No ratings yet
Malware Detection Using Machine Leaning
9 pages
Malware Detection for Tech Experts
No ratings yet
Malware Detection for Tech Experts
6 pages
DEF: Deep Ensemble Neural Network Classifier For Android Malware Detection
No ratings yet
DEF: Deep Ensemble Neural Network Classifier For Android Malware Detection
11 pages
NextComp2024 Paper 21
No ratings yet
NextComp2024 Paper 21
6 pages
Investigating The Performance of Optimizing The Convolutional Neural Net-Work in Detecting Malware Attack
No ratings yet
Investigating The Performance of Optimizing The Convolutional Neural Net-Work in Detecting Malware Attack
10 pages
Compusoft, 3 (10), 1116-1123 PDF
No ratings yet
Compusoft, 3 (10), 1116-1123 PDF
8 pages
6 Thsemminiproject
No ratings yet
6 Thsemminiproject
12 pages
Automated Machine Learning For Deep Learning Based Malware Detection
No ratings yet
Automated Machine Learning For Deep Learning Based Malware Detection
17 pages
Final Synposis
No ratings yet
Final Synposis
10 pages
Malware Detection Using Machine Learning
No ratings yet
Malware Detection Using Machine Learning
38 pages
When Machine Learning Meets Hardware Cybersecurity Delving Into Accurate Zero-Day Malware Detection
No ratings yet
When Machine Learning Meets Hardware Cybersecurity Delving Into Accurate Zero-Day Malware Detection
6 pages
Detecting Malware Using Deep Learning Mo
No ratings yet
Detecting Malware Using Deep Learning Mo
4 pages
Analysis Study of Malware Classification Portable Executable Using Hybrid Machine Learning
No ratings yet
Analysis Study of Malware Classification Portable Executable Using Hybrid Machine Learning
6 pages
Malware - Detection - Using - Machine - Learning (2) - Removed
No ratings yet
Malware - Detection - Using - Machine - Learning (2) - Removed
31 pages
Malware Detection with Machine Learning
No ratings yet
Malware Detection with Machine Learning
31 pages
Major 2 Mid Sem Report
No ratings yet
Major 2 Mid Sem Report
4 pages
Adversarial Examples For Malware Detection: Abstract
No ratings yet
Adversarial Examples For Malware Detection: Abstract
18 pages
Radon Transform Based Malware Classification in Cyb 2024 Results in Control
No ratings yet
Radon Transform Based Malware Classification in Cyb 2024 Results in Control
14 pages
A Framework For Detection of Malicious Code by Exploiting Machine Learning Techniques On Portable Executables
No ratings yet
A Framework For Detection of Malicious Code by Exploiting Machine Learning Techniques On Portable Executables
4 pages
Development of Malware Detection and Analysis Mode
No ratings yet
Development of Malware Detection and Analysis Mode
50 pages
Machine Learning Based Ensemble Classifier For Android Malware Detection
No ratings yet
Machine Learning Based Ensemble Classifier For Android Malware Detection
18 pages
Malware Classification ML Report TechGB2336 Group13
No ratings yet
Malware Classification ML Report TechGB2336 Group13
27 pages
Malware Detection With LSTM Using Opcode Language
100% (1)
Malware Detection With LSTM Using Opcode Language
7 pages
A Case Study Malware Classification
No ratings yet
A Case Study Malware Classification
32 pages
Malware Detection Using ANN
No ratings yet
Malware Detection Using ANN
10 pages
Internet 2016 1 40 40038
No ratings yet
Internet 2016 1 40 40038
6 pages
Udayakumar 2017
No ratings yet
Udayakumar 2017
6 pages
Malcode Detection
No ratings yet
Malcode Detection
5 pages
Malware Detection with Machine Learning
No ratings yet
Malware Detection with Machine Learning
29 pages
Research Paper 2 Malware Detection
No ratings yet
Research Paper 2 Malware Detection
24 pages
Malware Application Detection Using Machine Learning
No ratings yet
Malware Application Detection Using Machine Learning
7 pages
Detecting Malware in Portable Executable Files Using Machine Learning Approach
No ratings yet
Detecting Malware in Portable Executable Files Using Machine Learning Approach
7 pages
Malware Final
No ratings yet
Malware Final
13 pages
Sample Project Base Paper
No ratings yet
Sample Project Base Paper
9 pages
AI-Powered Windows Malware Detection
No ratings yet
AI-Powered Windows Malware Detection
10 pages
GR20 Final
No ratings yet
GR20 Final
10 pages
Review 1
No ratings yet
Review 1
10 pages
FuzzyRNN NIT SUB 2columns PDF
No ratings yet
FuzzyRNN NIT SUB 2columns PDF
8 pages
Amogh Bajpai PBL
No ratings yet
Amogh Bajpai PBL
1 page
Malware Application Detection Using Machine Learning
No ratings yet
Malware Application Detection Using Machine Learning
8 pages
CDX-C4900R-C5000R-C5000RX Ver 1.2
No ratings yet
CDX-C4900R-C5000R-C5000RX Ver 1.2
78 pages
(Ebook PDF) Complex Analysis: A First Course With Applications 3rd Edition Download
100% (3)
(Ebook PDF) Complex Analysis: A First Course With Applications 3rd Edition Download
44 pages
Ccs337 - Cognitive Science Laboratory Lab Manual Record
No ratings yet
Ccs337 - Cognitive Science Laboratory Lab Manual Record
27 pages
3phase Ac-Dc - Design
No ratings yet
3phase Ac-Dc - Design
9 pages
Genesys Voice Platform Overview
No ratings yet
Genesys Voice Platform Overview
8 pages
HALion 6 Operation Manual en
No ratings yet
HALion 6 Operation Manual en
550 pages
GNSS RTK Best Practices Guide
No ratings yet
GNSS RTK Best Practices Guide
20 pages
(AC-S09) Week 09 - Task: Assignment - How Is Your Life On Campus?
100% (1)
(AC-S09) Week 09 - Task: Assignment - How Is Your Life On Campus?
3 pages
Project Management Essentials
100% (1)
Project Management Essentials
5 pages
Product BackLog Exercise
No ratings yet
Product BackLog Exercise
64 pages
DHCP Questions
No ratings yet
DHCP Questions
23 pages
(2009) Weighted Nonnegative Matrix Factorization
No ratings yet
(2009) Weighted Nonnegative Matrix Factorization
4 pages
Engineering Document Review
No ratings yet
Engineering Document Review
1 page
How To Use MS Word, Excel, PowerPoint For Increase Productivity
No ratings yet
How To Use MS Word, Excel, PowerPoint For Increase Productivity
2 pages
Chapter 2 - Trading Software and Technology - P
No ratings yet
Chapter 2 - Trading Software and Technology - P
142 pages
Lecture 2
No ratings yet
Lecture 2
26 pages
Industrial Summer Training Project Report: Submitted by Ritu Singh
No ratings yet
Industrial Summer Training Project Report: Submitted by Ritu Singh
28 pages
Europe Since Napoleon David Thompson PDF
33% (6)
Europe Since Napoleon David Thompson PDF
2 pages
Day 1 Notes - Graphing Rational Functions - Keyed
No ratings yet
Day 1 Notes - Graphing Rational Functions - Keyed
4 pages
PRELIM Networking 2
No ratings yet
PRELIM Networking 2
2 pages
Harmony XVS XVSV7BBP
No ratings yet
Harmony XVS XVSV7BBP
5 pages
16 Crypto
No ratings yet
16 Crypto
8 pages
VBA Lecture
No ratings yet
VBA Lecture
15 pages
Authentication Form - Ceuzkac2wn86
No ratings yet
Authentication Form - Ceuzkac2wn86
1 page
Additional Information About Printmusic 2014A For Windows
No ratings yet
Additional Information About Printmusic 2014A For Windows
7 pages
10795-Article Text-14323-1-2-20201228part2
No ratings yet
10795-Article Text-14323-1-2-20201228part2
9 pages
Core Lec 18
No ratings yet
Core Lec 18
13 pages
Prolog Solution for Water Jugs
100% (1)
Prolog Solution for Water Jugs
5 pages
From Industry 40 To Tourism 40
No ratings yet
From Industry 40 To Tourism 40
26 pages
Excel Charts
No ratings yet
Excel Charts
18 pages

Ensemble Learning for Malware Detection

Uploaded by

Ensemble Learning for Malware Detection

Uploaded by

Malware Detection Using Ensemble Learning And

Akshat Chourey Chandrashekhar Pomu Chavan

As seen in figure 3, assembly is preprocessed and converted

Fig. 1. Combined ML and file monitoring

The combined workflow gives an understanding of how the

Fig. 5. Byte file converted to image

The png is further converted into a csv as shown in figure 6.

Fig. 6. csv representing byte file

Fig. 9. Confusion matrix

Fig. 8. Model performance

Figure 9 shows us the confusion matrix depicting the

You might also like