An Analysis of Machine Learning Algorithms and Deep Neural Networks For Email Spam Classification U

The document analyzes machine learning algorithms and deep neural networks for email spam classification using natural language processing. It compares traditional machine learning algorithms and several deep learning models trained with different embedding techniques. Among machine learning classifiers, XGBoost achieved the highest evaluation scores, while among deep learning models, those using Keras embedding outperformed models using GLOVE embedding, showing the effectiveness of transfer learning.

Uploaded by

ineel264

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

66 views6 pages

An Analysis of Machine Learning Algorithms and Deep Neural Networks For Email Spam Classification U

Uploaded by

ineel264

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

An Analysis of Machine Learning Algorithms and

Deep Neural Networks for Email Spam

Classification using Natural Language Processing
Md. Mohidul Hasan Syed Mahbubuz Zaman Md. Asif Talukdar
2021 IEEE International Conference on Service Operations and Logistics, and Informatics (SOLI) | 978-1-6654-6722-3/21/$31.00 ©2021 IEEE | DOI: 10.1109/SOLI54607.2021.9672398

Computer Science(Software Engineering) Computer Science & Engineering Computer Science & Engineering
University of Hertfordshire BRAC University BRAC University
London, UK Dhaka, Bangladesh Dhaka, Bangladesh
[email protected] [email protected] [email protected]

Ayesha Siddika Md. Golam Rabiul Alam, PhD

Computer Science & Engineering Computer Science & Engineering
BRAC University BRAC University
Dhaka, Bangladesh Dhaka, Bangladesh
[email protected] [email protected]

Abstract—Due to the extensive use of technology in our daily Traditionally, rules and protocol-based systems were employed
lives, email has become essential for online correspondence to identify spam and phishing emails [4], [5]. These rule-
between individuals from all walks of life. As such certain based systems were static in nature rendering them ineffective
individuals have weaponized this service by bulk mailing ma-
licious emails to recipients with the goal of retrieving some against modern spam and phishing attempts [6]. Malicious
form of classified information. Thus, Email classification has attackers are growing more versatile in circumventing existing
become a major area of research as it enables identification email filters as computational resources become more widely
and isolation of such malicious emails. The objectives of this available. As such various machine learning based spam email
paper include a robust comparison of several traditional ma- detection systems have been proposed in the existing literature.
chine learning (ML) algorithms, exploring transfer learning with
static (non-trainable) pretrained GLOVE (Global word vector The primary contributions of this paper include exploring
representation) embedding, comparison of several deep learning transfer learning in training deep learning models (GLOVE
models trained with GLOVE and keras embedding separately. embedding) as well as a comparison of the ML classifiers and
Among ML classifiers, XGBoost achieved the highest evaluation Deep learning models using appropriate performance metrics.
scores. Among deep learning algorithms, keras embedding based
models outperformed GLOVE embedding based models by a II. LITERATURE REVIEW
small margin which shows the efficiency of transfer learning in
downstream NLP tasks (parts of speech tagging).
A. Related Work
Index Terms—XGBoost, Transfer Learning, Bi-directional In their paper [7], I. AbdulNabi et al. trained a K-NN (K-
Long Short Term Memory, Artificial Neural Network & Con- nearest neighbour), NB (Naive bayes), Bi-LSTM (Bi direc-
volutional Neural Network. tional Long short-term memory) and Google BERT model for
email classification. These models were evaluated using Accu-
I. INTRODUCTION racy and f1-score. The results show that Bert out performed all
Electronic mails (emails) play a significant role in day-to- the other models with an accuracy of 97.30% and an F1-score
day communication for a wide variety of professionals and of 96.96%.
businesses alike. Approximately A total of 319 billion emails S. Srinivasan et al. in their paper [8], explored 3 Deep
are being sent and received per day in 2021 and this number is Convolutional Neural Network (DCNN) architectures as well
likely to grow over 376 billion by the end of 2025 according to as popular pretrained CNN architectures such as VGG29,
email statistics report 2021 by RADICATI group [1]. As such Xception to classify spam images. The authors used several
malicious actors have begun using unsolicited emails to exploit Image spam data-sets namely Image spam hunter data-set, an
users, customers or professionals of particular businesses. improved data-set developed by authors of [9] and Dredze
Despite the use of several spam email detection systems, ImageSpam data-set. These models were evaluated using ac-
the proportion of spam emails in total email traffic remains curacy and f1-score.
enormous [1], [2]. Statista states that a total of 45.1% of all In their paper [10] S. Ishik et al. explored several Recurrent
emails exchanged in March, 2021 is identified as spam [3]. Neural Network architectures for email classification on Ag-

Authorized licensed use limited to: REVA UNIVERSITY. Downloaded on March 24,2024 at 16:41:49 UTC from IEEE Xplore. Restrictions apply.
glutinative Language like Turkish. The data-set was collected
from [11]. The authors trained an Artificial Neural Network
(ANN), LSTM and Bi-LSTM with MI (Mutual Information)
and WMI (Weighted Mutual Information) as feature selection.
The authors show that Bi-LSTM received an accuracy of
100%.
Ankit Narendrakumar et al. in his paper [12], explored the
efficacy of D-CNN algorithms for email classification. The
authors used Enron and spam assassin data-sets to train the
DCNN models. Finally, the author proposed THEMIS, an
email classification model based on a mathematical approach
where by the emails are divided into several sections and
complex functions are employed to extract and classify the
email signature. The models were evaluated using accuracy
and f1-score. The proposed THEMIS model achieved an
accuracy of 99.84%.
Alia. Barushka et al. in their paper [13], reviewed a spam
Fig. 1. Workflow diagram
classification models based on ANN, CNN, NB, SVM, Ran-
dom Forest with ngram and skip gram word representation
models respectively. The data-sets used by the authors include using pretrained word2vec word representation model.
Cornell University positive hotel review spam and negative
hotel review spam and TripAdvisor (Amazon Mechanical B. Observation
Turk). These models were evaluated using accuracy, AU-ROC, Most research lack the use of appropriate performance
FN and FP. ANN and CNN models with a combination of metrics for model evaluation, as such it is difficult to conclude
ngram , skip-gram word representation out performed all the if these models are generalizing to the trained corpus or over-
other models with an accuracy of 88.38% on the Negative fitting. The use of transfer learning and data wrangling in NLP
data-set and 89.75% on the Positive data-set. is quite limited in existing literature. Our work incorporates
In their paper [14] Feng Wei et al. proposed a Bi-LSTM with these algorithms to provide a clear, concise and updated
GLOVE word Embedding to detect twitter bots. Cresci-2017 analysis of machine learning models in email classification.
twitter data-set was used by the authors in their work. The
model evaluation metrics include Precision, Recall, Specificity, III. METHODOLOGY
Accuracy, F-Measure and MCC. The proposed model achieved Supervised Classification tasks generally consist of six
an accuracy of 96.1%, a recall score of 97.6%, precision score steps. These steps are defined as Data Acquisition and Pre-
of 94% and a specificity score of 93.5%. processing, Feature Extraction, Model Selection, Model Eval-
Ismaila Idris in his paper [15], proposed an ANN with a neg- uation and Model Deployment. The figure 1 illustrates the
ative selection algorithm (genetic algorithm) to classify spam steps performed within our work.
and non-spam emails. The author contrasted the proposed
model with an SVM classifier. The models were evaluated A. Data Acquisition and Pre-processing
using train and test accuracy. The proposed model received a The data set used in our work was acquired from the Enron
train accuracy score of 94.30% and a test accuracy score of data-set [18], a well-known publicly available benchmark cor-
91.37%. pus dedicated to spam email classification. Only the Kaminski
Sarit Chakraborty et al. in their paper [16] employed several folder of the data-set was used to generate the .csv file. We
variations of Decision tree classifier to filter spam emails from divided the dataset into a training set (80 percent) and test set
non-spam emails. The authors specifically used the NBTree (20 percent). Resulting training set had 4396 email samples
Classifier, C 4.5 / J48 Decision Tree Algorithm and Logistic and the test set had 1099 email samples.
Model Tree Induction (LMT) classifier. These models were Several data wrangling/pre-processing steps were performed
evaluated using accuracy. The authors show that LMT out on raw emails towards optimising the data-set for the purpose
performs all the other classifiers with an accuracy of over 85%, of spam email classification. These steps include Normaliza-
followed by NBTree with an accuracy of over 82% and lastly tion (removing repeating emails, stopwords, words less than
J48 with an accuracy of 78%. 3 words and punctuation), Tokenization and Lemmatization.
Yoon Kim in his paper [17] employ variations of CNN Stopwords and puntuations usually hold negligible value when
model along with a pretrained word2vec word embedding to it comes to classification of texts or documents but they may
classify sentences. The CNN variations considered are CNN- be very useful in predicting words, completing sentences and
rand, CNN-static, CNN-nonstatic and CNN-multichannel. The other similar tasks.
author concludes that simple CNN based architectures perform Usually raw texts contain empty spaces, new line characters
quite well in classification tasks related to Natural language or other document specific symbols. For the purpose of our

Authorized licensed use limited to: REVA UNIVERSITY. Downloaded on March 24,2024 at 16:41:49 UTC from IEEE Xplore. Restrictions apply.
task the raw emails were converted into a list of words where the 100-dimensional GLOVE embedding matrix. The GLOVE
each word is referred to as a token (Tokenization). Words Embedding layer was static that is, GLOVE embedding matrix
in a document are used in varying forms due to grammat- was not fine-tuned (not updated during Gradient Descent)
ical requirements. For example : Democracy - Democratic towards email classification.
- Democratization. Lemmatization converts these words to
their base, root or dictionary form. This allows optimal feature C. Model Selection
extraction as words used in varying contexts will have the same 1) Machine Learning Classifiers (ML): The traditional ma-
base form. These tokens were lemmatized and converted chine learning classifiers trained within this work include,
back to sentences/emails. Multinomial Naive Bayes, Random Forest, Decision Tree,
B. Feature Extraction Gradient Boosting, XGBoost, Logistic Regression, K-nearest
neighbors, SVM and SVM(RBF).
Machines are unable to process natural language such as
2) Deep Neural Network Classifiers (DNN):
text in English. For this reason linguistic data is required to
a) ANN – Artificial Neural Network: ANN is a feed
be transformed into a numeric representation which concisely
forward neural network that can identify patterns within data.
encapsulates the statistical inferences (Distribution, Frequen-
ANN comprises of several interconnected layers of nodes. The
cies) of the data as well as contextual and semantic meaning
connections between the nodes have adjustable parameters.
in many cases. This numeric representation is used as features
These parameters, along with the connections among the
for training ML models.
nodes, determine the output of the ANN.
1) TF-IDF (Term Frequency - Inverse Document Fre-
b) Bi-LSTM – Bi-Directional Long Short-Term Memory:
quency): TF-IDF was used to train traditional ML classifiers
RNN (Recurrent Neural Network) specializes in processing
(Not neural network based) within our work. TF-IDF score is
sequential or time-dependent data because of their ability to
assigned to a word based on the frequency of the word in a
utilize context (retained memory across inputs) when making
document and the number of documents it exists in. Generally,
final predictions. Bi-LSTM is a type of RNN that process
within linguistic data or corpus certain words are used more
sequential data in forward (past) and backward (future) direc-
frequently despite retaining lower significance or relevance
tions making them more efficient in sequential learning tasks
contrast to certain other words used rarely despite holding
(Machine translation).
higher relevance to the meaning of the message. TF-IDF score
is used to balance out the weights assigned to words such that c) CNN – Convolutional Neural Network: CNN’s em-
frequent non relevant words hold lower values compared to ploy convolution operations on the data matrix to reduce its
infrequent highly relevant words. That is, the TF-IDF score dimensions while retaining important features of the data-
gives more meaning to rare terms in the corpus and penalizes set. CNN’s take in sequential data (text) as a 1-dimensional
more commonly occurring terms [19]. matrix and, consequently, perform 1-dimensional convolution
2) Word Embedding: Word Embeddings are able to rep- operation.
resent linguistics items or words in a low dimensional vector
D. Model Evaluation
space. These numeric vectors (of words) are grouped together
within the vector space/ word space based on semantic similar- The classifiers and neural network models trained within
ity. for instance, Boat - Ship. There are primarily two ways to this work were evaluated using the following metrics.
train word embeddings, namely: Learnable Embedding (Keras True Positives(TP):Total number of spam emails correctly
Embedding), Pre trained word embedding (GLOVE). The recognized.
DNN models within this work were trained using keras and True Negatives(TN):Total number of benign/ham emails
GLOVE embedding separately with varying architectures. correctly recognized.
a) Keras Embedding : Keras Embedding requires a False Positives(FP):Total number of ham emails falsely
specific input and output dimensions as arguments. The input recognized as spam emails.
texts/words are required to be converted into a one hot encoded False Negatives (FN): Total number of spam emails falsely
vector prior to training the Embedding matrix. The parameters recognized as ham emails.
of the Keras Embedding matrix is updated during Gradient Precision: Precision in this works context is defined as the
Descent. During our work a 100-dimension word embedding ratio of predicted spam emails and true spam emails.
was trained using Keras Embedding layer, meaning that, each
word from the corpus/emails was transformed to a 100- tp/(tp + f p) (1)
dimensional vector.
b) GLOVE Embedding: GLOVE is a pre-trained word High precision means the model predicts low false positives
embedding model developed by [20]. GLOVE employs both and high true positives.
global statistics of matrix factorization like LSA(Latent Se- Recall: Recall in this works context is defined as the ratio of
mantic Analysis) and word2vec model. Pennington has pub- true spam emails and predicted spam emails.
lished several GLOVE embedding matrices of varying di-
mensions (50,100,200,300). For this study we have employed tp/(tp + f n) (2)

Authorized licensed use limited to: REVA UNIVERSITY. Downloaded on March 24,2024 at 16:41:49 UTC from IEEE Xplore. Restrictions apply.
The higher the Recall, the higher the model identifies the pretrained GLOVE embedding based models by a very small
positive events and labels correctly. F1 score: F1 score is the margin in terms of evaluation scores.
weighted average of precision and recall The GLOVE based DNN models were static in nature which
means it was used in conjunction with the DNN models
F 1 = 2 ∗ (P recision ∗ Recall)/(P recision + Recall) (3)
and were not updated or fine-tuned during Gradient Descent
The best performing models have an F1 score close to 1. (training). As such these models had a significantly lower
Accuracy: Accuracy is the ratio of correctly classified emails number of trainable parameters compared to keras embedding
(both spam and ham) among all emails in the test or train set. based models. Despite being static in nature, GLOVE based
models achieved very high evaluation scores. The reason be-
(tp + tn)/(tp + f p + f n + tn) (4) ing pre-trained word embedding models (word2vec, GLOVE)
AU-ROC: Receiver Operating Characteristics (ROC) is a prob- encapsulate word similarities off the shelf.
ability distribution curve for both tp and tn. Area under curve Among GLOVE based models, ANN GLOVE-1 and ANN
(AUC) of ROC is the measure of separation that is the ability GLOVE-2 have the lowest evaluation metrics which was
of a model to distinguish between classes correctly. expected due to complexity of the problem, structure of the
data and the general working principle of artificial neural
IV. RESULT ANALYSIS AND DISCUSSION networks (ANN).
A. Experimental Setup: Overall, the DNN models trained in this work outperform
The traditional machine learning models were trained on all other models proposed within the literature specifically [4],
a laptop using a jupyter notebook environment. The Deep [6], [7], [16], [21].
learning models (ANN, CNN, Bi-LSTM) were trained using Figure 2, shows heat-maps of classification report for DNN
google colab GPU and high ram configuration. Libraries used models (both keras and GLOVE embedding based). GLOVE
within this work include: Pandas, Numpy, Seaborn, Matplotlib, based ANN models have the highest while keras embedding
WordCloud, Scikit-learn, Keras, NLTK and Tensorflow. based models have the lowest false positives and false nega-
tives respectively.
B. Comparison of traditional machine learning classifiers: Figure 3, shows the AU-ROC curves for DNN models. ANN
Table I, illustrates a comparison of all ML classifiers trained GLOVE-1 and ANN GLOVE-2 incurred the lowest AU-ROC
in our work. The table shows that XGBoost achieved the best scores because of low precision and recall as well as high
scores for recall, f1 score, accuracy and AU-ROC. SVM (RBF) false negatives and false positives. All other DNN models have
achieved the best precision score. SVM (Linear) achieved an AU-ROC score of over 0.95 which means these models
the second-best evaluation scores. KNN and Decision tree have generalized to the imbalanced data-set and were able
achieved the lowest evaluation scores. All the classifiers have to distinguish spam and ham emails with moderately high
received evaluation scores of over 95 percentile which was accuracy.
expected and an improvement over [4], [6], [7], [16], [21].
[16] shows that word embedding with CNN-LSTM V. C ONCLUSION
achieves an accuracy of 95.9 %, recall of 1.0, precision of
0.936, f1 score of 0.967 and a G-mean of 96.7 %. This The primary objective of this paper was threefold. We have
paper also shows FastText email representation in conjunction provided a concise comparison of traditional machine learning
with CNN-LSTM achieves the same evaluation scores as word algorithms (Naive Bayes, SVM) for email classification using
embedding with the exception of precision which is 93.5%. [6] Enron corpus. XGBoost achieved the highest evaluation scores
shows that Text CNN achieves an accuracy of 97.54 % and f1 among other classifiers. We have trained six DNN models
score of 0.97. [7] shows a Bert based model (Best performing using pre-trained GLOVE embedding and three DNN models
model) with accuracy of 0.9730 and f1 score of 0.9696 on the using keras embedding. We have provided a rigorous com-
training set and that of 0.9867 and 0.9866 respectively on the parison of these nine DNN models. Keras embedding based
holdout set. [16] shows a Logistic model tree classifier with DNN models due to their large number of trainable parameters
an accuracy of 85.9%. [21] shows that their proposed model (Table II) have outperformed other models and classifiers. We
QUAGGA produces a precision, recall and accuracy score of have also observed that pretrained GLOVE embedding based
0.98 respectively. DNN models (CNN GLOVE-1, CNN GLOVE-2, Bi-LSTM
The top 3 classifiers (XGBoost, SVM-Linear, SVM-RBF) GLOVE-1 and Bi-LSTM GLOVE-2) have achieved extremely
within our work outperform all the models from the [4], [6], high evaluation scores. This shows that transfer learning can
[7], [16], [21]. be extremely useful and, in many cases, better for downstream
NLP tasks like text classification.
C. Comparison of Deep Neural Network (DNN) models Some future works include, using all Enron directory to gen-
Table II, illustrates a comparison of all Deep learning erate a balanced data-set with 0.5 million messages or emails
models (ANN, CNN, BI-LSTM) with keras embedding and approximately. Generating Adversarial Attacks to evaluate the
pretrained GLOVE embedding trained in our work. The table robustness of the trained models, implementing Google Bert
shows that keras Embedding based DNN models outperform embedding layer and using Bert models.

Authorized licensed use limited to: REVA UNIVERSITY. Downloaded on March 24,2024 at 16:41:49 UTC from IEEE Xplore. Restrictions apply.
Fig. 2. Heat-map of the classification report for Deep learning models

Fig. 3. AU-ROC for DNN-Classifiers

Authorized licensed use limited to: REVA UNIVERSITY. Downloaded on March 24,2024 at 16:41:49 UTC from IEEE Xplore. Restrictions apply.
TABLE I
C OMPARISON OF M ACHINE LEARNING C LASSIFIERS

Model Precision Recall f1 Accuracy AU- Train

score ROC Time(s)
XGBoost 0.9844 0.9898 0.9871 0.9900 0.9898 18.89
SVM (Linear) 0.9868 0.9801 0.9833 0.9873 0.9801 0.15
SVM (RBF) 0.9879 0.9789 0.9833 0.9873 0.9789 15.33
Logistic Regr. 0.9861 0.9644 0.9746 0.9809 0.9644 2.16
Gradient 0.9787 0.9620 0.9699 0.9773 0.9620 24.85
Boosting
Random Forest 0.9816 0.9458 0.9620 0.9718 0.9458 7.06
Decision Tree 0.9602 0.9419 0.9506 0.9627 0.9419 2.92
KNN 0.9625 0.9350 0.9477 0.9609 0.9350 0.31
MultinomialNB 0.9352 0.7885 0.8312 0.8899 0.7885 0.02

TABLE II
C OMPARISON OF D EEP LEARNING MODELS

Model Precision Recall f1 Train Accu- Test Accu- Error Loss AU- Train
score racy(%) racy(%) ROC Time(s)
ANN(Keras embedding) 0.9888 0.9894 0.9899 99.91 99.18 0.81 0.02 0.9888 48.53
CNN(Keras embedding) 0.9865 0.9893 0.9922 100 99.18 0.81 0.03 0.9865 54.47
Bi-LSTM(Keras embedding) 0.9789 0.9833 0.9879 99.82 98.73 1.27 0.05 0.9789 119.11
CNN(GLOVE embedding-1) 0.9799 0.9788 0.9777 100 98.36 1.63 0.69 0.9799 942.53
Bi-LSTM(GLOVE embedding-1) 0.9775 0.9764 0.9754 97.52 98.18 1.81 0.07 0.9775 3592.29
Bi-LSTM(GLOVE embedding-2) 0.9630 0.9678 0.9728 98.27 97.54 2.45 0.05 0.9630 4142.67
CNN(GLOVE embedding-2) 0.9601 0.9665 0.9733 99.36 97.45 2.54 0.13 0.9601 202.71
ANN(GLOVE embedding-1) 0.9109 0.8810 0.8623 88.81 90.17 9.82 0.25 0.9109 806.91
ANN(GLOVE embedding-2) 0.7085 0.7461 0.8954 81.30 84.53 15.46 0.34 0.7085 318.35

R EFERENCES [12] A. N. Soni, “Spam-e-mail-detection-using-advanced-deep-convolution-

neuralnetwork-algorithms,” JOURNAL FOR INNOVATIVE DEVELOP-
[1] “Email statistics report.” [Online]. MENT IN PHARMACEUTICAL AND TECHNICAL SCIENCE, vol. 2,
Available: https://www.radicati.com/wp/wp- no. 5, pp. 74–80, 2019.
content/uploads/2021/EmailStatisticsReport,2021- [13] A. Barushka and P. Hajek, “Review spam detection using word embed-
2025ExecutiveSummary.pdf dings and deep neural networks,” in IFIP International Conference on
[2] J. Johnson, “Spam statistics: Spam e-mail traffic share 2019,” Jul 2021. Artificial Intelligence Applications and Innovations. Springer, 2019,
[Online]. Available: https://www.statista.com/statistics/420391/spam- pp. 340–350.
email-traffic-share/ [14] F. Wei and U. T. Nguyen, “Twitter bot detection using bidirectional long
[3] P. by Statista Research Department and O. 21, “Global average short-term memory neural networks and word embeddings,” in 2019
daily spam volume 2021,” Oct 2021. [Online]. Available: First IEEE International Conference on Trust, Privacy and Security in
https://www.statista.com/statistics/1270424/daily-spam-volume-global/ Intelligent Systems and Applications (TPS-ISA). IEEE, 2019, pp. 101–
[4] S. Srinivasan, V. Ravi, M. Alazab, S. Ketha, A.-Z. Ala’M, and S. K. 109.
Padannayil, “Spam emails detection based on distributed word em- [15] I. Idris, “E-mail spam classification with artificial neural network and
bedding with deep learning,” in Machine Intelligence and Big Data negative selection algorithm,” International Journal of Computer Science
Analytics for Cybersecurity Applications. Springer, 2021, pp. 161–189. & Communication Networks, vol. 1, no. 3, pp. 227–231, 2011.
[5] S. Nazirova, “Survey on spam filtering [16] S. Chakraborty and B. Mondal, “Spam mail filtering technique us-
techniques,” Aug 2011. [Online]. Available: ing different decision tree classifiers through data mining approach-a
https://www.scirp.org/journal/paperinformation.aspx?paperid=6769 comparative performance analysis,” International Journal of Computer
[6] S. Seth and S. Biswas, “Multimodal spam classification using deep Applications, vol. 47, no. 16, 2012.
learning techniques,” in 2017 13th International Conference on Signal- [17] Y. Kim, “Convolutional neural networks for sentence classi-
Image Technology & Internet-Based Systems (SITIS). IEEE, 2017, pp. fication,” CoRR, vol. abs/1408.5882, 2014. [Online]. Available:
346–349. http://arxiv.org/abs/1408.5882
[7] Q. Yaseen et al., “Spam email detection using deep learning techniques,” [18] B. Klimt and Y. Yang, “The enron corpus: A new dataset for email
Procedia Computer Science, vol. 184, pp. 853–858, 2021. classification research,” in European Conference on Machine Learning.
[8] S. Srinivasan, V. Ravi, V. Sowmya, M. Krichen, D. B. Noureddine, Springer, 2004, pp. 217–226.
S. Anivilla, and K. Soman, “Deep convolutional neural network based [19] S. Robertson, “Understanding inverse document frequency: On theoret-
image spam classification,” in 2020 6th Conference on data science and ical arguments for idf,” Journal of Documentation - J DOC, vol. 60, pp.
machine learning applications (CDMA). IEEE, 2020, pp. 112–117. 503–520, 10 2004.
[9] A. Chavda, K. Potika, F. D. Troia, and M. Stamp, “Support vector [20] J. Pennington, R. Socher, and C. D. Manning, “Glove: Global vectors
machines for image spam analysis,” in Proceedings of the 15th Interna- for word representation,” in Empirical Methods in Natural Language
tional Joint Conference on e-Business and Telecommunications - Volume Processing (EMNLP), 2014, pp. 1532–1543. [Online]. Available:
2: BASS,, INSTICC. SciTePress, 2018, pp. 431–441. http://www.aclweb.org/anthology/D14-1162
[10] S. Isik, Z. Kurt, Y. Anagun, and K. Ozkan, “Spam e-mail classifi- [21] T. Repke and R. Krestel, “Bringing back structure to free text email
cation recurrent neural networks for spam e-mail classification on an conversations with recurrent neural networks,” in European Conference
agglutinative language,” International Journal of Intelligent Systems and on Information Retrieval. Springer, 2018, pp. 114–126.
Applications in Engineering, vol. 8, no. 4, pp. 221–227, 2020.
[11] L. Özgür, T. Güngör, and F. S. Gürgen, “Adaptive anti-spam filtering for
agglutinative languages: a special case for turkish,” Pattern Recognit.
Lett., vol. 25, pp. 1819–1831, 2004.

Authorized licensed use limited to: REVA UNIVERSITY. Downloaded on March 24,2024 at 16:41:49 UTC from IEEE Xplore. Restrictions apply.

Spam Email Classifier
No ratings yet
Spam Email Classifier
17 pages
1822 B Deleted
No ratings yet
1822 B Deleted
38 pages
Email Classification Using Machine Learning
No ratings yet
Email Classification Using Machine Learning
22 pages
1822 B Deleted Merged Cropped
No ratings yet
1822 B Deleted Merged Cropped
40 pages
Emaill Classification - RNN and BiLSTM - 1
No ratings yet
Emaill Classification - RNN and BiLSTM - 1
6 pages
Spam Email Detection PPT - 1011
No ratings yet
Spam Email Detection PPT - 1011
12 pages
02 JCCE2202192 Online
No ratings yet
02 JCCE2202192 Online
5 pages
E-Mail Spam Classification Via Machine Learning and Natural Language Processing
No ratings yet
E-Mail Spam Classification Via Machine Learning and Natural Language Processing
7 pages
ML Lab
No ratings yet
ML Lab
13 pages
Vishal FOML Micro Project Vishal & Milan
No ratings yet
Vishal FOML Micro Project Vishal & Milan
26 pages
Published Paper
No ratings yet
Published Paper
9 pages
0 - Spam Mail Prediction
No ratings yet
0 - Spam Mail Prediction
29 pages
Pruthviraj Micor Foml
No ratings yet
Pruthviraj Micor Foml
26 pages
Id - 3747 - Literature Review
No ratings yet
Id - 3747 - Literature Review
3 pages
Research Article On The Forensic
No ratings yet
Research Article On The Forensic
14 pages
Ijst 2023 2979
No ratings yet
Ijst 2023 2979
12 pages
Final Report Spam Classifier
No ratings yet
Final Report Spam Classifier
24 pages
B. Flowchart of The Model: Esult
No ratings yet
B. Flowchart of The Model: Esult
3 pages
Spam-T5: Benchmarking Large Language Models For Few-Shot Email Spam Detection
No ratings yet
Spam-T5: Benchmarking Large Language Models For Few-Shot Email Spam Detection
18 pages
Email Spam Detection PPT Github
No ratings yet
Email Spam Detection PPT Github
11 pages
Final PPT
No ratings yet
Final PPT
18 pages
Project Report Emaildetection 4 44
No ratings yet
Project Report Emaildetection 4 44
41 pages
AI Phase1
No ratings yet
AI Phase1
7 pages
Spam Email Classifier - Ramsanjay
No ratings yet
Spam Email Classifier - Ramsanjay
2 pages
Spam Detection & Classification Final
No ratings yet
Spam Detection & Classification Final
38 pages
E-Mail Spam Classification Via Machine Learning and Natural Language Processing
No ratings yet
E-Mail Spam Classification Via Machine Learning and Natural Language Processing
2 pages
Machine Learning Based Classification For Spam Detection
No ratings yet
Machine Learning Based Classification For Spam Detection
14 pages
E-Mail Spam Detection
No ratings yet
E-Mail Spam Detection
8 pages
Presentation 3
No ratings yet
Presentation 3
13 pages
Email Spam Detection Using Machine Learning
No ratings yet
Email Spam Detection Using Machine Learning
2 pages
Document
No ratings yet
Document
11 pages
Pending Proj
No ratings yet
Pending Proj
37 pages
NLP Report
No ratings yet
NLP Report
19 pages
Evaluation and Comparison of Machine Learning Models For Ham and Spam Email Classification
No ratings yet
Evaluation and Comparison of Machine Learning Models For Ham and Spam Email Classification
13 pages
Improving Spam Email Classification Accuracy Using Ensemble Techniques: A Stacking Approach
No ratings yet
Improving Spam Email Classification Accuracy Using Ensemble Techniques: A Stacking Approach
13 pages
Final Report (Saie)
No ratings yet
Final Report (Saie)
38 pages
Spam Email Detection Using Deep Learning Techniques
No ratings yet
Spam Email Detection Using Deep Learning Techniques
6 pages
Aryan Blackbook 1
No ratings yet
Aryan Blackbook 1
29 pages
Spam Email Classifier
No ratings yet
Spam Email Classifier
16 pages
2023 V14i805
No ratings yet
2023 V14i805
7 pages
A Study of Machine Learning Algorithms On Email Spam Classification
No ratings yet
A Study of Machine Learning Algorithms On Email Spam Classification
10 pages
EMAIL+SPAM+DETECTION Final Fishries++ (2658+to+2664) - 1
No ratings yet
EMAIL+SPAM+DETECTION Final Fishries++ (2658+to+2664) - 1
7 pages
Project Report Emaildetection
No ratings yet
Project Report Emaildetection
44 pages
Project 2
No ratings yet
Project 2
10 pages
Irjet V9i11154
No ratings yet
Irjet V9i11154
4 pages
1 s2.0 S1389128622000469 Main - Good
No ratings yet
1 s2.0 S1389128622000469 Main - Good
15 pages
Zoom
No ratings yet
Zoom
20 pages
Spam Email Using Machine Learning
No ratings yet
Spam Email Using Machine Learning
13 pages
Email Spam Filtering Using Machine Learning.1
No ratings yet
Email Spam Filtering Using Machine Learning.1
16 pages
Report
No ratings yet
Report
11 pages
A Comparative Performance Evaluation of Content Based Spam and Malicious URL Detection in E-Mail
No ratings yet
A Comparative Performance Evaluation of Content Based Spam and Malicious URL Detection in E-Mail
6 pages
Resteam 253 - Cap2
No ratings yet
Resteam 253 - Cap2
13 pages
44 Decision Tree Model For Email Classification
No ratings yet
44 Decision Tree Model For Email Classification
4 pages
Email Spam Detection Edited
No ratings yet
Email Spam Detection Edited
30 pages
Spam Detection Thesis
100% (3)
Spam Detection Thesis
6 pages
Email Report
No ratings yet
Email Report
15 pages
IJCRT23A5429
No ratings yet
IJCRT23A5429
7 pages
Spam Mail Classifier
No ratings yet
Spam Mail Classifier
8 pages
Project2 Report
No ratings yet
Project2 Report
5 pages
Principles of Mesh Networks and Mesh Generation: Definitive Reference for Developers and Engineers
From Everand
Principles of Mesh Networks and Mesh Generation: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Fake News Detection Using Machine Learning
No ratings yet
Fake News Detection Using Machine Learning
6 pages
AI Based Threat Detection System - IEEE Report
No ratings yet
AI Based Threat Detection System - IEEE Report
10 pages
Mental Health Assessment Using AI With Sentiment A
No ratings yet
Mental Health Assessment Using AI With Sentiment A
8 pages
Mapping Remote Roads Using Satellite-2024
No ratings yet
Mapping Remote Roads Using Satellite-2024
18 pages
2021-Utilizing Artificial Neural Network For Real-Time Prediction of DifferentialSticking Symptoms
No ratings yet
2021-Utilizing Artificial Neural Network For Real-Time Prediction of DifferentialSticking Symptoms
14 pages
Basri MohammadAhmed
No ratings yet
Basri MohammadAhmed
103 pages
Automatic Detection of Schizophrenia by Applying Deep Learning Over Spectrogram Images of EEG Signals
No ratings yet
Automatic Detection of Schizophrenia by Applying Deep Learning Over Spectrogram Images of EEG Signals
10 pages
Deep Learning and Optimisation For Quality of Service Modelling
No ratings yet
Deep Learning and Optimisation For Quality of Service Modelling
10 pages
Preprints202502 2059 v1
No ratings yet
Preprints202502 2059 v1
19 pages
Lab 4 Specification
No ratings yet
Lab 4 Specification
3 pages
Research Papers
No ratings yet
Research Papers
16 pages
Amazon SageMaker
No ratings yet
Amazon SageMaker
1,055 pages
Dataset Details
No ratings yet
Dataset Details
6 pages
Using Language Models To Disambiguate Lexical Choices in Translation
No ratings yet
Using Language Models To Disambiguate Lexical Choices in Translation
12 pages
Fra Milestone 2 Graded Project Umendra Pratap
No ratings yet
Fra Milestone 2 Graded Project Umendra Pratap
16 pages
Weighted Boxes Fusion: Ensembling Boxes From Different Object Detection Models
No ratings yet
Weighted Boxes Fusion: Ensembling Boxes From Different Object Detection Models
9 pages
ClassX PreAnnual Sahodaya 2024-25
No ratings yet
ClassX PreAnnual Sahodaya 2024-25
9 pages
Machine Learning
No ratings yet
Machine Learning
115 pages
Solarsaksham: Aiml-Powered Solar Forecasting"
No ratings yet
Solarsaksham: Aiml-Powered Solar Forecasting"
30 pages
Dyscalculia Research Paper
No ratings yet
Dyscalculia Research Paper
5 pages
Priyadarshini J. L. College of Engineering, Nagpur: Session 2022-23 Semester-V
No ratings yet
Priyadarshini J. L. College of Engineering, Nagpur: Session 2022-23 Semester-V
31 pages
Mini Project (PPT) ... Last
No ratings yet
Mini Project (PPT) ... Last
19 pages
Batch - 142 - Minor Final Report
No ratings yet
Batch - 142 - Minor Final Report
48 pages
AI-417-IX Unit 1 Project - Cycle - Notes Session 2
No ratings yet
AI-417-IX Unit 1 Project - Cycle - Notes Session 2
13 pages
Final
No ratings yet
Final
63 pages
SS ZG568 EC 2R SECOND SEM 2020 2021 Solution 1617000149821
No ratings yet
SS ZG568 EC 2R SECOND SEM 2020 2021 Solution 1617000149821
6 pages
Mldap
No ratings yet
Mldap
6 pages
BatteryML Paper
No ratings yet
BatteryML Paper
22 pages
DP-100 Overview
No ratings yet
DP-100 Overview
13 pages
The Museums and AI
No ratings yet
The Museums and AI
15 pages

An Analysis of Machine Learning Algorithms and Deep Neural Networks For Email Spam Classification U

Uploaded by

An Analysis of Machine Learning Algorithms and Deep Neural Networks For Email Spam Classification U

Uploaded by

An Analysis of Machine Learning Algorithms and

Deep Neural Networks for Email Spam

Ayesha Siddika Md. Golam Rabiul Alam, PhD

Fig. 3. AU-ROC for DNN-Classifiers

Model Precision Recall f1 Accuracy AU- Train

R EFERENCES [12] A. N. Soni, “Spam-e-mail-detection-using-advanced-deep-convolution-

You might also like