0% found this document useful (0 votes)

30 views19 pages

Email Spam Filtering Using Logistic Regression With Artificial Bee Colony

The document describes using an artificial bee colony algorithm with logistic regression for email spam filtering. It discusses how artificial bee colony optimization can help logistic regression handle high dimensional data more efficiently by combining the exploitation and exploration abilities of the artificial bee colony algorithm. The summary then provides details on how the artificial bee colony algorithm is used to find the optimal weight vector for training a logistic regression classifier to differentiate spam and ham emails.

Uploaded by

imt2020007

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views19 pages

Email Spam Filtering Using Logistic Regression With Artificial Bee Colony

Uploaded by

imt2020007

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

Email Spam Filtering Using

Logistic Regression With

Artificial Bee Colony

Group Members -
Aman Kumar(2020IMT-007)
Milind Yadav(2020IMT-057)
Sahil Punia(2020IMT-083)
Yashraj Patil(2020IMT-117)
Deependra Yadav(2019IMT-031)
Siddharth Kumar Gautam(2018IMT-099)
Introduction
Emails are used in practically every industry today from business to
education.

Email spam, often known as junk email or unwanted email, is a kind of

email that may be used to hurt any user by wasting their time and
computing resources and stealing critical data.

Spam email volume is rising quickly day by day, Several machine learning
and deep learning techniques have been used, i.e., Naive Bayes, decision
trees, neural networks, and random forest.
Current spam detection methods typically have low detection
rates and struggle to handle high dimensional data.

To deal with this problem we can combine Artificial Bee Colony

Algorithm with Logistic Regression as exploitation and
exploration nature of ABC helps us deal in handling high
dimensional data with more efficiency.
Literature
Review
Email is the electronic way of communication and is categorized as spam and
ham emails.

In email filtering, content based filtering is most effective. The content based
filtering approach mainly depends on some machine learning algorithms based
on some features to differentiate between ham and spam using legitimate email
techniques.

The complete dataset is divided into training and testing set on which machine
learning algorithms are applied to already separate ham and spam email.
The testing dataset is used to analyze the efficiency of the technique Naive
Bayes is commonly used in spam filtering because of its simplicity, quick
convergence, linear computational complexity, and ease of interpretation.

Logistic Regression minimizes the error associated with the output calculated by
a logistic activation function.

It has also been applied to email classification and demonstrated good

performance in spam filtering.
What is
Logistic regression is a statistical
Logistic method used to model the
relationship between a binary
regression response variable and one or more
predictor variables.

? It is widely used in machine learning

and statistical modeling, particularly
in classification problems where the
goal is to predict the probability of an
event occurring.
Logistic regression
The logistic regression model uses a sigmoid function to map the
predictor variables to the binary response variable.
The sigmoid function ensures that the output of the model ies between 0
and 1, which represents the probability of the event occuring.
The model is trained by maximizing the likelihood of the observed data,
and the parameters of the model are estimated using iterative algorithms
such as maximum likelihood estimation or gradient descent.
Logistic regression is a powerful tool for modeling binary data and has
several advantages, including its simplicity, ease of interpretation, and
ability to handle a large number of predictor variables.
What is
Artificial The Artificial Bee Colony (ABC)
algorithm is a nature-inspired

Bee Colony optimization algorithm that was

first proposed in 2005.

? It is based on the behavior of

honey bees in a colony, where
bees work together to find the
best food sources.
Artificial Bee Colony
In the ABC algorithm, the problem to be solved is defined as an objective function that the
bees try to optimize.

The algorithm starts with a population of solutions (food sources), and the bees explore
the search space by flying to different solutions and evaluating their quality using the
objective function.

The bees communicate with each other by sharing information about the best solutions
found so far, and they use this information to adjust their search behavior.

The ABC algorithm has been shown to be effective at solving a wide range of optimization
problems, including continuous, discrete, and combinatorial optimization problems.
It has also been used in many applications, such as image processing, machine learning,
and engineering design.

The ABC algorithm is a simple and robust optimization algorithm that is easy to implement
and can often find high-quality solutions with relatively few function evaluations.
LR classifier
In Artificial Bee Colony, based on
based on the Logistic Regression classification,
the ABC algorithm is used in the
ABC algorithm training dataset to find the
optimal weight vector required.

The food sources needed for the

initial step of the weight vector
are associated with the logistic
regression.
LR classifier based on the ABC algorithm

In the first step, n random solutions are generated using eq . Then the fitness value
associated with each solution is calculated at the start of the Employed Bees Phase.

For each solution, a nearby neighboring solution is generated using the eq . Then the fitness
of this newly developed solution is calculated, and greedy selection occurs between the
newly generated solution and the existing solution.

After that, for each solution, a selection probability is generated using the eq In the
Onlookers Bees Phase, a random number is generated at every iteration. Then an iteration of
the existing solutions is carried up.

If the probability of solution selection is less than a random number, the given solution is
selected, and a corresponding neighboring solution is generated. Out of which, the solution
which has better fitness will be chosen. Then we break out the loop.
LR classifier based on the ABC algorithm

This process is carried out for each of the Onlooker Bees. The solution with the best fitness
value achieved so far is memoized so that it may not lose in the process.

In Scout Bees Phase, if the trial count of a solution exceeds the limit, a new solution is
generated and hence replaces the older one. This process repeats till the max iterations.

After that the weight vector is applied to the LR model, and an output is calculated using the
weights and bias value in the solution vector.
Understanding
the Model
Model Architecture

Classification
100% (2)
Classification
105 pages
Quality Questions
75% (16)
Quality Questions
26 pages
Project Report
67% (15)
Project Report
40 pages
Bee Colony Optimization Thesis
100% (3)
Bee Colony Optimization Thesis
8 pages
The Basel II IRB Approach For Credit Portfolios
0% (1)
The Basel II IRB Approach For Credit Portfolios
30 pages
Book of Sweep Picking
100% (3)
Book of Sweep Picking
4 pages
Supervised Learning-1
100% (1)
Supervised Learning-1
37 pages
The Chevalley-Warning Theorem (Featuring. - . The Erd Os-Ginzburg-Ziv Theorem)
No ratings yet
The Chevalley-Warning Theorem (Featuring. - . The Erd Os-Ginzburg-Ziv Theorem)
14 pages
Classification Algorithms
100% (2)
Classification Algorithms
23 pages
ML Unit 3
No ratings yet
ML Unit 3
40 pages
1694600777-Unit2.2 Logistic Regression CU 2.0
100% (1)
1694600777-Unit2.2 Logistic Regression CU 2.0
37 pages
3 - 1 Logistic Regression
No ratings yet
3 - 1 Logistic Regression
9 pages
Unit - 4 Machine Learning
100% (1)
Unit - 4 Machine Learning
84 pages
Perio Instruments
100% (3)
Perio Instruments
32 pages
Culinary Math
100% (1)
Culinary Math
11 pages
ARUM760LTE5
0% (1)
ARUM760LTE5
2 pages
Operating Instructions New
No ratings yet
Operating Instructions New
54 pages
Introduction To Machinelearning
No ratings yet
Introduction To Machinelearning
75 pages
Lec1 CSC 101 Ict
No ratings yet
Lec1 CSC 101 Ict
86 pages
Logistic Regression For Spam Filtering: Niclas Englesson
No ratings yet
Logistic Regression For Spam Filtering: Niclas Englesson
37 pages
Introduction To Machine Learning: Mohsen Afsharchi
No ratings yet
Introduction To Machine Learning: Mohsen Afsharchi
72 pages
Q No. 1 1.1machine Learning:: Machine Learning Is The Study of Computer Algorithms That Improve Automatically
No ratings yet
Q No. 1 1.1machine Learning:: Machine Learning Is The Study of Computer Algorithms That Improve Automatically
10 pages
Unit 3
No ratings yet
Unit 3
9 pages
Module3 Ids
No ratings yet
Module3 Ids
17 pages
FSP Logistics Regression
No ratings yet
FSP Logistics Regression
34 pages
Unit III
No ratings yet
Unit III
10 pages
ML Algo
No ratings yet
ML Algo
36 pages
Owt 2019 Presentation
No ratings yet
Owt 2019 Presentation
19 pages
Sonia Jessica - 2022 - How Does Logistic Regression Work
No ratings yet
Sonia Jessica - 2022 - How Does Logistic Regression Work
4 pages
Lecture 16 - Classification
No ratings yet
Lecture 16 - Classification
43 pages
Artificial Bee Colony Algorithm
No ratings yet
Artificial Bee Colony Algorithm
41 pages
Lecture - 6.2 - Logistic Regression - Standford ML Andrew NG
No ratings yet
Lecture - 6.2 - Logistic Regression - Standford ML Andrew NG
29 pages
BECE352E Module 3
No ratings yet
BECE352E Module 3
64 pages
Module 3 Intro
No ratings yet
Module 3 Intro
46 pages
Btcse 504 Machine Learning
No ratings yet
Btcse 504 Machine Learning
11 pages
ML 4
No ratings yet
ML 4
80 pages
AIWH
No ratings yet
AIWH
19 pages
Enhanced Artificial Bee Colony Optimizat
No ratings yet
Enhanced Artificial Bee Colony Optimizat
12 pages
Lec #15-ABC
No ratings yet
Lec #15-ABC
17 pages
04 Probability and Learning PDF
No ratings yet
04 Probability and Learning PDF
34 pages
Logistic Regression: Jia Li
No ratings yet
Logistic Regression: Jia Li
44 pages
Chun Feng2014
No ratings yet
Chun Feng2014
9 pages
Unit 9 - Classification & Clustering
No ratings yet
Unit 9 - Classification & Clustering
34 pages
23 LogisticRegression
No ratings yet
23 LogisticRegression
67 pages
Interactive Artificial Bee Colony Optimization
No ratings yet
Interactive Artificial Bee Colony Optimization
24 pages
Applied Soft Computing: Magdalene Marinaki, Yannis Marinakis, Constantin Zopounidis
No ratings yet
Applied Soft Computing: Magdalene Marinaki, Yannis Marinakis, Constantin Zopounidis
7 pages
P 2.1 Logistic Regression
No ratings yet
P 2.1 Logistic Regression
18 pages
Logistic Regression and Naive Bayes
No ratings yet
Logistic Regression and Naive Bayes
4 pages
Logit PDF
No ratings yet
Logit PDF
44 pages
Classification Rule Mining Paper
No ratings yet
Classification Rule Mining Paper
7 pages
Artificial Bee Colony Algorithm
No ratings yet
Artificial Bee Colony Algorithm
51 pages
Practical - Logistic Regression
No ratings yet
Practical - Logistic Regression
84 pages
Artificial Bee Colony Algorithm For Traveling Salesman Problem
No ratings yet
Artificial Bee Colony Algorithm For Traveling Salesman Problem
5 pages
Sample
No ratings yet
Sample
9 pages
Logistic Regressions
No ratings yet
Logistic Regressions
11 pages
Interactive Artificial Bee Colony Optimization
No ratings yet
Interactive Artificial Bee Colony Optimization
7 pages
Logistic Regression
No ratings yet
Logistic Regression
35 pages
AI & ML Unit 4, 5 Notes
No ratings yet
AI & ML Unit 4, 5 Notes
137 pages
Logistic Regression 5
No ratings yet
Logistic Regression 5
61 pages
Aderhold 2010
No ratings yet
Aderhold 2010
12 pages
Claude Shannon Masters Thesis
100% (3)
Claude Shannon Masters Thesis
7 pages
Session 9-Logistic Regression
No ratings yet
Session 9-Logistic Regression
33 pages
Bomba Kobe T200 - Manual de Partes
100% (1)
Bomba Kobe T200 - Manual de Partes
13 pages
Logistic Regression An Introduction
No ratings yet
Logistic Regression An Introduction
6 pages
CBI245A Data Sheet - R27-D
No ratings yet
CBI245A Data Sheet - R27-D
1 page
SMDS Unit 5
No ratings yet
SMDS Unit 5
21 pages
ML Classification Trupesh Patel
No ratings yet
ML Classification Trupesh Patel
39 pages
B55 MLExp 1
No ratings yet
B55 MLExp 1
4 pages
Lec 20
No ratings yet
Lec 20
16 pages
Module1.4 Regression
No ratings yet
Module1.4 Regression
24 pages
Digital Signals FAQ
100% (1)
Digital Signals FAQ
83 pages
Datasheer-11kw-220v-2900rpm-Afs225m-Dc Shunt Motor-Dvc
No ratings yet
Datasheer-11kw-220v-2900rpm-Afs225m-Dc Shunt Motor-Dvc
3 pages
Amazon Braket: Developer Guide
No ratings yet
Amazon Braket: Developer Guide
54 pages
STAT 206 - Chapter 10 (Two-Sample Hypothesis Tests)
No ratings yet
STAT 206 - Chapter 10 (Two-Sample Hypothesis Tests)
38 pages
JKSSB JE 29 Oct 2021 Shift 1 (English)
No ratings yet
JKSSB JE 29 Oct 2021 Shift 1 (English)
24 pages
13-Universal Bridgeless Non Isolated Battery Charger With Wide Output Voltage Range
No ratings yet
13-Universal Bridgeless Non Isolated Battery Charger With Wide Output Voltage Range
12 pages
Htl05 Sub Pe 001 Mem Imp Civ r00 - Equipment Foundations
No ratings yet
Htl05 Sub Pe 001 Mem Imp Civ r00 - Equipment Foundations
27 pages
Class 6 Maths Test (30!06!2025)
No ratings yet
Class 6 Maths Test (30!06!2025)
2 pages
Lecture 1
No ratings yet
Lecture 1
36 pages
Appendix:Glossary - Wiktionary
No ratings yet
Appendix:Glossary - Wiktionary
23 pages
Effect of Charging Current Variation On Internal Resistance in Lithium-Ion Batteries
No ratings yet
Effect of Charging Current Variation On Internal Resistance in Lithium-Ion Batteries
6 pages
With Manage Business Configuration
No ratings yet
With Manage Business Configuration
12 pages
CPM and Pert
No ratings yet
CPM and Pert
40 pages
80 Watt + 80 Watt Dual BTL Class-D Audio Amplifier: TDA7498L
No ratings yet
80 Watt + 80 Watt Dual BTL Class-D Audio Amplifier: TDA7498L
23 pages
How To Enable and Read QueryService Logs
No ratings yet
How To Enable and Read QueryService Logs
3 pages
Qpwugerqwjbrchapter 2 Descriptive Statistics: Tabular and Graphical Presentations
No ratings yet
Qpwugerqwjbrchapter 2 Descriptive Statistics: Tabular and Graphical Presentations
37 pages
معاينة جبس
No ratings yet
معاينة جبس
21 pages
Reliability and Validity of The Research Methods Skills Assessment
No ratings yet
Reliability and Validity of The Research Methods Skills Assessment
11 pages
Differential Evolution: Fundamentals and Applications
From Everand
Differential Evolution: Fundamentals and Applications
Fouad Sabry
No ratings yet
Evolutionary Robotics: Fundamentals and Applications
From Everand
Evolutionary Robotics: Fundamentals and Applications
Fouad Sabry
No ratings yet

Email Spam Filtering Using Logistic Regression With Artificial Bee Colony

Uploaded by

Email Spam Filtering Using Logistic Regression With Artificial Bee Colony

Uploaded by

Email Spam Filtering Using

Logistic Regression With

Email spam, often known as junk email or unwanted email, is a kind of

To deal with this problem we can combine Artificial Bee Colony

It has also been applied to email classification and demonstrated good

? It is widely used in machine learning

Bee Colony optimization algorithm that was

? It is based on the behavior of

The food sources needed for the

You might also like