0% found this document useful (0 votes)

159 views22 pages

Phishing Detection

Uploaded by

Shanthireddy Matam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

159 views22 pages

Phishing Detection

Uploaded by

Shanthireddy Matam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 22

Phishing Detection System Through

Hybrid Machine Learning Based on URL

-Matam Shanthi
21BRA16630
01 Introduction to Phishing Attacks

Content 02 Literature Survey on Phishing Detection

03 Proposed Phishing Detection System

04 System Design and Architecture

05 Implementation and Results

06 Conclusion and Future Work

01
Introduction to Phishing Attacks
Overview of Internet and Cybercrime

Role of the Internet in Daily Life

Importance of Cybersecurity

The Internet serves as a vital tool for communication, education,

Cybersecurity is crucial in protecting

and commerce, connecting individuals and businesses globally.

confidential data and maintaining the integrity

of
online transactions. As cybercrimes
become
increasingly sophisticated, effective

cybersecurity measures are essential for

safeguarding user privacy and ensuring trust

Types of Cybercrime
in digital platforms.

Cybercrime encompasses a range of illegal activities conducted

online, including identity theft, online fraud, distribution of malware,
and phishing
Understanding Phishing

Definition and History of Phishing Mechanisms of Phishing Attacks

Phishing is a cybercrime tactic aimed Phishing attacks typically employ

at tricking individuals into providing deceptive emails, fake websites, and

personal information by appearing instant messages that mimic legitimate

legitimate. Since its inception in the entities. These methods are designed

mid-1990s, phishing has evolved from to manipulate users into entering

simple email scams to complex sensitive information, such as

schemes utilizing various passwords and credit card numbers,

communication channels. thereby enabling unauthorized access.

02
Literature Survey on Phishing
Detection
Existing Anti-Phishing Mechanisms

Summary of Past Studies Focus on URL Structures

Previous research has focused on URL structures have garnered

various anti-phishing techniques, attention as significant indicators
of
including heuristic-based filters and phishing attempts. Analyzing URL
blacklisting strategies. However, these attributes helps in discerning legitimate
methods often fall short in identifying from fraudulent sites, providing
a
new attacks due to the dynamic nature foundation for effective
phishing
of phishing tactics. detection models.
Machine Learning in Cybersecurity

Role of Machine Learning

Machine learning algorithms are increasingly utilized

to enhance phishing detection systems. These 1
algorithms analyze patterns in historical data to predict
and identify potential phishing threats in real-time.

Feature Selection
Methods

Effective feature
selection is crucial for improving the
accuracy of machine
learning models. Techniques
2
such as
dimensionality reduction and importance
scoring prioritize
relevant features, thus enhancing
model performance
in detecting phishing URLs.
03
Proposed Phishing Detection
System
System Overview

Objectives of the Study Phishing URL Dataset

The primary objective of this study is This study utilizes a curated dataset
to develop a robust phishing detection containing attributes of both phishing
system that combines various and legitimate URLs. Sourced from a
machine learning algorithms to reputable dataset repository, it
achieve high accuracy in identifying comprises over 11,000 entries used
phishing URLs, thereby improving for training and evaluating the
user security. proposed models.
Machine Learning Approaches

Algorithms Used Proposed Hybrid LSD Model

The proposed system employs multiple The Hybrid LSD model integrates Logistic
machine learning algorithms, including Regression, Support Vector Machine, and
Decision Tree, Random Forest, and Decision Tree into a single framework.
Naive Bayes. Each algorithm contributes Utilizing both soft and hard
voting
unique strengths to enhance overall techniques, this model aims to maximize
detection performance. detection rates and minimize false

positives.
04
System Design and Architecture
System Architecture

Overview of the Architecture UML Diagrams

The system architecture consists of UML diagrams provide a visual
various modules, including data representation of the system's
preprocessing, feature extraction, components and their
relationships.
model training, and prediction. This These diagrams facilitate
understanding
modular design promotes scalability of the system workflows and help
in
and maintainability, ensuring effective identifying potential areas
for
system operation. improvement.
Input and Output Design

Input Requirements

The input design entails clean and validated data,

including URL attributes and their associated labels.
1
Proper input structuring is critical for the model's
learning process and subsequent performance.

Output
Specifications

The output of the

system includes classification
results indicating
whether a URL is phishing or
2
legitimate.
Additionally, performance metrics such as
accuracy,
precision, and recall are generated to
evaluate system
effectiveness.
05
Implementation and Results
Implementation Process

Development Environment Challenges Faced

The system is implemented in Python During implementation, challenges

using libraries such as Scikit-learn include data quality issues, model
and Pandas, which facilitate machine overfitting, and ensuring the system
learning and data manipulation. A effectively generalizes across diverse
rigorous development environment phishing scenarios. Addressing these
ensures consistency and reliability challenges is vital for developing a
during the implementation phase. robust detection system.
Evaluation of Results

Metrics for Performance

Measurement Comparative Analysis of Models
Key metrics for measuring performance A comparative analysis is
conducted
include accuracy, F1-score, precision, across different models,
highlighting
and recall. These metrics provide a their strengths and weaknesses.
This
comprehensive assessment of the analysis helps in identifying the
best-
system's ability to correctly identify performing model and provides
insights
phishing URLs compared to legitimate for enhancing the overall
system
ones. performance.
06
Conclusion and Future Work
Summary of Findings

Effectiveness of Proposed
System Lessons Learned

The proposed system demonstrates Key lessons from this study include the
significant effectiveness in detecting importance of feature selection and the
phishing attacks, achieving a higher need for continuous updates to the models
accuracy rate compared to existing as phishing tactics evolve. Dynamic
models. This success underscores the adaptations are essential for
maintaining
potential of hybrid machine learning detection
efficacy.
approaches in cybersecurity.
Recommendations for Future Research

Expanding Research in
Potential Improvements
Cybersecurity

Future research could explore the The findings encourage broader

integration of additional features, such research in cybersecurity, with a
as behavioral analytics, to enhance focus on developing comprehensive
model accuracy further. Investigating frameworks that encompass various
deep learning techniques may also cyber threats beyond phishing.
yield promising results in phishing Collaborative efforts across
detection. disciplines will be vital to creating
robust defense mechanisms.
Thank you for listening.
-Matam Shanthi

ESP32 - ESP - IDF Programming Guide
100% (2)
ESP32 - ESP - IDF Programming Guide
2,314 pages
Canva 101 - A Beginners Journey Ebook PDF
100% (1)
Canva 101 - A Beginners Journey Ebook PDF
33 pages
Final Report (Yau Jia Xin)
No ratings yet
Final Report (Yau Jia Xin)
68 pages
Mini Project Report Sample Format 2024 - Final
No ratings yet
Mini Project Report Sample Format 2024 - Final
80 pages
Phishingdmreport
No ratings yet
Phishingdmreport
19 pages
Phishing Website Detection
No ratings yet
Phishing Website Detection
63 pages
PHISHING WEBSITE DETECTION USING MACHINE LEARNING - COMPLETED (1) Full
No ratings yet
PHISHING WEBSITE DETECTION USING MACHINE LEARNING - COMPLETED (1) Full
73 pages
Detecting Phishing Website With Code Implementation
No ratings yet
Detecting Phishing Website With Code Implementation
13 pages
SE Report G7
No ratings yet
SE Report G7
21 pages
Across The Spectrum In-Depth Review AI-Based Models For Phishing Detection
No ratings yet
Across The Spectrum In-Depth Review AI-Based Models For Phishing Detection
28 pages
The Official Ubuntu Book Matthew Helmke Download
100% (1)
The Official Ubuntu Book Matthew Helmke Download
56 pages
URL Phishing
No ratings yet
URL Phishing
36 pages
Malicious URL Detection Using Random Forest
No ratings yet
Malicious URL Detection Using Random Forest
36 pages
Chapter Report
No ratings yet
Chapter Report
44 pages
DESERTATION
No ratings yet
DESERTATION
18 pages
Aaaaaaaaaaa
No ratings yet
Aaaaaaaaaaa
52 pages
Avanti Kumari - A Report
No ratings yet
Avanti Kumari - A Report
39 pages
81.phishing Detection System Through Hybrid Machine Learning Based On Url
No ratings yet
81.phishing Detection System Through Hybrid Machine Learning Based On Url
99 pages
Depuuu DOCNW
No ratings yet
Depuuu DOCNW
28 pages
My Mini Project Final
No ratings yet
My Mini Project Final
32 pages
Devops Unit - 2 Material Final
No ratings yet
Devops Unit - 2 Material Final
25 pages
AVEVA Licensing System 4.1 User Guide
No ratings yet
AVEVA Licensing System 4.1 User Guide
62 pages
FR - Detecting Malicious Urls Using Data Analytics
No ratings yet
FR - Detecting Malicious Urls Using Data Analytics
17 pages
Manohar DC Inte
No ratings yet
Manohar DC Inte
17 pages
FE Chat Bypass DrawOrOof
No ratings yet
FE Chat Bypass DrawOrOof
4 pages
Detection of Phishing Website
No ratings yet
Detection of Phishing Website
23 pages
Midterm Project Report
No ratings yet
Midterm Project Report
21 pages
Cyberbullying and Online Aggression Survey Instrument
100% (2)
Cyberbullying and Online Aggression Survey Instrument
4 pages
1822 B.E Cse Batchno 287
No ratings yet
1822 B.E Cse Batchno 287
65 pages
Major Project File
No ratings yet
Major Project File
53 pages
1 PB
No ratings yet
1 PB
11 pages
Researchpaper
No ratings yet
Researchpaper
6 pages
Innovative Nitesh
No ratings yet
Innovative Nitesh
14 pages
Updated Phishing Url Detection
No ratings yet
Updated Phishing Url Detection
13 pages
Edited Phishing Domains Detection Using Deep Learning
No ratings yet
Edited Phishing Domains Detection Using Deep Learning
11 pages
Final Paper On Phishing Domains Detection Using Deep Learning
No ratings yet
Final Paper On Phishing Domains Detection Using Deep Learning
11 pages
Detection of Phishing Website
No ratings yet
Detection of Phishing Website
12 pages
PhishNotCloud-Based ML
No ratings yet
PhishNotCloud-Based ML
11 pages
Automated Phishing Detection Through URL Analysis and Machine Learning
No ratings yet
Automated Phishing Detection Through URL Analysis and Machine Learning
9 pages
Review Paper
No ratings yet
Review Paper
9 pages
A Multi-Algorithm Approach For Phishing Uniform Resource Locator's Detection
No ratings yet
A Multi-Algorithm Approach For Phishing Uniform Resource Locator's Detection
10 pages
Final Yr Project PhishingAttack
No ratings yet
Final Yr Project PhishingAttack
12 pages
Fake Url
No ratings yet
Fake Url
64 pages
Project Report1
No ratings yet
Project Report1
83 pages
Malicious Site Detection (MSD)
No ratings yet
Malicious Site Detection (MSD)
58 pages
Test 1Z0-1085-23
0% (1)
Test 1Z0-1085-23
12 pages
Research Gap
No ratings yet
Research Gap
3 pages
Report PUD
No ratings yet
Report PUD
20 pages
Research Report
No ratings yet
Research Report
19 pages
Final Synopsisi 2
No ratings yet
Final Synopsisi 2
11 pages
Nces CW
No ratings yet
Nces CW
22 pages
Introduction
No ratings yet
Introduction
4 pages
Tittle of The Project
No ratings yet
Tittle of The Project
1 page
Phishing Detection (Yamu Research Project)
No ratings yet
Phishing Detection (Yamu Research Project)
19 pages
Paper 1412
No ratings yet
Paper 1412
8 pages
ML 36pages
No ratings yet
ML 36pages
36 pages
ML 36pages
No ratings yet
ML 36pages
36 pages
1NT21MC081 Research Report
No ratings yet
1NT21MC081 Research Report
5 pages
Drone Technology in Architecture Engineering and Construction A Strategic Guide To Unmanned Aerial Vehicle Operation and Implementation Daniel Tal Jon Altschuld PDF Download
No ratings yet
Drone Technology in Architecture Engineering and Construction A Strategic Guide To Unmanned Aerial Vehicle Operation and Implementation Daniel Tal Jon Altschuld PDF Download
48 pages
Review 0 - Phishing Website in SEO
No ratings yet
Review 0 - Phishing Website in SEO
6 pages
ITB1 Documentation Detection of Phishing Website Using ML
No ratings yet
ITB1 Documentation Detection of Phishing Website Using ML
49 pages
Chapter 1
No ratings yet
Chapter 1
27 pages
CyberSec Review3 Team10
No ratings yet
CyberSec Review3 Team10
28 pages
Shanthi ML
No ratings yet
Shanthi ML
26 pages
P Series
No ratings yet
P Series
36 pages
Detecting Phishing Websites Using Machine Learning
No ratings yet
Detecting Phishing Websites Using Machine Learning
16 pages
Calstate - Edu Backlinks
No ratings yet
Calstate - Edu Backlinks
3 pages
Phishing Seminar
No ratings yet
Phishing Seminar
19 pages
Blood Group Detection
No ratings yet
Blood Group Detection
1 page
AI IMP Questions III-I
No ratings yet
AI IMP Questions III-I
6 pages
2023 12 01 FIMM Virtual Examination iFVE Schedule Q1 2024
No ratings yet
2023 12 01 FIMM Virtual Examination iFVE Schedule Q1 2024
6 pages
Phishing Website Detection Using ML 2-1
No ratings yet
Phishing Website Detection Using ML 2-1
20 pages
A Comparative Analysis of Different Feature Set On The Performance of Different Algorithms in Phishing Website Detection
No ratings yet
A Comparative Analysis of Different Feature Set On The Performance of Different Algorithms in Phishing Website Detection
7 pages
Fake URL Detection Using Machine LearningNKKKKKKKKKKKKKKK
No ratings yet
Fake URL Detection Using Machine LearningNKKKKKKKKKKKKKKK
7 pages
Ionic Framework
No ratings yet
Ionic Framework
2 pages
Development of A Phishing Detection System Using Support Vector Machine
No ratings yet
Development of A Phishing Detection System Using Support Vector Machine
11 pages
How To Configure Wireless Network in Packet Tracer
No ratings yet
How To Configure Wireless Network in Packet Tracer
7 pages
Leveraging Advanced Machine Learning Techniques For Phishing Website Detection
No ratings yet
Leveraging Advanced Machine Learning Techniques For Phishing Website Detection
6 pages
Strategic Security Information and Event Management: Definitive Reference for Developers and Engineers
From Everand
Strategic Security Information and Event Management: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Instructions CSMNRF - 2023 Draft 1 Final
No ratings yet
Instructions CSMNRF - 2023 Draft 1 Final
7 pages
En GMS 8.1.2 Deployment Book
No ratings yet
En GMS 8.1.2 Deployment Book
84 pages
Comprehensive Guide to Checkmarx Security Automation: Definitive Reference for Developers and Engineers
From Everand
Comprehensive Guide to Checkmarx Security Automation: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
(English) NoSQL Database Tutorial - Full Course For Beginners (DownSub - Com)
No ratings yet
(English) NoSQL Database Tutorial - Full Course For Beginners (DownSub - Com)
72 pages
Presentation MANETs
No ratings yet
Presentation MANETs
24 pages
Sentry Error Monitoring and Application Observability: Definitive Reference for Developers and Engineers
From Everand
Sentry Error Monitoring and Application Observability: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Manas Raj - Resume
No ratings yet
Manas Raj - Resume
1 page
Checkpresence (Userlist) : and Enginepriority - Java To 4)
No ratings yet
Checkpresence (Userlist) : and Enginepriority - Java To 4)
4 pages
Paragraph e
No ratings yet
Paragraph e
8 pages
Level 0 The Lobby Backrooms
No ratings yet
Level 0 The Lobby Backrooms
1 page
What Is Banner Advertising
No ratings yet
What Is Banner Advertising
1 page
Kiem Tra Giua Ki 2 Right On 7
No ratings yet
Kiem Tra Giua Ki 2 Right On 7
5 pages
Lookup Field Not Show All Columns Dynamics 365 Business Central
No ratings yet
Lookup Field Not Show All Columns Dynamics 365 Business Central
5 pages
Link Assignment
No ratings yet
Link Assignment
1 page
Thycotic Weak Password Finder Report Sample
No ratings yet
Thycotic Weak Password Finder Report Sample
7 pages
OC200 Release Note
No ratings yet
OC200 Release Note
6 pages
EX - No 8. Simulation of Distance Vector/Link State Routing
No ratings yet
EX - No 8. Simulation of Distance Vector/Link State Routing
2 pages
MemoQ Server Migration Guide
No ratings yet
MemoQ Server Migration Guide
5 pages