0% found this document useful (0 votes)

55 views11 pages

Sat - 26.Pdf - Phishing Website Detection Using Novel Machine Learning Fusion Approach

Here are a few potential research questions addressed in the literature review: - How effective are different machine learning algorithms (such as decision trees, neural networks, etc.) at detecting phishing URLs compared to previous rule-based detection methods? - What types of features (e.g. lexical, syntactic, website content-based) are most useful for machine learning models to accurately identify phishing websites? - How do models perform at detecting new or "zero-day" phishing attacks that use emerging techniques to evade detection? - What are some challenges and open problems in using machine learning for phishing detection, such as the need for large labeled training datasets or the arms race with adapting phishing techniques?

Uploaded by

Vj Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

55 views11 pages

Sat - 26.Pdf - Phishing Website Detection Using Novel Machine Learning Fusion Approach

Uploaded by

Vj Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

ChapterNo. TITLE Page No.

ABSTRACT 8
LIST OF FIGURES vii
1 INTRODUCTION 9

2 LITERATURE SURVEY 10
2.1. Detection of Phishing URL using Machine Learning 10

2.2. A Survey of Machine Learning-Based Solutions for Phishing

23
Website Detection

2.3. Detection of Phishing Websites using MachineLearning 44

2.4 Detecting Phishing Websites Using Machine Learning 52

3 METHODOLOGY 61
v
3.1. EXISTING SYSTEM 61

3.2. PROPOSED SYSTEM 62

3.3. SYSTEM ARCHITECTURE 62

3.4. WORKING OFDECISION TREE 63

4 RESULTS AND DISCUSSION

76
4.1. MACHINE LEARNING

5 CONCLUSION

87
5.1.CONCLUSION

6 CONCLUSION 90

6.1. CONCLUSION

90
REFERENCES

APPENDICES

A. SOURCE CODE 92
B. SCREENSHOTS 95
C. PLAGIARISM REPORT 97
D. JOURNAL PAPER 99

LIST OF FIGURES
vi
Figure No. Figure Name Page No.

3.1 Block Diagram 62

3.2 Flow Diagram 63

3.3 Real Life Analogy 70

3.4 Bagging Parallel & Boosting Sequential 71

3.5 Bagging 72

3.6 Bagging Ensemble Method 73

4.1 Model 80

4.2 Clustering 83

4.3 ML Overview 84

4.4 Training and Testing 85

4.5 Validation Test 85

vii
ABSTARCT:

Phishing websites have proven to be a major security concern. Several cyber attacks risk the
confidentiality, integrity, and availability of company and consumer data, and phishing is the
beginning point for many of them.Many researchers have spent decades creating unique
approaches to automatically detect phishing websites. While cutting-edge solutions can deliver
better results, they need a lot of manual feature engineering and aren't good at identifying new
phishing attacks. As a result, finding strategies that can automatically detect phishing websites and
quickly manage zero-day phishing attempts is an open challenge in this field. The web page in the
URL which hosts that contains a wealth of data that can be used to determine the web server's
maliciousness.Machine Learning is an effective method for detecting phishing.It also eliminates the
disadvantages of the previous method.We conducted a thorough review of the literature and
suggested a new method for detecting phishing websites using features extraction and a machine
learning algorithm. The goal of this research is to use the dataset collected to train ML models and
deep neural nets to anticipate phishing websites.

8
Chapter 1

INTRODUCTION:

Phishing is the most unsafe criminal exercises in cyber space. Since most of the users go online to
access the services provided by government and financial institutions, there has been a significant
increase in phishing attacks for the past few years. Phishers started to earn money and they are
doing this as a successful business. Various methods are used by phishers to attack the
vulnerable users such as messaging, VOIP, spoofed link and counterfeit websites. It is very easy
to create counterfeit websites, which looks like a genuine website in terms of layout and content.
Even, the content of these websites would be identical to their legitimate websites. The reason for
creating these websites is to get private data from users like account numbers, login id, passwords
of debit and credit card, etc. Moreover, attackers ask security questions to answer to posing as a
high level security measure providing to users. When users respond to those questions, they get
easily trapped into phishing attacks. Many researches have been going on to prevent phishing
attacks by different communities around the world. Phishing attacks can be prevented by detecting
the websites and creating awareness to users to identify the phishing websites. Machine learning
algorithms have been one of the powerful techniques in detecting phishing websites. In this study,
various methods of detecting phishing websites have been discussed.

9
Chapter 2

Literature review

2.1 Detection of Phishing URL using Machine Learning

Abstract:

Phishing websites have proven to be a major security concern. Several cyberattacks

risk the confidentiality, integrity, and availability of company and consumer data, and phishing is
the beginning point for many of them. Many researchers have spent decades creating unique
approaches to automatically detect phishing websites. While cutting-edge solutions can deliver
better results, they need a lot of manual feature engineering and aren't good at identifying new
phishing attacks. As a result, finding strategies that can automatically detect phishing websites and
quickly manage zero-day phishing attempts is an open challenge in this field. The web page in the
URL which hosts that contains a wealth of data that can be used to determine the web server's
maliciousness. Machine Learning is an effective method for detecting phishing.It also eliminates
the disadvantages of the previous method. We conducted a thorough review of the literature and
suggested a new method for detecting phishing websites using features extraction and a machine
learning algorithm. The goal of this research is to use the dataset collected to train ML models and
deep neural nets to anticipate phishing websites.

INTRODUCTION

Phishing has become the most serious problem, harming individuals, corporations, and even entire
countries. The availability of multiple services such as online banking, entertainment, education,
software downloading, and social networking has accelerated the Web's evolution in recent years.
As a result, a massive amount of data is constantly downloaded and transferred to the Internet.
Spoofed e-mails pretending to be from reputable businesses and agencies are used in social
engineering techniques to direct consumers to fake websites that deceive users into giving
financial information such as usernames and passwords. Technical tricks involve the installation of
malicious software on computers to steal credentials directly, with systems frequently used to
intercept users' online account usernames and passwords.

A. Types of Phishing Attacks

10
• Deceptive Phishing:
This is the most frequent type of phishing assault, in which a
Cyber criminal impersonates a well-known institution, domain, or organization to acquire sensitive
personal information from the victim, such as login credentials, passwords, bank account
information, credit card information, and so on. Because there is no personalization or
customization for the people, this form of attack lacks sophistication.
•Spear Phishing: Emails containing malicious URLs in this sort of phishing email contain a lot of
personalization information about the potential victim. The recipient's name, company name,
designation, friends, co-workers, and other social information may be included in the email.
•Whale Phishing: To spear phish a "whale," here a top-level executive such as CEO, this sort of
phishing targets corporate leaders such as CEOs and top-level management employees.
• URL Phishing: To infect the target, the fraudster or cyber-criminal employs a URL link. People
are sociable creatures who will eagerly click the link to accept friend invitations and may even be
willing to disclose personal information such as email addresses.
This is because the phishers are redirecting users to a false web server. Secure browser
connections are also used by attackers to carry out their unlawful actions. Due to a lack of
appropriate tools for combating phishing attacks, firms are unable to train their staff in this area,
resulting in an increase in phishing attacks.Companies are educating their staff with mock phishing
assaults, updating all their systems with the latest security procedures, and encrypting important
Information as broad countermeasures. Browsing without caution is one of the most common ways
to become a victim of this phishing assault. The appearance of phishing websites is like that of
authentic websites.

Research question:
Are some of the research questions on which this research paper will elaborate.
• Is it possible to extract features from the URL using machine learning techniques?
• How can phishing URLs be detected using a Machine learning approach in terms of
efficiency?
The ultimate purpose of this study work is to provide a better understanding of the process of
identifying the presence of Phishing attacks using a machine learning technique to identify URL
based features like Address Bar, Domain, JavaScript, and HTML based features. The remaining
part of the paper is written out as follows. The Section 2 of paper is dedicated to a literature review.

11
Section 3 outlines the planned research approach, Section 4 presents the experimental data, and
Section 5 provides the conclusion.

Literature Review

Many scholars have done some sort of analysis on the statistics of phishing URLs. Our technique
incorporates key concepts from past research. We review past work in the detection of phishing
sites using URL features, which inspired our current approach. Happy describe phishing as "one of
the most dangerous ways for hackers to obtain users' accounts such as usernames, account
numbers and passwords, without their awareness." Users are ignorant of this type of trap and will
ultimately, they fall into Phishing scam. This could be due to a lack of a combination of financial aid
and personal experience, as well as a lack of market awareness or brand trust. In this article,
Mehmet et al. suggested a method for phishing detection based on URLs. To compare the results,
the researchers utilized eight different algorithms to evaluate the URLs of three separate datasets
using various sorts of machine learning methods and hierarchical architectures. The first method
evaluates various features of the URL; the second method investigates the website's authenticity
by determining where it is hosted and who operates it; and the third method investigates the
website's graphic presence.We employ Machine Learning techniques and algorithms to analyse
these many properties of URLs and websites. Garera et al. classify phishing URLs using logistic
regression over hand-selected variables. The inclusion of red flag keywords in the URL, as well as
features based on Google's Web page and Google's Page Rank quality recommendations, are
among the features. Without access to the same URLs and features as our approach, it's difficult
to conduct a direct comparison. In this research, Yong et al. created a novel approach for detecting
phishing websites that focuses on detecting a URL which has been demonstrated to be an
accurate and efficient way of detection. To offer you a better idea, our new capsule-based neural
network is divided into several parallel components. One method involves removing shallow
characteristics from URLs. The other two, on the other hand, construct accurate feature
representations of URLs and use shallow features to evaluate URL legitimacy. The final output of
our system is calculated by adding the outputs of all divisions. Extensive testing on a dataset
collected from the Internet indicate that our system can compete with other cutting-edge detection
methods while consuming a fair amount of time. For phishing detection, Vahid Shahrivari et al.
used machine learning approaches. They used the logistic regression classification method, KNN,
12
Adaboost algorithm, SVM, ANN and random forest. They found random forest algorithm provided
good accuracy. Dr.G. Ravi Kumar used a variety of machine learning methods to detect phishing
assaults. For improved results, they used NLP tools. They were able to achieve high accuracy
using a Support Vector Machine and data that had been pre-processed using NLP approaches.
Amani Alswailem et al. tried different machine learning model for phishing detection but was able
to achieve more accuracy in random forest. Hossein et al. created the ―Fresh-Phish‖ open-source
framework. This system can be used to build machine-learning data for phishing websites. They
used a smaller feature set and built the query in Python. They create a big, labelled dataset and
test several machine-learning classifiers on it. Using machine-learning classifiers, this analysis
yields very high accuracy. These studies look at how long it takes to train a model. X. Zhang
suggested a phishing detection model based on mining the semantic characteristics of word
embedding, semantic feature, and multi-scale statistical features in Chinese web pages to detect
phishing performance successfully. To obtain statistical aspects of web pages, eleven features
were retrieved and divided into five classes. To obtain statistical aspects of web pages, eleven
features were retrieved and divided into five classes. To learn and evaluate the model, AdaBoost,
Bagging, Random Forest, and SMO are utilized. The legitimate URLs dataset came from
DirectIndustry online guides, and the phishing data came from China's Anti-Phishing Alliance. With
novel methodologies, M. Aydin approaches a framework for extracting characteristics that is
versatile and straightforward. Phish Tank provides data, and Google provides authentic URLs. C#
programming and R programming were utilized to btain the text attributes. The dataset and third-
party service providers yielded a total of 133 features. The feature selection approaches of CFS
subset based and Consistency subset-based feature selection were employed and examined with
the WEKA tool. The performance of the Nave Bayes and Sequential Minimal Optimization (SMO)
algorithms was evaluated, and the author prefers SMO to NB for phishing detection.

Research Methodology

A phishing website is a social engineering technique that imitates legitimate webpages and uniform
resource locators (URLs). The Uniform Resource Locator (URL) is the most common way for
phishing assaults to occur. Phisher has complete control over the URL's sub-domains. The phisher
can alter the URL because it contains file components and directories.
Methodologies
13
This research used the linear-sequential model, often known as the waterfall model. Although the
waterfall approach is considered conventional, it works best in instances where there are few
requirements. The application was divided into smaller components that were built using
frameworks and hand-written code.

Research Framework:

The steps of this research in which some selected publications were read to determine the
research gap and, as a result, the research challenge was defined. Feature selection, classification
and phishing website detection were all given significant consideration. It's worth noting that most
phishing detection researchers rely on datasets they've created. However, because the datasets
utilized were not available online for those who use and check their results, it is difficult to assess
and compare the performance of a model with other models. As a result, such results cannot be
generalized.

Language

For the preparation of this dissertation, I used Python as the primary language. Python is a
language that is heavily focused on machine learning. It includes several machine learning libraries
that may be utilized straight from an import. Python is commonly used by developers all around the
world to deal with machine learning because of its extensive library of machine learning libraries.
Python has a strong community, and as a result, new features are added with each release.

Data Collection

The phishing URLs were gathered using the open source tool Phish Tank. This site provides a set
of phishing URLs in a variety of forms, including csv, json, and others, which are updated hourly.
This dataset is used to train machine learning models with 5000 random phishing URLs.

Data Cleaning

Fill in missing numbers, smooth out creaking data, detect and delete outliers, and repair anomalies
to clean up the data.
14
Data Pre-processing

Data pre-processing is a cleaning operation that converts unstructured raw data into a neat, well-
structured dataset that may be used for further research. Data pre-processing is a cleaning
operation that transforms unstructured raw data into well-structured and neat dataset which can be
used for further research.

Extraction of Features

In the literature and commercial products, there are numerous algorithms and data formats for
phishing URL detection. A phishing URL and its accompanying website have various
characteristics that distinguish them from harmful URLs. For example, to mask the true domain
name, an attacker can create a long and complicated domain name. Different types of features
that are used in machine learning algorithms in the academic study detection process are used.
The following is a list of features gathered from academic studies for phishing domain detection
using machine learning approaches. Because of some constraints, it may not be logical to use
some of the features in specific instances. Using Content-Based Features to construct a quick
detection mechanism capable of analyzing a huge number of domains may not be feasible. Page-
Based Features are not very effective when analyzing registered domains. As a result, the features
that the detection mechanism will use are determined by the detection mechanism's purpose. So,
which features should be used in the detecting technique been carefully chosen.

Models and Training

The data is split into 8000 training samples and 2000 testing samples, before the ML model is
trained. It is evident from the dataset that this is a supervised machine learning problem.
Classification and regression are the two main types of supervised machine learning issues.
Because the input URL is classed as legitimate or phishing, this data set has a classification
problem. The following supervised machine learning models were examined for this project's
dataset training:

• Decision Tree
• Multilayer Perceptron

Detecting Phishing Website With Code Implementation
No ratings yet
Detecting Phishing Website With Code Implementation
13 pages
Design of Pressure Vessel
No ratings yet
Design of Pressure Vessel
91 pages
Gray Hair and Excessive Facial and Body Hair Proto
100% (1)
Gray Hair and Excessive Facial and Body Hair Proto
6 pages
Ch06 Roth3e
100% (1)
Ch06 Roth3e
85 pages
(IJCST-V9I3P26) :P.Hema Sujatha, S.Sushma Sree, N. Vinay Sreenath, S. Suresh, DR - Bala Brahmeswara Kadaru
No ratings yet
(IJCST-V9I3P26) :P.Hema Sujatha, S.Sushma Sree, N. Vinay Sreenath, S. Suresh, DR - Bala Brahmeswara Kadaru
6 pages
Assignment 3
No ratings yet
Assignment 3
2 pages
Detection of Phishing On Apps and Websites - Project Report
No ratings yet
Detection of Phishing On Apps and Websites - Project Report
21 pages
1NH16CS054
No ratings yet
1NH16CS054
95 pages
ITB1 Documentation Detection of Phishing Website Using ML
No ratings yet
ITB1 Documentation Detection of Phishing Website Using ML
49 pages
Fake Url
No ratings yet
Fake Url
64 pages
Project Report1
No ratings yet
Project Report1
83 pages
Classification of Features For Detecting Phishing Web Sites Based On Machine Learning Techniques
No ratings yet
Classification of Features For Detecting Phishing Web Sites Based On Machine Learning Techniques
51 pages
Phishing URL Detection Using ML: Project Report
No ratings yet
Phishing URL Detection Using ML: Project Report
25 pages
Detecting Phishing Websites Using Machine Learning
No ratings yet
Detecting Phishing Websites Using Machine Learning
6 pages
Phishing Seminar
No ratings yet
Phishing Seminar
19 pages
Detection of Phishing Website
No ratings yet
Detection of Phishing Website
12 pages
CyberSec Review3 Team10
No ratings yet
CyberSec Review3 Team10
28 pages
ISAA Report PDF
No ratings yet
ISAA Report PDF
24 pages
Detecting Phishing Websites Using Machine Learning
No ratings yet
Detecting Phishing Websites Using Machine Learning
6 pages
LIS 2022 New 1-154-160
No ratings yet
LIS 2022 New 1-154-160
7 pages
Phishing Phase1 Report
No ratings yet
Phishing Phase1 Report
20 pages
PHISHING WEBSITE DETECTION USING MACHINE LEARNING - COMPLETED (1) Full
No ratings yet
PHISHING WEBSITE DETECTION USING MACHINE LEARNING - COMPLETED (1) Full
73 pages
IJCRTI020051
No ratings yet
IJCRTI020051
4 pages
Logistic Regression Based Machine Learning Technique For Phishing Website Detection
No ratings yet
Logistic Regression Based Machine Learning Technique For Phishing Website Detection
4 pages
Fin Irjmets1682919970
No ratings yet
Fin Irjmets1682919970
5 pages
Properties of Ocean Water
100% (1)
Properties of Ocean Water
5 pages
Leveraging Advanced Machine Learning Techniques For Phishing Website Detection
No ratings yet
Leveraging Advanced Machine Learning Techniques For Phishing Website Detection
6 pages
Social Engineering Detection: Phishing URLs
No ratings yet
Social Engineering Detection: Phishing URLs
7 pages
Business Plan Group 2
No ratings yet
Business Plan Group 2
48 pages
1822 B.E Cse Batchno 287
No ratings yet
1822 B.E Cse Batchno 287
65 pages
Detection of Phising Websites Using Machine Learning Approaches
No ratings yet
Detection of Phising Websites Using Machine Learning Approaches
9 pages
Hazard Analysis and Risk Assessments For Industrial Processes Using FMEA and Bow-Tie Methodologies
No ratings yet
Hazard Analysis and Risk Assessments For Industrial Processes Using FMEA and Bow-Tie Methodologies
13 pages
Detecting Phishing Websites Using Machine Learning
No ratings yet
Detecting Phishing Websites Using Machine Learning
16 pages
Contents 1
No ratings yet
Contents 1
19 pages
V6I602
No ratings yet
V6I602
8 pages
CH 2. Literature Survey
No ratings yet
CH 2. Literature Survey
5 pages
Phishing Web Site Detection Using Diverse Machine Learning Algorithms
No ratings yet
Phishing Web Site Detection Using Diverse Machine Learning Algorithms
16 pages
Research Report
No ratings yet
Research Report
19 pages
Fake URL Detection Using Machine LearningNKKKKKKKKKKKKKKK
No ratings yet
Fake URL Detection Using Machine LearningNKKKKKKKKKKKKKKK
7 pages
Single Phase String Inverter 7-10 KW: Csi-7Ktl1P-Gi-Fl - Csi-8Ktl1P-Gi-Fl CSI-9KTL1P-GI-FL - CSI-10KTL1P-GI-FL
No ratings yet
Single Phase String Inverter 7-10 KW: Csi-7Ktl1P-Gi-Fl - Csi-8Ktl1P-Gi-Fl CSI-9KTL1P-GI-FL - CSI-10KTL1P-GI-FL
2 pages
A Machine Learning Based Approach For Phishing Detection Using
No ratings yet
A Machine Learning Based Approach For Phishing Detection Using
14 pages
Towards Detection of Phishing Websites On Client-Side Using Machine
No ratings yet
Towards Detection of Phishing Websites On Client-Side Using Machine
14 pages
SCHEME HND 1 General Computer II 2019-2020
No ratings yet
SCHEME HND 1 General Computer II 2019-2020
5 pages
Hazardous Substance Fact Sheet: Right To Know
No ratings yet
Hazardous Substance Fact Sheet: Right To Know
6 pages
Jain 2018
No ratings yet
Jain 2018
14 pages
Based Python Code Generator For CNN
No ratings yet
Based Python Code Generator For CNN
11 pages
Sat - 63.Pdf - Crime Detction Using Machine Learning
No ratings yet
Sat - 63.Pdf - Crime Detction Using Machine Learning
11 pages
Fluid Mechanics and Hydraulics - Gillesania
No ratings yet
Fluid Mechanics and Hydraulics - Gillesania
308 pages
Paper 1
No ratings yet
Paper 1
5 pages
Sat - 84.Pdf - Traffic-Sign Detection and Recognition Using Deep Learning
100% (1)
Sat - 84.Pdf - Traffic-Sign Detection and Recognition Using Deep Learning
11 pages
Phishing Website Detection
No ratings yet
Phishing Website Detection
19 pages
Crop Yield Prediction Using Random Forest Algorithm
No ratings yet
Crop Yield Prediction Using Random Forest Algorithm
11 pages
Fake News Detection Using Machine Learning
No ratings yet
Fake News Detection Using Machine Learning
11 pages
DOST PCHRD Calls For Thesis Grant Applications
No ratings yet
DOST PCHRD Calls For Thesis Grant Applications
3 pages
Phishing Detection (Yamu Research Project)
No ratings yet
Phishing Detection (Yamu Research Project)
19 pages
Analyzing User Comments On YouTube Coding Tutorial Videos
No ratings yet
Analyzing User Comments On YouTube Coding Tutorial Videos
50 pages
Sat - 33.Pdf - Recognition and Listing of Acute Stroke Progression Based On Oct Images Using Curvelet Analysis
No ratings yet
Sat - 33.Pdf - Recognition and Listing of Acute Stroke Progression Based On Oct Images Using Curvelet Analysis
11 pages
Sat - 149.Pdf - Prediction of Bigmart Sales Using Machine Learning Algorihms
No ratings yet
Sat - 149.Pdf - Prediction of Bigmart Sales Using Machine Learning Algorihms
11 pages
Sat - 46.Pdf - Crop Yeild Prediction and Crop Recommendation Based On Machine Learning
No ratings yet
Sat - 46.Pdf - Crop Yeild Prediction and Crop Recommendation Based On Machine Learning
11 pages
TCC Catalog 2017 18
No ratings yet
TCC Catalog 2017 18
186 pages
Phishing Detection Using Machine Learnin
No ratings yet
Phishing Detection Using Machine Learnin
5 pages
Phish Guard Phishing Website Using Machine Learning Algorithms
No ratings yet
Phish Guard Phishing Website Using Machine Learning Algorithms
10 pages
Real Time Machine Learning Detection of Heart Disease
No ratings yet
Real Time Machine Learning Detection of Heart Disease
11 pages
Part 3 Discription
No ratings yet
Part 3 Discription
27 pages
Discrete and Stationary Wavelet Decomposition For Image Resolution Enhancement
100% (2)
Discrete and Stationary Wavelet Decomposition For Image Resolution Enhancement
61 pages
Review Paper
No ratings yet
Review Paper
9 pages
Sat - 61.Pdf - Detection of Abnormalities in Brain Using Machine Learning in Medical Image Analysis
No ratings yet
Sat - 61.Pdf - Detection of Abnormalities in Brain Using Machine Learning in Medical Image Analysis
11 pages
Our Paper
No ratings yet
Our Paper
8 pages
Batch-5 Journal-6 ECE-D New
No ratings yet
Batch-5 Journal-6 ECE-D New
6 pages
Paper Major1
No ratings yet
Paper Major1
6 pages
Sri Dev Suman Uttarakhand University ी देव सुमन उ तराख ड व व व यालय
No ratings yet
Sri Dev Suman Uttarakhand University ी देव सुमन उ तराख ड व व व यालय
1 page
Batch-5 ECE-D
No ratings yet
Batch-5 ECE-D
4 pages
Chapter 3
No ratings yet
Chapter 3
10 pages
Chemical Identity: Material Safety Data Sheet Gasoline/Petrol
No ratings yet
Chemical Identity: Material Safety Data Sheet Gasoline/Petrol
4 pages
PDF - Object Detection and Person Tracking Using Uav
No ratings yet
PDF - Object Detection and Person Tracking Using Uav
11 pages
Review 0 - Phishing Website in SEO
No ratings yet
Review 0 - Phishing Website in SEO
6 pages
UNIT 6 - 4000 Essential English Words 1
No ratings yet
UNIT 6 - 4000 Essential English Words 1
6 pages
Sat - 7.Pdf - Predicting Student's Performance Based On Machine Learning
No ratings yet
Sat - 7.Pdf - Predicting Student's Performance Based On Machine Learning
11 pages
Effective Heart Disease Prediction Using Data Mining Technique
No ratings yet
Effective Heart Disease Prediction Using Data Mining Technique
11 pages
Attendance System Based On The Face Recognition of Webcam's Image of The Classroom
No ratings yet
Attendance System Based On The Face Recognition of Webcam's Image of The Classroom
11 pages
Grape Leaf Processing Techniques and Image Processing Techniques
No ratings yet
Grape Leaf Processing Techniques and Image Processing Techniques
11 pages
Sat - 49.Pdf - PEdestrian Detection Using Compact-CNN
No ratings yet
Sat - 49.Pdf - PEdestrian Detection Using Compact-CNN
11 pages
Final
No ratings yet
Final
26 pages
Sat - 153.Pdf - Gmentation of Features Using Neural Network With Cardiac Dataset
No ratings yet
Sat - 153.Pdf - Gmentation of Features Using Neural Network With Cardiac Dataset
11 pages
Compromised Account Detection On Social Networks
No ratings yet
Compromised Account Detection On Social Networks
11 pages
Yuni - The Pcos Detector
No ratings yet
Yuni - The Pcos Detector
11 pages
Sat 21.PDF Secure Vault
No ratings yet
Sat 21.PDF Secure Vault
11 pages
Rubrics Essay
No ratings yet
Rubrics Essay
1 page
Sat - 73.Pdf - Social Media Analysis With Machine Learning
No ratings yet
Sat - 73.Pdf - Social Media Analysis With Machine Learning
8 pages
Sat - 37.Pdf - Quality Analysis of Rice Grains Using Morphological Technniques
No ratings yet
Sat - 37.Pdf - Quality Analysis of Rice Grains Using Morphological Technniques
10 pages
Base Paper
No ratings yet
Base Paper
16 pages
Data Security in Green Cloud
No ratings yet
Data Security in Green Cloud
11 pages
Sat - 41.Pdf - Assification of Quality of Drinking Water Using Machine Learning Technique
No ratings yet
Sat - 41.Pdf - Assification of Quality of Drinking Water Using Machine Learning Technique
11 pages
Sat - 31.Pdf - Failed To Extract Project Title.
No ratings yet
Sat - 31.Pdf - Failed To Extract Project Title.
11 pages
Daily Report Swiss Embassy Jakarta
No ratings yet
Daily Report Swiss Embassy Jakarta
1 page
List of Imran Series by Ibn-e-Safi - Wikipedia
No ratings yet
List of Imran Series by Ibn-e-Safi - Wikipedia
25 pages
Homework 1
No ratings yet
Homework 1
3 pages
Revision Plan - Class X - All in One - SCHEDULE 1
No ratings yet
Revision Plan - Class X - All in One - SCHEDULE 1
13 pages
Major Project Final Report
No ratings yet
Major Project Final Report
53 pages
Presentation Slides
No ratings yet
Presentation Slides
42 pages
Illrigger - GM Binder
No ratings yet
Illrigger - GM Binder
8 pages
Updated Phishing Url Detection
No ratings yet
Updated Phishing Url Detection
13 pages
Wiljam Flight Training: 050-01-01 Composition, Extent, Vertical Division
No ratings yet
Wiljam Flight Training: 050-01-01 Composition, Extent, Vertical Division
18 pages
Phishing 4
No ratings yet
Phishing 4
6 pages
Depuuu DOCNW
No ratings yet
Depuuu DOCNW
28 pages
THE Infinite Game: Simon Sinek
No ratings yet
THE Infinite Game: Simon Sinek
27 pages
PhishNotCloud-Based ML
No ratings yet
PhishNotCloud-Based ML
11 pages
Detection of Phishing Websites Using Mac
No ratings yet
Detection of Phishing Websites Using Mac
3 pages
Paper 2
No ratings yet
Paper 2
10 pages
Automated Phishing Detection Through URL Analysis and Machine Learning
No ratings yet
Automated Phishing Detection Through URL Analysis and Machine Learning
9 pages
Worksheet Geography CH 4
No ratings yet
Worksheet Geography CH 4
2 pages
Data Structure Programs Using C Language (Unit-3)
No ratings yet
Data Structure Programs Using C Language (Unit-3)
10 pages
Blue Zones Minestrone - Dan's Version - Dan Buettner
No ratings yet
Blue Zones Minestrone - Dan's Version - Dan Buettner
3 pages
Phishing
No ratings yet
Phishing
18 pages

Sat - 26.Pdf - Phishing Website Detection Using Novel Machine Learning Fusion Approach

Uploaded by

Sat - 26.Pdf - Phishing Website Detection Using Novel Machine Learning Fusion Approach

Uploaded by

TABLE OF CONTENTS

ChapterNo. TITLE Page No.

2.2. A Survey of Machine Learning-Based Solutions for Phishing

2.3. Detection of Phishing Websites using MachineLearning 44

2.4 Detecting Phishing Websites Using Machine Learning 52

3.2. PROPOSED SYSTEM 62

3.3. SYSTEM ARCHITECTURE 62

3.4. WORKING OFDECISION TREE 63

4 RESULTS AND DISCUSSION

3.1 Block Diagram 62

3.2 Flow Diagram 63

3.3 Real Life Analogy 70

3.4 Bagging Parallel & Boosting Sequential 71

3.6 Bagging Ensemble Method 73

4.4 Training and Testing 85

4.5 Validation Test 85

2.1 Detection of Phishing URL using Machine Learning

Phishing websites have proven to be a major security concern. Several cyberattacks

A. Types of Phishing Attacks

Models and Training

You might also like