0% found this document useful (0 votes)

26 views24 pages

Appendices A D

The document contains appendices that provide additional details related to a project. Appendix C specifically outlines a user's guide for a web-based system that detects phishing websites. It describes the user interface which allows users to input a URL, click a button to assess it, and see the results showing what features the URL matches and a percentage classification of it being a phishing site or not. The guide provides an overview of the system's main functionality for users.

Uploaded by

Jhon Emar Quillo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views24 pages

Appendices A D

Uploaded by

Jhon Emar Quillo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

43

APPENDICES

A. Project Gantt Chart

B. Relevant Source Code
C. User’s Guide
D. Evaluation Tool
E. Endorsement
F. IMRAD
G. IT Expert’s Resume
H. Curriculum Vitae
44

Appendix A
Project Gantt Chart
45

A. Project Gantt Chart

Appendix B
Relevant Source Code
47

B. Relevant Source Code

App.py
#importing required libraries

from flask import Flask, request, render_template

import numpy as np
import pandas as pd
from sklearn import metrics
import warnings
warnings.filterwarnings('ignore')
from feature import generate_data_set
# Gradient Boosting Classifier Model
from sklearn.ensemble import GradientBoostingClassifier

data = pd.read_csv("phishing.csv")
#droping index column
data = data.drop(['Index'],axis = 1)
# Splitting the dataset into dependant and independant fetature

X = data.drop(["class"],axis =1)
y = data["class"]

# instantiate the model

gbc = GradientBoostingClassifier(max_depth=4,learning_rate=0.7)

# fit the model

gbc.fit(X,y)

app = Flask(_name_)

@app.route("/")
def index():
return render_template("index.html", xx= -1)

@app.route("/predict", methods=["GET", "POST"])

def predict():
if request.method == "POST":

Features
import ipaddress
import re
import urllib.request
from bs4 import BeautifulSoup
import socket
48

import requests
from googlesearch import search
import whois
from datetime import date, datetime
import time
from dateutil.parser import parse as date_parse

def diff_month(d1, d2):

return (d1.year - d2.year) * 12 + d1.month - d2.month

def generate_data_set(url):

data_set = []

if not re.match(r"^https?", url):

url = "http://" + url

try:
response = requests.get(url)
soup = BeautifulSoup(response.text, 'html.parser')
except:
response = ""
soup = -999

domain = re.findall(r"://([^/]+)/?", url)[0]

if re.match(r"^www.", domain):
domain = domain.replace("www.", "")
whois_response = whois.whois(domain)

rank_checker_response =
requests.post("https://www.checkpagerank.net/index.php", {
"name": domain
})

try:
global_rank = int(re.findall(
r"Global Rank: ([0-9]+)", rank_checker_response.text)[0])
except:
global_rank = -1

# 1.UsingIP
try:
ipaddress.ip_address(url)
data_set.append(-1)
except:
49

data_set.append(1)

# 2.LongURL
if len(url) < 54:
data_set.append(1)
elif len(url) >= 54 and len(url) <= 75:
data_set.append(0)
else:
data_set.append(-1)

Gradientboostclassifier
#importing required libraries

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
from sklearn import metrics
import warnings
warnings.filterwarnings('ignore')

#Loading Data
50

data = pd.read_csv("phishing.csv")
data.head()

#Listing the features of the dataset

data.columns

#Information about the dataset

data.info()

#nunique values in columns

data.nunique()

#droping index column

data = data.drop(['Index'],axis = 1)

#description of dataset

data.describe().T

#Splitting the dataset into dependant and independant features

X = data.drop(["class"],axis =1)
y = data["class"]

#Splitting the dataset into train and test sets: 80-20 split

from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.2,
random_state = 42)
X_train.shape, y_train.shape, X_test.shape, y_test.shape

#Model Building & Training

#Creating holders to store the model performance results

ML_Model = []
accuracy = []
f1_score = []
recall = []
precision = []

#function to call for storing the results

def storeResults(model, a,b,c,d):

ML_Model.append(model)
accuracy.append(round(a, 3))
f1_score.append(round(b, 3))
recall.append(round(c, 3))
precision.append(round(d, 3))

#Gradient Boosting Classifier Model

from sklearn.ensemble import GradientBoostingClassifier

# instantiate the model

gbc = GradientBoostingClassifier(max_depth=4,learning_rate=0.7)

# fit the model

gbc.fit(X_train,y_train)

#predicting the target value from the model for the samples

y_train_gbc = gbc.predict(X_train)
y_test_gbc = gbc.predict(X_test)

HTML
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="UTF-8">
<meta http-equiv="X-UA-Compatible" content="IE=edge">
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<meta name="description" content="This website is developed to identify the
safety of a url.">
<meta name="keywords" content="phishing url,phishing,cyber
security,machine learning,classifier,python">
<meta name="author" content="VJCET">

<link href="static/styles.css" rel="stylesheet">

<title>Phishing Website Detection</title>
<script src="https://code.jquery.com/jquery-latest.min.js"></script>
</head>

<body>

<br><br><br>
<form action="/predict" method ="post">
<input type="text" class="form__input" name ='url' id="url" placeholder="
Type a URL" required="" />
<button class="button" id="bxt" role="button" >Click here to
check</button>
</form>

<br>
<h3 id="prediction"></h3>
<button class="button2" id="button2"
role="button" onclick="window.location.replace('http://127.0.0.1:5000')"
target="_blank" > Continue ?</button>
<button class="button1" id="button1"
role="button" onclick="window.location.replace('http://127.0.0.1:5000')"
target="_blank">Continue?</button>
</div>
</div>

$('#form2').hide();
let x = '{{xx}}';
let num = x*100;
if (0<=x && x<0.50){
num = 100-num;
}
let txtx = num.toString();
if(x<=1 && x>=0.50){
var label = "Website is "+txtx +"% safe to use...";
document.getElementById("prediction").innerHTML = label;
document.getElementById("button1").style.display="block";
$('#form1').hide();
setTimeout(function()
{
$('#form2').show();
},1000);

}
else if (0<=x && x<0.50){
var label = "Website is "+txtx +"% unsafe to use..."
document.getElementById("prediction").innerHTML = label ;
document.getElementById("button2").style.display="block";
$('#form1').hide();
setTimeout(function()
{
$('#form2').show();
},1000);

}
</script>

</body>

</html>
54

Appendix C
User’s Guide
55

C. User's Guide

The product is a web-based system that detect a phishing website by

providing a URL. This aim to lessen the people who get deceive by
phishers or people who make phishing website. The system is just giving
a percentage whether the URL is safe or not but the decision will always
on the users.

User Interface Overview

There are three (3) main features;

 Input field – where the user input the URL.
 Check button – clicking it will start to assess the URL.
 Features table – the user can see where the URL matches
features (factors of a phishing website) and the percentage of the
website (URL) to be classified as a phishing website.

Using System’s Features

1. Paste the URL to “Input field” area.

2. Click the “Check button”

3. After assessing the URL, the system will show “This Website is
Safe to use” or “This Website is not safe to use”. Check the
“Features table” to see where the URL matches to features and its
percentage to be classified as phishing site.

4. Check the button “Check button” again to assess new URL.

Appendix D
Evaluation Tool
59

D. Evaluation Tool
60
61
62
63
64
65
66

Final PPT - Phishing Website
100% (1)
Final PPT - Phishing Website
23 pages
MS Office Information For Competitive Exam PDF
No ratings yet
MS Office Information For Competitive Exam PDF
8 pages
Phishing URL Detection
No ratings yet
Phishing URL Detection
242 pages
Phishing
No ratings yet
Phishing
18 pages
Phishing URL Detection Presentation
No ratings yet
Phishing URL Detection Presentation
12 pages
Malicious Url: Analysis and Detection Using Machine Learning
No ratings yet
Malicious Url: Analysis and Detection Using Machine Learning
58 pages
ENM Command
87% (23)
ENM Command
4 pages
URL Crawling & Classification System
No ratings yet
URL Crawling & Classification System
131 pages
Final Report (Yau Jia Xin)
No ratings yet
Final Report (Yau Jia Xin)
68 pages
Mini Project Report Sample Format 2024 - Final
No ratings yet
Mini Project Report Sample Format 2024 - Final
80 pages
Fortiweb v5.8.0 Administration Guide
No ratings yet
Fortiweb v5.8.0 Administration Guide
932 pages
Technothon Phishing Detection
No ratings yet
Technothon Phishing Detection
30 pages
Template SEO Audit Worksheet - SV
No ratings yet
Template SEO Audit Worksheet - SV
33 pages
Phishing URL Detection - Jupyter Notebook
No ratings yet
Phishing URL Detection - Jupyter Notebook
25 pages
Major Project Final Report
No ratings yet
Major Project Final Report
53 pages
Final CPE
No ratings yet
Final CPE
29 pages
WT&DA
No ratings yet
WT&DA
21 pages
Openmind 3 Unit 8 Grammar and Vocabulary Test B
100% (2)
Openmind 3 Unit 8 Grammar and Vocabulary Test B
3 pages
Digi Tags Case Study - Assignment: Suraj Borlepawar
100% (1)
Digi Tags Case Study - Assignment: Suraj Borlepawar
12 pages
Worksheet Phishing
No ratings yet
Worksheet Phishing
15 pages
Url Pishing
No ratings yet
Url Pishing
28 pages
Pasolink NLiteN FULL Manual
No ratings yet
Pasolink NLiteN FULL Manual
438 pages
Ceragon FibeAir 1500 - Operation and Installation Manual
No ratings yet
Ceragon FibeAir 1500 - Operation and Installation Manual
137 pages
Atoll 3.1.0 GSM Gprs Edge Complete
No ratings yet
Atoll 3.1.0 GSM Gprs Edge Complete
135 pages
Detailed Steps For Building A Web Application Vulnerability Scanner
No ratings yet
Detailed Steps For Building A Web Application Vulnerability Scanner
10 pages
Second Review
No ratings yet
Second Review
26 pages
Project
No ratings yet
Project
3 pages
Phishing Detection Tool
No ratings yet
Phishing Detection Tool
16 pages
Chapter 3
No ratings yet
Chapter 3
20 pages
Worksheet Phishing
No ratings yet
Worksheet Phishing
15 pages
App
No ratings yet
App
10 pages
A Machine Learning-Based Solution For Enhanced Online Security
No ratings yet
A Machine Learning-Based Solution For Enhanced Online Security
13 pages
MaliciousURLDetection Acomparativestudy
No ratings yet
MaliciousURLDetection Acomparativestudy
6 pages
Phishing Website Detection by Machine Learning Techniques Presentation
No ratings yet
Phishing Website Detection by Machine Learning Techniques Presentation
12 pages
27 28 37 49 Cpe
No ratings yet
27 28 37 49 Cpe
19 pages
Appendices e F
No ratings yet
Appendices e F
6 pages
Report PUD
No ratings yet
Report PUD
20 pages
Phishing Final
No ratings yet
Phishing Final
13 pages
NIS Microproject
No ratings yet
NIS Microproject
10 pages
Review 4
No ratings yet
Review 4
9 pages
Phishing Review 2023
No ratings yet
Phishing Review 2023
17 pages
FKDGM BP WUq 7 Ik 3 TH JG CC ZIke J2 Lo 1 HK8 HSJM KMFX M3 FSBPN VOe EXc G4 S 9 T Yh U
No ratings yet
FKDGM BP WUq 7 Ik 3 TH JG CC ZIke J2 Lo 1 HK8 HSJM KMFX M3 FSBPN VOe EXc G4 S 9 T Yh U
18 pages
20mis0106 VL2023240103172 Pe003
No ratings yet
20mis0106 VL2023240103172 Pe003
5 pages
Phishing Detection Using ML
No ratings yet
Phishing Detection Using ML
11 pages
Organizational Behavior Chapter 9
No ratings yet
Organizational Behavior Chapter 9
25 pages
22 04 CPE Presentation
No ratings yet
22 04 CPE Presentation
18 pages
Phisingppt
No ratings yet
Phisingppt
15 pages
Comparative Evaluation of Machine Learning Models For Malicious URL Detection
No ratings yet
Comparative Evaluation of Machine Learning Models For Malicious URL Detection
7 pages
Phishing URL Detection Using ML: Project Report
No ratings yet
Phishing URL Detection Using ML: Project Report
24 pages
Phishing
No ratings yet
Phishing
10 pages
Problem Statement - Phishing URL Detection
No ratings yet
Problem Statement - Phishing URL Detection
2 pages
Updated Phishing Url Detection
No ratings yet
Updated Phishing Url Detection
13 pages
Project 3 - Phishing Detector Using LR
No ratings yet
Project 3 - Phishing Detector Using LR
3 pages
Phishing Detection Website Base Paper
No ratings yet
Phishing Detection Website Base Paper
8 pages
Paper 2
No ratings yet
Paper 2
10 pages
Fast and Memory Efficient Phishing Detection Using Extended XGBoost and LightGBM
No ratings yet
Fast and Memory Efficient Phishing Detection Using Extended XGBoost and LightGBM
6 pages
Phishing URL Detection Using ML: Project Report
No ratings yet
Phishing URL Detection Using ML: Project Report
25 pages
Malicious - Url - Detect - 1BY21IS087,88
No ratings yet
Malicious - Url - Detect - 1BY21IS087,88
5 pages
Phishing Seminar
No ratings yet
Phishing Seminar
19 pages
128 Submission
No ratings yet
128 Submission
7 pages
Ai Phishing Report
No ratings yet
Ai Phishing Report
3 pages
Integrating Machine Learning Into Web Applications With Flask
No ratings yet
Integrating Machine Learning Into Web Applications With Flask
7 pages
Paper 7AdvancesinEngineeringSoftware
No ratings yet
Paper 7AdvancesinEngineeringSoftware
6 pages
Phishing Detection Website
No ratings yet
Phishing Detection Website
7 pages
Sniffing Dtetction IEEE Paper
No ratings yet
Sniffing Dtetction IEEE Paper
3 pages
Deploy A Machine Learning Model As An API On AWS, Step by Step
No ratings yet
Deploy A Machine Learning Model As An API On AWS, Step by Step
12 pages
Reducing Web Vulnerabilities by Detecting Malicious Urls: Final Year Project Report
No ratings yet
Reducing Web Vulnerabilities by Detecting Malicious Urls: Final Year Project Report
11 pages
Manual Lexmark x3470
100% (1)
Manual Lexmark x3470
88 pages
CP R75.20 Firewall Admin Guide
No ratings yet
CP R75.20 Firewall Admin Guide
208 pages
Subscriber Unit - Data Sheet (RW5000/HSU/5510/F54/UNI/SFF/INT/23)
No ratings yet
Subscriber Unit - Data Sheet (RW5000/HSU/5510/F54/UNI/SFF/INT/23)
3 pages
IT Defense Database Logs D3
No ratings yet
IT Defense Database Logs D3
5 pages
Direct Marketing: Chapter Twenty Two
No ratings yet
Direct Marketing: Chapter Twenty Two
7 pages
Cooking Up Change The Cookbook
100% (1)
Cooking Up Change The Cookbook
52 pages
SR Iov
No ratings yet
SR Iov
82 pages
DNS and ENUM Guidelines For Service Providers
No ratings yet
DNS and ENUM Guidelines For Service Providers
71 pages
DOCSIS 3.0 Multicast
No ratings yet
DOCSIS 3.0 Multicast
49 pages
DF Unit 2
No ratings yet
DF Unit 2
32 pages
Avnet-ARM Global Seminar Series
No ratings yet
Avnet-ARM Global Seminar Series
4 pages
Intelligent Optical Huawei Optix Osn 3500 For Optical Network Switch-97178415 2
No ratings yet
Intelligent Optical Huawei Optix Osn 3500 For Optical Network Switch-97178415 2
3 pages
Homo Connectus: The Impact of Technology On People's Everyday Lives
No ratings yet
Homo Connectus: The Impact of Technology On People's Everyday Lives
50 pages
S/MIME Message Specification: (Followed by RFC 2633)
No ratings yet
S/MIME Message Specification: (Followed by RFC 2633)
24 pages
Setup Guide-Cisco SPA122
No ratings yet
Setup Guide-Cisco SPA122
9 pages
Practical-19: AIM: How To Install Active Directory Certificate Services (ADCS)
No ratings yet
Practical-19: AIM: How To Install Active Directory Certificate Services (ADCS)
7 pages
Home Sitemap: Introduction To Multiple Antenna Systems: SIMO, MISO, MIMO Miso
No ratings yet
Home Sitemap: Introduction To Multiple Antenna Systems: SIMO, MISO, MIMO Miso
2 pages
X509 Certificate - UNICORE - Lecture 2
No ratings yet
X509 Certificate - UNICORE - Lecture 2
2 pages
TESOL+TEYL PROGRAM DECK 2019v1
67% (3)
TESOL+TEYL PROGRAM DECK 2019v1
14 pages
F 2
No ratings yet
F 2
1 page