Mitigating Bias in Machine Learning Models for Phishing Webpage Detection

Kulkarni, Aditya; Balachandran, Vivek; Divakaran, Dinil Mon; Das, Tamal

Computer Science > Cryptography and Security

arXiv:2401.08363 (cs)

[Submitted on 16 Jan 2024]

Title:Mitigating Bias in Machine Learning Models for Phishing Webpage Detection

Authors:Aditya Kulkarni, Vivek Balachandran, Dinil Mon Divakaran, Tamal Das

View PDF

Abstract:The widespread accessibility of the Internet has led to a surge in online fraudulent activities, underscoring the necessity of shielding users' sensitive information from cybercriminals. Phishing, a well-known cyberattack, revolves around the creation of phishing webpages and the dissemination of corresponding URLs, aiming to deceive users into sharing their sensitive information, often for identity theft or financial gain. Various techniques are available for preemptively categorizing zero-day phishing URLs by distilling unique attributes and constructing predictive models. However, these existing techniques encounter unresolved issues. This proposal delves into persistent challenges within phishing detection solutions, particularly concentrated on the preliminary phase of assembling comprehensive datasets, and proposes a potential solution in the form of a tool engineered to alleviate bias in ML models. Such a tool can generate phishing webpages for any given set of legitimate URLs, infusing randomly selected content and visual-based phishing features. Furthermore, we contend that the tool holds the potential to assess the efficacy of existing phishing detection solutions, especially those trained on confined datasets.

Subjects:	Cryptography and Security (cs.CR)
Cite as:	arXiv:2401.08363 [cs.CR]
	(or arXiv:2401.08363v1 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2401.08363

Submission history

From: Aditya Kulkarni [view email]
[v1] Tue, 16 Jan 2024 13:45:54 UTC (290 KB)

Computer Science > Cryptography and Security

Title:Mitigating Bias in Machine Learning Models for Phishing Webpage Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:Mitigating Bias in Machine Learning Models for Phishing Webpage Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators