A Review Credit Card Fraud Detection in Banks Using Machine Learning Algorithms
A Review Credit Card Fraud Detection in Banks Using Machine Learning Algorithms
A Review Credit Card Fraud Detection in Banks Using Machine Learning Algorithms
Correspondence
ABSTRACT
University of Management and
Technology Fraud events are increasing on day to day bases. Credit card fraud is the most
frequent fraud that is making a big financial loss on a global level. Researchers
Dr. Ahmed Hassan Butt have implemented many machine learning algorithms to detect the credit card
[email protected] frauds. This research has briefly described several algorithms and compares the
performance of Random Forest, Naïve Bayes, K-Nearest Neighbor, Logistic
M. Hazeel Ahmed Butt
[email protected] Regression and Multilayer Perceptron. These Algorithms are also used to
classify the real transactions or fraudulent transactions. Datasets are compared
on the basis of accuracy, precision, recall & false positive rate. Comparison of
the result shows that Random Forest performs best in credit card fraud detection
dataset among others. Research shows that any ML algorithm can be used to
demonstrate the classification of fraud detection.
KEYWORDS
Fraud detection, Credit card fraud detection, Random forest, Machine learning,
fraudulent behavior detection.
organizations of the world suffer 10% to 15% lose fraud detection from the research of 2016 till 2022. It
in their revenues due to fraudulent activities. also provides positive knowledge about transactional
fraud that are occurring currently though credit cards
• Fraud Detection is a technique to protect a system
these days.
from fraud. It can also be used to enhance the
security of the system in order prevent it from a I. INTRODUCTION
fraudulent activity. In today’s world E-commerce systems are trending
and every single person is aware of it. Now the
WHAT THIS RESEARCH ADDS ON TO
THE FRAUD DETECTION IN BANKS? question arises with the enhancement in electronic
commerce systems, frauds are also increased on the
1. We have categorized and explored in a positive
day to day bases because these e-commerce system
sense to discuss a detailed knowledge about the
has security issues regarding the privacy of
scenarios in which fraud detection techniques
customers’ credit cards.
are used by researchers.
1
These E-commerce websites have two types of users, methods and techniques which are adopted. Section 4,
i.e. Authorized users & Scammers. brings the results with outcomes of doing the research.
At last section 5 contains the concluding part of this
Advancement in the system makes users aware of
online transactions through credit cards, which is the research paper.
most easiest and popular way of money transferring II. LITERATURE REVIEW
on a commercial scale [2]. There can be two types of In history, fraud detection has always been a hot topic
credit card frauds i.e. fraudster can make an unknown towards the research. Different researchers have done
identity to make an online fraud transaction or research in many areas of fraud. In [1], the author
fraudster can use the stolen or lost credit card to refers to the techniques of Artificial Intelligence
transact money or get cash. where he explains the working of neural network with
However, transactional fraud is increasing rapidly and the central nerve system of animals, because it is
every year it is a great loss of money globally [3]. In capable to learn and recognize easily. Researchers
today’s world a person does not need a physical credit represented many types of frauds i.e. bankruptcy
card to make any transaction online, but just the frauds, theft fraud and credit card frauds. Paper [1],
information on a credit card is needed to process the highlights some of the issues & challenges related to
transaction. Credit card has very sensitive a fraud detection system such as Concept Drift, Real
information, i.e. Card Number, CVV code and expiry Time Detection, and Skewed Distribution & Large
date. If a fraudster can have these three private details, amount of Data. It also provides an overview of
then he can make a large amount of purchase. different state-of-art Fraud Detection System
approaches & methods such as artificial neural
To overcome the transactional frauds organizations
network (ANN), support vector machines (SVM),
uses different machine learning algorithms for fraud
decision trees & meta-heuristics etc. They conclude
detection. We have analyzed some algorithms of
by highlighting some challenges that are improper to
machine learning, i.e. Random Forest, which is used
model & have weak accuracy.
as a classifier to classify normal & fraudulent
behaviors by using voting of base classifiers in the According to data mining concept of classification
data set. [4] Naïve Bayes (NB), Logistic Regression fraud detection falls in the bucket of classification
(LG), Multilayer Perceptron (MLP) and decision trees problem [2]. As fraud detection works on the
[5] to identify that which algorithm fits best in credit algorithm of data mining to classify the credit card
card fraud detection. transaction as an original or fraudulent one. The
author proposed in [2] that Credit Card Fraud
This paper also highlights the consequences of
Detection is a problem of Data Mining and there are
financial literacy [6], which is mainly focused on the
two major reasons for which credit card fraud
knowledge of finance and financial decisions in order
detection is becoming more complex & challenging.
to carefully manage the money and transactions of
They also performed a performance test on the bases
credit card. So, by combining the data results of the
of comparison on European cardholders having
above analysis from all techniques and algorithms,
284,807 transactions by using three techniques K-
this paper positively shows that which algorithm will
nearest, Naive Bayes & Logistic Regression. They
better perform on the commercial scale. Section 2,
conclude by showing the effect of hybrid sampling.
defines the literature review and related work of
previous researchers. Section 3, defines the research
2
There are two types of credit card frauds, one is a III. METHODOLOGY
fraud from the application in which fraudster gets a We come up with the search space that includes
card by using false or wrong information & second is different e-databases to acquire valid papers about
a fraud through stealing the card number and fraud detection. We assessed different online
password of card owner and make transactions refer research papers and explored the work of previous
to as behavioral fraud [7]. In [8], misuse detection and researchers to provide a positive review that will
anomaly detection are two fraud detection techniques help future researchers have a better understanding
4
Table 1: Represent Results of Classification
Evaluation
Multilayer 0.90 0.08 0.15 98.4% Figure 3: ROC Curve between TPR and FPR
Perceptron
V. CONCLUSION
Naïve Bayes 0.85 0.26 0.40 99.6%
Credit Card fraud has been increasing in recent years,
these fraudulent transactions, from credit cards are
Logitic 0.91 0.07 0.14 98.2%
Regression causing serious money loss at global level to the people
using bank accounts for the safety of their money.
Many Fraud detection systems have been proposed
As we can see that Random Forest gives highest
technically to prevent the fraudulent activities. Our
accuracy and K-Nearest Neighbor gives lowest
findings are based on the original dataset of credit card
accuracy among these models of classification. In our
fraud detection. The main goal of this paper is to make a
experiment Random forest performs better than other
comparison between different techniques & models of
four models, but on the other side for any other dataset
ML. This research shows that Random Forest gives better
it might be possible that some other model will work
results among other algorithms of ML.
better than Random forest. We have created a bar chart
We have extracted the results of classification evaluation
of comparison on the basis of accuracy and we have
in tabular forms to provide better understanding.
constructed ROC_Curve of Random Forest Classifier to
Furthermore, we have created a bar chart of comparison
demonstrate the pictorial graph.
on the bases of accuracy to give a pictorial representation
of the results.
All our findings are actual and reality-based. Future
researchers should make a comparison with other models
in order to provide a wider view of the fraud detection
techniques. We hope that this paper will help the
researchers have positive minds towards Credit Card
Fraud Detection.
VI. ACKNOWLEDGEMENT