UPI Fraud Detection Using Machine Learning
UPI Fraud Detection Using Machine Learning
| DOI: 10.15680/IJIRCCE.2023.1206055 |
ABSTACRT: UPI fraud detection is presently the most frequently occurring problem in the present world. This is due
to the rise in both online transactions and e-commerce platforms. UPIfraud generally happens when the card was stolen
for any of the unauthorized purposes or even when the fraudster uses the UPI information for his use. In the present
world, we are facing a lot of UPI problems. To detect the fraudulent activities the UPIfraud detection system was
introduced. This project aims to focus mainly on machine learning algorithms. The algorithms used are random forest
algorithm and the Adaboost algorithm. The results of the two algorithms are based on accuracy, precision, recall, and
F1-score. The ROC curve is plotted based on the confusion matrix. The Random Forest and the Adaboost algorithms
are compared and the algorithm that has the greatest accuracy, precision, recall, and F1-score is considered as the best
algorithm that is used to detect the fraud.
I. INTRODUCTION
1.1 OVERVIEW
Mobile payment has gained significant popularity as a mainstream payment method, leading to a high volume of
transactions on online trading platforms. Unfortunately, this popularity also attracts criminals who exploit the complex
network environment to commit fraud. Such fraudulent activities not only harm consumers but also impede the healthy
growth of the online economy. Consequently, effective transaction fraud detection becomes a vital tool in combating
network transaction fraud.
Traditional fraud detection approaches primarily rely on statistical and multi-dimensional analysis techniques.
However, these verification based methods struggle to uncover the underlying patterns in transaction data, limiting their
Traditional fraud detection approaches primarily rely on statistical and multi-dimensional analysis techniques.
However, these verification based methods struggle to uncover the underlying patterns in transaction data, limiting their
In 2018, Zhaohui Zhang proposed a reconstructed feature convolutional neural network prediction model specifically
tailored for transaction fraud detection. This model demonstrated improved stability and classification effectiveness
compared to other convolutional neural network models. However, a challenge remains in achieving high detection
accuracy due to imbalanced sample labels. To address this, the paper introduces two fraud detection algorithms: one
based on a Fully Connected Neural Network and another utilizing XGBoost. The former algorithm integrates two
neural network models with different cross-entropy loss functions, enabling a quick and convenient design process for
the combined model. The latter algorithm leverages Hyper opt to optimize the XGBoost classifier, resulting in a fraud
detection model with superior performance by selecting the best parameters. These two algorithms serve different
application scenarios.
| DOI: 10.15680/IJIRCCE.2023.1206055 |
In this blog, we will explore the benefits, limitations, and use cases of machine learning for fraud detection. We will
delve into how machine learning can empower businesses to detect and prevent fraudulent activities, highlighting the
role of unsupervised learning algorithms and the expertise of ML engineers. Additionally, we will discuss the impact of
machine learning on fraud prevention strategies and the advantages of adopting this technology.
3. Predictive-Analysis-based Machine Learning Model for Fraud Detection with Boosting Classifiers
Author- M. Valavan's, S. Rita's
Year-2023
Fraud detection for credit/debit card, loan defaulters and similar types is achievable with the assistance of Machine
Learning (ML) algorithms as they are well capable of learning from previous fraud trends or historical data and spot
them in current or future transactions. Fraudulent cases are scant in the comparison of non-fraudulent observations,
almost in all the datasets. In such cases detecting fraudulent transaction are quite difficult. The most effective way to
prevent loan default is to identify non-performing loans as soon as possible. Machine learning algorithms are coming
into sight as adept at handling such data with enough computing influence. In this paper, the rendering of different
machine learning algorithms such as Decision Tree, Random Forest, linear regression, and Gradient Boosting method
are compared for detection and prediction of fraud cases using loan fraudulent manifestations. Further model accuracy
metric have been performed with confusion matrix and calculation of accuracy, precision, recall and F-1 score along
with Receiver Operating Characteristic (ROC )curves.
| DOI: 10.15680/IJIRCCE.2023.1206055 |
EXISTING SYSTEM
To detect counterfeit transactions, three machine-learning algorithms were presented and implemented.
There are many measures used to evaluate the performance of classifiers or predictors, such as the Gradient Boost
Classifier, Vector Machine, Random Forest, and Decision Tree.
These metrics are either prevalence dependent or prevalence-independent.
Furthermore, these techniques are used in UPI fraud detection mechanisms, and the results of these algorithms
have been compared.
Disadvantages
Mobile payment has gained significant popularity as a mainstream payment method, leading to a high volume of
transactions on online trading platforms.
Unfortunately, this popularity also attracts criminals who exploit the complex network environment to commit
fraud. Such fraudulent activities not only harm consumers but also impede the healthy growth of the online
economy.
Consequently, effective transaction fraud detection becomes a vital tool in combating network transaction fraud.
ADVANTAGES
A typical organization loses an estimated 5% of its yearly revenue to fraud. In this course, learn to fight fraud by
using data. Apply supervised learning algorithms to detect fraudulent behavior based upon past fraud, and use
unsupervised learning methods to discover new types of fraud activities.
Fraudulent transactions are rare compared to the norm. As such, learn to properly classify imbalanced datasets.
The course provides technical and theoretical insights and demonstrates how to implement fraud detection models.
Finally, get tips and advice from real-life experience to help prevent common mistakes in fraud analytics.
V. SYSTEM CONFIGURATION
| DOI: 10.15680/IJIRCCE.2023.1206055 |
UPIFRAUD
Credits are typically used to refer to electronic financial transactions made without the use of physical cash. A UPI that
is extensively used for online transactions is a small piece made up of thin plastic material with credit services and
customer details. Fraudsters use credit cards to make unlawful transactions that result in massive losses to banks and
card holders. Moreover, the invention of counterfeit cards has aided fraudsters in performing illicit transactions more
easily. In general, it is regarded as illegitimate to use the card without the proper owners’ authorization. By obtaining
access to a certain account illegitimately, any transaction that is carried out is considered as fraudulent. UPI fraudulent
activities can be divided into two aspects, namely, offline and online fraud. In offline fraudulent activity, the fraudsters
conduct their illicit transactions with stolen credit cards such as genuine card holders, while online fraudsters conduct
their activities in online transactions through Internet Online fraud.
INSURANCE FRAUD
Insurance fraud can be defined as the act of misusing an insurance policy for gaining illegitimate benefits from an
insurance business. Usually, insurance is made to protect the organization’s transactions or individual’s transactions
against any financial risks. The main sectors of target by fraudulent insurance claims include healthcare and automobile
insurance companies although home and crop insurance fraudulent also occur, however, there is a paucity of the
literature on both. It has been estimated recently that the total cost of insurance fraud in the United States is over a
billion USD yearly and it is finally passed on to consumers in the form of higher insurance premiums.
In order to cover the relevant costs of theft or accidental damages to a car, an agreement between the insurance provider
and the insured person or organization is typically involved in automobile insurance claims. Individual fraudsters are
capable of committing fraudulent claims, and one method of committing fraud is through deception during the claims
process. Evidence of organized groups working together to conduct insurance fraud also exists.
VII. CONCLUSION
Online transaction are increasing with UPI, cards, net banking as it is easier and saves time of the customer purchase.
But as the online transaction is increasing so is the fraud. Hence using machine learning algorithms and ANN model, an
attempt has been made to gain the knowledge about the fraud and genuine transaction. With the increasing usage of
ecommerce and online shopping, customer’s vital information like card’s CVV, UPI pin, OTP, passwords etc. are
always vulnerable. The reason for rise in fraud cases is that fraudsters easily barge into the customers private
information. Even the banking systems are vulnerable to frauds, also the fraudsters are finding new ways of fraud like
identity theft from social media platform so the fraud prevention and detection is an ever evolving process. As the
online frauds are ever increasing, the traditional way of detecting and preventing frauds are being replaced by various
machine learning algorithms.
REFERENCES
1.Elisa Indriasari; Ford LumbanGaol; Tokuro Matsuo, “Digital Banking Transformation: Application of Artificial
Intelligence and Big Data Analytics forLeveraging Customer Experience in the Indonesia Banking
Sector”,presented at 2019 8th International Congress on Advanced Applied Informatics( IIAI-AAI) ,Toyanama,
Japan,7-11 Year: 2019, pp: 863-868.
| DOI: 10.15680/IJIRCCE.2023.1206055 |
2.Dr. Navleen Kaur, Ms. SupriyaLambaSahdev, “Banking 4.0:the influence of artificial intelligence on the banking
industry & how AIis changing the face of modern day banks”,International Journal of Management (IJM) Volume 11,
Issue 6, June 2020,pp 39-45.
3.Tushar Gupta; Naman Gupta; Ankit Agrawal; Aksh Agrawal; KartikKansal, “Role of Big Data Analytics In
Banking”, 2019 International Conference on contemporary Computing and Informatics (IC3I).
4.KaithekuzhicalLeenaKurien*1 & Dr. AjeetChikkamannur, “Detection and Prediction of credit card fraud
transactions using machine learning”,International Journal of Engineering Sciences & Research
Technology(IJESRT),ISSN:2277-9655,pp. 205-207
5.CorreaBahnsen, D. Aouada, A. Stojanovic, and B. Ottersten, “Feature engineering strategies for UPIfraud
detection,” Expert Systems with Applications, vol. 51, pp. 134–142, 2016.
6.Varun Kumar K S, Vijaya Kumar V G, Vijay Shankar A, Pratibha K, “UPIFraud Detection using Machine Learning
Algorithms”,International Journal of Engineering Research & Technology (IJERT) ,Volume 09, Issue 07 (July
2020),pp 5-8.
7.Dal Pozzolo, G. Boracchi, O. Caelen, C. Alippi and G. Bontempi, “UPIFraud Detection: A Realistic Modeling and a
Novel LearningStrategy,” in IEEE Transactions on Neural Networks and Learning Systems, vol. 29, no. 8, pp. 3784-
3797, Aug. 2018.
8.Pratyush Sharma, Souradeep Banerjee, Devyanshi Tiwari, and Jagdish Chandra Patni, “Machine Learning
Model for UPIFraud Detection-A Comparative Analysis”,The International Arab Journal of Information
Technology, Vol. 18, No. 6, November 2021 , pp 789-790.
9.Ruttala Sailusha ; V. Gnaneswar ; R. Ramesh ; G. Ramakoteswara Rao ,”UPIFraud Detection Using Machine
Learning ”,2020 4th International Conference on Intelligent Computing and Control Systems (ICICCS) ,19 June
2020
10.Kha Shing Lim, Lam Hong Lee and Yee-WaiSim,“A Review of Machine Learning Algorithms for Fraud
Detection in Credit Card Transaction”,IJCSNS International Journal of Computer Science and Network Security,
Vol.21 No.9, September 2021,pp 32-37.