0% found this document useful (0 votes)

6 views2 pages

Assignment1 921275

The document outlines a project on credit card fraud detection using a dataset from Kaggle, consisting of 284,807 transactions with a focus on preprocessing steps, model performance, and a front-end application. A Logistic Regression model was trained, achieving a precision of 0.92, recall of 0.85, and an ROC-AUC score of 0.98, indicating effective fraud detection. Additionally, an interactive web application was developed to predict fraud in real-time, with features for user input and probability display.

Uploaded by

jyotiprakashuprety2024

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views2 pages

Assignment1 921275

Uploaded by

jyotiprakashuprety2024

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Purnima Gosain

921275
Assignment 1

1. Data Preparation and Preprocessing Steps

Dataset Overview

The dataset for this project is taken from Credit Card Fraud Detection Dataset of Kaggle. It
consists of 284,807 transactions, of which only 0.172% are fraudulent. Each transation has 30
numberical features such as 'Time', 'Amount', and 28 anonymized PCA components (V1-V28).
The target variable ('Class') is: 1 for fraudulent transactions and 0 for legitimate transactions.

Preprocessing Steps

Managing Missing Values: There were no missing values hence imputation was not done

Feature Scaling:

To improve model performance 'Amount' and 'Time' were standardized using StandardScaler

Class Imbalance Handling:

Synthetic Minority Over-sampling Technique (SMOTE) outperformed better on the imbalance

dataset since fraudulent transactions were highly imbalanced.

Data Splitting:

80% training and 20% testing** of the dataset.

2. Model Performance Metrics

Logistic Regression Model

The trained Logistic Regression model was assessed by using standard classification
performance metrics :

Precision: The fraction of predicted frauds which were actually fraud.

Recall: Determines the number of actual fraud cases that have been identified correctly.

F1 Score: it is Harmonic mean of precision and recall.

ROC-AUC Score: Represents overall classification performance

Precision: 0.92

Recall: 0.85
Purnima Gosain
921275
Assignment 1

F1 Score: 0.88

ROC-AUC: 0.98

These scores suggest the model is able to identify fraud transactions while having relatively few
false positives.

3. Key Features of the Front-End Application

An interactive web application based on Streamlit was created to accept transaction details and
predict whether is fraud or not.

Key Features

User Inputs:

Users insert value for 'Time', 'Amount' and anonymized PCA characteristics (V1-V28).

– Fraud Prediction in Real Time: The model assigns a label to the transaction as Fraud (Class
1)/ Legitimacy (Class 0)

Probability Display: With the app's results, it shows both a fraud and a legitimate transaction
probability.

Input Validation: Yes, this allows for a freshness of data up until October 2023.

Easy to Use Interface: Very basic UI with buttons and color-coded messages (green for legit,
red for fakes).

4. Conclusion

It includes a machine learning pipeline for fraud detection, i.e a trained Logistic Regression
model, an interactive web app that can be used for predictions in real-time. Room for
improvement that might come:

Playing with more complex models (Random Forest, XGBoost, Neural Networks, etc).

Using real-time transaction monitoring in a production system

A Practical approach for efficient and accurate Detection of Fraudulent Credit Card
Transactions

Credit Card Fraud Detection (Data Analyst)
No ratings yet
Credit Card Fraud Detection (Data Analyst)
22 pages
Fine-Tune Whisper For Multilingual ASR With Transformers
No ratings yet
Fine-Tune Whisper For Multilingual ASR With Transformers
24 pages
1
No ratings yet
1
12 pages
Aindumps AI-900 v2021-04-29 by Mohammed 47q
No ratings yet
Aindumps AI-900 v2021-04-29 by Mohammed 47q
30 pages
Credit Card Fraud Detection Report
No ratings yet
Credit Card Fraud Detection Report
2 pages
Wa0006
No ratings yet
Wa0006
6 pages
Credit Card Fraud Detection
No ratings yet
Credit Card Fraud Detection
8 pages
Credit Card Fraud Detection Using Machine Learning
No ratings yet
Credit Card Fraud Detection Using Machine Learning
6 pages
Mano Phase 2
No ratings yet
Mano Phase 2
10 pages
B17 Discrete Report
No ratings yet
B17 Discrete Report
16 pages
Credit Card Fraud Detection
No ratings yet
Credit Card Fraud Detection
4 pages
.Trashed 1750261541 Phase 2 - Hari
No ratings yet
.Trashed 1750261541 Phase 2 - Hari
3 pages
11
No ratings yet
11
15 pages
Credit Card Fraud Detection Using Machine Learning Techniques
No ratings yet
Credit Card Fraud Detection Using Machine Learning Techniques
4 pages
Phase 5
No ratings yet
Phase 5
10 pages
Fraud Detection Synopsis
No ratings yet
Fraud Detection Synopsis
5 pages
Credit Card Fraud Detection Proposal
No ratings yet
Credit Card Fraud Detection Proposal
2 pages
Internship Project
No ratings yet
Internship Project
8 pages
Assignment 1 Individual Assignment Template
No ratings yet
Assignment 1 Individual Assignment Template
26 pages
Pdsreport
No ratings yet
Pdsreport
6 pages
Session 5
No ratings yet
Session 5
21 pages
Financial Fraud Detection
No ratings yet
Financial Fraud Detection
11 pages
ANN, KNN & Decision Tree
No ratings yet
ANN, KNN & Decision Tree
13 pages
10 Case Study
No ratings yet
10 Case Study
6 pages
Porposal Datamining
No ratings yet
Porposal Datamining
4 pages
AI and DS Final Document For Phase 5
No ratings yet
AI and DS Final Document For Phase 5
9 pages
307 A029 Seminar
No ratings yet
307 A029 Seminar
16 pages
Credit Card Fraud Detection
No ratings yet
Credit Card Fraud Detection
25 pages
Final Year Project
No ratings yet
Final Year Project
27 pages
IEEE Conference Template
No ratings yet
IEEE Conference Template
3 pages
Phase 3
No ratings yet
Phase 3
19 pages
Script KHDL
No ratings yet
Script KHDL
4 pages
Report
No ratings yet
Report
14 pages
Credit Card Fraud Detection: Sushant Singh Soumya Bhambani Nishkarsh Mittal Guide:-Ms. Indra Kumari BT4029
No ratings yet
Credit Card Fraud Detection: Sushant Singh Soumya Bhambani Nishkarsh Mittal Guide:-Ms. Indra Kumari BT4029
10 pages
Model
No ratings yet
Model
2 pages
Ibm Project
No ratings yet
Ibm Project
18 pages
Project Report
No ratings yet
Project Report
34 pages
Synopsis Major Project CreditCardFraudDetection
No ratings yet
Synopsis Major Project CreditCardFraudDetection
16 pages
Credit Card Fraud Detection and Analysis
No ratings yet
Credit Card Fraud Detection and Analysis
4 pages
Fraud Detection in Financial Transactions - PPT.PPTX - 20240805 - 175608 - 0000
No ratings yet
Fraud Detection in Financial Transactions - PPT.PPTX - 20240805 - 175608 - 0000
22 pages
Creditcard Fraud Detection
No ratings yet
Creditcard Fraud Detection
26 pages
FA Zoro
No ratings yet
FA Zoro
5 pages
Phase 2-AI Credit Card Fraud Detection System-1-2
No ratings yet
Phase 2-AI Credit Card Fraud Detection System-1-2
4 pages
Nityananda Vyawhare 2223216 Case Study 5
No ratings yet
Nityananda Vyawhare 2223216 Case Study 5
5 pages
IJIRSET Paper Sample
No ratings yet
IJIRSET Paper Sample
4 pages
A Comparison Study of Fraud Detection in Usage of Credit Cards Using Machine Learning
No ratings yet
A Comparison Study of Fraud Detection in Usage of Credit Cards Using Machine Learning
24 pages
Phase-3 Ai Credit Card Detection PDF
No ratings yet
Phase-3 Ai Credit Card Detection PDF
5 pages
ONLINE PAYMENT FRAUD DETECTION USING MACHINE LEARNING MODEL - Key
No ratings yet
ONLINE PAYMENT FRAUD DETECTION USING MACHINE LEARNING MODEL - Key
12 pages
Anti Fraud
No ratings yet
Anti Fraud
23 pages
Final Project Document
No ratings yet
Final Project Document
8 pages
Sowmya
No ratings yet
Sowmya
12 pages
21EBKCS42
No ratings yet
21EBKCS42
57 pages
Ad Batch
No ratings yet
Ad Batch
12 pages
Phase-2 For DS
No ratings yet
Phase-2 For DS
13 pages
New Report
No ratings yet
New Report
61 pages
Real-Time Fraud Detection System
No ratings yet
Real-Time Fraud Detection System
3 pages
Machine Learning Report
No ratings yet
Machine Learning Report
5 pages
Charlton Gotami Presentation
No ratings yet
Charlton Gotami Presentation
14 pages
Charlton Gotami Presentation
No ratings yet
Charlton Gotami Presentation
14 pages
Stripe Payment Integration for Beginners: A Practical Guide to Accepting Payments Online
From Everand
Stripe Payment Integration for Beginners: A Practical Guide to Accepting Payments Online
Steven Mcananey
No ratings yet
Defect Prediction in Software Development & Maintainence
From Everand
Defect Prediction in Software Development & Maintainence
Rudra Kumar
No ratings yet
Consumption-Based Forecasting and Planning: Predicting Changing Demand Patterns in the New Digital Economy
From Everand
Consumption-Based Forecasting and Planning: Predicting Changing Demand Patterns in the New Digital Economy
Charles W. Chase
No ratings yet
Short Brief - Machine Learning
No ratings yet
Short Brief - Machine Learning
10 pages
A Novel Study On Machine Learning Algorithm Based
No ratings yet
A Novel Study On Machine Learning Algorithm Based
10 pages
Final Project Report
No ratings yet
Final Project Report
58 pages
Lal Babu
No ratings yet
Lal Babu
19 pages
BatteryML Paper
No ratings yet
BatteryML Paper
22 pages
AP Computer Science Principles-EXAM1-5 Steps To A 5-MCQ-ANSWERS
No ratings yet
AP Computer Science Principles-EXAM1-5 Steps To A 5-MCQ-ANSWERS
12 pages
Smart Energy Management System
No ratings yet
Smart Energy Management System
11 pages
A Novel Coupled Optimization Prediction Model For Air Quality
No ratings yet
A Novel Coupled Optimization Prediction Model For Air Quality
19 pages
Wahyudi 2021 J. Phys. Conf. Ser. 1830 012016
No ratings yet
Wahyudi 2021 J. Phys. Conf. Ser. 1830 012016
13 pages
Artificial Intelligence and Deep Learning in Pathology 1st Edition Stanley Cohen MD (Editor) - Ebook PDF Download
100% (1)
Artificial Intelligence and Deep Learning in Pathology 1st Edition Stanley Cohen MD (Editor) - Ebook PDF Download
74 pages
Chapter 01 Notes
No ratings yet
Chapter 01 Notes
11 pages
Automatic Detection of Dress-Code Surveillance in A University Using YOLO Algorithm
No ratings yet
Automatic Detection of Dress-Code Surveillance in A University Using YOLO Algorithm
8 pages
Lecture 1 - Introduction To NN - CET
No ratings yet
Lecture 1 - Introduction To NN - CET
53 pages
2024 - Project Report Writing
No ratings yet
2024 - Project Report Writing
48 pages
Lecture 2
No ratings yet
Lecture 2
66 pages
Majeed MV-Soccer Motion-Vector Augmented Instance Segmentation For Soccer Player Tracking CVPRW 2024 Paper
No ratings yet
Majeed MV-Soccer Motion-Vector Augmented Instance Segmentation For Soccer Player Tracking CVPRW 2024 Paper
11 pages
OCR Free Table of Contents Detection in Urdu Books
No ratings yet
OCR Free Table of Contents Detection in Urdu Books
5 pages
Lecture 15
No ratings yet
Lecture 15
37 pages
Personality Prediction System Based On Graphology Using Machine Learning
No ratings yet
Personality Prediction System Based On Graphology Using Machine Learning
34 pages
Reinforced Model Merging: Jiaqi Han, Jingwen Ye, Shunyu Liu, Haofei Zhang, Jie Song, Zunlei Feng, Mingli Song
No ratings yet
Reinforced Model Merging: Jiaqi Han, Jingwen Ye, Shunyu Liu, Haofei Zhang, Jie Song, Zunlei Feng, Mingli Song
6 pages
Application of Machine Learning and Deep Learning For Predicting Groundwater Levels in The West Coast Aquifer System, South Africa
No ratings yet
Application of Machine Learning and Deep Learning For Predicting Groundwater Levels in The West Coast Aquifer System, South Africa
18 pages
Uncertainty, Calibration, and Membership Inference Attacks - An Information-Theoretic Perspective
No ratings yet
Uncertainty, Calibration, and Membership Inference Attacks - An Information-Theoretic Perspective
27 pages
Revised - Enhanced Landslide Detection Using Deep Learning With Recurrent Neural Networks For Temporal Monitoring
No ratings yet
Revised - Enhanced Landslide Detection Using Deep Learning With Recurrent Neural Networks For Temporal Monitoring
6 pages
Lab 1 - Machine Learning with Python - ML Engineering مهم
No ratings yet
Lab 1 - Machine Learning with Python - ML Engineering مهم
10 pages
Capstone Project
No ratings yet
Capstone Project
6 pages
MultiEmo: Language-Agnostic Sentiment Analysis
No ratings yet
MultiEmo: Language-Agnostic Sentiment Analysis
7 pages
Enhancing Multi-Objective Optimisation Through Machine Learning-Supported Multiphysics Simulation
No ratings yet
Enhancing Multi-Objective Optimisation Through Machine Learning-Supported Multiphysics Simulation
16 pages
PCR - S2023 03 053
No ratings yet
PCR - S2023 03 053
5 pages

Assignment1 921275

Uploaded by

Assignment1 921275

Uploaded by

Purnima Gosain

1. Data Preparation and Preprocessing Steps

Class Imbalance Handling:

Synthetic Minority Over-sampling Technique (SMOTE) outperformed better on the imbalance

80% training and 20% testing** of the dataset.

2. Model Performance Metrics

Logistic Regression Model

Precision: The fraction of predicted frauds which were actually fraud.

F1 Score: it is Harmonic mean of precision and recall.

ROC-AUC Score: Represents overall classification performance

3. Key Features of the Front-End Application

Using real-time transaction monitoring in a production system

You might also like