0% found this document useful (0 votes)

35 views17 pages

Project Stage I Report

The document describes a project to develop a machine learning model for predicting loan defaults. The objectives are to minimize risk of default and classify borrowers. The methodology involves data cleaning, exploratory analysis, and building models like KNN, Random Forest and XGBoost. Key performance metrics are accuracy, recall, precision and F1 score. Feature selection and handling imbalanced data are techniques to improve efficiency. The project timeline outlines phases from topic selection to deployment over 7 months.

Uploaded by

ravenharley1863

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views17 pages

Project Stage I Report

Uploaded by

ravenharley1863

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 17

K. K.

WAGH INSTITUTE OF ENGINEERING EDUCATION & RESEARCH Click icon to add picture

LOAN DEFAULTER PREDICTION USING

SUPERVISED MACHINE LEARNING
ALGORITHMS

Group Id 02
Internal Guide: Prof. Smita Patil
TEAM MEMBERS
Division. Name of the Email ID Contact Number
/ student
Roll No.
05 Rajshree Thakare [email protected] 9763914017

06 Manasi Barge [email protected] 8805539779

07 Sanket Padwal [email protected] 9404688630

08 Pratik Desale [email protected] 7588801537

Problem Definition

 To develop a model for Loan Defaulter Prediction. However, there are some customers behave
negatively after their application are approved. To prevent this situation, banks have to find some
methods to predict customers’ behaviours using Machine learning algorithms.
Objectives

 Objective
 To minimize the risk of borrowers defaulting the loans using created model.
 Create predicative model to classify each borrower as defaulter or not using the
data collected when the loan has been given. Determining probability of user
liability.
 Creating an interactive UI that will take users input and return an output

 Scope
 The goal of this project is to build a machine learning model that can predict if a
person will default on the loan based on the loan and personal information
provided.
 The model is intended to be used as a reference tool for the client and his financial
institution to help make decisions on issuing loans, so that the risk can be lowered,
and the profit can be maximized.
Literature Review

Research Paper Year Author Content

Loan prediction by using machine 2020 Supriya P, Pavani M, They started their analysis with data
learning models Saisushma N cleaning pre-processing, missing value
imputation, then exploratory data analysis,
and finally model building and evaluation.

Implementation of decision tree 2019 Amin R K, Indwiarti and The maximum precision value achieved
using C4.5 algorithm in decision Sibaroni Zhou, was 78.08% with data partition of 90:10 and
making of loan application by debtor the biggest recall value was 86% with data
partition of 80:20.
An exploratory data analysis for loan 2018 Sumathi V P and Sri J S They classify and examine the nature of
prediction based on nature of the loan applicants andconcluded that most
clients loan applicants preferred short-term loans.

Credit risk analysis and prediction 2017 G. Sudhamathy Banks hold huge volumes of customer
modelling of bank loans behaviour related data from which they are
unable to arrive at a judgement if an
applicant can be defaulter or not.
Requirement Specification

 Functional requirements

 The system should be able to build Users profile and maintain the record.

 The system will predict a users performance on the basis of the previous record.

 On the basis of previous record the system should be able to notify about the users, that user
is good or bad in that particular Loan Facility.

 Determining probability of user liability.

Requirement Specification

 Non-functional requirements
 Availability
 The system gives advice or alerts user immediately
 The system gives accurate results
 Interactive, minimal delays, safe information transmission

 Reliability
 Predictability
 Accuracy
 Usability
 Interoperability
 Efficiency
Methodology

 Data Cleaning and Pre-processing

 Take Dataset as a input.
 Give Training Dataset and Testing Dataset.
 Data Preprocessing step contains data cleaning process.

 Exploratory Data Analysis

 Finding meaningful patterns
 Statistical measured.

 Model building contain algorithm work

 KNN
 Random Forest
 XGBoost
Algorithms

 KKN
 The KNN algorithm is used for both classification and regression problems. How- ever, the KNN is
more widely used in classification problems in the industry and thus will be used in doing
classification and predictive analysis in this paper. The KNN is a simple algorithm that stores all
available cases and classifies new cases by a majority vote of its k neighbors.

 Random Forest
 This is a tree based ensemble model which helps in improving the accuracy of the model . It
combines a large number of Decision trees to build a powerful predicting model. It takes a random
sample of rows and features of each individual tree to prepare a decision tree model. Final prediction
class is either the mode of all the predictors or the mean of all the predictors.

 XGBooost
 This algorithm only works with the quantitative variable. It is a gradient boosting algorithm which
forms strong rules for the model by boosting weak learners to a strong learner. It is a fast and
efficient algorithm which recently dominated machine learning because of its high performance
and speed.
Detailed Design
Experimental Setup / Simulation

 Expectations
 To achieve a F1 score of training around 90% and F1 score of testing around 85-90%.

 Datasets Used
 The dataset we used is derived from the Kaggle.
 It contains more than 115,000 original loan data of users with 102 attributes.
 Training - 50%, Testing – 25%, Validation – 25%.
Experimental Setup / Simulation

 Operating System: - Windows 7/8/10

 Application Server :- Apache Tomcat 7/8/9
 Front End :- HTML, CSS
 Database : -Mysql
 Programming Language :- Python
 Processor : Intel i3/i5/i7
 Hard Disk :- 5 GB
 Memory:- 1GB RAM
Performance Parameters

 Confusion Matrix

• Accuracy
• Accuracy is defined as the ratio of the number of samples correctly classified by the classifier to the total
number of samples for a given test data set.
Performance Parameters

 F1-score
 F1-score, also called a balanced F Score, is defined as the balanced average of Precision and
recall.

 Recall

 Precision
Efficiency Issues

 Recursive Feature Elimination method to select 30 features with the strongest

correlation with the target variable, and eliminated the features step by step to achieve
the first dimensionality reduction, with the independent variable reduced .

 The target variable ‘loans status’ has a large difference in the number of normal and
default categories, which will cause trouble to model learning.
Project Planning (5)
7/22/2021 9/10/2021 10/30/2021 12/19/2021 2/7/2022 3/29/2022

Topic Searching and Paper Finding

Project Topic Approval

Problem Defination

Literature Review

Collecting Datasets

Understanding Required Technique

Implementation

Testing Dataset

Integration with Framework

Testing and Deployment

THANK YOU !!

EBX Documentation Advanced
100% (2)
EBX Documentation Advanced
664 pages
Wa0000.
No ratings yet
Wa0000.
58 pages
Loan-Prediction Using Machine Learning
No ratings yet
Loan-Prediction Using Machine Learning
31 pages
For Loan Approval Prediction
100% (1)
For Loan Approval Prediction
14 pages
Loan Approval - PPT
No ratings yet
Loan Approval - PPT
19 pages
Kisutsa - Loan Default Prediction Using Machine Learning, A Case of Mobile Based Lending
No ratings yet
Kisutsa - Loan Default Prediction Using Machine Learning, A Case of Mobile Based Lending
55 pages
1822 B.E Cse Batchno 6
No ratings yet
1822 B.E Cse Batchno 6
60 pages
SSRN 5088929
No ratings yet
SSRN 5088929
11 pages
IJSRDV8I80146
No ratings yet
IJSRDV8I80146
6 pages
Loan Eligibility Prediction
No ratings yet
Loan Eligibility Prediction
14 pages
Paper 1
No ratings yet
Paper 1
10 pages
Paper 4
No ratings yet
Paper 4
9 pages
Engineers Reference Manual
No ratings yet
Engineers Reference Manual
506 pages
Project Review I Final Pid 02
No ratings yet
Project Review I Final Pid 02
9 pages
Sat - 6.Pdf - Prediction of Modernized Loan Approval System Based On Machine Learning Approach
No ratings yet
Sat - 6.Pdf - Prediction of Modernized Loan Approval System Based On Machine Learning Approach
11 pages
The Loan Prediction Using Machine Learning
No ratings yet
The Loan Prediction Using Machine Learning
9 pages
ML Report1
No ratings yet
ML Report1
19 pages
1 s2.0 S2666307423000293 Main
No ratings yet
1 s2.0 S2666307423000293 Main
13 pages
Ranvijay 12203409
No ratings yet
Ranvijay 12203409
13 pages
Research Paper ALAS
No ratings yet
Research Paper ALAS
4 pages
Research Report
No ratings yet
Research Report
8 pages
Anu Internshipreport
No ratings yet
Anu Internshipreport
28 pages
Credit Card Fraud Detection
No ratings yet
Credit Card Fraud Detection
89 pages
minipptPOWER 1pdf
No ratings yet
minipptPOWER 1pdf
16 pages
Loan Approval Model Prediction
No ratings yet
Loan Approval Model Prediction
10 pages
Wa0001.
No ratings yet
Wa0001.
8 pages
Edafinal 1
No ratings yet
Edafinal 1
32 pages
Gupta 2020
No ratings yet
Gupta 2020
4 pages
IJNRD2407179
No ratings yet
IJNRD2407179
7 pages
Ieeexplore Ieee
No ratings yet
Ieeexplore Ieee
2 pages
Credit Score Kisutsa - Loan Default Prediction Using Machine Learning, A Case of Mobile Based Lending
No ratings yet
Credit Score Kisutsa - Loan Default Prediction Using Machine Learning, A Case of Mobile Based Lending
51 pages
Paper 3
No ratings yet
Paper 3
5 pages
MLIBISc Syllabus 2024 2025
No ratings yet
MLIBISc Syllabus 2024 2025
78 pages
Reasearchby AK0102
No ratings yet
Reasearchby AK0102
7 pages
SSRN Id4532468
No ratings yet
SSRN Id4532468
13 pages
Loan Prediction Using Artificial Intelligence and Machine Learning
No ratings yet
Loan Prediction Using Artificial Intelligence and Machine Learning
24 pages
Unit 1
No ratings yet
Unit 1
61 pages
Loan Prediction System
No ratings yet
Loan Prediction System
8 pages
Loan Approval Prediction Using Supervised Learning Algorithm
No ratings yet
Loan Approval Prediction Using Supervised Learning Algorithm
11 pages
(IJCST-V9I3P21) :sanket Bhattad, Sumit Bawane, Shweta Agrawal, Unnati Ramteke, Dr. P. B. Ambhore
No ratings yet
(IJCST-V9I3P21) :sanket Bhattad, Sumit Bawane, Shweta Agrawal, Unnati Ramteke, Dr. P. B. Ambhore
4 pages
2022 V13i876
No ratings yet
2022 V13i876
9 pages
Research Paper
No ratings yet
Research Paper
14 pages
Prediction of Modernized Loan Approval System Based On Machine Learning Approach
No ratings yet
Prediction of Modernized Loan Approval System Based On Machine Learning Approach
11 pages
Fin Irjmets1651834789
No ratings yet
Fin Irjmets1651834789
8 pages
Project Lit Final1
No ratings yet
Project Lit Final1
15 pages
Paper 14014
No ratings yet
Paper 14014
9 pages
Loan Prediction 10
No ratings yet
Loan Prediction 10
10 pages
MP Paper
No ratings yet
MP Paper
4 pages
Loan Approval Prediction Based On Machine Learning Approach: Kumar Arun, Garg Ishan, Kaur Sanmeet
No ratings yet
Loan Approval Prediction Based On Machine Learning Approach: Kumar Arun, Garg Ishan, Kaur Sanmeet
4 pages
B2 19bec113 19bec116 Loan Prediction
No ratings yet
B2 19bec113 19bec116 Loan Prediction
3 pages
Synopsis of Lep 01
No ratings yet
Synopsis of Lep 01
8 pages
Wa0003.
No ratings yet
Wa0003.
6 pages
Assessment Report Richa
No ratings yet
Assessment Report Richa
12 pages
School of Information Technology and Engineering M.Tech Software Engineering (Integrated) FALL SEMESTER 2020 - 2021
No ratings yet
School of Information Technology and Engineering M.Tech Software Engineering (Integrated) FALL SEMESTER 2020 - 2021
36 pages
2022 V13i1198
No ratings yet
2022 V13i1198
12 pages
Yoast SEO For WordPress BE 2 2 Yoast SEO Metabox
100% (1)
Yoast SEO For WordPress BE 2 2 Yoast SEO Metabox
9 pages
Finance Project Proposal
No ratings yet
Finance Project Proposal
7 pages
Unit II
No ratings yet
Unit II
31 pages
06 Database, Security, CDN, and EI Services
No ratings yet
06 Database, Security, CDN, and EI Services
90 pages
ABSTRACT
No ratings yet
ABSTRACT
7 pages
Experiment No: 05 Aim: Theory:: What Is A USE Case Diagram?
No ratings yet
Experiment No: 05 Aim: Theory:: What Is A USE Case Diagram?
30 pages
Loan Eligibility Prediction
No ratings yet
Loan Eligibility Prediction
12 pages
EMC Data Domain Retention Lock WP
No ratings yet
EMC Data Domain Retention Lock WP
22 pages
Prediction of Modernized Loan Approval System Based On Machine Learning Approach
No ratings yet
Prediction of Modernized Loan Approval System Based On Machine Learning Approach
22 pages
Viva Data Mining Lab
No ratings yet
Viva Data Mining Lab
11 pages
ML and Ai Synopsis
No ratings yet
ML and Ai Synopsis
8 pages
6 HMI Human Machine Interaction
No ratings yet
6 HMI Human Machine Interaction
17 pages
CB19442 DT
0% (1)
CB19442 DT
1 page
Unit-1 DBMS LECTURE-1
No ratings yet
Unit-1 DBMS LECTURE-1
28 pages
SongBookEditor 13
No ratings yet
SongBookEditor 13
28 pages
Juan Rosas - Case Study 1 - GitMeal
No ratings yet
Juan Rosas - Case Study 1 - GitMeal
28 pages
Snowflake Cortex
No ratings yet
Snowflake Cortex
9 pages
Software Testing QUESTION BANK ANSWERS
No ratings yet
Software Testing QUESTION BANK ANSWERS
9 pages
Literature Survey
No ratings yet
Literature Survey
3 pages
Evolution of WWW
No ratings yet
Evolution of WWW
24 pages
Loan Prediction
No ratings yet
Loan Prediction
3 pages
TE Comp 2019 I AY23-24 DBMS UT1
No ratings yet
TE Comp 2019 I AY23-24 DBMS UT1
1 page
CH 13
No ratings yet
CH 13
12 pages
Final Report Analysis (2) .
No ratings yet
Final Report Analysis (2) .
16 pages
Adebayo
No ratings yet
Adebayo
5 pages
Lab Report-01DB
No ratings yet
Lab Report-01DB
7 pages
CRM Unit 5
No ratings yet
CRM Unit 5
4 pages
Edward Tufte. Envisioning Information
0% (3)
Edward Tufte. Envisioning Information
1 page
Textbook Exercise G7 M3
No ratings yet
Textbook Exercise G7 M3
4 pages
Call For Papers - IJAIKE Inaugural Issues - Rev3
No ratings yet
Call For Papers - IJAIKE Inaugural Issues - Rev3
2 pages
Business Semantics Management
No ratings yet
Business Semantics Management
3 pages
SQLSTATE (42S02) - Base Table or View Not Found - 1146 Table - Laravel - Io
No ratings yet
SQLSTATE (42S02) - Base Table or View Not Found - 1146 Table - Laravel - Io
6 pages
Databases For Big Data: 2 Exam, FAMNIT
No ratings yet
Databases For Big Data: 2 Exam, FAMNIT
5 pages

Project Stage I Report

Uploaded by

Project Stage I Report

Uploaded by

K. K.

LOAN DEFAULTER PREDICTION USING

06 Manasi Barge [email protected] 8805539779

07 Sanket Padwal [email protected] 9404688630

08 Pratik Desale [email protected] 7588801537

Research Paper Year Author Content

 Determining probability of user liability.

 Data Cleaning and Pre-processing

 Exploratory Data Analysis

 Model building contain algorithm work

 Operating System: - Windows 7/8/10

 Recursive Feature Elimination method to select 30 features with the strongest

Topic Searching and Paper Finding

Project Topic Approval

Understanding Required Technique

Integration with Framework

Testing and Deployment

You might also like