0% found this document useful (0 votes)

5 views31 pages

Loan-Prediction Using Machine Learning

Uploaded by

ak7735205

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views31 pages

Loan-Prediction Using Machine Learning

Uploaded by

ak7735205

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 31

Loan Prediction using

Machine Learning

By
K. Vikramaditya Reddy
Mtech ACS
194609
Content
 Introduction
 The classification problem
 Steps involved in machine learning
 Features
 Labels
 Visualizing data using Google Colab
 Explanation of the Code using Google Colab
 Models of training and testing the dataset
1. Loan prediction using logistic regression
2. Loan prediction using random forest classification
3. Loan prediction using decision tree classification
 Loan Prediction models Comparison
INTRODUCTION
 Loan-Prediction
 Understanding the problem statement is the first and foremost step.
This would help you give an intuition of what you will face ahead of
time. Let us see the problem statement.
 Dream Housing Finance company deals in all home loans. They have
presence across all urban, semi urban and rural areas. Customer first
apply for home loan after that company validates the customer
eligibility for loan. Company wants to automate the loan eligibility
process (real time) based on customer detail provided while filling
online application form. These details are Gender, Marital Status,
Education, Number of Dependents, Income, Loan Amount, Credit
History and others. To automate this process, they have given a
problem to identify the customers segments, those are eligible for loan
amount so that they can specifically target these customers.
The Classification problem
 It is a classification problem where we have to predict whether a
loan would be approved or not. In a classification problem, we
have to predict discrete values based on a given set of
independent variable(s). Classification can be of two types:
 Binary Classification : In this classification we have to predict
either of the two given classes. For example: classifying the
gender as male or female, predicting the result as win or loss, etc.
Multiclass Classification : Here we have to classify the data into
three or more classes. For example: classifying a movie's genre as
comedy, action or romantic, classify fruits as oranges, apples, or
pears, etc.
 Loan prediction is a very common real-life problem that each retail
bank faces atleast once in its lifetime. If done correctly, it can save
a lot of man hours at the end of a retail bank.
Steps involved in machine learning

1 - Data Collection
 The quantity & quality of your data dictate how accurate our model is
 The outcome of this step is generally a representation of data (Guo
simplifies to specifying a table) which we will use for training
 Using pre-collected data, by way of datasets from Kaggle, UCI, etc.,
still fits into this step

2 - Data Preparation
 Wrangle data and prepare it for training
 Clean that which may require it (remove duplicates, correct errors,
deal with missing values, normalization, data type conversions, etc.)
 Randomize data, which erases the effects of the particular order in
which we collected and/or otherwise prepared our data.
Steps involved in machine learning

3 - Choose a Model
 Different algorithms are for different tasks; choose the right
one

4 - Train the Model
 The goal of training is to answer a question or make a
prediction correctly as often as possible
 Linear regression example: algorithm would need to learn
values for m (or W) and b (x is input, y is output)
 Each iteration of process is a training step
Steps involved in machine learning

5 - Evaluate the Model

 Uses some metric or combination of metrics to "measure"
objective performance of model
 Test the model against previously unseen data
 This unseen data is meant to be somewhat representative of
model performance in the real world, but still helps tune the
model (as opposed to test data, which does not)
 Good train/evaluate split 80/20, 70/30, or similar, depending
on domain, data availability, dataset particulars, etc.
Steps involved in machine learning

6 - Parameter Tuning
 This step refers to hyper-parameter tuning, which is an "art form" as
opposed to a science
 Tune model parameters for improved performance
 Simple model hyper-parameters may include: number of training
steps, learning rate, initialization values and distribution, etc.

7 - Make Predictions
 Using further (test set) data which have, until this point, been
withheld from the model (and for which class labels are known), are
used to test the model; a better approximation of how the model will
perform in the real world.
DATASETS

 Here we have two datasets. First is train_dataset.csv,

test_dataset.csv.
 These are datasets of loan approval applications which are
featured with annual income, married or not, dependents are
there or not, educated or not, credit history present or not, loan
amount etc.
 The outcome of the dataset is represented by loan status in the
train dataset.
 This column is absent in test_dataset.csv as we need to assign
loan status with the help of training dataset.
FEATURES PRESENT IN LOAN
PREDICTION
 Loan_ID – The ID number generated by the bank which is giving loan.
 Gender – Whether the person taking loan is male or female.
 Married – Whether the person is married or unmarried.
 Dependents – Family members who stay with the person.
 Education – Educational qualification of the person taking loan.
 Self_Employed – Whether the person is self-employed or not.
 ApplicantIncome – The basic salary or income of the applicant per month.
 CoapplicantIncome – The basic income or family members.
 LoanAmount – The amount of loan for which loan is applied.
 Loan_Amount_Term – How much time does the loan applicant take to pay the loan.
 Credit_History – Whether the loan applicant has taken loan previously from same
bank.
 Property_Area – This is about the area where the person stays ( Rural/Urban).
Labels

 LOAN_STATUS – Based on the mentioned features, the machine

learning algorithm decides whether the person should be give loan or
not.
Visualizing data using google Colab
Visualizing data using google Colab
Visualizing data using google Colab
Visualizing data using google Colab
Visualizing data using google Colab
Visualizing data using google Colab
Explanation of the Code using Google
Colab
 The dataset is trained and tested with 3 methods
1. Loan prediction using logistic regression
2. Loan prediction using random forest classification
3. Loan prediction using decision tree classification
Loan prediction using Logistic Regression
 # take a look at the top 5 rows of the train set, notice the column "Loan_Status"
 train.head()

Applica Coappli Loan_A

Marrie Depend Educati Self_Em LoanAm Credit_ Propert Loan_St
Loan_ID Gender ntIncom cantInc mount_
d ents on ployed ount History y_Area atus
e ome Term

LP00100 Graduat
Male No 0 No 5849 0.0 NaN 360.0 1.0 Urban Y
2 e

LP00100 Graduat
Male Yes 1 No 4583 1508.0 128.0 360.0 1.0 Rural N
3 e

LP00100 Graduat
Male Yes 0 Yes 3000 0.0 66.0 360.0 1.0 Urban Y
5 e

Not
LP00100
Male Yes 0 Graduat No 2583 2358.0 120.0 360.0 1.0 Urban Y
6
e

LP00100 Graduat
Male No 0 No 6000 0.0 141.0 360.0 1.0 Urban Y
8 e
Loan prediction using Logistic Regression
• # take a look at the top 5 rows of the test set, notice the absense of "Loa
n_Status" that we will predict
• test.head()

Applica Coapplic Loan_A

Depend Educati Self_Em LoanAm Credit_H Propert
Loan_ID Gender Married ntIncom antInco mount_T
ents on ployed ount istory y_Area
e me erm

LP00101
Male Yes 0 Graduate No 5720 0 110.0 360.0 1.0 Urban
5

LP00102
Male Yes 1 Graduate No 3076 1500 126.0 360.0 1.0 Urban
2

LP00103
Male Yes 2 Graduate No 5000 1800 208.0 360.0 1.0 Urban
1

LP00103
Male Yes 2 Graduate No 2340 2546 100.0 360.0 NaN Urban
5

LP00105 Not
Male No 0 No 3276 0 78.0 360.0 1.0 Urban
1 Graduate
Loan prediction using Logistic Regression
 # Printing values of whether loan is accepted or rejected
 y_pred [:100]
Loan prediction using Logistic Regression

 Confusion Matrix
Loan prediction using Logistic Regression
# Check Accuracy
from sklearn.metrics import accuracy_score
accuracy_score(y_test,y_pred)

0.8373983739837398

# Applying k-Fold Cross Validation

from sklearn.model_selection import cross_val_score
accuracies = cross_val_score(estimator = classifier, X = X_train, y = y_train, cv = 10)
accuracies.mean()
# accuracies.std()

0.8024081632653062
Loan prediction using random forest classification

 # Printing values of whether loan is accepted or rejected

 y_pred [:100]
Loan prediction using random forest classification

 Confusion matrix
Loan prediction using random forest classification
# Check Accuracy
from sklearn.metrics import accuracy_score
accuracy_score(y_test,y_pred)

0.6910569105691057
# Applying k-Fold Cross Validation
from sklearn.model_selection import cross_val_score

accuracies = cross_val_score(estimator = classifier, X = X_train, y = y_train, cv = 10

)

accuracies.mean()
# accuracies.std()
Loan Prediction using Decision Tree
Classification
# Printing values of whether loan is accepted or rejected
 y_pred[:100]
Loan Prediction using Decision Tree Classification
 Confusion Matrix
Loan Prediction using Decision Tree
Classification
# Check Accuracy
from sklearn.metrics import accuracy_score
accuracy_score(y_test,y_pred)

0.8292682926829268
# Applying k-Fold Cross Validation
from sklearn.model_selection import cross_val_score
accuracies = cross_val_score(estimator = classifier, X = X_train, y = y_train, cv = 10)

accuracies.mean()
# accuracies.std()

0.7922448979591836
Loan prediction models comparison
Loan Prediction Accuracy Accuracy using K-fold
Cross Validation

Using Logistic Regression 0.8373983739837398 0.8024081632653062

Using Random Forest 0.6910569105691057 0.7148163265306122

Classification

Using Decision Tree 0.8292682926829268 0.7922448979591836

Classification

This means that from the above accuracy table, we can conclude that logistic regression
is best model for the loan prediction problem.
THANK YOU

Practical Exam Instructions: Canadian Welding Bureau
No ratings yet
Practical Exam Instructions: Canadian Welding Bureau
4 pages
Loan Prediction Using Machine Learning
No ratings yet
Loan Prediction Using Machine Learning
29 pages
Loan Eligibility Prediction: Machine Learning
100% (1)
Loan Eligibility Prediction: Machine Learning
8 pages
Loan Approval
No ratings yet
Loan Approval
12 pages
Project Stage I Report
No ratings yet
Project Stage I Report
17 pages
Loan Prediction
No ratings yet
Loan Prediction
20 pages
Paper 3
No ratings yet
Paper 3
5 pages
Ihic-2022 PPT Paper - Id 100
No ratings yet
Ihic-2022 PPT Paper - Id 100
11 pages
Loan Eligibility Prediction
No ratings yet
Loan Eligibility Prediction
12 pages
Loan Approval - PPT
No ratings yet
Loan Approval - PPT
19 pages
Paper 1
No ratings yet
Paper 1
10 pages
For Loan Approval Prediction
100% (1)
For Loan Approval Prediction
14 pages
School of Information Technology and Engineering M.Tech Software Engineering (Integrated) FALL SEMESTER 2020 - 2021
No ratings yet
School of Information Technology and Engineering M.Tech Software Engineering (Integrated) FALL SEMESTER 2020 - 2021
36 pages
Research Paper ALAS
No ratings yet
Research Paper ALAS
4 pages
Loan Eligibility Prediction
No ratings yet
Loan Eligibility Prediction
14 pages
Synopsis of Lep 01
No ratings yet
Synopsis of Lep 01
8 pages
Loan Approval Model Prediction
No ratings yet
Loan Approval Model Prediction
10 pages
minipptPOWER 1pdf
No ratings yet
minipptPOWER 1pdf
16 pages
Prediction of Modernized Loan Approval System Based On Machine Learning Approach
No ratings yet
Prediction of Modernized Loan Approval System Based On Machine Learning Approach
22 pages
Loan Prediction Using Artificial Intelligence and Machine Learning
No ratings yet
Loan Prediction Using Artificial Intelligence and Machine Learning
24 pages
B2 19bec113 19bec116 Loan Prediction
No ratings yet
B2 19bec113 19bec116 Loan Prediction
3 pages
Research Paper
No ratings yet
Research Paper
14 pages
IJNRD2407179
No ratings yet
IJNRD2407179
7 pages
Ranvijay 12203409
No ratings yet
Ranvijay 12203409
13 pages
Predicting Personal Loan Approval Using Machine Learning Handbook
No ratings yet
Predicting Personal Loan Approval Using Machine Learning Handbook
31 pages
Loan Prediction Using Artificial Intelligence and Machine Learning
No ratings yet
Loan Prediction Using Artificial Intelligence and Machine Learning
23 pages
Research Report
No ratings yet
Research Report
8 pages
Loan Approval Prediction Using Machine Learning
No ratings yet
Loan Approval Prediction Using Machine Learning
8 pages
Assessment Report Richa
No ratings yet
Assessment Report Richa
12 pages
Project Review I Final Pid 02
No ratings yet
Project Review I Final Pid 02
9 pages
D.sce Project
No ratings yet
D.sce Project
28 pages
Loan Prediction Project Report
No ratings yet
Loan Prediction Project Report
3 pages
ML Report1
No ratings yet
ML Report1
19 pages
2022 V13i876
No ratings yet
2022 V13i876
9 pages
Reasearchby AK0102
No ratings yet
Reasearchby AK0102
7 pages
Finance Project Proposal
No ratings yet
Finance Project Proposal
7 pages
IJSRDV8I80146
No ratings yet
IJSRDV8I80146
6 pages
Synopsis: Loan Prediction Stsyem
No ratings yet
Synopsis: Loan Prediction Stsyem
8 pages
Anu Internshipreport
No ratings yet
Anu Internshipreport
28 pages
Report
No ratings yet
Report
15 pages
SSRN Id4532468
No ratings yet
SSRN Id4532468
13 pages
The Loan Prediction Using Machine Learning
No ratings yet
The Loan Prediction Using Machine Learning
9 pages
Loan
No ratings yet
Loan
4 pages
Unit 1 ML PDF
No ratings yet
Unit 1 ML PDF
19 pages
ML and Ai Synopsis
No ratings yet
ML and Ai Synopsis
8 pages
Machine Learning Part: Domain Overview
No ratings yet
Machine Learning Part: Domain Overview
20 pages
Loan Prediction 10
No ratings yet
Loan Prediction 10
10 pages
Loan Approval Prediction: Internship Project Report On
No ratings yet
Loan Approval Prediction: Internship Project Report On
22 pages
Wa0001.
No ratings yet
Wa0001.
8 pages
Python Code For Loan Default Prediction
No ratings yet
Python Code For Loan Default Prediction
4 pages
Paper 4
No ratings yet
Paper 4
9 pages
Presentation 13
No ratings yet
Presentation 13
8 pages
Loan Prediction
No ratings yet
Loan Prediction
3 pages
Literature Survey
No ratings yet
Literature Survey
3 pages
Report 2
No ratings yet
Report 2
26 pages
SSRN 5088929
No ratings yet
SSRN 5088929
11 pages
Perform Prediction Using Regression Algorithm: Ex No: 1 Date
No ratings yet
Perform Prediction Using Regression Algorithm: Ex No: 1 Date
13 pages
(IJCST-V9I3P21) :sanket Bhattad, Sumit Bawane, Shweta Agrawal, Unnati Ramteke, Dr. P. B. Ambhore
No ratings yet
(IJCST-V9I3P21) :sanket Bhattad, Sumit Bawane, Shweta Agrawal, Unnati Ramteke, Dr. P. B. Ambhore
4 pages
Fin Irjmets1651834789
No ratings yet
Fin Irjmets1651834789
8 pages
Gupta 2020
No ratings yet
Gupta 2020
4 pages
JSON Web Token Complete Self-Assessment Guide
From Everand
JSON Web Token Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet
Blue Modern Simple Corporate Presentation
No ratings yet
Blue Modern Simple Corporate Presentation
7 pages
Resume 1
No ratings yet
Resume 1
1 page
Irfan Resume
No ratings yet
Irfan Resume
1 page
Jake S Resume Anonymous
No ratings yet
Jake S Resume Anonymous
1 page
The Fit of Hollands RIASEC Model To US Occupation
No ratings yet
The Fit of Hollands RIASEC Model To US Occupation
23 pages
BSC Sem 3 & 4 (Major-Minor-MDC-SEC) Medical Laboratory Syllabus From 2024-25 (DT 13-05-2024)
No ratings yet
BSC Sem 3 & 4 (Major-Minor-MDC-SEC) Medical Laboratory Syllabus From 2024-25 (DT 13-05-2024)
24 pages
Diarrhea
No ratings yet
Diarrhea
35 pages
Chapt 1
No ratings yet
Chapt 1
38 pages
Venkatesh Resume
No ratings yet
Venkatesh Resume
2 pages
Theories of Morality Chart
No ratings yet
Theories of Morality Chart
1 page
RMCT Assignment
100% (1)
RMCT Assignment
10 pages
Pedoman Penulisan Karya Ilmiah UPI 2013
No ratings yet
Pedoman Penulisan Karya Ilmiah UPI 2013
6 pages
Homework s13
No ratings yet
Homework s13
14 pages
Grade Thresholds - June 2024: Cambridge IGCSE Physics (0625)
No ratings yet
Grade Thresholds - June 2024: Cambridge IGCSE Physics (0625)
2 pages
Change Management or Organization Development
No ratings yet
Change Management or Organization Development
3 pages
Mobile Robot SLAM Methods Improved For Adapting To Search and Rescue Environments
No ratings yet
Mobile Robot SLAM Methods Improved For Adapting To Search and Rescue Environments
6 pages
5 Resources For English Language Teachers - Cambridge English
No ratings yet
5 Resources For English Language Teachers - Cambridge English
7 pages
Image Segmentation DeepLearning
No ratings yet
Image Segmentation DeepLearning
18 pages
Nomination Form18
No ratings yet
Nomination Form18
6 pages
Aim High 6 Teachers Book
No ratings yet
Aim High 6 Teachers Book
129 pages
Questionq and Answers 2024
No ratings yet
Questionq and Answers 2024
15 pages
Aamir Resume Retail
0% (1)
Aamir Resume Retail
3 pages
Assignment (JELLA MAE YCALINA)
No ratings yet
Assignment (JELLA MAE YCALINA)
2 pages
Consul Personality
No ratings yet
Consul Personality
19 pages
CHAPTER 7: Managing Change and Disruptive Innovation
No ratings yet
CHAPTER 7: Managing Change and Disruptive Innovation
7 pages
A Historiographical Survey of Literacy in Britain Between 1780 and 1830 Devon Lemire
No ratings yet
A Historiographical Survey of Literacy in Britain Between 1780 and 1830 Devon Lemire
14 pages
The 221 - Systematic Theology I Syllabus - Fall 2015
No ratings yet
The 221 - Systematic Theology I Syllabus - Fall 2015
2 pages
Principles and Practice of Pedodontics 2nd Edition by Arathi Rao ISBN 8184483457 9788184483451 PDF Download
No ratings yet
Principles and Practice of Pedodontics 2nd Edition by Arathi Rao ISBN 8184483457 9788184483451 PDF Download
83 pages
PedagogySyllabus F11
No ratings yet
PedagogySyllabus F11
3 pages
Microsoft Windows Server 2016 Licensing
No ratings yet
Microsoft Windows Server 2016 Licensing
2 pages
The Problem and Its Background: Thesis Title: Learning Virtues Through Literary Selections in English
No ratings yet
The Problem and Its Background: Thesis Title: Learning Virtues Through Literary Selections in English
12 pages
Unit 2 Univariate Data Unit Plan
No ratings yet
Unit 2 Univariate Data Unit Plan
5 pages
Test Automation Framework & Design For XXXXX Project: Author: XXXXXX
No ratings yet
Test Automation Framework & Design For XXXXX Project: Author: XXXXXX
14 pages

Loan-Prediction Using Machine Learning

Uploaded by

Loan-Prediction Using Machine Learning

Uploaded by

Loan Prediction using

5 - Evaluate the Model

 Here we have two datasets. First is train_dataset.csv,

 LOAN_STATUS – Based on the mentioned features, the machine

Applica Coappli Loan_A

Applica Coapplic Loan_A

# Applying k-Fold Cross Validation

 # Printing values of whether loan is accepted or rejected

accuracies = cross_val_score(estimator = classifier, X = X_train, y = y_train, cv = 10

Using Logistic Regression 0.8373983739837398 0.8024081632653062

Using Random Forest 0.6910569105691057 0.7148163265306122

Using Decision Tree 0.8292682926829268 0.7922448979591836

You might also like