Microsoft Azure Machine Learning Project To Predict Likelihood of Good Credit of Customer

The document summarizes a machine learning project that analyzed customer data to predict credit status. It used Python, R and SQL to clean the data by removing unnecessary columns. A two-class decision forest model was trained on the cleaned data with bagging, 50 decision trees of maximum depth 32, 32 random splits per node, and 4 minimum samples per leaf node. The model achieved an accuracy of 77.9% in predicting customer credit.

Uploaded by

api-355102227

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

237 views6 pages

Microsoft Azure Machine Learning Project To Predict Likelihood of Good Credit of Customer

Uploaded by

api-355102227

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Microsoft Azure Machine Learning Project

By: Shubham Dwivedi

Report:

Model Output
Predicted with an accuracy of 77.9% whether customer is likely to have good or bad credit.
Used Python, R, SQL for data modification and feature engineering.

Inputs:
Analyzed the data set noting that it contains data on 950 customer cases. There are column
headers- 20 features (data columns which can be used to train a machine learning model)
and the label (the column indicating the actual credit status of the customers).

The second column labeled Duration, which will display some properties of that feature
(data column) on the right side of the display. These properties include summary statistics
and the data type, as shown here:
Label: CreditStatus (0,1)

Data Transformation:

As part of Data transformation, we will be removing some of the columns which are
as follows: Housing, SexAndStatus , OtherDetorsGuarantors, OtherInstalments and
ExistingCreditsAtBank

I have used the python Scripts and R script to drop the mentioned columns

Python Code:

def azureml_main(creditframe):
drop_cols = ['SexAndStatus',
'OtherDetorsGuarantors']
creditframe.drop(drop_cols, axis = 1, inplace = True)
return creditframe
R Code:
credit.frame <- maml.mapInputPort(1)
drop.cols <- c('OtherInstalments',
'ExistingCreditsAtBank')
out.frame <- credit.frame[, !(names(credit.frame) %in% drop.cols)]
maml.mapOutputPort("out.frame")

SQL:
select
CheckingAcctStat,
Duration,
CreditHistory,
Purpose,
Savings,
Employment,
InstallmentRatePecnt,
PresentResidenceTime,
Property,
Age,
Telephone,
CreditStatus
from t1;

Creating and Evaluating a Machine Learning Model

Now will use the algorithm to train the module ( Classification)
Two Class Decision Forest module

Resampling method: Bagging

Create trainer mode: Single Parameter
Number of Decision trees: 50
Maximum depth of the decision tree: 32
Number of random splits per node: 32
Minimum number of samples per leaf node: 4
Once it is trained will score and evaluate the model and the experiment looks
like this:
Evaluation Results:

Lab 1 - Getting Started With Azure ML
No ratings yet
Lab 1 - Getting Started With Azure ML
16 pages
Default of Credit Card Clients
No ratings yet
Default of Credit Card Clients
33 pages
PA v0.21
No ratings yet
PA v0.21
17 pages
Machine Learning With PySpark and MLlib - Solving A Binary Classification Problem - by Susan Li - Towards Data Science
No ratings yet
Machine Learning With PySpark and MLlib - Solving A Binary Classification Problem - by Susan Li - Towards Data Science
10 pages
Final Project Making Predictions From Data-Course 2: October 6, 2020
No ratings yet
Final Project Making Predictions From Data-Course 2: October 6, 2020
20 pages
Group 5 Dseb64a Report
No ratings yet
Group 5 Dseb64a Report
10 pages
Final Project Credit Risk - Compressed - Compressed
No ratings yet
Final Project Credit Risk - Compressed - Compressed
27 pages
Final Project Report - Kelompok 4
No ratings yet
Final Project Report - Kelompok 4
6 pages
Prediction of Company Bankruptcy: Amlan Nag
100% (2)
Prediction of Company Bankruptcy: Amlan Nag
16 pages
Kritika Sejwal 24MCI10023 ML Lab Project Report
No ratings yet
Kritika Sejwal 24MCI10023 ML Lab Project Report
10 pages
Articles Xgboost Classification With Smote-Enn Algorithm
No ratings yet
Articles Xgboost Classification With Smote-Enn Algorithm
11 pages
Python Code For Loan Default Prediction
No ratings yet
Python Code For Loan Default Prediction
4 pages
PA v0.25
No ratings yet
PA v0.25
18 pages
Credit Card Score Prediction Using Machine Learning Models: A New Dataset
No ratings yet
Credit Card Score Prediction Using Machine Learning Models: A New Dataset
7 pages
Machine Learning Paper BD
No ratings yet
Machine Learning Paper BD
16 pages
Credit Card Default Prediction PRESENTATION
No ratings yet
Credit Card Default Prediction PRESENTATION
12 pages
金融违约笔记
No ratings yet
金融违约笔记
10 pages
Practical # 9 PDF
No ratings yet
Practical # 9 PDF
30 pages
11th Batch 0th Review-4
No ratings yet
11th Batch 0th Review-4
6 pages
Credit Card Score Prediction Using Machine Learning
No ratings yet
Credit Card Score Prediction Using Machine Learning
8 pages
Project Report
No ratings yet
Project Report
19 pages
Default Payment Analysis of Credit Card Clients: July 2018
No ratings yet
Default Payment Analysis of Credit Card Clients: July 2018
7 pages
Case Study Stock Market Prediciton
No ratings yet
Case Study Stock Market Prediciton
10 pages
Project Stage I Report
No ratings yet
Project Stage I Report
17 pages
Azki Task Solution - Afshin Amiri
No ratings yet
Azki Task Solution - Afshin Amiri
7 pages
Project Presentation.
No ratings yet
Project Presentation.
19 pages
Credit Score Prediction.
No ratings yet
Credit Score Prediction.
3 pages
Predicting Credit Card Approvals
100% (1)
Predicting Credit Card Approvals
14 pages
Project Presentation
No ratings yet
Project Presentation
19 pages
SSRN Id3769854
No ratings yet
SSRN Id3769854
8 pages
Data Mining Presentation Slide
No ratings yet
Data Mining Presentation Slide
13 pages
Credit Scores Classification
No ratings yet
Credit Scores Classification
104 pages
The Implication of Statistical Analysis and Feature Engineering For Model Building Using Machine Learning Algorithms
No ratings yet
The Implication of Statistical Analysis and Feature Engineering For Model Building Using Machine Learning Algorithms
11 pages
Progress Report 2
No ratings yet
Progress Report 2
10 pages
Reading Material - Module-5 - Introduction To Special Topics
No ratings yet
Reading Material - Module-5 - Introduction To Special Topics
27 pages
Customer Credit Risk Application and Evaluation of Machine Learning and Deep Learning Models
No ratings yet
Customer Credit Risk Application and Evaluation of Machine Learning and Deep Learning Models
5 pages
Research Paper ALAS
No ratings yet
Research Paper ALAS
4 pages
Quadexp IDS Project
No ratings yet
Quadexp IDS Project
22 pages
04 MLModelingBasics
No ratings yet
04 MLModelingBasics
61 pages
With Python: Machine Learning
No ratings yet
With Python: Machine Learning
3 pages
Digital Transformation in Banking
No ratings yet
Digital Transformation in Banking
4 pages
Assignment 3 F1 - F4
No ratings yet
Assignment 3 F1 - F4
19 pages
Project Report-Micro Credit Loan
No ratings yet
Project Report-Micro Credit Loan
8 pages
Py - Customer Churn Classification - Actuaries' Analytical Cookbook
No ratings yet
Py - Customer Churn Classification - Actuaries' Analytical Cookbook
76 pages
Data Science Real World Applications
100% (1)
Data Science Real World Applications
19 pages
Development of A Machine Learning-Based Financial Risk Control Sy
No ratings yet
Development of A Machine Learning-Based Financial Risk Control Sy
70 pages
Credit Risk Analysis
No ratings yet
Credit Risk Analysis
6 pages
Fin Irjmets1651834789
No ratings yet
Fin Irjmets1651834789
8 pages
Loan Approval - PPT
No ratings yet
Loan Approval - PPT
19 pages
Loan Status Prediction
No ratings yet
Loan Status Prediction
23 pages
Viral Pandey Bankruptcy Prediction
No ratings yet
Viral Pandey Bankruptcy Prediction
7 pages
PA v0.7
No ratings yet
PA v0.7
15 pages
Assignment 2: Hive
No ratings yet
Assignment 2: Hive
11 pages
Catboost ET Comparaison
No ratings yet
Catboost ET Comparaison
20 pages
B2 19bec113 19bec116 Loan Prediction
No ratings yet
B2 19bec113 19bec116 Loan Prediction
3 pages
Coser Al. Crisan Albu (T)
No ratings yet
Coser Al. Crisan Albu (T)
17 pages
Capstone Project - Credit Card Fraud Prediction - Alexandre Daltro
No ratings yet
Capstone Project - Credit Card Fraud Prediction - Alexandre Daltro
15 pages
Project Lit Final1
No ratings yet
Project Lit Final1
15 pages
Time Series Prediction - California Dairy Data 1995-2013
No ratings yet
Time Series Prediction - California Dairy Data 1995-2013
30 pages
SQL Queries To Generate Reports
No ratings yet
SQL Queries To Generate Reports
8 pages
Hypothesis Testing and Regression Modelling
No ratings yet
Hypothesis Testing and Regression Modelling
8 pages
Prediction For Best Credit Card Best Brand
No ratings yet
Prediction For Best Credit Card Best Brand
17 pages
Microsoft Dat201x Certificate Edx
No ratings yet
Microsoft Dat201x Certificate Edx
1 page
Microsoft Dat101x Certificate Edx
No ratings yet
Microsoft Dat101x Certificate Edx
1 page