Microsoft Azure Machine Learning Project To Predict Likelihood of Good Credit of Customer
Microsoft Azure Machine Learning Project To Predict Likelihood of Good Credit of Customer
Report:
Model Output
Predicted with an accuracy of 77.9% whether customer is likely to have good or bad credit.
Used Python, R, SQL for data modification and feature engineering.
Inputs:
Analyzed the data set noting that it contains data on 950 customer cases. There are column
headers- 20 features (data columns which can be used to train a machine learning model)
and the label (the column indicating the actual credit status of the customers).
The second column labeled Duration, which will display some properties of that feature
(data column) on the right side of the display. These properties include summary statistics
and the data type, as shown here:
Label: CreditStatus (0,1)
Data Transformation:
As part of Data transformation, we will be removing some of the columns which are
as follows: Housing, SexAndStatus , OtherDetorsGuarantors, OtherInstalments and
ExistingCreditsAtBank
I have used the python Scripts and R script to drop the mentioned columns
Python Code:
def azureml_main(creditframe):
drop_cols = ['SexAndStatus',
'OtherDetorsGuarantors']
creditframe.drop(drop_cols, axis = 1, inplace = True)
return creditframe
R Code:
credit.frame <- maml.mapInputPort(1)
drop.cols <- c('OtherInstalments',
'ExistingCreditsAtBank')
out.frame <- credit.frame[, !(names(credit.frame) %in% drop.cols)]
maml.mapOutputPort("out.frame")
SQL:
select
CheckingAcctStat,
Duration,
CreditHistory,
Purpose,
Savings,
Employment,
InstallmentRatePecnt,
PresentResidenceTime,
Property,
Age,
Telephone,
CreditStatus
from t1;