0% found this document useful (0 votes)

48 views25 pages

Assignment - 3 - Data Analytics

Uploaded by

Learners Hub

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views25 pages

Assignment - 3 - Data Analytics

Uploaded by

Learners Hub

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 25

Credit Score

Model Airtime
for

Loans
USING MACHINE LEARNING
TECHNIQUES
Group
Roll No Name
21PGPEX-02 Abhishek Kumar

21PGPEX-19 Ketan Jain

21PGPEX-21 Kritika Sharma

21PGPEX-23 Maharaja E
Introduction

Literature Review

Agenda Methodology

Results

Conclusions
Introduction
AIRTIME LOAN – A NEW BUSINESS OPPORTUNITY FOR MOBILE NETWORK
OPERATORS:
• Airtime is becoming a basic commodity in developing countries.

• Failure to have sufficient air time is a challenge to many customers.

• Opportunity to offer short-term airtime loans @ 10%.

• Risk of default need to be analysed.

• Risk transcends to 3rd party loan providers / MNOs.

Default on Loan: Risk Mitigation

To mitigate this risk, credit scoring models are required to assess the capability of the
customer to pay a certain amount within the specified period.
Credit Score Models
( Estimated using a variety of historical personal and financial data obtained from customers. )

Advantages : Challenges :
• Enables faster credit decisions. • Large population of unbanked adults, data
• Reduces the cost of credit analysis. are not readily available.
• Need to search for alternative datasets in
• Monitors the portfolio of existing
order to determine whether a customer.
accounts.
• Available Data - Customer’s calls and
recharge history.
Airtime Lending Industry
COMZAFRICA ( A Micro lending firm , Africa)

Airtime Credit Service (ACS) allows users to easily access airtime on a credit basis from wherever
they are at anytime, day or night.
Literature Review
Predictor factors in credit scoring model
Challenges
• Customer details are not available due to customer privacy
constraints.
• Selection of appropriate data for model to predict effectively
– cross validation of data is needed.
• Building a model without customer details is a challenge.

• Limitation of study due to factors availability only for loan

details and customer behaviour.
• Earlier models did not consider – Multiple loans taken by a
customer, loan duration, age in network.
Methodology
Feature selection in Model
Predictor factors considered for study:
• Loan amount
• Number of recharges for each
• Usage amount
• Activation date
• Date when loan was taken
• Date of loan payment
• Total amount used every month
Feature Construction
• Loan count (how many loans the customer has at any time).

• Loan duration (how long the customer took to repay the loan).

• Age on network (how long the customer has been with the MNO).

• Loan month (the month that the loan was taken).

Evaluation techniques in model
Machine learning models

• Logistic regression (LR) – Linear model

• Decision Tree (DT) – Non-linear model
• Random Forest (RF) – Non-linear model
Evaluation
• Cost of default is much greater than benefit of customer re-paying – Factor of 10.

• Accuracy not a relevant performance metric in the model.

• Most important to correct predict, when customer defaults.

• Specificity is key performance metric for model.

• Specificity = TN/(TN + FP)
Cross validation considerations

• Out of sample data for model is recommended. Test dataset is not part of building model
from train data.
• Two prediction classes highly imbalance – repaid & default (low percentage of defaults).

• To avoid bias in model, train & test datasets need to have equal representation 50-50% of
both prediction classes – repaid & default records.
Cross validation scenarios
CV1
• Loans are divided in a ratio of 70:30 randomly without considering any variable.
• Some loans of same customer can be considered in both train & test data.
• loan in future can be used to predict a loan default status in past.
• Default records is very small representation in train & test data.
• This will create bias in model to predict customer defaults (TN).
CV2
• Loans are divided in a ratio of 70:30 randomly based on customer.
• Customer bias from CV1 is eliminated from model.
• Does not address the time issue of loan (past versus future).
• Default records is very small representation in train & test data.
• This will create bias in model to predict customer defaults (TN).
CV3

• Loans for each customer are segregated and latest loan status taken in train / test data.
Ratio of default & non-default customers are maintained 50-50% to create balanced
dataset and then split into 70:30.
• No repeat customers & no time continuity problem.
Results
Model CV1 & CV2
• High Accuracy

• Low Specificity

• Unable to predict the

customers who will default.
Model CV3
• Accuracy lower than Model C1 and C2.
• High Specificity.
• DT and RF outperform LR because of non-linearity in
model.
• the predictions for loans repaid is correct for 85% and
incorrect for 15%.
• The predictions for loans defaulted are correct for 80%
and incorrect for 20%.
Business Implications
Default Rate as low as 0.01% -> accept all the loan requests.
When default rate increases > 2%, company can generate more
profits by using the model.
Without Model : company breaks even at zero profit at a default
rate of 8%.
With Model: company can leverage profits to loan defaults >=
32% (Tolerance limit is increased).
Conclusions
• Obtaining customer details from the MNOs would improve performance.

• For a classification problem with an imbalanced number of categories, specificity is a

better measure.

• For credit scoring, correct handling of the time of loan disbursement and customer
identity are crucial to avoid over-fitting and unrealistically high estimates of accuracy.

• Random forest was the best classifier with an accuracy of 82.3% which showed that
nonlinearity and an ensemble approach was superior.
Conclusions
• When the default rate is low, it is better to offer the loans to every customer.

• With increasing default rates, a point is eventually reached whereby the model will outperform
this simple approach of offering loans to everyone.

• The maximum tolerable default rate is increased by the optimal model to 32% compared with 8%
when the company does not use a model.

• The methodology and approach studied in this paper are also relevant for a wide range of pay-as-
you-go mobile products where credit is offered for basic services: electricity tokens, smart water
meters; smart cooking devices and solar energy.
Thank you

Non Graded Music 0 Q3 W8 03 15 22
No ratings yet
Non Graded Music 0 Q3 W8 03 15 22
11 pages
Driving School Monitoring System
No ratings yet
Driving School Monitoring System
54 pages
Cranial Nerves: in Health and Disease
No ratings yet
Cranial Nerves: in Health and Disease
2 pages
MGEB02-2019 Syllabus
100% (1)
MGEB02-2019 Syllabus
6 pages
Inclusive Physical Education - Preschool
No ratings yet
Inclusive Physical Education - Preschool
16 pages
Use of Machine Learning Techniques To Create A Credit Score Model For Airtime Loans
No ratings yet
Use of Machine Learning Techniques To Create A Credit Score Model For Airtime Loans
11 pages
Dictionary of Credit Risk Business Terms - EXTRACT
From Everand
Dictionary of Credit Risk Business Terms - EXTRACT
Steve Preece
No ratings yet
Xtreme Boosting Machine
No ratings yet
Xtreme Boosting Machine
5 pages
CUSTOMER CENTRICITY & GLOBALISATION: PROJECT MANAGEMENT: MANUFACTURING & IT SERVICES
From Everand
CUSTOMER CENTRICITY & GLOBALISATION: PROJECT MANAGEMENT: MANUFACTURING & IT SERVICES
Chandra Sekar
No ratings yet
Credit Risk Management Using ML
No ratings yet
Credit Risk Management Using ML
4 pages
Coser Al. Crisan Albu (T)
No ratings yet
Coser Al. Crisan Albu (T)
17 pages
Ajol-File-Journals 543 Articles 255840 650d5184b77f4
No ratings yet
Ajol-File-Journals 543 Articles 255840 650d5184b77f4
14 pages
Hp1047, Vmr286 Loan Default Prediction Final Report
No ratings yet
Hp1047, Vmr286 Loan Default Prediction Final Report
8 pages
Loan Approval Prediction Using DM Techniques: Pusendra Chaudhary, Sumit Chaudhary, Arpan Mahatra
No ratings yet
Loan Approval Prediction Using DM Techniques: Pusendra Chaudhary, Sumit Chaudhary, Arpan Mahatra
8 pages
Project Documents
No ratings yet
Project Documents
9 pages
Ajol-File-Journals 387 Articles 263414 65b236d58cc5e
No ratings yet
Ajol-File-Journals 387 Articles 263414 65b236d58cc5e
8 pages
Tax Delcon Research Paper
No ratings yet
Tax Delcon Research Paper
10 pages
Globalisation Trends
From Everand
Globalisation Trends
Chandra Sekar
No ratings yet
Credit Loan Default Prediction Based On Data Mining
No ratings yet
Credit Loan Default Prediction Based On Data Mining
4 pages
Algorithm Comparison For Data Mining Classification: Assessing Bank Customer Credit Scoring Default Risk
No ratings yet
Algorithm Comparison For Data Mining Classification: Assessing Bank Customer Credit Scoring Default Risk
10 pages
金融违约笔记
No ratings yet
金融违约笔记
10 pages
Network-Aware Credit Scoring System For Telecom Subscribers Using Machine Learning and Network Analysis
No ratings yet
Network-Aware Credit Scoring System For Telecom Subscribers Using Machine Learning and Network Analysis
21 pages
Nazreen - CIA 2 Applied Data Mining and Big Data
No ratings yet
Nazreen - CIA 2 Applied Data Mining and Big Data
5 pages
Behavior Revealed in Mobile Phone Usage Predicts Credit Repayment
No ratings yet
Behavior Revealed in Mobile Phone Usage Predicts Credit Repayment
28 pages
1 PB
No ratings yet
1 PB
13 pages
ML Implementation in Lending and Credit Scoring in Rural Areas
No ratings yet
ML Implementation in Lending and Credit Scoring in Rural Areas
24 pages
Project Stage I Report
No ratings yet
Project Stage I Report
17 pages
An Automatic Credit Analysis Model
No ratings yet
An Automatic Credit Analysis Model
12 pages
A Comparative Study of Forecasting Corporate Credit Ratings Using Neural Networks, Support Vector Machines, and Decision Trees
No ratings yet
A Comparative Study of Forecasting Corporate Credit Ratings Using Neural Networks, Support Vector Machines, and Decision Trees
40 pages
November 2010)
No ratings yet
November 2010)
6 pages
Loan Default Prediction System
No ratings yet
Loan Default Prediction System
44 pages
Machinelearning
No ratings yet
Machinelearning
24 pages
Credit Scoring For Microfinance Using Behavioral Data in Emerging Markets
No ratings yet
Credit Scoring For Microfinance Using Behavioral Data in Emerging Markets
25 pages
Loan Prediction System Using Machine Learning
No ratings yet
Loan Prediction System Using Machine Learning
4 pages
The VIth International Conference Advanced Information Systems and Technologies, AIST 2018
No ratings yet
The VIth International Conference Advanced Information Systems and Technologies, AIST 2018
4 pages
Final - Bank Customer Response Prediction Model
No ratings yet
Final - Bank Customer Response Prediction Model
23 pages
Performance Evaluation of Credit Risk Models
No ratings yet
Performance Evaluation of Credit Risk Models
11 pages
Lending Club Data Analysis PDF
No ratings yet
Lending Club Data Analysis PDF
3 pages
Case Studies 2024 - 2025 ODD SEM
No ratings yet
Case Studies 2024 - 2025 ODD SEM
61 pages
Irjet V12i425
No ratings yet
Irjet V12i425
7 pages
Survival Analysis
No ratings yet
Survival Analysis
6 pages
Transition Matrix Models of Consumer Credit Ratings
No ratings yet
Transition Matrix Models of Consumer Credit Ratings
27 pages
Capstone Project Report v1 - Abhishek Bihani
No ratings yet
Capstone Project Report v1 - Abhishek Bihani
16 pages
Presentation - Women Micro Bank
No ratings yet
Presentation - Women Micro Bank
16 pages
Project Lit Final1
No ratings yet
Project Lit Final1
15 pages
ssrn-5363683
No ratings yet
ssrn-5363683
14 pages
Credit Scoring For VN Retail Banking
No ratings yet
Credit Scoring For VN Retail Banking
36 pages
Predicting Consumer Default
No ratings yet
Predicting Consumer Default
71 pages
Qtmfinalpresentationpaper
No ratings yet
Qtmfinalpresentationpaper
19 pages
Bank Alliance
No ratings yet
Bank Alliance
18 pages
10.3934 Dsfe.2024009
No ratings yet
10.3934 Dsfe.2024009
14 pages
CustomerChurnPrediction ProjectReport 2555425555
No ratings yet
CustomerChurnPrediction ProjectReport 2555425555
19 pages
Loan Default Risk Assessment Using Supervised Learning
No ratings yet
Loan Default Risk Assessment Using Supervised Learning
7 pages
Modelling Credit Risk of Portfolio of Consumer Loans - 2010
No ratings yet
Modelling Credit Risk of Portfolio of Consumer Loans - 2010
11 pages
SSRN Id3769854
No ratings yet
SSRN Id3769854
8 pages
Sat - 90.Pdf - Prediction of Bank Customer Churn Using Machine Learning Technique
No ratings yet
Sat - 90.Pdf - Prediction of Bank Customer Churn Using Machine Learning Technique
11 pages
JRFM 18 00023
No ratings yet
JRFM 18 00023
20 pages
Evaluation of Using Big Data For Credit Ratings
No ratings yet
Evaluation of Using Big Data For Credit Ratings
22 pages
J Ijsd 20241002 11
No ratings yet
J Ijsd 20241002 11
9 pages
Credit Risk Analysis in Peer-to-Peer Lending System: September 2016
No ratings yet
Credit Risk Analysis in Peer-to-Peer Lending System: September 2016
5 pages
2022 V13i1198
No ratings yet
2022 V13i1198
12 pages
2818-Article Text-5218-1-10-20210411
No ratings yet
2818-Article Text-5218-1-10-20210411
5 pages
Credit Scoring Through Data Mining Approach A Case Study of Mortgage Loan in Indonesia
No ratings yet
Credit Scoring Through Data Mining Approach A Case Study of Mortgage Loan in Indonesia
5 pages
PA v0.7
No ratings yet
PA v0.7
15 pages
The Art of Maximizing Debt Collections: Digitization, Analytics, AI, Machine Learning and Performance Management
From Everand
The Art of Maximizing Debt Collections: Digitization, Analytics, AI, Machine Learning and Performance Management
Darryl D'Souza
No ratings yet
Cloud Cost Estimator
No ratings yet
Cloud Cost Estimator
2 pages
Animal+Behaviour Practicle File
No ratings yet
Animal+Behaviour Practicle File
130 pages
Ellis, R. (2005) Instructed Language Learning
100% (1)
Ellis, R. (2005) Instructed Language Learning
16 pages
Induction Program Schedule 2025 2026 1
No ratings yet
Induction Program Schedule 2025 2026 1
2 pages
Department of Education Division City Schools Manila: National Capital Region
No ratings yet
Department of Education Division City Schools Manila: National Capital Region
8 pages
Nursing Care Plan - Risk For Falls (Antepartum)
No ratings yet
Nursing Care Plan - Risk For Falls (Antepartum)
2 pages
Vlsi Technology Lesson Plan
No ratings yet
Vlsi Technology Lesson Plan
4 pages
A Critical Theory of Medical Discourse
No ratings yet
A Critical Theory of Medical Discourse
21 pages
Waa Aniga Iyo Adiga
No ratings yet
Waa Aniga Iyo Adiga
3 pages
Improve WT 1 - 5 - Maps - Changes - KEY
No ratings yet
Improve WT 1 - 5 - Maps - Changes - KEY
3 pages
Group-F Yogesh
No ratings yet
Group-F Yogesh
170 pages
CV-Ismail Zabi Ull
No ratings yet
CV-Ismail Zabi Ull
2 pages
Curriculum - Vitae: Career Objective
No ratings yet
Curriculum - Vitae: Career Objective
3 pages
Zahoor's Resume 27 Courses
No ratings yet
Zahoor's Resume 27 Courses
3 pages
Unit 6 Know How To Support Clients Who Take Part in Exercise and Physical Activity
No ratings yet
Unit 6 Know How To Support Clients Who Take Part in Exercise and Physical Activity
6 pages
Motives For Adult Participation in Physical Activity: Type of Activity, Age, and Gender
No ratings yet
Motives For Adult Participation in Physical Activity: Type of Activity, Age, and Gender
12 pages
한국어 어휘 교육 방안 연구의 동향 분석 : 학위논문을 중심으로
No ratings yet
한국어 어휘 교육 방안 연구의 동향 분석 : 학위논문을 중심으로
27 pages
The Balanced Scorecard: A Foundation For The Strategic Management of Information Systems
No ratings yet
The Balanced Scorecard: A Foundation For The Strategic Management of Information Systems
18 pages
Mits II Sem Result 2009
No ratings yet
Mits II Sem Result 2009
137 pages
Oral Communication Remedial Exam
100% (8)
Oral Communication Remedial Exam
1 page
UGX Tuition Structure 2023
No ratings yet
UGX Tuition Structure 2023
1 page
Understanding Roles, Responsibilities and Relationships in Education and Training
No ratings yet
Understanding Roles, Responsibilities and Relationships in Education and Training
12 pages
Speech Restructuring Programmes
No ratings yet
Speech Restructuring Programmes
10 pages
Financial Markets and Institutions 7th Edition Frederic S Mishkin
No ratings yet
Financial Markets and Institutions 7th Edition Frederic S Mishkin
316 pages
bk978 0 7503 4957 4ch0
No ratings yet
bk978 0 7503 4957 4ch0
14 pages