Decision - Tree Using R

The document discusses building and evaluating a decision tree model to predict loan defaults. It covers preparing training and test data by random sampling, using the C5.0 algorithm to train a decision tree model on the training data, and then evaluating the model's performance on the test data by calculating accuracy and error rates.

Uploaded by

Wajahat Ali085

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views13 pages

Decision - Tree Using R

Uploaded by

Wajahat Ali085

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 13

Decision Tree

exploring and preparing the data

Data preparation – creating random training and test
datasets
• Usually data that had been sorted in a random order, we simply divided the dataset
into two portions, by taking the first 90 percent of records for training, and the
remaining 10 percent for testing.
• In contrast, the credit dataset is not randomly ordered, making the prior approach
unwise.
• Suppose that the bank had sorted the data by the loan amount, with the largest
loans at the end of the file.
• If we used the first 90 percent for training and the remaining 10 percent for testing,
we would be training a model on only the small loans and testing the model on the
big loans. Obviously, this could be problematic.
• We'll solve this problem by using a random sample of the credit data for training.
• A random sample is simply a process that selects a subset of records at random.
• In R, the sample() function is used to perform random sampling.
• However, before putting it in action, a common practice is to set a seed value, which
causes the randomization process to follow a sequence that can be replicated later
on if desired.
training a model on the data
• We will use the C5.0 algorithm in the
C50 package to train our decision tree
model.
• For the first iteration of our credit
approval model, we'll use the default
C5.0 configuration, as shown in the
following code.
• The 17th column in credit_train is the
default class variable, so we need to
exclude it from the training data frame,
but supply it as the target factor vector
for classification.
If the checking account balance is unknown or greater than 200 DM, then classify as "not likely to
default."
2. Otherwise, if the checking account balance is less than zero DM or between one and 200 DM.
3. And the credit history is perfect or very good, then classify as "likely to default."
evaluating model performance
• credit_pred <- predict(credit_model, credit_test)
• This creates a vector of predicted class values, which we can compare
to the actual class values using the CrossTable() function in the
gmodels package.
Results
 Out of the 100 test loan application
records, our model correctly predicted that
59 did not default and 14 did default,
resulting in an accuracy of 73 percent and
an error rate of 27 percent.

 Also note that the model only correctly

predicted 14 of the 33 actual loan defaults
in the test data, or 42 percent.

Project 2
100% (1)
Project 2
17 pages
Credit Risk Modeling in Python Chapter3
No ratings yet
Credit Risk Modeling in Python Chapter3
35 pages
Candidate Evaluation Form Template For Campus Recruiting
100% (1)
Candidate Evaluation Form Template For Campus Recruiting
1 page
CH 8 Data Mining
No ratings yet
CH 8 Data Mining
30 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
Decision Tree
No ratings yet
Decision Tree
30 pages
MGTSC 645 Shivani Gupta Assignment 2 1646112 Decision Tree
No ratings yet
MGTSC 645 Shivani Gupta Assignment 2 1646112 Decision Tree
4 pages
Classification Algorithm
No ratings yet
Classification Algorithm
78 pages
08 - Classification - Decision Trees
No ratings yet
08 - Classification - Decision Trees
116 pages
Project Report - ML
100% (1)
Project Report - ML
17 pages
Module 04
No ratings yet
Module 04
75 pages
Classification, Prediction
100% (1)
Classification, Prediction
67 pages
Classification Ppts 2021
No ratings yet
Classification Ppts 2021
80 pages
Project Stage I Report
No ratings yet
Project Stage I Report
17 pages
Data Mining - Classification & Prediction
No ratings yet
Data Mining - Classification & Prediction
62 pages
ranvijay12203409 (1)
No ratings yet
ranvijay12203409 (1)
13 pages
PA v0.7
No ratings yet
PA v0.7
15 pages
Loan-Prediction Using Machine Learning
No ratings yet
Loan-Prediction Using Machine Learning
31 pages
12113667 an Kit
No ratings yet
12113667 an Kit
12 pages
Module 04 Edited
No ratings yet
Module 04 Edited
19 pages
DecisionTreeAssignment
No ratings yet
DecisionTreeAssignment
7 pages
08 Class Basic
No ratings yet
08 Class Basic
103 pages
Spotle - Ai Data Science Final Capstone Project Building An Credit Card Analyser Using Decision Tree Classifier
No ratings yet
Spotle - Ai Data Science Final Capstone Project Building An Credit Card Analyser Using Decision Tree Classifier
4 pages
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Data Mining
No ratings yet
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Data Mining
50 pages
Module 6
No ratings yet
Module 6
82 pages
Weka Tutorial 2
No ratings yet
Weka Tutorial 2
50 pages
Classification: Decision Trees: Business Analytics Lecture 7/8
No ratings yet
Classification: Decision Trees: Business Analytics Lecture 7/8
35 pages
Konsep Ensemble
No ratings yet
Konsep Ensemble
52 pages
Unit 3
No ratings yet
Unit 3
16 pages
3-Classification, Clustering and Prediction
No ratings yet
3-Classification, Clustering and Prediction
142 pages
Credit Risk Modeling in R
100% (2)
Credit Risk Modeling in R
66 pages
Janani Prakash Loan Prediction Study
No ratings yet
Janani Prakash Loan Prediction Study
97 pages
Loan Eligibility Prediction
No ratings yet
Loan Eligibility Prediction
12 pages
Module 5 Machine Learning
No ratings yet
Module 5 Machine Learning
36 pages
Classification
No ratings yet
Classification
36 pages
Week 4 Part 1 Classification
No ratings yet
Week 4 Part 1 Classification
71 pages
Lecture 13-Supervised Learning-Decision Trees-M
No ratings yet
Lecture 13-Supervised Learning-Decision Trees-M
47 pages
Week 6 - 7 - Classification
No ratings yet
Week 6 - 7 - Classification
67 pages
Week 7 Laboratory Activity
No ratings yet
Week 7 Laboratory Activity
12 pages
Classification & Prediction
No ratings yet
Classification & Prediction
24 pages
Default_of_Credit_Card_Clients
No ratings yet
Default_of_Credit_Card_Clients
33 pages
Module 7 Homework Prompt - JMP
No ratings yet
Module 7 Homework Prompt - JMP
6 pages
Decision Trees
No ratings yet
Decision Trees
77 pages
Credit Risk Analysis
No ratings yet
Credit Risk Analysis
6 pages
Unit 3 Machine Learning
No ratings yet
Unit 3 Machine Learning
159 pages
Unit 3 Classification - Dr. Vidyut D
No ratings yet
Unit 3 Classification - Dr. Vidyut D
72 pages
Decision Tree R
No ratings yet
Decision Tree R
5 pages
Predict Default of Credit Card Clients: By: Varsha Waingankar
No ratings yet
Predict Default of Credit Card Clients: By: Varsha Waingankar
25 pages
Lecture 11
No ratings yet
Lecture 11
24 pages
Copy of Classification-1
No ratings yet
Copy of Classification-1
48 pages
Classification and Prediction Lecture-22,23,24,25,26,27, 28: Dr. Sudhir Sharma Manipal University Jaipur
No ratings yet
Classification and Prediction Lecture-22,23,24,25,26,27, 28: Dr. Sudhir Sharma Manipal University Jaipur
43 pages
Data Mining-Unit-3
No ratings yet
Data Mining-Unit-3
16 pages
08 Class Basic
No ratings yet
08 Class Basic
141 pages
NTCC Seminar Sem6 Prachi Kumari A35400719009
No ratings yet
NTCC Seminar Sem6 Prachi Kumari A35400719009
30 pages
CH 5
No ratings yet
CH 5
84 pages
MACHINE LEARNING (1)
No ratings yet
MACHINE LEARNING (1)
12 pages
Lecture 2
No ratings yet
Lecture 2
98 pages
Module 3
No ratings yet
Module 3
64 pages
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
AWS Certified Machine Learning Associate Exam Study Guide
From Everand
AWS Certified Machine Learning Associate Exam Study Guide
Dániel Rozmán
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Sprob 2
No ratings yet
Sprob 2
55 pages
Project11323 (1) (1) 3
No ratings yet
Project11323 (1) (1) 3
49 pages
Ai Lab Terminal (Fa21-Bse-033)
No ratings yet
Ai Lab Terminal (Fa21-Bse-033)
3 pages
Lloseng CH 08 E2
No ratings yet
Lloseng CH 08 E2
48 pages
PDF Worldwide Views On Police Discretion A Scoping Review Regarding Police Decision Making 1st Edition Yinthe Feys Download
100% (5)
PDF Worldwide Views On Police Discretion A Scoping Review Regarding Police Decision Making 1st Edition Yinthe Feys Download
49 pages
Impact of Manpower Planning
No ratings yet
Impact of Manpower Planning
30 pages
It's A Queer World After All: Studying The Sims and Sexuality
No ratings yet
It's A Queer World After All: Studying The Sims and Sexuality
50 pages
Conducting Educational Research Fifth: Bruce W. Tuckman
No ratings yet
Conducting Educational Research Fifth: Bruce W. Tuckman
51 pages
Esta Billo
No ratings yet
Esta Billo
7 pages
Identifying The Sources of Gains From Takeovers: Feature Article
No ratings yet
Identifying The Sources of Gains From Takeovers: Feature Article
25 pages
Role of Financial Derivatives in Risk Management: Imran Ramzan
100% (1)
Role of Financial Derivatives in Risk Management: Imran Ramzan
15 pages
Assessment 1 PDF
No ratings yet
Assessment 1 PDF
49 pages
The Effects of The Classroom Environment To The BSA Students of STI College Balagtas
No ratings yet
The Effects of The Classroom Environment To The BSA Students of STI College Balagtas
55 pages
The Contingency Theory Of Organizations Donaldson Lex instant download
No ratings yet
The Contingency Theory Of Organizations Donaldson Lex instant download
88 pages
MCQ Questionnaire Part 01
No ratings yet
MCQ Questionnaire Part 01
9 pages
Transcript - 2024 03 26
No ratings yet
Transcript - 2024 03 26
100 pages
CHAPTER 3 - RESEARCH METHODOLOGY: Data Collection Method and Research Tools
No ratings yet
CHAPTER 3 - RESEARCH METHODOLOGY: Data Collection Method and Research Tools
10 pages
Hao 2019
No ratings yet
Hao 2019
12 pages
Ssa Grant Proposal
No ratings yet
Ssa Grant Proposal
8 pages
Sand Casting Process Optimization Via Design of Experiments: A Review
No ratings yet
Sand Casting Process Optimization Via Design of Experiments: A Review
6 pages
2.fulfilment of Open Green Space by The Regional Office in Semarang City
No ratings yet
2.fulfilment of Open Green Space by The Regional Office in Semarang City
6 pages
Sampling Techniques
No ratings yet
Sampling Techniques
21 pages
Gender and Personality in Transformational Leaders
No ratings yet
Gender and Personality in Transformational Leaders
26 pages
HUMSS - Trends, Networks, and Critical Thinking in The 21st Century CG
0% (1)
HUMSS - Trends, Networks, and Critical Thinking in The 21st Century CG
6 pages
Chess Study - Kiesel 2009 Playing Chess Unconsciously
No ratings yet
Chess Study - Kiesel 2009 Playing Chess Unconsciously
7 pages
Project Pasok 2016 2017
No ratings yet
Project Pasok 2016 2017
64 pages
Game Design Patterns
100% (1)
Game Design Patterns
12 pages
By David Del Vecchio, With Kelly Davidson, Justine Sanchez, Ryan Mayfield & Erik Westerholm
No ratings yet
By David Del Vecchio, With Kelly Davidson, Justine Sanchez, Ryan Mayfield & Erik Westerholm
12 pages
Exogeneity Assumptions
No ratings yet
Exogeneity Assumptions
3 pages
Kabit and Hiya
No ratings yet
Kabit and Hiya
2 pages
Spearman's Rank Correlation Coefficient: Idea
No ratings yet
Spearman's Rank Correlation Coefficient: Idea
4 pages
Service-Learning in Baccalaureate Nursing Education A Literature Review
No ratings yet
Service-Learning in Baccalaureate Nursing Education A Literature Review
4 pages
Let Reviewer Prof Ed 1
No ratings yet
Let Reviewer Prof Ed 1
21 pages

Decision - Tree Using R

Uploaded by

Decision - Tree Using R

Uploaded by

Decision Tree

exploring and preparing the data

 Also note that the model only correctly

You might also like