0% found this document useful (0 votes)

94 views7 pages

Interpretable Machine Learning

This document discusses an assignment on interpretable machine learning. It contains three parts: 1) Training, validating and testing a logistic regression model on credit risk data, including default fitting, cross-validation to tune hyperparameters, and nested cross-validation. 2) Applying the Frisch–Waugh–Lovell theorem using machine learning on bike sharing data to verify that adjusting for confounding variables yields the same regression coefficient. 3) Examining tree-based models.

Uploaded by

bkiakisolako

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

94 views7 pages

Interpretable Machine Learning

Uploaded by

bkiakisolako

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Interpretable Machine Learning 20/03/2024, 8:34 in the evening

Interpretable Machine Learning

Assignment I

AUTHOR PUBLISHED
Munir Eberhardt Hiabu March 19, 2024

Part 1 (Training, Validating and

Testing)
We will work with the “credit-g” dataset. The dataset classifies people described
by a set of attributes as good or bad credit risks. See
https://archive.ics.uci.edu/dataset/144/statlog+german+credit+data for more
details. We will fetch the data from https://www.openml.org/d/31. We will rely on
the mlr3 environment (https://mlr3.mlr-org.com/environment). As classifier, we
will use logistic regression with elastic net.

Load necessary packages.

library(mlr3)
library(mlr3learners )
library(mlr3tuning)
library(mlr3mbo)
library(glmnet)
library(OpenML)
library(mlr3pipelines)

### If parallelizationis wanted also:

library(future)
future::plan("multisession")

(a) (Default fitting)

Fetch the data and create a task.

credit_data = getOMLDataSet(data.id = 31)

task = as_task_classif(credit_data$data, target = "class")

Split your data into a training and test set.

Build a graph where

first: Dummy encode variables via po(“encode”),

https://mhiabu.github.io/iml_aims/assignment1.html Page 1 of 7
Interpretable Machine Learning 20/03/2024, 8:34 in the evening

second: Standardize via po(“scale”),

third: Logistic Regression with default settings is applied.

Train your model on the training set and evaluate it on the test set.

(b) (Cross Validation)

In this part we want to train a logistic regression with elastic net. The tuning
parameters are alpha and s (Here: s has the same function as the penalty
parameter lambda).

Build a new graph

First and second step as before in Exercise I

Third: Use learner classif.glmnet with tunable parameters:

s = to_tune(0, 1)
alpha = to_tune(0, 1)

Note: Tuning s manually via cross-validation is not optimal with respect to

computational eﬀiciency and a more eﬀicient solution is implemented
directly in the classif.glmnet learner. In this exercise, for learning purposes,
we will do cross-validation manually.

Tune your hyperparameters via the tune function with:

tuner: random search

resampling: 5-fold cross validation
measure: classification error
terminator: 50 evaluations

What is the CV error of the best configuration?

What is the test error of the best configuration?

It can be helpful to use

graph_learner_elastic_net$param_set$values =
instance$result_learner_param_vals
graph_learner_elastic_net$param_set$values$classif.glmnet.la
mbda =
graph_learner_elastic_net$param_set$values$classif.glmnet.s

Here graph_learner_elastic_net is the name of my graph and instance is the

name of my tuned instance.

Print the beta values via

https://mhiabu.github.io/iml_aims/assignment1.html Page 2 of 7
Interpretable Machine Learning 20/03/2024, 8:34 in the evening

graph_learner_elastic_net$model$classif.glmnet$model$beta

You may want to try out diﬀerent tuning configurations

e.g. tuner: “mbo” or other measures.

(c) (Nested Cross Validation)

Try out nested cross-validation by
defining your graph as an auto_tuner with
tuner: random search,
resampling: 5-fold cross validation,
measure: classification error,
terminator: 50 evaluations.

Use resample to run 5-fold cross validation.

Print out the nested cross validation error.

Part 2 (Frisch–Waugh–Lovell
theorem)
In this part we will study the Frisch–Waugh–Lovell theorem. The implied
algorithm, but using machine learning instead of linear regression, has been
introduced a couple of years ago as Double Machine Learning. Since then it has
gained a lot of attention and popularity. Here, we want to verify the result via a
coding exercise.

Load necessary packages. ::: {.cell}

library(mlr3)
library(mlr3learners)
library(OpenML)
library(mlr3pipelines)

:::

Fetch and edit data.

bike_data = getOMLDataSet(data.id = 42713)

bike_data$data <- bike_data$data[,-c(7,13,14)] ## remove casual and registered

### convert dates to factors

bike_data$data$year <- factor(bike_data$data$year)
bike_data$data$month <- factor(bike_data$data$month)
bike_data$data$hour <- factor(bike_data$data$hour)
bike_data$data$weekday <- factor(bike_data$data$weekday)

https://mhiabu.github.io/iml_aims/assignment1.html Page 3 of 7
Interpretable Machine Learning 20/03/2024, 8:34 in the evening

a. Run simple least squares linear regression with response being count and
predictor equal windspeed . Report the coeﬀicient you get.

b. Run least squares linear regression with response being count and predictor
being all remaining variables. Don’t forget to create a graph to dummy-
encode the factor variables. Report the coeﬀicient you get for windspeed .

c. Do the following steps

Run least squares linear regression with response being count and predictor
being all remaining variables except windspeed . Calculate the residuals and
call that variable count_residuals .
Run least squares linear regression with response being windspeed and
predictor being all remaining variables except count . Calculate the residuals
and call that variable windpseed_residuals .
Run simple least squares linear regression with response being
count_residuals and predictor windpseed_residuals .
Report the regression coeﬀicient you get.

d. Verify that the coeﬀicients in Steps (b) and (c) are the same.

e. Replace the simple linear regression model in the second last step in part (c)
by an auto-tuned k-nearest neighbors. Visualize the fit (by plotting
windpseed_residuals against observed and predicted
count_residuals ) and compare it to the previous simple linear regression
fit. Discuss the result.

Part 3 (Tree based models)

Load necessary packages.

library(mlr3)
library(mlr3learners)
library(mlr3tuning)
library(OpenML)
library(mlr3pipelines)
library(future)
future::plan("multisession")

Fetch data. This is the same data is in Part 1.

# load credit-g data and define task

credit_data = getOMLDataSet(data.id = 31)
task = as_task_classif(credit_data$data, target = "class")

https://mhiabu.github.io/iml_aims/assignment1.html Page 4 of 7
Interpretable Machine Learning 20/03/2024, 8:34 in the evening

(a)
Use the learner classif.rpart with predict_type = "prob" and train it on
the task. Visualize the learned tree via

# load credit-g data and define task

full_tree_trained <- full_tree$model$classif.rpart$model
plot(full_tree_trained , compress = TRUE, margin = 0.1)
text(full_tree_trained , use.n = TRUE, cex = 0.8)

Here full_tree is the graph you trained.

(b)
We now aim to find a penalty parameter α that results in a pruned tree with
strong predictive power. To this end, we define a tree learner that runs the
weakest link algorithm and therea"er 5-fold cross validation to compare the
performance between diﬀerent trees.

# load credit-g data and define task

my_cart_learner_cv = lrn("classif.rpart", xval = 5, predict_type = "prob")

You can run the following command on the rpart object in order to see the CV
result. Hint: If unsure how to extract the rpart object from your trained graph,
check how this was done in part (a) above.

# load credit-g data and define task

rpart::plotcp(cart_trained_cv)
rpart::printcp(cart_trained_cv)

A plot of α versus risk o"en has an initial sharp drop followed by a

relatively flat plateau and then a slow rise. The choice of α among those
models on the plateau can be essentially random. To avoid this, both an
estimate of the risk and its standard error are computed during the cross-
validation. Any risk within one standard error of the achieved minimum is
marked as being equivalent to the minimum (i.e.considered to be part of
the flat plateau). Then the simplest model, among all those “tied” on the
plateau, is chosen.

Train and then visualize the tree with the chosen α. (The relevant parameter is
called cp ).

https://mhiabu.github.io/iml_aims/assignment1.html Page 5 of 7
Interpretable Machine Learning 20/03/2024, 8:34 in the evening

(d)
Using the benchmark function, compare the predictive performance of the
following five algorithms

A baseline model that uses no features ( classif.featureless )

A non-pruned CART tree
A pruned CART tree with α as chosen in part (c).
An auto-tuned xgboost ( classif.xgboost ). You could for example tune
parameters in the following way:
eta = to_tune(0, 0.5) ,
nrounds = to_tune(10, 5000) ,
max_depth = to_tune(1, 10) .

An auto-tuned random forest ( classif.ranger ). You could for example

tune parameters in the following way:
mtry.ratio = to_tune(0.1, 1) ,
min.node.size = to_tune(1, 50) .

You may want too look at more then just the classification error. You can for
example run.

# load credit-g data and define task

res$aggregate(list(msr("classif.ce"),
msr("classif.acc"),
msr("classif.auc"),
msr("classif.fpr"),
msr("classif.fnr")))

Here res is the calculated benchmark object.

Remark
Note that we are not comparing how an optimally pruned decision tree compares to
other algorithms that are optimally tuned. While this is possible (one would just need to
decide on how to choose the optimal α explicitly and define an auto-tuner accordingly),
the purpose of this task is another. While we expect that a single decision tree will have
poorer performance than tree ensembles, we would like to know how big the
performance loss is if we choose to employ an interpretable decision tree. Here it is also
essential that the α we have chosen in (c) is big enough such that it leads to a small
enough tree.

(e)
The German Credit dataset comes with a cost matrix
https://www.openml.org/search?type=data&id=31&sort=runs&status=active.

https://mhiabu.github.io/iml_aims/assignment1.html Page 6 of 7
Interpretable Machine Learning 20/03/2024, 8:34 in the evening

Good (predicted) Bad (predicted)

Good (actual) 0 1

Bad (actual) 5 0

Use classif.costs(costs = mycosts) , to define a measure with the given

cost. Here, mycosts is the transpose of the cost matrix. Use the calculated
benchmark object from (d) to see how the algorithms compare for this new
measure.

(f)
If time allows you can re-run part (d) where the auto-tuned object are optimized
via the measure defined in part (e) and see how much the results change.

https://mhiabu.github.io/iml_aims/assignment1.html Page 7 of 7

Decision Support and Data Warehouse Systems
No ratings yet
Decision Support and Data Warehouse Systems
9 pages
Sentiment Analysis Using Natural Language Processing
No ratings yet
Sentiment Analysis Using Natural Language Processing
7 pages
LAB 02: Designing and Implementing A Data Warehouse: Scenario
No ratings yet
LAB 02: Designing and Implementing A Data Warehouse: Scenario
4 pages
DWH by Concepts - v1
No ratings yet
DWH by Concepts - v1
56 pages
An Algorithm To Transform Natural Languages To SQL Queries For Relational Databases
No ratings yet
An Algorithm To Transform Natural Languages To SQL Queries For Relational Databases
7 pages
Unit #2 - Data Warehouse and Data Mining
No ratings yet
Unit #2 - Data Warehouse and Data Mining
51 pages
Unit #1 - Data Warehouse and Data Mining
No ratings yet
Unit #1 - Data Warehouse and Data Mining
62 pages
Data Mining: Concepts and Techniques: 0501 - 01/server.920/a96520 PDF
100% (1)
Data Mining: Concepts and Techniques: 0501 - 01/server.920/a96520 PDF
63 pages
Training Course Outlines: Machine Learning
No ratings yet
Training Course Outlines: Machine Learning
3 pages
Data Warehousing Laboratory
0% (1)
Data Warehousing Laboratory
28 pages
Hitachi Freedom Storage Lightning 9900 Series, Thunder 9200 and 7700E Guidelines For Oracle Database Backup and Recovery
No ratings yet
Hitachi Freedom Storage Lightning 9900 Series, Thunder 9200 and 7700E Guidelines For Oracle Database Backup and Recovery
27 pages
Assignment of Information Technology: Submitted To: Submitted by
No ratings yet
Assignment of Information Technology: Submitted To: Submitted by
14 pages
Introduction To Data Warehouse
No ratings yet
Introduction To Data Warehouse
34 pages
A Comprehensive Approach To Data Warehouse Testing
No ratings yet
A Comprehensive Approach To Data Warehouse Testing
8 pages
Chap01 Data Warehouse 1
No ratings yet
Chap01 Data Warehouse 1
65 pages
Introduction To Data Warehousing: Presentation On
No ratings yet
Introduction To Data Warehousing: Presentation On
8 pages
Why Is The Snowflake Schema A Good Data Warehouse Design
No ratings yet
Why Is The Snowflake Schema A Good Data Warehouse Design
19 pages
Ethem Alpaydin Machine Learning PDF
33% (6)
Ethem Alpaydin Machine Learning PDF
2 pages
Stress Detection in It Professional by Image Processing and Machine Learning
No ratings yet
Stress Detection in It Professional by Image Processing and Machine Learning
91 pages
Data Warehouse Architecture 032008
No ratings yet
Data Warehouse Architecture 032008
3 pages
Batch B DWM Experiments
No ratings yet
Batch B DWM Experiments
90 pages
Data Warehouse KT
No ratings yet
Data Warehouse KT
11 pages
Unit 2 - Data Warehouse Logical Designm
No ratings yet
Unit 2 - Data Warehouse Logical Designm
73 pages
Image Sentiment Analysis Using Deep Learning
No ratings yet
Image Sentiment Analysis Using Deep Learning
4 pages
Data Warehouse Databases
No ratings yet
Data Warehouse Databases
28 pages
DBMS Vs DataWarehouse
No ratings yet
DBMS Vs DataWarehouse
2 pages
How To Sell A Data Warehouse To Upper Management Checklist
No ratings yet
How To Sell A Data Warehouse To Upper Management Checklist
6 pages
Data Warehouse
No ratings yet
Data Warehouse
77 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
8 pages
Data Warehouse Development Approach
No ratings yet
Data Warehouse Development Approach
25 pages
Natural Language Processing Using Artificial Intelligence
No ratings yet
Natural Language Processing Using Artificial Intelligence
3 pages
Data Warehouse File
No ratings yet
Data Warehouse File
9 pages
Deep Learning and Its Applications
No ratings yet
Deep Learning and Its Applications
21 pages
Chapter-21The Virtual Data Warehouse
No ratings yet
Chapter-21The Virtual Data Warehouse
11 pages
5 Levels of AI Agents (Updated)
No ratings yet
5 Levels of AI Agents (Updated)
16 pages
Big Data and Data Warehouse
No ratings yet
Big Data and Data Warehouse
19 pages
What Is Data Warehouse?: Explanatory Note
No ratings yet
What Is Data Warehouse?: Explanatory Note
11 pages
Data Warehouse
No ratings yet
Data Warehouse
74 pages
Data Science Manual
No ratings yet
Data Science Manual
155 pages
SKP Engineering College: A Course Material On
No ratings yet
SKP Engineering College: A Course Material On
212 pages
Data Warehousing Concepts
No ratings yet
Data Warehousing Concepts
9 pages
Data Warehouse Conceptual Data Model
No ratings yet
Data Warehouse Conceptual Data Model
6 pages
Web Enabled Data Warehouse
No ratings yet
Web Enabled Data Warehouse
7 pages
Data Warehouse and OLAP
No ratings yet
Data Warehouse and OLAP
55 pages
40 Free Ai Courses List
100% (1)
40 Free Ai Courses List
8 pages
Gold Price Prediction System
No ratings yet
Gold Price Prediction System
8 pages
Natural Language Processing
No ratings yet
Natural Language Processing
20 pages
HBR, 2016, Dhar, When To Trust Robots With Decisions, and When Not To
No ratings yet
HBR, 2016, Dhar, When To Trust Robots With Decisions, and When Not To
8 pages
Hillier 7e Ch01 PPT Accessible
No ratings yet
Hillier 7e Ch01 PPT Accessible
39 pages
DWDM Lecturenotes PDF
No ratings yet
DWDM Lecturenotes PDF
133 pages
DWDM Unit-2 PDF
No ratings yet
DWDM Unit-2 PDF
149 pages
Data Warehouse Testing - Approaches and Standards
No ratings yet
Data Warehouse Testing - Approaches and Standards
8 pages
Machine Learning Models Predicting Returns - Why Most Popular Performance Metrics Are Misleading and Proposal For An Efficient Metric
No ratings yet
Machine Learning Models Predicting Returns - Why Most Popular Performance Metrics Are Misleading and Proposal For An Efficient Metric
37 pages
How To Read A Paper Involving AI
No ratings yet
How To Read A Paper Involving AI
10 pages
04 Data Warehouse and Data Mart
No ratings yet
04 Data Warehouse and Data Mart
15 pages
NCRAMLAI-2025 Announcement
No ratings yet
NCRAMLAI-2025 Announcement
1 page
Machine Learning Algorithm Cheat Sheet
No ratings yet
Machine Learning Algorithm Cheat Sheet
1 page
DataWarehousing 1
No ratings yet
DataWarehousing 1
46 pages
AI-Driven Fraud Detection in Financial Transactions With Graph Neural Networks and Anomaly Detection
No ratings yet
AI-Driven Fraud Detection in Financial Transactions With Graph Neural Networks and Anomaly Detection
6 pages
Machine Learning Complete-Course-Notes Polimi
No ratings yet
Machine Learning Complete-Course-Notes Polimi
107 pages
Data Warehouse and Design Presentation
No ratings yet
Data Warehouse and Design Presentation
11 pages
COSREV D 24 00138 - Reviewer
No ratings yet
COSREV D 24 00138 - Reviewer
25 pages
Nouman CV
No ratings yet
Nouman CV
6 pages
FFFFF
No ratings yet
FFFFF
7 pages
Weather Report
No ratings yet
Weather Report
13 pages
Research Paper - Advancements and Ethical Implications of AI
No ratings yet
Research Paper - Advancements and Ethical Implications of AI
1 page
Create First Data WareHouse - CodeProject
No ratings yet
Create First Data WareHouse - CodeProject
10 pages
Friendbook A New Friend Recommendation Application
No ratings yet
Friendbook A New Friend Recommendation Application
4 pages
Data Mining Unit - 1 Notes
No ratings yet
Data Mining Unit - 1 Notes
16 pages
Stock Market Analysis Using Classification Algorithm PDF
No ratings yet
Stock Market Analysis Using Classification Algorithm PDF
6 pages
Image-Based Air Quality Analysis Using Deep Convolutional Neural Network
No ratings yet
Image-Based Air Quality Analysis Using Deep Convolutional Neural Network
5 pages
Federated vs. Centeralized vs. De-Centeralized Data Warehouse
No ratings yet
Federated vs. Centeralized vs. De-Centeralized Data Warehouse
5 pages
Sex Trouble Sexgender Slippage Sex Confusion and S
No ratings yet
Sex Trouble Sexgender Slippage Sex Confusion and S
11 pages
2K16-SE-071 (ESE File)
No ratings yet
2K16-SE-071 (ESE File)
16 pages
Tailor Shaikshavali (J)
No ratings yet
Tailor Shaikshavali (J)
2 pages
Data Warehouse Testing - Practical Approach
No ratings yet
Data Warehouse Testing - Practical Approach
8 pages
Data W Areho Us e
100% (1)
Data W Areho Us e
9 pages
Unit 4 - Week 3: Assignment 3
No ratings yet
Unit 4 - Week 3: Assignment 3
3 pages
Wikipedia K Nearest Neighbor Algorithm
No ratings yet
Wikipedia K Nearest Neighbor Algorithm
4 pages
Why You Need A Data Warehouse
No ratings yet
Why You Need A Data Warehouse
8 pages
Data Warehouse Final Report
No ratings yet
Data Warehouse Final Report
19 pages
Advantages of Data Warehouse
No ratings yet
Advantages of Data Warehouse
2 pages
Data Mining and Data Warehouse
No ratings yet
Data Mining and Data Warehouse
11 pages
Need of Two Types of Data: Information
No ratings yet
Need of Two Types of Data: Information
7 pages
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
From Everand
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
Eric Tome
No ratings yet
Pentaho Data Integration Cookbook - Second Edition
From Everand
Pentaho Data Integration Cookbook - Second Edition
María Carina Roldán
No ratings yet
Optimizing Hadoop for MapReduce
From Everand
Optimizing Hadoop for MapReduce
Khaled Tannir
No ratings yet
(Excerpts From) Investigating Performance: Design and Outcomes With Xapi
From Everand
(Excerpts From) Investigating Performance: Design and Outcomes With Xapi
Janet Laane Effron
No ratings yet
Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
Decision Support System: Fundamentals and Applications for The Art and Science of Smart Choices
From Everand
Decision Support System: Fundamentals and Applications for The Art and Science of Smart Choices
Fouad Sabry
No ratings yet

Interpretable Machine Learning

Uploaded by

Interpretable Machine Learning

Uploaded by

Interpretable Machine Learning 20/03/2024, 8:34 in the evening

Interpretable Machine Learning

Part 1 (Training, Validating and

Load necessary packages.

### If parallelizationis wanted also:

(a) (Default fitting)

credit_data = getOMLDataSet(data.id = 31)

Split your data into a training and test set.

Build a graph where

first: Dummy encode variables via po(“encode”),

second: Standardize via po(“scale”),

(b) (Cross Validation)

Build a new graph

First and second step as before in Exercise I

Note: Tuning s manually via cross-validation is not optimal with respect to

Tune your hyperparameters via the tune function with:

tuner: random search

What is the CV error of the best configuration?

What is the test error of the best configuration?

It can be helpful to use

Here graph_learner_elastic_net is the name of my graph and instance is the

Print the beta values via

You may want to try out diﬀerent tuning configurations

(c) (Nested Cross Validation)

Use resample to run 5-fold cross validation.

Load necessary packages. ::: {.cell}

Fetch and edit data.

bike_data = getOMLDataSet(data.id = 42713)

### convert dates to factors

c. Do the following steps

Part 3 (Tree based models)

Fetch data. This is the same data is in Part 1.

# load credit-g data and define task

# load credit-g data and define task

Here full_tree is the graph you trained.

# load credit-g data and define task

# load credit-g data and define task

A plot of α versus risk o"en has an initial sharp drop followed by a

A baseline model that uses no features ( classif.featureless )

An auto-tuned random forest ( classif.ranger ). You could for example

# load credit-g data and define task

Here res is the calculated benchmark object.

Good (predicted) Bad (predicted)

Use classif.costs(costs = mycosts) , to define a measure with the given

You might also like