Midterm 2023 Fall

Uploaded by

nikita.andhale

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Midterm 2023 Fall

Uploaded by

nikita.andhale

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

T HE U NIVERSITY O F T EXAS AT AUSTIN

MIS382N - B USINESS DATA S CIENCE

FALL 2023

M IDTERM E XAM

T UESDAY, N OVEMBER 14, 2023

Name:

Email:

• You have 75 minutes for this exam.

• The exam is closed book and closed notes, except for two handwritten pages of notes.

• No electronic device may be used.

• Write your answers in the spaces provided.

• Please show all of your work. Answers without appropriate justification will receive very little
credit. If you need extra space, use the back of the previous page.

Problem 1 (25 pnts):

Problem 2 (25 pnts):

Problem 3 (25 pnts):

Problem 4 (25 pnts):

Total (100 pnts) :

Problem 1 (25 pnts):

(a) You solve a logistic regression with two features, and you use no offset. You find

3
ŵ = .
2

• Draw the set of points that corresponds to the decision region, i.e., the set of points for which this
logistic regression classifier assigns a 50% chance of being 1 and a 50% chance of being 0. Jus-
tify/explain your answer.

(b) In the problem above, also indicate the region that the classifier assigns a higher probability of being
a “1” and the region with a higher probability of being a “0.”
(c) Now suppose we use feature augmentation, and add a feature X3 = X12 and X4 = X22 . Suppose that
we solve the logistic regression problem, and now we use an offset. Thus we compute five values:
w0 for the offset, and w1 , w2 , w3 and w4 for the four features. If w0 = −4, and w1 = w2 = 0, and
w3 = w4 = 1, draw the set of points that corresponds to the decision region, in the (X1 , X2 )-space.
Problem 2 (25 pnts):
True/False and Multiple Choice: circle your answer, and provide a brief justification.

1. With an appropriate increase in the regularization coefficient in linear regression, it is possible to

decrease the training loss, i.e., to obtain a better fit on the training data. (Never. Always. Only with
Ridge Regression. Only with Lasso.)

2. If X1 and Y are uncorrelated, then we can discard X1 and we will never hurt training or testing error.
(True. False.)

3. Logarithmic transformations of the features (assuming the values of the features are strictly positive
so that log is defined) do not change the training loss for decision trees, but they can improve the
testing error. (True. False.)

4. Suppose we have a regression problem with 2 features. If we are using linear regression and we add
the feature X3 = X1 − 3X2 , then it’s possible that the training error might be strictly reduced. (True.
False.)

5. Suppose we have a regression problem with 2 features. If we are using depth 2 regression trees and
we add the feature X3 = X1 − 3X2 , then it’s possible that the training error might be strictly reduced.
(True. False.)
Problem 3 (20 pnts):

Consider the following binary classification problem. For this problem, we want to use the exponential loss:
exp(−ŷy), where ŷ is given by h(x) for the function h(x) of our choice.

Table 1: Data
x(1) x(2) y
0.2 0.6 1
0.3 0.65 1
0.7 0.4 -1
0.3 0.4 -1
0.6 0.55 -1
0.8 0.7 1

(a) Find the best decision stump for this problem. Assume that you can only set leaf values to be in the
range [−5, 5].

(b) Suppose we fit a stump, and you split on x(1) ≥ 0.4, and assign leaf values ℓ1 and ℓ2 , so that if
x(1) < 0.4 it is assigned ℓ1 , and if x(1) ≥ 0.4 it is assigned ℓ2 . Write down the value of the loss
function. This should not be a numerical value, but a function of ℓ1 and ℓ2 .
(c) For the stump above (same splitting rule as given above), suppose we set values ℓ1 = (ln 4)/2 and
ℓ2 = −(ln 4)/2. Call this stump h1 . Suppose we wish to use the AdaBoost framework to boost the
stump above with a linear function of the form: h2 (x) = β1 x(1) + β2 x(2). This is done by solving a
minimization problem that is a sum of six terms, one for each of the data points. Write down the first
term. This should be an expression involving β1 and β2 , but should not have other variables.

(d) If we boost more decision trees, it could be possible to: circle all that apply

(i) Strictly increase training error

(ii) Strictly decrease training error
(iii) Strictly increase testing error
(iv) Strictly decrease testing error
Problem 4 (20 pnts):

(a) Suppose we have a data set with two features. Imagine that have a solution to the linear logistic
regression problem with an offset. We draw the region where the model says P (Y = 1 | X = x) =
1/2, and we find that all the points are on one side, i.e., all the 1’s and all the 0’s are on the same side
of the region. What is the highest the AUC of this logistic regression model could be? Give a score,
and justify your answer.

(b) Consider a classifier (not necessarily the one described above). Suppose that it is 99% accurate.
Provide two examples that show that the AUC of this very accurate classifier could be very close to 1,
and an example that shows that the AUC could be very close to 1/2.
(c) For a dataset, a model predicts probabilities {0.25, 0.3, 0.4, 0.5, 0.8, 0.9} and the true corresponsing
labels are y = {0, 0, 0, 1, 0, 1}. Draw the ROC curve and compute the AUC for these predictions.

CS230: Deep Learning: Winter Quarter 2018 Stanford University Midterm Examination 180 Minutes
100% (1)
CS230: Deep Learning: Winter Quarter 2018 Stanford University Midterm Examination 180 Minutes
36 pages
IIT Kanpur Machine Learning End Sem Paper
No ratings yet
IIT Kanpur Machine Learning End Sem Paper
10 pages
ECS7020P Sample Paper Solutions
No ratings yet
ECS7020P Sample Paper Solutions
6 pages
Role and Purposes of Language Arts
100% (2)
Role and Purposes of Language Arts
10 pages
Practice Midterm
No ratings yet
Practice Midterm
4 pages
midterm2008f_sol
No ratings yet
midterm2008f_sol
12 pages
DSCI 303: Machine Learning For Data Science Fall 2020
No ratings yet
DSCI 303: Machine Learning For Data Science Fall 2020
5 pages
Ai Ml Exam_1march 16 2022-Michael Magreola
No ratings yet
Ai Ml Exam_1march 16 2022-Michael Magreola
8 pages
Sample Exam PDF
No ratings yet
Sample Exam PDF
4 pages
Midterm 2010 F
No ratings yet
Midterm 2010 F
15 pages
Assignment 5: E1 244 - Detection and Estimation Theory (Jan 2023) Due Date: April 02, 2023 Total Marks: 55
No ratings yet
Assignment 5: E1 244 - Detection and Estimation Theory (Jan 2023) Due Date: April 02, 2023 Total Marks: 55
2 pages
Midterm With Solutions
No ratings yet
Midterm With Solutions
26 pages
CALCASSIGNMENT2-1
No ratings yet
CALCASSIGNMENT2-1
2 pages
Practice Problems of MSE204
No ratings yet
Practice Problems of MSE204
5 pages
Solu 10
No ratings yet
Solu 10
43 pages
2024-exam2-solution
No ratings yet
2024-exam2-solution
11 pages
103 Exercises
No ratings yet
103 Exercises
70 pages
endsem_ML_regular_AK
No ratings yet
endsem_ML_regular_AK
7 pages
10315 S23 Midterm2 Practice Problems Sol
No ratings yet
10315 S23 Midterm2 Practice Problems Sol
42 pages
Cs230exam Win20 Soln
No ratings yet
Cs230exam Win20 Soln
28 pages
Tut3 Questions
No ratings yet
Tut3 Questions
2 pages
Compre FoDS
No ratings yet
Compre FoDS
3 pages
Part A: Texas A&M University MEEN 683 Multidisciplinary System Design Optimization (MSADO) Spring 2021 Assignment 2
No ratings yet
Part A: Texas A&M University MEEN 683 Multidisciplinary System Design Optimization (MSADO) Spring 2021 Assignment 2
5 pages
Ass 1
No ratings yet
Ass 1
2 pages
ASU Assignment2 Sol
No ratings yet
ASU Assignment2 Sol
8 pages
SAHADEB - Logistic Reg - Sessions 8-10
No ratings yet
SAHADEB - Logistic Reg - Sessions 8-10
145 pages
Compre FoDS
No ratings yet
Compre FoDS
2 pages
3 Homework Set No. 3: Problem 1
No ratings yet
3 Homework Set No. 3: Problem 1
3 pages
LHM Machine Learning and Intelligent Data Analysis 2022-23
No ratings yet
LHM Machine Learning and Intelligent Data Analysis 2022-23
6 pages
2015 Preparatory Notes: Australian Chemistry Olympiad (Acho)
No ratings yet
2015 Preparatory Notes: Australian Chemistry Olympiad (Acho)
44 pages
1.1: The Bisection Method September 2019: MA385/530 - Numerical Analysis
No ratings yet
1.1: The Bisection Method September 2019: MA385/530 - Numerical Analysis
16 pages
assignment-3
No ratings yet
assignment-3
5 pages
1st Exam Question Paper
No ratings yet
1st Exam Question Paper
2 pages
PHY250 Lectures1-8 Complete
No ratings yet
PHY250 Lectures1-8 Complete
48 pages
Lokesh T00691325
No ratings yet
Lokesh T00691325
5 pages
MAM1000W Tutorials-3
No ratings yet
MAM1000W Tutorials-3
75 pages
SS ZG568 EC 2M SECOND SEM 2020 2021 Solution 1617600765956
No ratings yet
SS ZG568 EC 2M SECOND SEM 2020 2021 Solution 1617600765956
9 pages
HW 3
No ratings yet
HW 3
5 pages
General-Mathematics Q1 Module-1
No ratings yet
General-Mathematics Q1 Module-1
19 pages
This Study Resource Was: Final Examination
No ratings yet
This Study Resource Was: Final Examination
7 pages
R Exercises
No ratings yet
R Exercises
35 pages
HW 1
No ratings yet
HW 1
4 pages
HW 23 P 4 Rie
No ratings yet
HW 23 P 4 Rie
5 pages
KTGK-TN059-F03-Bai 1 -BTTUH-P2
No ratings yet
KTGK-TN059-F03-Bai 1 -BTTUH-P2
78 pages
CPSC 540 Assignment 1 (Due January 19)
No ratings yet
CPSC 540 Assignment 1 (Due January 19)
9 pages
Solu 10
No ratings yet
Solu 10
43 pages
Algo Assignment4
No ratings yet
Algo Assignment4
7 pages
Engineering Science 204: Machine Problem No. 1
No ratings yet
Engineering Science 204: Machine Problem No. 1
29 pages
Midterm Practice Questions
No ratings yet
Midterm Practice Questions
14 pages
Winter21Exam1
No ratings yet
Winter21Exam1
17 pages
Midterm Solution
No ratings yet
Midterm Solution
6 pages
ML Support Vector Machines 2
No ratings yet
ML Support Vector Machines 2
22 pages
Assignment 4
No ratings yet
Assignment 4
3 pages
CS60050: Machine Learning Mid-Semester Examination, Autumn 2017
No ratings yet
CS60050: Machine Learning Mid-Semester Examination, Autumn 2017
1 page
Tutorial 5 A
No ratings yet
Tutorial 5 A
7 pages
Assignment #3_handout
No ratings yet
Assignment #3_handout
3 pages
Exercise 03
No ratings yet
Exercise 03
5 pages
Exam 1 Soln
No ratings yet
Exam 1 Soln
6 pages
2021 Quiz2 Problems
No ratings yet
2021 Quiz2 Problems
13 pages
UCK 337 Introduction To Optimization Spring 2019-2020 Problem Set I
No ratings yet
UCK 337 Introduction To Optimization Spring 2019-2020 Problem Set I
3 pages
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
master_data
No ratings yet
master_data
520 pages
BDS-Homework-1-Submission.ipynb - Colab
No ratings yet
BDS-Homework-1-Submission.ipynb - Colab
11 pages
Lecture 1-2 - Introduction
No ratings yet
Lecture 1-2 - Introduction
72 pages
MIS382N Fall 2024 Syllabus
No ratings yet
MIS382N Fall 2024 Syllabus
4 pages
Updated Nikita MSITM CV Fortune Brands
No ratings yet
Updated Nikita MSITM CV Fortune Brands
2 pages
CPAR_Qr2_Wk1-2(UPDATED)
No ratings yet
CPAR_Qr2_Wk1-2(UPDATED)
16 pages
History and Social Science Framework: Grades Pre-Kindergarten To 12
No ratings yet
History and Social Science Framework: Grades Pre-Kindergarten To 12
217 pages
Predictive Maintenance: & Condition Monitoring
No ratings yet
Predictive Maintenance: & Condition Monitoring
2 pages
Akselito - Twitter Stoqo CV
No ratings yet
Akselito - Twitter Stoqo CV
1 page
Interior Architecture Thesis Topics List
100% (2)
Interior Architecture Thesis Topics List
7 pages
Instructions:: 1 9 2 10 3 11 4 12 5 13 6 14 7 15 8 Total
No ratings yet
Instructions:: 1 9 2 10 3 11 4 12 5 13 6 14 7 15 8 Total
14 pages
Complete Core Competencies of Relational Psychoanalysis A Guide To Practice Study and Research 1st Edition Roy E. Barsness PDF For All Chapters
100% (3)
Complete Core Competencies of Relational Psychoanalysis A Guide To Practice Study and Research 1st Edition Roy E. Barsness PDF For All Chapters
62 pages
Psychology, Foreign Policy, and International Relations Theory
100% (1)
Psychology, Foreign Policy, and International Relations Theory
15 pages
PHY 241 Biophysics
No ratings yet
PHY 241 Biophysics
6 pages
M.com
No ratings yet
M.com
7 pages
JD - Data Processing - IC Entry
No ratings yet
JD - Data Processing - IC Entry
3 pages
FCE Writing Plan Template
No ratings yet
FCE Writing Plan Template
1 page
Department of Education: Republic of The Philippines
No ratings yet
Department of Education: Republic of The Philippines
6 pages
W WS SM MA A: Event 4111 - Trumpet Solo
No ratings yet
W WS SM MA A: Event 4111 - Trumpet Solo
34 pages
Cot-Mtb-Mle 3-Q4-W6
No ratings yet
Cot-Mtb-Mle 3-Q4-W6
7 pages
Thesis Writing Guide
No ratings yet
Thesis Writing Guide
7 pages
Anexure-I & II
No ratings yet
Anexure-I & II
2 pages
PERCEPTION
No ratings yet
PERCEPTION
36 pages
TEVET Registered Training Institutions
50% (2)
TEVET Registered Training Institutions
10 pages
Registration Card
No ratings yet
Registration Card
1 page
January 2017 (IAL) QP - Unit 3 Edexcel Economics A-Level
No ratings yet
January 2017 (IAL) QP - Unit 3 Edexcel Economics A-Level
40 pages
How To Write An Opening Speech
No ratings yet
How To Write An Opening Speech
2 pages
ENGLISH STD 9 II SEMESTER ASSINGMENTS AND ORAL EXAMINATION
No ratings yet
ENGLISH STD 9 II SEMESTER ASSINGMENTS AND ORAL EXAMINATION
3 pages
Dalton et al_2016_Multifaceted Contributions by Different Regions of the Orbitofrontal and Medial
No ratings yet
Dalton et al_2016_Multifaceted Contributions by Different Regions of the Orbitofrontal and Medial
11 pages
Lohana Schools: P.2 English Revision Work Activity One: 2020
No ratings yet
Lohana Schools: P.2 English Revision Work Activity One: 2020
9 pages
List of Teachers Spring - 2010 With Short Name and Telephone Number
No ratings yet
List of Teachers Spring - 2010 With Short Name and Telephone Number
20 pages
Neighborhood Stakeholder Meeting Notes
No ratings yet
Neighborhood Stakeholder Meeting Notes
3 pages
Magazine Spread Project
No ratings yet
Magazine Spread Project
2 pages
Optimasi Metode Lisis Alkali Untuk Meningkatkan Ko
No ratings yet
Optimasi Metode Lisis Alkali Untuk Meningkatkan Ko
6 pages