Midterm - APS1070 - 2020 - 05 Summer

Uploaded by

The document contains 6 multiple choice or short answer questions related to machine learning concepts like decision trees, feature selection, correlation, and classification metrics. The questions assess understanding of topics like overfitting decision trees, improving model performance, removing correlated features, calculating AUC and choosing the best model based on a given metric like F1 score.

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Midterm - APS1070 - 2020 - 05 Summer

Uploaded by

Michael Ye

0% found this document useful (0 votes)

51 views2 pages

Original Title

midterm - APS1070 - 2020_05 Summer

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

0% found this document useful (0 votes)

51 views2 pages

Midterm - APS1070 - 2020 - 05 Summer

Uploaded by

Michael Ye

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

Jump to Page

You are on page 1of 2

Search inside document

1.

Please read this statement and Agree/Disagree below:

“In submitting this assessment, I confirm that my conduct during this quiz adheres to the Code
of Behaviour on Academic Matters. I confirm that I did NOT act in such a way that would
constitute cheating, misrepresentation, or unfairness, including but not limited to, using
unauthorized aids and assistance, impersonating another person, and committing plagiarism. I
pledge upon my honour that I have not violated the Faculty of Applied Science & Engineering’s
Honour Code during this assessment.”

2. [2] Which of the following statements is false?

a. Basis vectors forming an orthogonal basis are always orthonormal.
b. Basis vectors forming an orthonormal basis are always orthogonal.
c. Basis vectors forming an orthonormal basis are always normal.
d. All vectors in an orthonormal basis has length 1.

3. [2] In lecture, we discussed decision trees – an intuitive classification model that splits on
different attributes, creating a tree-like structure. A data scientist is given a large data set and
uses part of the data to train a really big decision tree with many branches and nodes, that
perfectly fits the data. When they apply it to the validation data, overall accuracy is only 78%.
a. Why is test performance so poor?
b. What can the data scientist do to improve the model?

4. [2] A data scientist has a data set with a lot of features and chooses to use some of these
features to train a model on training data and evaluate performance on testing data. They find
that both training and testing accuracy is poor. What would you recommend (i) removing a few
features or (ii) adding more features? Explain.

5. [2] A data set with 4 features has the following covariance matrix:

A B C D
A 0.5 0.018 0.11 0.048
B 0.018 0.01 0.0025 0.14
C 0.11 0.0025 0.023 0.0055
D 0.048 0.14 0.0055 6

You’re asked to remove a highly correlated feature from the data set. Which one would you
remove?
6. [4] You have two binary classification models (P_1 and P_2), that use a series of features to
predict the probability of emails being spam. The computed probabilities are shown in the table
below, along with actual labels, for six validation data.

Label P_1 P_2

1 0 0.1 0.1
2 0 0.4 0.5
3 0 0.3 0.5
4 1 0.5 0.4
5 1 0.4 0.8
6 1 0.8 0.6

a. Calculate the AUC for each model.

b. Assuming you value F1-score, which model would you choose?
c. What is the precision, recall, accuracy and confusion matrix for this best model?

Artea - Data
Document1,266 pages
Artea - Data
Anh Tú
No ratings yet
Test Automation Estimate Template: Automation Type Project Name
Document6 pages
Test Automation Estimate Template: Automation Type Project Name
Santosh Prasad Ulpi
No ratings yet
Josiah Genao 1.01 Exploring Life Lab Report
Document4 pages
Josiah Genao 1.01 Exploring Life Lab Report
Josiah Paulino
No ratings yet
Syed Wajahat Abbas 10466
Document8 pages
Syed Wajahat Abbas 10466
ammar abbas
No ratings yet
Assignment #2 - For Statistical Software
Document4 pages
Assignment #2 - For Statistical Software
Nhatty Wero
No ratings yet
Question Bank Topic 6 - Foundations of Portfolio Theory
Document8 pages
Question Bank Topic 6 - Foundations of Portfolio Theory
mile
No ratings yet
R06 Time-Series Analysis
Document16 pages
R06 Time-Series Analysis
Indonesian Pro
0% (2)
Company Analysis: Case Abstract
Document4 pages
Company Analysis: Case Abstract
Aimira Aimagambetova
No ratings yet
Me Summer 2019
Document2 pages
Me Summer 2019
Ritik Mitra
No ratings yet
Superior University Lahore: The Main Objectives of This Ordering System Are
Document4 pages
Superior University Lahore: The Main Objectives of This Ordering System Are
Abubakar Chaudhry
No ratings yet
AF304 WEEK 11 Additional Tut Questions+tables
Document7 pages
AF304 WEEK 11 Additional Tut Questions+tables
Shivanjani Prasad
No ratings yet
Annotated Follow-Along Guide - Construct A Naive Bayes Model With Python
Document9 pages
Annotated Follow-Along Guide - Construct A Naive Bayes Model With Python
Trần Hoàng Thuý Vy
No ratings yet
MIS602 - Assessment 3 - 20240603
Document5 pages
MIS602 - Assessment 3 - 20240603
EMMANUEL FADHILI
No ratings yet
Chapter 5 - Data Analysis and Findings
Document10 pages
Chapter 5 - Data Analysis and Findings
priyanka
No ratings yet
FIN213 - Semester Test 1 20240406 Solutions Memo
Document11 pages
FIN213 - Semester Test 1 20240406 Solutions Memo
gaming.og122
No ratings yet
July 2018 545
Document6 pages
July 2018 545
Nur Adriana binti Abdul Aziz
No ratings yet
Module 1 Quiz (AEA) - Correct
Document6 pages
Module 1 Quiz (AEA) - Correct
anjibalaji52
No ratings yet
CA Assignment Group 1 RBA
Document17 pages
CA Assignment Group 1 RBA
Pashmeen Kaur
No ratings yet
Grade 12 Information and Communication Technology 2nd Term Test Paper With Answers 2020 North Western Province
Document20 pages
Grade 12 Information and Communication Technology 2nd Term Test Paper With Answers 2020 North Western Province
Manupa Perera
No ratings yet
MapR Certified Data Analyst (MCDA) Study Guide 16Skmxd
Document34 pages
MapR Certified Data Analyst (MCDA) Study Guide 16Skmxd
mrinal570
No ratings yet
PQ1 - Attempt Review PDF
Document6 pages
PQ1 - Attempt Review PDF
Peter Eclevia
No ratings yet
Application Lifecycle MGT - FQuiz 2
Document4 pages
Application Lifecycle MGT - FQuiz 2
JaniceRemateNoble
No ratings yet
Practice - Quality Management
Document8 pages
Practice - Quality Management
Dexter Khoo
No ratings yet
S14 Report and Citation
Document12 pages
S14 Report and Citation
Daksh Aneja
No ratings yet
Lal Bahadur Shastri Institute of Management, Delhi: PGDM - (General/R&BA/Finance Term-III) End-Term Exam, April 2021
Document4 pages
Lal Bahadur Shastri Institute of Management, Delhi: PGDM - (General/R&BA/Finance Term-III) End-Term Exam, April 2021
Nishit Srivastav
No ratings yet
My Courses 2022 Second Summer CSC 7333 For Jianhua Chen Final Exam Final Exam
Document16 pages
My Courses 2022 Second Summer CSC 7333 For Jianhua Chen Final Exam Final Exam
Dasari naveen
No ratings yet
Study Guide 1
Document3 pages
Study Guide 1
zubairsalmanpk
No ratings yet
Machine Learning Extended Project
Document3 pages
Machine Learning Extended Project
Krishnameera python
No ratings yet
a1948a9392c842949001ebd059aa752e
Document5 pages
a1948a9392c842949001ebd059aa752e
naveen
No ratings yet
BITI 1113 12223 Lab Assessment
Document4 pages
BITI 1113 12223 Lab Assessment
Jack frost
No ratings yet
Alm Final Exam!
Document20 pages
Alm Final Exam!
Nicole Tenoria
No ratings yet
Ife Matrix Boeing
Document5 pages
Ife Matrix Boeing
Daniela Peña
100% (1)
Malic acid problem
Document4 pages
Malic acid problem
aparmarhpt
No ratings yet
Ell784 Aq
Document2 pages
Ell784 Aq
lovlesh roy
No ratings yet
Question Paper OM, EPGP-13 (Sec B)
Document5 pages
Question Paper OM, EPGP-13 (Sec B)
Akshay Singh
0% (1)
Logistic Regression: Prof. Andy Field
Document34 pages
Logistic Regression: Prof. Andy Field
Syed
No ratings yet
Multiple Choice Quiz
Document2 pages
Multiple Choice Quiz
Jasdeep Singh Deepu
100% (2)
Operations Management Case Study (Mark: 40%)
Document3 pages
Operations Management Case Study (Mark: 40%)
Bedri M Ahmedu
No ratings yet
Sample Exam Questions
Document5 pages
Sample Exam Questions
dinesh
No ratings yet
6014 Question Paper
Document2 pages
6014 Question Paper
rahulgupta32005
No ratings yet
Optimax USER'S MANUAL v0.6.3: 2.1 Scalar Variables
Document21 pages
Optimax USER'S MANUAL v0.6.3: 2.1 Scalar Variables
Reza Aldavood
No ratings yet
Assign1 s2 2024
Document5 pages
Assign1 s2 2024
wiremu casey
No ratings yet
C TBW45 70 Sample
Document6 pages
C TBW45 70 Sample
Konduru Prashanth
No ratings yet
1617sem1 Ie5203
Document5 pages
1617sem1 Ie5203
apple pie
No ratings yet
Six Sigma Sample Question Answers
Document15 pages
Six Sigma Sample Question Answers
Tanveer Siddique
No ratings yet
Workshop - Data Mining & Explosion
Document11 pages
Workshop - Data Mining & Explosion
javediqbal78.uk
No ratings yet
Download Complete Statistics for Engineers and Scientists 5th Edition William Navidi PDF for All Chapters
Document52 pages
Download Complete Statistics for Engineers and Scientists 5th Edition William Navidi PDF for All Chapters
paulsemalte
100% (1)
Problem (Objective 17-5) in Auditing The Valuation of Inventory, The Auditor, Claire Butler, Decided To Use
Document3 pages
Problem (Objective 17-5) in Auditing The Valuation of Inventory, The Auditor, Claire Butler, Decided To Use
Angelina Simanjuntak
No ratings yet
Decision Science
Document4 pages
Decision Science
Vinita
No ratings yet
SLA Mid-termV2 Soln
Document5 pages
SLA Mid-termV2 Soln
cadi0761
No ratings yet
Application Lifecycle MGT - Final Exam
Document20 pages
Application Lifecycle MGT - Final Exam
JaniceRemateNoble
No ratings yet
2009 06 SSBB Rev 2 Col Sample Exam
Document15 pages
2009 06 SSBB Rev 2 Col Sample Exam
Prasoon Verma
No ratings yet
Risk Profile: UTILIDAD ESPERADA (2.5) 2900 27.3312
Document3 pages
Risk Profile: UTILIDAD ESPERADA (2.5) 2900 27.3312
César Vallejo
No ratings yet
Where Can Buy Statistics Using Technology Second Edition Kathryn Kozak Ebook With Cheap Price
Document74 pages
Where Can Buy Statistics Using Technology Second Edition Kathryn Kozak Ebook With Cheap Price
jeffrymuck
100% (5)
COS10022 Data Science Assignment 1 Question
Document3 pages
COS10022 Data Science Assignment 1 Question
j22037228
No ratings yet
Introduction to Statistics Through Resampling Methods and Microsoft Office Excel
From Everand
Introduction to Statistics Through Resampling Methods and Microsoft Office Excel
Phillip I. Good
No ratings yet
Math Practice Simplified: Decimals & Percents (Book H): Practicing the Concepts of Decimals and Percentages
From Everand
Math Practice Simplified: Decimals & Percents (Book H): Practicing the Concepts of Decimals and Percentages
Ann Cassill Sofge
Rating: 5 out of 5 stars
5/5 (3)
Quantitative Finance: Its Development, Mathematical Foundations, and Current Scope
From Everand
Quantitative Finance: Its Development, Mathematical Foundations, and Current Scope
T. Wake Epps
No ratings yet
Service Science
From Everand
Service Science
Mark S. Daskin
No ratings yet
The Data Science Workshop: A New, Interactive Approach to Learning Data Science
From Everand
The Data Science Workshop: A New, Interactive Approach to Learning Data Science
Anthony So
No ratings yet
Amy Mini
Document57 pages
Amy Mini
burakozlu
No ratings yet
People v. Estrada Part 2
Document10 pages
People v. Estrada Part 2
Bryce King
No ratings yet
8 Analysis and Synthesis
Document7 pages
8 Analysis and Synthesis
fiahstone
No ratings yet
SP60PL Specialized 3-2
Document1 page
SP60PL Specialized 3-2
Ahmad Sabra
No ratings yet
NCP302035 Integrated Driver and Mosfet: Description
Document17 pages
NCP302035 Integrated Driver and Mosfet: Description
Yenco Barliza Diaz
No ratings yet
Ifix Services - Ind 100Ft Road: MR Lokranjith P
Document1 page
Ifix Services - Ind 100Ft Road: MR Lokranjith P
Anonymous J7kMk6A7T
No ratings yet
Use of Pet Coke in Cement Manufacturing and Its Comparitve Propreties With Coal
Document23 pages
Use of Pet Coke in Cement Manufacturing and Its Comparitve Propreties With Coal
nitesh1985
100% (1)
2016 NatHazard PhillipinesTyphoon Haiyan
Document13 pages
2016 NatHazard PhillipinesTyphoon Haiyan
John Jerald Villamanca
No ratings yet
JUNOS Secure BGP Template V 1 PDF
Document12 pages
JUNOS Secure BGP Template V 1 PDF
Salman Alfarisi
No ratings yet
Second Periodical Test in Science Vi Table of Specification Objectives No. of Days Taught Percent No. of Items Item Placement
Document14 pages
Second Periodical Test in Science Vi Table of Specification Objectives No. of Days Taught Percent No. of Items Item Placement
Nida Espinas Francisco
No ratings yet
Proposed PBL 2.0 Curriculum
Document30 pages
Proposed PBL 2.0 Curriculum
Rodel Ebal
No ratings yet
2021 Barangay Nutrition Action Plan (Bnap) : Poblacion
Document6 pages
2021 Barangay Nutrition Action Plan (Bnap) : Poblacion
Julius Espiga Elmedorial
100% (3)
Application Form For Acr I-Card Renewal
Document2 pages
Application Form For Acr I-Card Renewal
mikhail81
No ratings yet
Important Tips On The Calibration and Adjustment of The Testo 270
Document1 page
Important Tips On The Calibration and Adjustment of The Testo 270
Abu Alif
No ratings yet
TVS Jupiter RMC - 19.3.19
Document60 pages
TVS Jupiter RMC - 19.3.19
vinod
No ratings yet
Calico Rules
Document16 pages
Calico Rules
Tomás Sánchez
No ratings yet
119403-2003-Magsalin - v. - National - Organization - of - Working
Document7 pages
119403-2003-Magsalin - v. - National - Organization - of - Working
Carol Terrado
No ratings yet
Define COPA Mapping: Requirements
Document4 pages
Define COPA Mapping: Requirements
GK SK
No ratings yet
Sop
Document19 pages
Sop
Abdul Razzaq
No ratings yet
Frequencies: Notes
Document36 pages
Frequencies: Notes
anggi purnamasari
No ratings yet
A Cantilever Side Table C Table End Table
Document23 pages
A Cantilever Side Table C Table End Table
Danilo Rocha
No ratings yet
VZW Credo
Document1 page
VZW Credo
fruitfuck
No ratings yet
Baseband M-Ary PAM
Document12 pages
Baseband M-Ary PAM
Danish Soonka
No ratings yet
Pearson Catalog Bioscience 2019 Final
Document96 pages
Pearson Catalog Bioscience 2019 Final
Jeetu Rao
No ratings yet
A-130JACK: Operating Instruction Water Pump
Document1 page
A-130JACK: Operating Instruction Water Pump
James Choong
No ratings yet
Chemical Reactions Balancing Equations Activity
Document3 pages
Chemical Reactions Balancing Equations Activity
Katy Ospina Polania
No ratings yet
APTIS Writing Part 4
Document11 pages
APTIS Writing Part 4
Lara
No ratings yet
Research Paper
Document4 pages
Research Paper
PranjalSharma
No ratings yet
Letter To A Young Scientist
Document2 pages
Letter To A Young Scientist
Shunah Uporash
No ratings yet
Elton Mayo Theory Defication
Document25 pages
Elton Mayo Theory Defication
Monday
No ratings yet