Homework3

Uploaded by

salah.abdo.tech

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Homework3

Uploaded by

salah.abdo.tech

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

For Questions 1- 4, please submit a word file or a PDF file;

For Question 5 (programming question), please submit an .ipynb file.

Question 1: [4 points] Explain what is the bias-variance trade-off? Describe few techniques to
reduce bias and variance respectively.

Question 2: [6 points] Assume the following confusion matrix of a classifier. Please compute its
1) precision,
2) recall, and
3) F1-score.

Predicted results
Actual values

Class 1 Class 2
Class 1 50 30
Class 2 40 60

Question 3: [10 points] Build a decision tree using the following training instances (using
information gain approach):

Applied Machine Learning – CPE 695 © 2 0 2 1 - 1 -

ST E VEN S I N ST I T U T E oƒ T EC H N O L O G Y
Question 4. [10 points] The naïve Bayes method is an ensemble method as we learned in
Module 5. Assuming we have 3 classifiers, and their predicted results are given in the table 1.
The confusion matrix of each classifier is given in table 2. Please give the final decision using the
Naïve Bayes method:

Table 1 Predicted results of each classifier

Sample x Result

Classifier 1 Class 1

Classifier 2 Class 1

Classifier 3 Class 2

Table 2 Confusion matrix of each classifier

i) Classifier 1 ii) Classifier 2 iii) Classifier 3

Class1 Class2 Class1 Class2 Class1 Class2

Class1 40 10 Class1 20 30 Class1 50 0

Class2 30 20 Class2 20 30 Class2 40 10

Question 5: Programming (40 points):

Use decision tree and random forest to train the titanic.csv dataset included in the assignment.

Step 1: Read in Titanic.csv and observe a few samples, some features are categorical, and
others are numerical. If some features are missing, fill them in using the average of the same
feature of other samples. Take a random 80% samples for training and the rest 20% for test.

Step 2: Fit a decision tree model using independent variables ‘pclass + sex + age + sibsp’ and
dependent variable ‘survived’. Plot the full tree. Make sure ‘survived’ is a qualitative variable

ST E VEN S I N ST I T U T E oƒ T EC H N O L O G Y
taking 1 (yes) or 0 (no) in your code. You may see a tree similar to this one (the actual structure
and size of your tree can be different):

Step 3: Use the GridSearchCV() function to find the best parameter max_leaf_nodes to prune the
tree. Plot the pruned tree which shall be smaller than the tree you obtained in Step 2.

Step 4: For the pruned tree, report its accuracy on the test set for the following:

percent survivors correctly predicted (on test set)

percent fatalities correctly predicted (on test set)

Step 5: Use the RandomForestClassifier() function to train a random forest using the value of
max_leaf_nodes you found in Step 3. You can set n_estimators as 50. Report the accuracy of
random forest on the test set for the following:

percent survivors correctly predicted (on test set)

percent fatalities correctly predicted (on test set)

Check whether there is improvement as compared to a single tree obtained in Step 4.

ST E VEN S I N ST I T U T E oƒ T EC H N O L O G Y

Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
HUAWEI Final Written Exam 3333
50% (2)
HUAWEI Final Written Exam 3333
13 pages
Capstone - Project - Final - Report - Churn - Prediction
100% (3)
Capstone - Project - Final - Report - Churn - Prediction
28 pages
DWM_EXP4
No ratings yet
DWM_EXP4
5 pages
P02 DecisionTrees SolutionNotes
No ratings yet
P02 DecisionTrees SolutionNotes
3 pages
DWM_EXP3_63
No ratings yet
DWM_EXP3_63
7 pages
Assignment 3
No ratings yet
Assignment 3
3 pages
Data Mining Assignment No. 1
No ratings yet
Data Mining Assignment No. 1
7 pages
4.1.3.5 Lab - Decision Tree Classification
No ratings yet
4.1.3.5 Lab - Decision Tree Classification
11 pages
DIT865 2018 Mar Solution
No ratings yet
DIT865 2018 Mar Solution
9 pages
Data Science and ML - End Term
No ratings yet
Data Science and ML - End Term
4 pages
DT RF
No ratings yet
DT RF
7 pages
Ass3 v1
No ratings yet
Ass3 v1
4 pages
UCS622
No ratings yet
UCS622
1 page
Machine Learning CA 2
No ratings yet
Machine Learning CA 2
19 pages
AML ML Practical List
No ratings yet
AML ML Practical List
10 pages
P02 DecisionTrees
No ratings yet
P02 DecisionTrees
2 pages
practical 15 python
No ratings yet
practical 15 python
6 pages
MachineLearning MidTerm UMT Spring 2021
100% (1)
MachineLearning MidTerm UMT Spring 2021
12 pages
Merging Result-Merged
No ratings yet
Merging Result-Merged
14 pages
Soft Computing Lab Practical Assignment 2
No ratings yet
Soft Computing Lab Practical Assignment 2
10 pages
Skit Learn Cheatsheet
No ratings yet
Skit Learn Cheatsheet
11 pages
Decision Tree
No ratings yet
Decision Tree
4 pages
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
No ratings yet
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
20 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
10 pages
ML Important Questions.docx
No ratings yet
ML Important Questions.docx
7 pages
Project Occupancy Alfonso Vicente Aragues
No ratings yet
Project Occupancy Alfonso Vicente Aragues
18 pages
Machine learning with Titanic dataset tutorial
No ratings yet
Machine learning with Titanic dataset tutorial
7 pages
8 To 12 Jaimeen
No ratings yet
8 To 12 Jaimeen
34 pages
CS2B Nov 24 QP
No ratings yet
CS2B Nov 24 QP
5 pages
ML New record (5)
No ratings yet
ML New record (5)
51 pages
Machine Learning
100% (1)
Machine Learning
62 pages
Unit 3 Classification - Dr. Vidyut D
No ratings yet
Unit 3 Classification - Dr. Vidyut D
72 pages
COL774_A5
No ratings yet
COL774_A5
6 pages
Lecture 7.2 - DTC Algorithm Implementation
No ratings yet
Lecture 7.2 - DTC Algorithm Implementation
7 pages
ML Unit-Ii Notes
No ratings yet
ML Unit-Ii Notes
17 pages
Machine Learning Practical
No ratings yet
Machine Learning Practical
59 pages
Classification Problems
No ratings yet
Classification Problems
53 pages
Titanic
No ratings yet
Titanic
1 page
MLP Question Bank of AI and ML and NLP
No ratings yet
MLP Question Bank of AI and ML and NLP
7 pages
Decision Trees in Sklearn Decision Trees in Sklearn
No ratings yet
Decision Trees in Sklearn Decision Trees in Sklearn
7 pages
Types of Pruning Techniques
No ratings yet
Types of Pruning Techniques
10 pages
National Institute of Technology Rourkela: Department of Computer Science and Engineering
No ratings yet
National Institute of Technology Rourkela: Department of Computer Science and Engineering
2 pages
dwm_06
No ratings yet
dwm_06
4 pages
COL 774 - Machine Learning - Assignment 5
No ratings yet
COL 774 - Machine Learning - Assignment 5
6 pages
AI+and+ML Assigment 03
No ratings yet
AI+and+ML Assigment 03
4 pages
ML_Prac1-10
No ratings yet
ML_Prac1-10
32 pages
CE802 Report
No ratings yet
CE802 Report
7 pages
Expt7_ML2025_250306_143857
No ratings yet
Expt7_ML2025_250306_143857
5 pages
PUT MLT
No ratings yet
PUT MLT
12 pages
Ml Ese 031223 Openbook
No ratings yet
Ml Ese 031223 Openbook
4 pages
Slide 3
No ratings yet
Slide 3
23 pages
Experiment No 4 Vanraj
No ratings yet
Experiment No 4 Vanraj
2 pages
Title: Implementation of Decision Tree Classification: Department of Computer Science and Engineering
No ratings yet
Title: Implementation of Decision Tree Classification: Department of Computer Science and Engineering
8 pages
Ifjo 320 Fy 98324 Fo 3 F 2 Ifr
No ratings yet
Ifjo 320 Fy 98324 Fo 3 F 2 Ifr
6 pages
ML Mid Sem Sep2023 Paper
No ratings yet
ML Mid Sem Sep2023 Paper
3 pages
Unit 2
No ratings yet
Unit 2
11 pages
تمارین درس داده کاوی فصل طبقه بندی
No ratings yet
تمارین درس داده کاوی فصل طبقه بندی
7 pages
MISY 631 Final Review Calculators Will Be Provided For The Exam
No ratings yet
MISY 631 Final Review Calculators Will Be Provided For The Exam
9 pages
601 sp09 Midterm Solutions
No ratings yet
601 sp09 Midterm Solutions
14 pages
Redis Certified Developer - Exam Practice Tests
From Everand
Redis Certified Developer - Exam Practice Tests
Cristian Scutaru
No ratings yet
fake review detection
No ratings yet
fake review detection
9 pages
JPNR 2022 S01 126
No ratings yet
JPNR 2022 S01 126
8 pages
Reading 4 Big Data Projects
No ratings yet
Reading 4 Big Data Projects
4 pages
Evaluation Metrics-ML
No ratings yet
Evaluation Metrics-ML
16 pages
Web Image Re-Ranking Using Query-Specific Semantic Signatures
No ratings yet
Web Image Re-Ranking Using Query-Specific Semantic Signatures
3 pages
Paper 6
No ratings yet
Paper 6
8 pages
I Unit
No ratings yet
I Unit
43 pages
Locality-Sensitive Binary Codes From Shift-Invariant Kernels
No ratings yet
Locality-Sensitive Binary Codes From Shift-Invariant Kernels
9 pages
Customer Classification by Past Purchase Data Analysis
No ratings yet
Customer Classification by Past Purchase Data Analysis
4 pages
Unit-1 Chapter 1
No ratings yet
Unit-1 Chapter 1
44 pages
Name:Fedrick Samuel W Reg No: 19MIS1112 Course: Machine Learning (SWE4012) Slot: L11 + L12 Faculty: Dr.M. Premalatha
No ratings yet
Name:Fedrick Samuel W Reg No: 19MIS1112 Course: Machine Learning (SWE4012) Slot: L11 + L12 Faculty: Dr.M. Premalatha
30 pages
applsci-15-05930
No ratings yet
applsci-15-05930
29 pages
Sentiment Analysis of Nepali Text Using Naïve Bayes Under The Supervision of
No ratings yet
Sentiment Analysis of Nepali Text Using Naïve Bayes Under The Supervision of
36 pages
Asss
100% (4)
Asss
2 pages
sample paper
No ratings yet
sample paper
12 pages
Early Predicting of Students Performance in Higher
No ratings yet
Early Predicting of Students Performance in Higher
12 pages
Lecture9_ML-Algorithms
No ratings yet
Lecture9_ML-Algorithms
22 pages
License Plate Detection Using YOLOv8
No ratings yet
License Plate Detection Using YOLOv8
36 pages
ML notes
No ratings yet
ML notes
16 pages
Content Based ML Repo
No ratings yet
Content Based ML Repo
36 pages
DCNN-a novel binary and multi-class network intrusion detection model via deep convolutional neural network
No ratings yet
DCNN-a novel binary and multi-class network intrusion detection model via deep convolutional neural network
23 pages
Yolo-V7 Object Detection Assessment
No ratings yet
Yolo-V7 Object Detection Assessment
15 pages
FYPppt
No ratings yet
FYPppt
40 pages
Machine Learning Based Education Data Mining Through Student Session Streams
No ratings yet
Machine Learning Based Education Data Mining Through Student Session Streams
12 pages
An Anaya
No ratings yet
An Anaya
40 pages
Integrating_Handcrafted_Features_with_Machine_Lear
No ratings yet
Integrating_Handcrafted_Features_with_Machine_Lear
13 pages
Ijarcce 2023 12530
No ratings yet
Ijarcce 2023 12530
7 pages
SonarQube Rules
No ratings yet
SonarQube Rules
11 pages
6D286504-DE31-11EF-98E1-880324B5EAEC
No ratings yet
6D286504-DE31-11EF-98E1-880324B5EAEC
21 pages