Assignment 8 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran

This document contains solutions to questions from an assignment on machine learning. It discusses bagging and boosting techniques. Key points: - Bagging reduces variance and can make unstable classifiers more robust by combining outputs from multiple classifiers trained on different training data samples. - Boosting gives higher weights to misclassified examples in subsequent classifiers to focus on correcting errors. It may increase error more than bagging if noise is overemphasized. - Both bagging and boosting use sampling, but bagging typically uses sampling with replacement while boosting often uses weighted sampling.

Uploaded by

Byron Xavier Lima Cedillo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

289 views3 pages

Assignment 8 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran

Uploaded by

Byron Xavier Lima Cedillo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Assignment 8 (Sol.

)
Introduction to Machine Learning
Prof. B. Ravindran
1. Which of the following is/are true about bagging?

(a) Bagging reduces variance of the classifier

(b) Bagging increases the variance of the classifier
(c) Bagging can help make robust classifiers from unstable classifiers
(d) Majority is one way of combining outputs from various classifiers which are being bagged

Sol. a, c, d
In bagging we combine the outputs of multiple classifiers trained on different samples of the
training data. This helps in reducing overall variance. Due to the reduction in variance,
normally unstable classifiers can be made robust with the help of bagging.
2. Which among the following prevents overfitting when we perform bagging?

(a) The use of sampling with replacement as the sampling technique

(b) The use of weak classifiers
(c) The use of classification algorithms which are not prone to overfitting
(d) The practice of validation performed on every classifier trained

Sol. b
The presence of over-training (which leads to overfitting) is not generally a problem with
weak classifiers. For example, in decision stumps, i.e., decision trees with only one node (the
root node), there is no real scope for overfitting. This helps the classifier which combines the
outputs of weak classifiers in avoiding overfitting.
3. Consider an alternative way of learning a Random Forest where instead of randomly sampling
the attributes at each node, we sample a subset of attributes for each tree and build the tree
on these features. Would you prefer this method over the original or not, and why?

(a) Yes, because it reduces the correlation between the resultant trees
(b) Yes, because it reduces the time taken to build the trees due to the decrease in the
attributes considered
(c) No, because many of the trees will be bad classifiers due to the absence of critical features
considered in the construction of some of the trees

Sol. c
The availability of all attributes (at possibly differing levels) allows the original random forest
approach to have relatively good classifiers from which to construct the combined classifier. In
the proposed approach, many of the constituent classifiers will exhibit very poor performance
affecting the performance of the random forest classifier.
4. In case of limited training data, which technique, bagging or stacking, would be preferred, and
why?

1
(a) Bagging, because we can combine as many classifier as we want by training each on a
different sample of the training data
(b) Bagging, because we use the same classification algorithms on all samples of the training
data
(c) Stacking, because each classifier is trained on all of the available data
(d) Stacking, because we can use different classification algorithms on the training data

Sol. c
When data is at a premium, we would ideally prefer to train all models on all of the available
training data.
5. Is AdaBoost sensitive to outliers?

(a) Yes
(b) No

Sol. a
See solution to question 7.
6. Considering the AdaBoost algorithm, which among the following statements is true?

(a) In each stage, we try to train a classifier which makes accurate predictions on any subset
of the data points where the subset size is at least half the size of the data set
(b) In each stage, we try to train a classifier which makes accurate predictions on a subset of
the data points where the subset contains more of the data points which were miscalssified
in earlier stages
(c) The weight assigned to an individual classifier depends upon the number of data points
correctly classified by the classifier
(d) The weight assigned to an individual classifier depends upon the weighted sum error of
misclassified points for that classifier

Sol. b, d
The classifier chosen at each stage is the one that minimises the weighted error at that stage.
The weight of a point is high if it has been misclassified more number of times in the previous
iterations. Thus, maximum error minimisation is performed by trying to correctly predict
the points which were misclassified in earlier iterations. Also, weights are assigned to the
classifiers depending upon their accuracy which again depends upon the weighted error (for
that classifier).

7. In AdaBoost, we re-weight points giving points misclassified in previous iterations more weight.
Suppose we introduced a limit or cap on the weight that any point can take (for example, say
we introduce a restriction that prevents any point’s weight from exceeding a value of 10).
Which among the following would be an effect of such a modification?

(a) We may observe the performance of the classifier reduce as the number of stages increase
(b) It makes the final classifier robust to outliers
(c) It may result in lower overall performance

2
Sol. b, c
Outliers tend to get misclassified. As the number of iterations increase, the weight correspond-
ing to outlier points can become very large resulting in subsequent classifier models trying to
classify the outlier points correctly. This generally has an adverse effect on the overall clas-
sifier. Restricting the weights is one way of mitigating this problem. However, this can also
lower the performance of the classifier.

8. Which among the following are some of the differences between bagging and boosting?

(a) In bagging we use the same classification algorithm for training on each sample of the data,
whereas in boosting, we use different classification algorithms on the different training
data samples
(b) Bagging is easy to parallelise whereas boosting is inherently a sequential process
(c) In bagging we typically use sampling with replacement whereas in boosting, we typically
use weighted sampling techniques
(d) In comparison with the performance of a base classifier on a particular data set, bagging
will generally not increase the error whereas as boosting may lead to an increase in the
error

Sol. b, c, d
With regards to the last option, boosting can result in an increase in error over a base classifier
due to over-emphasis on existing noise data points in later iterations.

Motion Control PDF
No ratings yet
Motion Control PDF
590 pages
Paper 3 Exploration Ques
No ratings yet
Paper 3 Exploration Ques
2 pages
Week 7 Prev & Current Assignments
No ratings yet
Week 7 Prev & Current Assignments
21 pages
Chapter 5 - Machine Learning Basics
No ratings yet
Chapter 5 - Machine Learning Basics
58 pages
Classroom Allocation System
No ratings yet
Classroom Allocation System
10 pages
Project Proposal AI Assignment
No ratings yet
Project Proposal AI Assignment
9 pages
A New Approach in Dynamic Traveling Salesman Problem: A Hybrid of Ant Colony Optimization and Descending Gradient
No ratings yet
A New Approach in Dynamic Traveling Salesman Problem: A Hybrid of Ant Colony Optimization and Descending Gradient
10 pages
Algorithms - Algorithm Fundamentals - Answers
No ratings yet
Algorithms - Algorithm Fundamentals - Answers
6 pages
Lab IAT 05
No ratings yet
Lab IAT 05
9 pages
Unit 1 TOC 1
No ratings yet
Unit 1 TOC 1
34 pages
IML-IITKGP - Assignment 1 Solution
No ratings yet
IML-IITKGP - Assignment 1 Solution
7 pages
Bubble Sort Cocktail Sort: A Group Project On Fundamentals of Computing 1
No ratings yet
Bubble Sort Cocktail Sort: A Group Project On Fundamentals of Computing 1
13 pages
Introduction To Data Structures: Sonal Pandey Nitttr CHD
No ratings yet
Introduction To Data Structures: Sonal Pandey Nitttr CHD
53 pages
Formal Languages and Automata Theory Exercises Finite Automata Unit 3
No ratings yet
Formal Languages and Automata Theory Exercises Finite Automata Unit 3
12 pages
4.1.2 Bubble Sort and Insertion Sort (MT-L)
No ratings yet
4.1.2 Bubble Sort and Insertion Sort (MT-L)
10 pages
Module 2: Matrices and Elementary Row Operations: Letters)
No ratings yet
Module 2: Matrices and Elementary Row Operations: Letters)
14 pages
Pidnn Arduino2
No ratings yet
Pidnn Arduino2
2 pages
Merge Sort
No ratings yet
Merge Sort
8 pages
ANN Quiz - PDF - Artificial Neural Network - Computational Science
No ratings yet
ANN Quiz - PDF - Artificial Neural Network - Computational Science
17 pages
Undecidable Problems About Turing Machines
No ratings yet
Undecidable Problems About Turing Machines
16 pages
Cat True Dog False Print (Type (Cat) ) : Booleans
No ratings yet
Cat True Dog False Print (Type (Cat) ) : Booleans
2 pages
S7 Ethernet-Local Area Network: GROUP: 4960 Members: William Obaco Alexander Méndez Raúl Saltos
No ratings yet
S7 Ethernet-Local Area Network: GROUP: 4960 Members: William Obaco Alexander Méndez Raúl Saltos
47 pages
Cartesian Impedance Control of Redundant Robots: Recent Results With The DLR-Light-Weight-Arms
No ratings yet
Cartesian Impedance Control of Redundant Robots: Recent Results With The DLR-Light-Weight-Arms
6 pages
Practical File: Subject: Analysis and Design of Algorithms Subject Code: BTIT-305
No ratings yet
Practical File: Subject: Analysis and Design of Algorithms Subject Code: BTIT-305
27 pages
E Volts - Velocidad Rad/seg: Torque Inch/ Corriente Amps Velocidad RPM
No ratings yet
E Volts - Velocidad Rad/seg: Torque Inch/ Corriente Amps Velocidad RPM
3 pages
DS Lab-Scheme
No ratings yet
DS Lab-Scheme
4 pages
08 r059210502 Mathematical Foundation of Computer Science
No ratings yet
08 r059210502 Mathematical Foundation of Computer Science
12 pages
1314-1269 Cotizacion 3 Sistemas Quoncer PDF
No ratings yet
1314-1269 Cotizacion 3 Sistemas Quoncer PDF
4 pages
Artificial Intelligence: Paf-Karachi Institute of Economics & Technology College of Engineering
No ratings yet
Artificial Intelligence: Paf-Karachi Institute of Economics & Technology College of Engineering
8 pages
Color Mark Sensor With Teach Function: Ordering Information
No ratings yet
Color Mark Sensor With Teach Function: Ordering Information
14 pages
Oms T
No ratings yet
Oms T
2 pages
Unit - 1: Part - A
No ratings yet
Unit - 1: Part - A
6 pages
Final Exam AIML2023
No ratings yet
Final Exam AIML2023
3 pages
Chapter 20 Minimal Spanning Tree 3399
No ratings yet
Chapter 20 Minimal Spanning Tree 3399
8 pages
Errors and Approximations Lec. 2.1: Errors in Numerical Methods
No ratings yet
Errors and Approximations Lec. 2.1: Errors in Numerical Methods
13 pages
ML Program Output
No ratings yet
ML Program Output
22 pages
Answer 1722791857 NLP and Classification Practical MCQ 4991
No ratings yet
Answer 1722791857 NLP and Classification Practical MCQ 4991
26 pages
KR4 600
No ratings yet
KR4 600
1 page
Sugeno-Style Fuzzy Inference
No ratings yet
Sugeno-Style Fuzzy Inference
23 pages
2022 ML Assignments
No ratings yet
2022 ML Assignments
45 pages
Internship Presentation
No ratings yet
Internship Presentation
30 pages
Demonstrating PID Control Principles Using An Air Heater and LabVIEW
No ratings yet
Demonstrating PID Control Principles Using An Air Heater and LabVIEW
38 pages
Downloads Tilt
No ratings yet
Downloads Tilt
5 pages
Deep Learning - IIT Ropar - Unit 4 - Week 1
No ratings yet
Deep Learning - IIT Ropar - Unit 4 - Week 1
5 pages
Cds Photoconductive Photocells: Pdv-P8001
No ratings yet
Cds Photoconductive Photocells: Pdv-P8001
1 page
Advanced Machine Learning: Module-1
No ratings yet
Advanced Machine Learning: Module-1
164 pages
Merge Sort Quick Sort
No ratings yet
Merge Sort Quick Sort
43 pages
Assignment 1 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 1 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
7 pages
Asymptotic Notations by Codewithharry
No ratings yet
Asymptotic Notations by Codewithharry
4 pages
DSA Company Wise
No ratings yet
DSA Company Wise
35 pages
Introduction To Deep Learning - Assignment
No ratings yet
Introduction To Deep Learning - Assignment
4 pages
Vanishing and Exploding
No ratings yet
Vanishing and Exploding
9 pages
Deep Learning Exp
No ratings yet
Deep Learning Exp
25 pages
Quiz 3 - Recommendation Systems, Association Rule Mining - Machine Learning 3 - Ravi
No ratings yet
Quiz 3 - Recommendation Systems, Association Rule Mining - Machine Learning 3 - Ravi
7 pages
Classification - Issues Regarding Classification and Prediction
No ratings yet
Classification - Issues Regarding Classification and Prediction
42 pages
DSC VivaQuestions PDF
No ratings yet
DSC VivaQuestions PDF
7 pages
Assignment 10: Introduction To Machine Learning Prof. B. Ravindran
100% (1)
Assignment 10: Introduction To Machine Learning Prof. B. Ravindran
4 pages
Machine Learning For Engineering and Science Applications 2019 Assignments
No ratings yet
Machine Learning For Engineering and Science Applications 2019 Assignments
6 pages
QUBE-Servo Inverted Pendulum Modeling
No ratings yet
QUBE-Servo Inverted Pendulum Modeling
4 pages
Assignment 6 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 6 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
10 pages
Regression Notes
100% (1)
Regression Notes
20 pages
Assignment 9
No ratings yet
Assignment 9
4 pages
Video Tutorial: Machine Learning 17CS73
100% (2)
Video Tutorial: Machine Learning 17CS73
27 pages
Lecture 9 PDF
100% (1)
Lecture 9 PDF
28 pages
Unit 4
No ratings yet
Unit 4
24 pages
Assignment 5: Unit 7 - Week 5
No ratings yet
Assignment 5: Unit 7 - Week 5
6 pages
Gate Cse 2005
No ratings yet
Gate Cse 2005
22 pages
Applied Machine Learning Question Paper
100% (1)
Applied Machine Learning Question Paper
2 pages
IAT-1 Workbook P3-Python
No ratings yet
IAT-1 Workbook P3-Python
16 pages
Assignment 10
100% (1)
Assignment 10
3 pages
Assignment 4: Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 4: Introduction To Machine Learning Prof. B. Ravindran
2 pages
Assignment 3: Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 3: Introduction To Machine Learning Prof. B. Ravindran
4 pages
Algorithm and Programming Language MCQ'S
100% (3)
Algorithm and Programming Language MCQ'S
25 pages
Assignment 2: Introduction To Machine Learning Prof. B. Ravindran
100% (1)
Assignment 2: Introduction To Machine Learning Prof. B. Ravindran
3 pages
CS771 IITK EndSem Solutions
100% (1)
CS771 IITK EndSem Solutions
8 pages
Assignment 1
No ratings yet
Assignment 1
4 pages
Assignment 5
No ratings yet
Assignment 5
3 pages
CS 601 Machine Learning Unit 5
No ratings yet
CS 601 Machine Learning Unit 5
18 pages
Assignment 7
No ratings yet
Assignment 7
3 pages
Assigment - 1 - Week 1 - 2023 - G
No ratings yet
Assigment - 1 - Week 1 - 2023 - G
3 pages
11536.engineering Maths-I (AML5101)
0% (2)
11536.engineering Maths-I (AML5101)
4 pages
NPTEL Online Certification Courses Indian Institute of Technology Kharagpur
100% (1)
NPTEL Online Certification Courses Indian Institute of Technology Kharagpur
4 pages
FDSA Unit-2
No ratings yet
FDSA Unit-2
41 pages
Objective Questions
No ratings yet
Objective Questions
40 pages
Assignment 11: Introduction To Machine Learning Prof. B. Ravindran
100% (1)
Assignment 11: Introduction To Machine Learning Prof. B. Ravindran
3 pages
IIT Madras Notes Machine Learning
No ratings yet
IIT Madras Notes Machine Learning
13 pages
Machine Learning Unit 4 MCQ
No ratings yet
Machine Learning Unit 4 MCQ
28 pages
SCSA3016 Data Science L T P Credits Total Marks 3 0 0 3 100
No ratings yet
SCSA3016 Data Science L T P Credits Total Marks 3 0 0 3 100
1 page
Week 5
No ratings yet
Week 5
8 pages
Assignment 11: Introduction To Machine Learning Prof. B. Ravindran
100% (2)
Assignment 11: Introduction To Machine Learning Prof. B. Ravindran
3 pages
Week 7 Assignment 1
No ratings yet
Week 7 Assignment 1
6 pages
Assignment 11
100% (1)
Assignment 11
4 pages
ML Assignment 3
No ratings yet
ML Assignment 3
5 pages
Overfitting vs. Underfitting, Bias vs. Variance
No ratings yet
Overfitting vs. Underfitting, Bias vs. Variance
7 pages
Assignment 6
No ratings yet
Assignment 6
2 pages
GE3151 Problem Solving and Python Programming Syllabus
No ratings yet
GE3151 Problem Solving and Python Programming Syllabus
1 page
Quiz Week 7 - Support Vector Machines
100% (1)
Quiz Week 7 - Support Vector Machines
3 pages
Thank You For Taking The Week 3: Assignment 3. Week 3: Assignment 3
No ratings yet
Thank You For Taking The Week 3: Assignment 3. Week 3: Assignment 3
3 pages
Assignment 4 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 4 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
4 pages
Assignment - Week 6 (Neural Networks) Type of Question: MCQ/MSQ
No ratings yet
Assignment - Week 6 (Neural Networks) Type of Question: MCQ/MSQ
4 pages
IML-IITKGP - Assignment 7 Solution
No ratings yet
IML-IITKGP - Assignment 7 Solution
8 pages
ML UNIT-2 Notes
No ratings yet
ML UNIT-2 Notes
15 pages
Machine Learning
No ratings yet
Machine Learning
7 pages
ML Question Bank - Beena Kapadia
No ratings yet
ML Question Bank - Beena Kapadia
3 pages

Assignment 8 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran

Uploaded by

Assignment 8 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran

Uploaded by

Assignment 8 (Sol.

(a) Bagging reduces variance of the classifier

(a) The use of sampling with replacement as the sampling technique

You might also like