Week7 Assignment
Week7 Assignment
1. Which of the following option is / are correct regarding the benefits of ensemble model?
1. Better performance
2. More generalized model
3. Better interpretability
A) 1 and 3
B) 2 and 3
C) 1 and 2
D) 1, 2 and 3
Answer: C (1 and 2 are the benefits of ensemble modelling. Option 3 is incorrect because
when we ensemble multiple models, we lose interpretability of the models).
A) We may observe the performance of the classifier reduce as the number of stages
increase.
B) It makes the final classifier robust to outliers.
C) It may result in lower overall performance.
D) None of these.
Answer: B, C (Outliers tend to get misclassified. As the number of iterations increase, the
weight corresponding to outlier points can become very large resulting in subsequent
classifier models trying to classify the outlier points correctly. This generally has an adverse
effect on the overall classifier. Restricting the weights is one way of mitigating this
problem. However, this can also lower the performance of the classifier).
3. Which among the following are some of the differences between bagging and boosting?
A) In bagging we use the same classification algorithm for training on each sample of
the data, whereas in boosting, we use different classification algorithms on the
different training data samples.
NPTEL Online Certification Courses
Indian Institute of Technology Kharagpur
A) 3
B) 4
C) 5
D) 6
5. Considering the AdaBoost algorithm, which among the following statements is true?
A) In each stage, we try to train a classifier which makes accurate predictions on any
subset of the data points where the subset size is at least half the size of the data set.
B) In each stage, we try to train a classifier which makes accurate predictions on
a subset of the data points where the subset contains more of the data points which
were misclassified in earlier stages.
C) The weight assigned to an individual classifier depends upon the number of data
points correctly classified by the classifier.
D) The weight assigned to an individual classifier depends upon the weighted sum
error of misclassified points for that classifier.
Answer: B, D (The classifier chosen at each stage is the one that minimizes the weighted
error at that stage. The weight of a point is high if it has been misclassified more number
of times in the previous iterations. Thus, maximum error minimization is performed by
trying to correctly predict the points which were misclassified in earlier iterations. Also,
weights are assigned to the classifiers depending upon their accuracy which again depends
upon the weighted error (for that classifier).
NPTEL Online Certification Courses
Indian Institute of Technology Kharagpur
6. Suppose the VC dimension of a hypothesis space is 6. Which of the following are true?
7. Ensembles will yield bad results when there is a significant diversity among the models.
Write True or False.
A) True
B) False
Answer: B
Ensemble is a collection of diverse set of learners to improve the stability and the
performance of the algorithm. So, more diverse the models are, the better will be the
performance of ensemble.
Answer: D.
Decision trees do not aggregate the results of multiple trees, so it is not an ensemble
algorithm.