Week 7 solution

The document contains multiple-choice questions (MCQs) related to decision trees, focusing on concepts such as attribute selection measures, binary vs. multiway splits, pruning techniques, and the use of Chi-Square and Gini index. It explains the correct answers and provides reasoning for each question, highlighting the characteristics and limitations of decision trees. Key points include that K-Nearest Neighbors is not an attribute selection measure, decision trees are prone to overfitting, and Gini index measures node impurity.

Uploaded by

Nadakuditi Venkat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

Week 7 solution

Uploaded by

Nadakuditi Venkat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

1. Which of the following is NOT an attribute selection measure used in decision trees?

A) Entropy
B) Information Gain
C) Chi-Square
D) K-Nearest Neighbors (KNN)
Answer: D) K-Nearest Neighbors (KNN)
Explanation:
Decision trees use measures like Entropy, Information Gain, and Chi-Square to determine the best
split at each node. KNN is a classification algorithm and is not used for attribute selection in decision
trees.

2. What is the key difference between a binary split and a multiway split in decision trees?
A) Binary splits divide the data into two groups, while multiway splits create multiple child nodes.
B) Multiway splits are used only for numerical attributes, whereas binary splits are for categorical
attributes.
C) Binary splits use entropy, while multiway splits use Gini index.
D) Multiway splits always result in better accuracy than binary splits.
Answer: A) Binary splits divide the data into two groups, while multiway splits create multiple child
nodes.
Explanation:
 Binary splits create two branches from a node, dividing the data into two groups.
 Multiway splits allow multiple branches, creating more than two child nodes.
 Both binary and multiway splits can be used for numerical or categorical attributes,
depending on the decision tree implementation.

3. In decision tree pruning, which technique removes unnecessary nodes AFTER the tree has
been fully grown?
A) Pre-Pruning
B) Post-Pruning
C) Overfitting Pruning
D) Random Forest
Answer: B) Post-Pruning
Explanation:
 Post-pruning (also called Reduced Error Pruning) removes nodes after the full tree has
been built.
 It evaluates subtrees and removes branches that do not significantly improve accuracy,
helping to reduce overfitting.
 Pre-pruning, in contrast, stops tree growth early based on conditions like minimum samples
per split.
4. How is the Chi-Square test used for decision tree splitting?
A) It calculates entropy to determine the best split.
B) It measures the statistical significance of differences between parent and child nodes.
C) It ensures all splits are binary.
D) It helps reduce the number of categorical features.
Answer: B) It measures the statistical significance of differences between parent and child nodes.
Explanation:
The Chi-Square test measures whether a split significantly improves classification by checking
differences in observed vs. expected frequencies of target variables. A higher Chi-Square value
means a better split.

5. What is the main disadvantage of decision trees compared to other machine learning
algorithms?
A) Decision trees are difficult to interpret.
B) They always underfit the data.
C) They are prone to overfitting, especially with deep trees.
D) They require extensive data cleaning.
Answer: C) They are prone to overfitting, especially with deep trees.
Explanation:
 Decision trees tend to overfit when they become too complex, learning noise in the training
data.
 This issue can be addressed using pruning or ensemble methods like Random Forest to
improve generalization.
MCQs (Decision Tree)
1. Given entropy of parent = 1, weights averages = (3/4,1/4) and entropy of children =
(0.9, 0). What is the information gain?
a) 0.675
b) 0.75
c) 0.325
d) 0.1
Ans: c)
Explanation: We know Information Gain = Entropy (Parent) – ∑ (weights average *
entropy (Child)).

Information Gain = 1 – (3/4 * 0.9 + 1/4 * 0)

= 1 – (0.675 + 0)
= 1 – 0.675
= 0.325

2. Which of the following statements is not true about Information Gain?

a) It is used to determine which feature/attribute gives us the maximum information
about a class
b) It is based on the concept of entropy, which is the degree of impurity or disorder
c) It aims to reduce the level of entropy starting from the root node to the leave nodes
d) It is often promote the level of entropy starting from the root node to the leave
nodes
Ans: d)
Explanation: Information Gain is based on the concept of entropy and it never tries to
promote but tries to reduce the level of entropy starting from the root node to the leave
nodes.

3. If a dataset has three classes with probabilities 0.2, 0.3, and 0.5, what is the Gini
index?
a) 0.50
b) 0.62
c) 0.42
d) 0.38
Ans: c)
Explanation: Gini=1−((0.2)2+ (0.3)2+ (0.5)2) = 1−(0.04+0.09+0.25) =1−0.38=0.62

4. The Gini coefficient in a decision tree is used to measure:

a) The depth of the tree
b) The impurity of a node
c) The number of leaves in the tree
d) The accuracy of the model
Ans: b)
Explanation: The Gini coefficient in a decision tree is used to measure the impurity of
a node. The nodes having the lowest value of Ginni coefficient
5. Which criterion is used by default in DecisionTreeClassifier() for classification?
a) Entropy
b) Gini
c) Mean Squared Error
d) Information Gain
Ans: b)
Explanation: Gini Coefficient is used by default in objective questions on decision tree
in python

ML Using Scikit
50% (4)
ML Using Scikit
23 pages
Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
Decision Tree & Techniques
71% (7)
Decision Tree & Techniques
41 pages
Decision Trees and Random Forest Q&a
No ratings yet
Decision Trees and Random Forest Q&a
48 pages
Unit 4 MCQ
No ratings yet
Unit 4 MCQ
10 pages
AIML Removed
No ratings yet
AIML Removed
25 pages
AIML Removed Merged
No ratings yet
AIML Removed Merged
31 pages
Decision Tree Questions (1)
No ratings yet
Decision Tree Questions (1)
8 pages
Sheet 4 - Decision Tree
No ratings yet
Sheet 4 - Decision Tree
4 pages
25-questions-to-test-your-skills-on-decision-trees
No ratings yet
25-questions-to-test-your-skills-on-decision-trees
10 pages
Nptel Week 8
No ratings yet
Nptel Week 8
3 pages
MLT Unit-3 Important Questions
No ratings yet
MLT Unit-3 Important Questions
8 pages
AIML Final Cpy Word
No ratings yet
AIML Final Cpy Word
15 pages
Assignment 6 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 6 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
10 pages
Decision Trees Cheat Sheet PDF
No ratings yet
Decision Trees Cheat Sheet PDF
2 pages
Decision Trees: Decision Tree Is One of The Most Widely Used and
No ratings yet
Decision Trees: Decision Tree Is One of The Most Widely Used and
53 pages
CS467-M4-Machine Learning-Ktustudents - in
No ratings yet
CS467-M4-Machine Learning-Ktustudents - in
9 pages
Assignment 04
No ratings yet
Assignment 04
17 pages
Decision Trees Set-1
No ratings yet
Decision Trees Set-1
7 pages
BSC ML Ch3.pptx
No ratings yet
BSC ML Ch3.pptx
106 pages
Decision Tree
No ratings yet
Decision Tree
35 pages
Decision Tree Assignment
No ratings yet
Decision Tree Assignment
3 pages
unit 3
No ratings yet
unit 3
14 pages
DMT MCQ
No ratings yet
DMT MCQ
15 pages
r 2031053
No ratings yet
r 2031053
12 pages
Decision Tree: "For Each Node of The Tree, The Information Value Measures
No ratings yet
Decision Tree: "For Each Node of The Tree, The Information Value Measures
3 pages
ML Classification Tree
No ratings yet
ML Classification Tree
36 pages
ML Unit 2
No ratings yet
ML Unit 2
8 pages
Nptel Week 7
No ratings yet
Nptel Week 7
3 pages
ML_Module-3-chapter-6 RNSIT
No ratings yet
ML_Module-3-chapter-6 RNSIT
10 pages
Machine Learning Unit-3.2
No ratings yet
Machine Learning Unit-3.2
61 pages
L04 Decision Trees
No ratings yet
L04 Decision Trees
34 pages
Decision Tree
No ratings yet
Decision Tree
15 pages
Decision Trees - Pres
No ratings yet
Decision Trees - Pres
9 pages
AI&Ml-module 4 (Complete)
No ratings yet
AI&Ml-module 4 (Complete)
124 pages
AI&Ml-module 4 (Part 1)
No ratings yet
AI&Ml-module 4 (Part 1)
85 pages
Root Split For The Iris Data Set
No ratings yet
Root Split For The Iris Data Set
4 pages
Decision Tree Comprehesive
No ratings yet
Decision Tree Comprehesive
7 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
DECISION TREES-jb
No ratings yet
DECISION TREES-jb
8 pages
Unit IV Decision Trees
No ratings yet
Unit IV Decision Trees
37 pages
FMLanswerkey-IT 2.docx (1) (1) (1)
No ratings yet
FMLanswerkey-IT 2.docx (1) (1) (1)
11 pages
decision-tree-intro-MDT903
No ratings yet
decision-tree-intro-MDT903
40 pages
Experiment No-2
No ratings yet
Experiment No-2
4 pages
Decision Trees - 2022
No ratings yet
Decision Trees - 2022
49 pages
decision tree
No ratings yet
decision tree
66 pages
Unit 3 (A) NGP
No ratings yet
Unit 3 (A) NGP
78 pages
Assignment 1
No ratings yet
Assignment 1
24 pages
DT-0 (3 Files Merged)
No ratings yet
DT-0 (3 Files Merged)
143 pages
Decitions Tree
No ratings yet
Decitions Tree
6 pages
Team 5
No ratings yet
Team 5
12 pages
Lec 07
No ratings yet
Lec 07
66 pages
Classification
No ratings yet
Classification
8 pages
Decision Tree For Classification (ID3 Information Gain Entropy)
No ratings yet
Decision Tree For Classification (ID3 Information Gain Entropy)
3 pages
Decision Tree
No ratings yet
Decision Tree
43 pages
Ml Mid-2 Objective[1]
No ratings yet
Ml Mid-2 Objective[1]
12 pages
Decision Tree
No ratings yet
Decision Tree
19 pages
ML Unit-2
No ratings yet
ML Unit-2
16 pages
Unit Iir20
No ratings yet
Unit Iir20
22 pages
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet
SAT Math Level 2 Subject Test Practice Problems 2013 Edition
From Everand
SAT Math Level 2 Subject Test Practice Problems 2013 Edition
Dr. David Kronmiller
1/5 (1)
Project_Report___Vishal_Pradeep
No ratings yet
Project_Report___Vishal_Pradeep
97 pages
Tu3 Weka Tutorials
No ratings yet
Tu3 Weka Tutorials
11 pages
RapidMiner Fact Sheet
No ratings yet
RapidMiner Fact Sheet
11 pages
1 Theme: Comparison of The Implementation of The CART Algorithm Under Tanagra and R (Rpart Package)
No ratings yet
1 Theme: Comparison of The Implementation of The CART Algorithm Under Tanagra and R (Rpart Package)
15 pages
Detecting Cybersecurity Attacks Across Different Network Features and Learners
No ratings yet
Detecting Cybersecurity Attacks Across Different Network Features and Learners
29 pages
Classification Using Decision Tree: CSE-454: Data Warehousing and Data Mining Sessional
No ratings yet
Classification Using Decision Tree: CSE-454: Data Warehousing and Data Mining Sessional
23 pages
Project 1
No ratings yet
Project 1
4 pages
Commonly Used Machine Learning Algorithms (With Python and R Codes)
No ratings yet
Commonly Used Machine Learning Algorithms (With Python and R Codes)
19 pages
ML Lab Manual TE 2021-22
No ratings yet
ML Lab Manual TE 2021-22
43 pages
MLT Answer Key
No ratings yet
MLT Answer Key
10 pages
Cse(Aiml) 6th Sem
No ratings yet
Cse(Aiml) 6th Sem
7 pages
Backorder Prediction in The Supply Chain Using Machine Learning
No ratings yet
Backorder Prediction in The Supply Chain Using Machine Learning
6 pages
UCS622
No ratings yet
UCS622
1 page
Machine Learning Lab Manual (15CSL76)
No ratings yet
Machine Learning Lab Manual (15CSL76)
30 pages
Appliance Classification Using Energy Disaggregation in Smart Homes
No ratings yet
Appliance Classification Using Energy Disaggregation in Smart Homes
6 pages
12395-Article (PDF) - 25776-2-10-20210118
No ratings yet
12395-Article (PDF) - 25776-2-10-20210118
39 pages
Data Science Portfolio
No ratings yet
Data Science Portfolio
17 pages
Unit-1 PRCV
No ratings yet
Unit-1 PRCV
86 pages
AI Unit 4
No ratings yet
AI Unit 4
11 pages
06 Learning
No ratings yet
06 Learning
51 pages
Sumit Tripathi Applied AI Course Schedule
No ratings yet
Sumit Tripathi Applied AI Course Schedule
31 pages
Daa Module 2
No ratings yet
Daa Module 2
22 pages
IT 802 ML Unit-2 Notes
No ratings yet
IT 802 ML Unit-2 Notes
19 pages
Cewp21 05
No ratings yet
Cewp21 05
22 pages
Moizen Classification and Regression Trees
No ratings yet
Moizen Classification and Regression Trees
7 pages
Pattern Recognition: Hong Kuan Sok, Melanie Po-Leen Ooi, Ye Chow Kuang, Serge Demidenko
No ratings yet
Pattern Recognition: Hong Kuan Sok, Melanie Po-Leen Ooi, Ye Chow Kuang, Serge Demidenko
15 pages
RSSI Fingerprinting Techniques For Indoor Localization Datasets
No ratings yet
RSSI Fingerprinting Techniques For Indoor Localization Datasets
13 pages
Instruction Add-In DT
No ratings yet
Instruction Add-In DT
5 pages
JNTUK R20 B.Tech CSE 3-2 Machine Learning Unit 3 Notes
No ratings yet
JNTUK R20 B.Tech CSE 3-2 Machine Learning Unit 3 Notes
21 pages

Week 7 solution

Uploaded by

Week 7 solution

Uploaded by

1. Which of the following is NOT an attribute selection measure used in decision trees?

Information Gain = 1 – (3/4 * 0.9 + 1/4 * 0)

2. Which of the following statements is not true about Information Gain?

4. The Gini coefficient in a decision tree is used to measure:

You might also like