0% found this document useful (0 votes)

12 views10 pages

What Is An ID3 Algorithm?

Uploaded by

loyole9986

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views10 pages

What Is An ID3 Algorithm?

Uploaded by

loyole9986

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 10

What is an ID3 Algorithm?

ID3 stands for Iterative Dichotomiser 3

It is a classification algorithm that follows a greedy approach by selecting the best attribute that
yields maximum Information Gain(IG) or minimum Entropy(H).

What is Entropy and Information gain?

Entropy is a measure of the amount of uncertainty in the dataset S. Mathematical
Representation of Entropy is shown here -

H(S)=∑c∈C−p(c)log2p(c) ,
Where,
● S - The current dataset for which entropy is being calculated(changes every iteration
of the ID3 algorithm).
● C - Set of classes in S {example - C ={yes, no}}
● p(c) - The proportion of the number of elements in class c to the number of elements
in set S.

In ID3, entropy is calculated for each remaining attribute. The attribute with the smallest
entropy is used to split the set S on that particular iteration.
Entropy = 0 implies it is of pure class, that means all are of the same category.
Information Gain IG(A) tells us how much uncertainty in S was reduced after splitting set S
on attribute A. Mathematical representation of Information gain is shown here -

IG(A,S)=H(S)−∑t∈Tp(t)H(t)
Where,
● H(S) - Entropy of set S.
● T - The subsets created from splitting set S by attribute A such that

S=⋃tϵTt
● p(t) - The proportion of the number of elements in t to the number of elements in set
S.
● H(t) - Entropy of subset t.
In ID3, information gain can be calculated (instead of entropy) for each remaining attribute.
The attribute with the largest information gain is used to split the set S on that particular
iteration.

What are the steps in the ID3 algorithm?

The steps in ID3 algorithm are as follows:
1. Calculate entropy for the dataset.
2. For each attribute/feature.
2.1. Calculate entropy for all its categorical values.
2.2. Calculate information gain for the feature.
3. Find the feature with maximum information gain.
4. Repeat it until we get the desired tree.

Use ID3 algorithm on a data

We'll discuss it here mathematically and later see it's implementation in Python.
So, Let's take an example to make it more clear.

Here,the dataset is of binary classes(yes and no), where 9 out of 14 are "yes" and 5 out of 14 are
"no".
Complete entropy of dataset is: formula H(S)=∑c∈C−p(c)log2p(c) ,
H(S) = - p(yes) * log2(p(yes)) - p(no) * log2(p(no))
= - (9/14) * log2(9/14) - (5/14) * log2(5/14)
= - (-0.41) - (-0.53)

= 0.94
For each attribute of the dataset, let's follow the step-2 of pseudocode : -

First Attribute - Outlook

Categorical values - sunny, overcast and rain

H(Outlook=sunny) = -(2/5)log2(2/5)-(3/5)log2(3/5) =0.971

H(Outlook=rain) = -(3/5)*log2(3/5)-(2/5)*log2(2/5) =0.971

H(Outlook=overcast) = -(4/4)*log2(4/4)-0 = 0

Average Entropy Information for Outlook -

I(Outlook) = p(sunny) * H(Outlook=sunny) + p(rain) * H(Outlook=rain) + p(overcast) * H(Outlook=overcast)

= (5/14)0.971 + (5/14)0.971 + (4/14)*0

= 0.693

Information Gain = H(S) - I(Outlook)

= 0.94 - 0.693

= 0.247

Second Attribute - Temperature

Categorical values - hot, mild, cool

H(Temperature=hot) = -(2/4)*log2(2/4)-(2/4)*log2(2/4) = 1

H(Temperature=cool) = -(3/4)log2(3/4)-(1/4)log2(1/4) = 0.811

H(Temperature=mild) = -(4/6)log2(4/6)-(2/6)log2(2/6) = 0.9179

Average Entropy Information for Temperature -

I(Temperature) = p(hot)H(Temperature=hot) + p(mild)H(Temperature=mild) + p(cool)*H(Temperature=cool)

= (4/14)1 + (6/14)0.9179 + (4/14)*0.811

= 0.9108

Information Gain = H(S) - I(Temperature)

= 0.94 - 0.9108

= 0.0292

Third Attribute - Humidity

Categorical values - high, normal

H(Humidity=high) = -(3/7)log2(3/7)-(4/7)log2(4/7) = 0.983

H(Humidity=normal) = -(6/7)*log2(6/7)-(1/7)*log2(1/7) = 0.591

Average Entropy Information for Humidity -

I(Humidity) = p(high)H(Humidity=high) + p(normal)H(Humidity=normal)

= (7/14)*0.983 + (7/14)*0.591

= 0.787

Information Gain = H(S) - I(Humidity)

= 0.94 - 0.787

= 0.153

Fourth Attribute - Wind

Categorical values - weak, strong

H(Wind=weak) = -(6/8)log2(6/8)-(2/8)log2(2/8) = 0.811

H(Wind=strong) = -(3/6)*log2(3/6)-(3/6)*log2(3/6) = 1

Average Entropy Information for Wind -

I(Wind) = p(weak)H(Wind=weak) + p(strong)H(Wind=strong)

= (8/14)*0.811 + (6/14)*1

= 0.892

Information Gain = H(S) - I(Wind)

= 0.94 - 0.892

= 0.048

Here, the attribute with maximum information gain is Outlook. So, the decision tree built so far -
Here, when Outlook == overcast, it is of pure class(Yes).

Now, we have to repeat the same procedure for the data with rows consisting of Outlook value as
Sunny and then for Outlook value as Rain.
Now, finding the best attribute for splitting the data with Outlook=Sunny values{ Dataset rows =
[1, 2, 8, 9, 11]}.

Complete entropy of Sunny is -

H(S) = - p(yes) * log2(p(yes)) - p(no) * log2(p(no))

= - (2/5) * log2(2/5) - (3/5) * log2(3/5)

= 0.971

First Attribute - Temperature

Categorical values - hot, mild, cool

H(Sunny, Temperature=hot) = -0-(2/2)*log2(2/2) =0

H(Sunny, Temperature=cool) = -(1)*log2(1)- 0 = 0

H(Sunny, Temperature=mild) = -(1/2)log2(1/2)-(1/2)log2(1/2) = 1

Average Entropy Information for Temperature -

I(Sunny, Temperature) = p(Sunny, hot)*H(Sunny, Temperature=hot) + p(Sunny, mild)*H(Sunny,
Temperature=mild) + p(Sunny, cool)*H(Sunny, Temperature=cool)

= (2/5)0 + (1/5)0 + (2/5)*1

= 0.4

Information Gain = H(Sunny) - I(Sunny, Temperature)

= 0.971 - 0.4

= 0.571

Second Attribute - Humidity

Categorical values - high, normal

H(Sunny, Humidity=high) = - 0 - (3/3)*log2(3/3) = 0

H(Sunny, Humidity=normal) = -(2/2)*log2(2/2)-0 = 0

Average Entropy Information for Humidity -

I(Sunny, Humidity) = p(Sunny, high)H(Sunny, Humidity=high) + p(Sunny, normal)H(Sunny,

Humidity=normal)

= (3/5)*0 + (2/5)*0

Information Gain = H(Sunny) - I(Sunny, Humidity)

= 0.971 - 0

= 0.971
Third Attribute - Wind

Categorical values - weak, strong

H(Sunny, Wind=weak) = -(1/3)log2(1/3)-(2/3)log2(2/3) = 0.918

H(Sunny, Wind=strong) = -(1/2)log2(1/2)-(1/2)log2(1/2) = 1

Average Entropy Information for Wind -

I(Sunny, Wind) = p(Sunny, weak)H(Sunny, Wind=weak) + p(Sunny, strong)H(Sunny,

Wind=strong)

= (3/5)*0.918 + (2/5)*1

= 0.9508

Information Gain = H(Sunny) - I(Sunny, Wind)

= 0.971 - 0.9508

= 0.0202

Here, the attribute with maximum information gain is Humidity. So, the decision tree built so far -
Here, when Outlook = Sunny and Humidity = High, it is a pure class of category "no". And When
Outlook = Sunny and Humidity = Normal, it is again a pure class of category "yes". Therefore, we
don't need to do further calculations.
Now, finding the best attribute for splitting the data with Outlook=Sunny values{ Dataset rows =
[4, 5, 6, 10, 14]}.

Complete entropy of Rain is -

H(S) = - p(yes) * log2(p(yes)) - p(no) * log2(p(no))

= - (3/5) * log2(3/5) - (2/5) * log2(2/5)

= 0.971

First Attribute - Temperature

Categorical values - mild, cool

H(Rain, Temperature=cool) = -(1/2)log2(1/2)- (1/2)log2(1/2) = 1

H(Rain, Temperature=mild) = -(2/3)log2(2/3)-(1/3)log2(1/3) = 0.918

Average Entropy Information for Temperature -

I(Rain, Temperature) = p(Rain, mild)H(Rain, Temperature=mild) + p(Rain, cool)H(Rain,

Temperature=cool)

= (2/5)*1 + (3/5)*0.918

= 0.9508

Information Gain = H(Rain) - I(Rain, Temperature)

= 0.971 - 0.9508
= 0.0202

Second Attribute - Wind

Categorical values - weak, strong

H(Wind=weak) = -(3/3)*log2(3/3)-0 = 0

H(Wind=strong) = 0-(2/2)*log2(2/2) = 0

Average Entropy Information for Wind -

I(Wind) = p(Rain, weak)H(Rain, Wind=weak) + p(Rain, strong)H(Rain, Wind=strong)

= (3/5)*0 + (2/5)*0

Information Gain = H(Rain) - I(Rain, Wind)

= 0.971 - 0

= 0.971

Here, the attribute with maximum information gain is Wind. So, the decision tree built so far -
Here, when Outlook = Rain and Wind = Strong, it is a pure class of category "no". And When
Outlook = Rain and Wind = Weak, it is again a pure class of category "yes".
And this is our final desired tree for the given dataset.

Exertherm® Modbus Datacard
No ratings yet
Exertherm® Modbus Datacard
2 pages
REST Services Version 1 2022.2
No ratings yet
REST Services Version 1 2022.2
90 pages
Single RC Staircase Design
100% (4)
Single RC Staircase Design
4 pages
ML 19
No ratings yet
ML 19
28 pages
Decision Tree
100% (1)
Decision Tree
10 pages
ML Unit-3
No ratings yet
ML Unit-3
29 pages
3ID3 Algorithm
No ratings yet
3ID3 Algorithm
9 pages
Decision Tree
No ratings yet
Decision Tree
27 pages
Start With The Root Node
No ratings yet
Start With The Root Node
7 pages
Decision Tree Classification
100% (1)
Decision Tree Classification
11 pages
07 - Decision Tree
No ratings yet
07 - Decision Tree
45 pages
ID3 Complete Solution
No ratings yet
ID3 Complete Solution
3 pages
Assigment 2 Ammad Ali
No ratings yet
Assigment 2 Ammad Ali
8 pages
Decision Tree (Class 37-38) 169692509554958626652505a71d481
No ratings yet
Decision Tree (Class 37-38) 169692509554958626652505a71d481
45 pages
Assigment 2 Ammad Ali
No ratings yet
Assigment 2 Ammad Ali
8 pages
00 Decision Tree Example
No ratings yet
00 Decision Tree Example
12 pages
ID3 Decision Tree Explanation
No ratings yet
ID3 Decision Tree Explanation
8 pages
DT Classifier
No ratings yet
DT Classifier
45 pages
Classification - Issues Regarding Classification and Prediction
No ratings yet
Classification - Issues Regarding Classification and Prediction
42 pages
Unit 6 Finalized
No ratings yet
Unit 6 Finalized
30 pages
Lecture2 DT
No ratings yet
Lecture2 DT
75 pages
3 Decision Trees - LMS
No ratings yet
3 Decision Trees - LMS
47 pages
Lec-2 Decision Tree - 13-8-2024
No ratings yet
Lec-2 Decision Tree - 13-8-2024
38 pages
Decision Tree Id3 Problem
No ratings yet
Decision Tree Id3 Problem
5 pages
L5 - Decision Tree - B
No ratings yet
L5 - Decision Tree - B
51 pages
Decision Tree
No ratings yet
Decision Tree
100 pages
T6 Decision Tree
No ratings yet
T6 Decision Tree
38 pages
MLT UNIT-3 Notes
No ratings yet
MLT UNIT-3 Notes
35 pages
Id3algorithm 200307175839
No ratings yet
Id3algorithm 200307175839
22 pages
ML Intro
No ratings yet
ML Intro
45 pages
Decision Tree & Random Forest
No ratings yet
Decision Tree & Random Forest
41 pages
3.1 C 4.5 Algorithm-19
No ratings yet
3.1 C 4.5 Algorithm-19
10 pages
Unit 4 - Decision Tree ID3
No ratings yet
Unit 4 - Decision Tree ID3
5 pages
2.3 Decision-Tree-Algorithm
No ratings yet
2.3 Decision-Tree-Algorithm
61 pages
A Step by Step ID3 Decision Tree Example by Niranjan Kumar Das
No ratings yet
A Step by Step ID3 Decision Tree Example by Niranjan Kumar Das
8 pages
Unit 3
No ratings yet
Unit 3
90 pages
ML Unit-3
No ratings yet
ML Unit-3
92 pages
07 - ML - Decision Tree
No ratings yet
07 - ML - Decision Tree
37 pages
DM UNIT 4b (1R ALGO)
No ratings yet
DM UNIT 4b (1R ALGO)
39 pages
Decision-Tree Learning .
No ratings yet
Decision-Tree Learning .
29 pages
06 Classification Decision Tree
No ratings yet
06 Classification Decision Tree
42 pages
Entropy and Information Gain Explained
No ratings yet
Entropy and Information Gain Explained
10 pages
Decisiontrees
No ratings yet
Decisiontrees
46 pages
Module 3-Decision Tree Learning
100% (1)
Module 3-Decision Tree Learning
33 pages
Decision Trees
No ratings yet
Decision Trees
19 pages
ID3 Algorithm
No ratings yet
ID3 Algorithm
22 pages
Decision Trees
No ratings yet
Decision Trees
29 pages
Decision Tree - ID3
No ratings yet
Decision Tree - ID3
11 pages
Decision Trees
No ratings yet
Decision Trees
49 pages
Decision Tree Classifier-C4.5
No ratings yet
Decision Tree Classifier-C4.5
23 pages
Geometric Intuition of Decision Tree: Axis Parallel Hyperplanes
No ratings yet
Geometric Intuition of Decision Tree: Axis Parallel Hyperplanes
7 pages
7-Decision Trees Learning
No ratings yet
7-Decision Trees Learning
51 pages
Dec Tree
No ratings yet
Dec Tree
17 pages
Decision Tree
100% (4)
Decision Tree
66 pages
Decision Trees Iterative Dichotomiser 3 (ID3) For Classification: An ML Algorithm
No ratings yet
Decision Trees Iterative Dichotomiser 3 (ID3) For Classification: An ML Algorithm
7 pages
Machine Learning Descision Tree
No ratings yet
Machine Learning Descision Tree
20 pages
Machine Learning Lec6
No ratings yet
Machine Learning Lec6
40 pages
Week - 2 Day - 2 Machine Learning 2 - 3
No ratings yet
Week - 2 Day - 2 Machine Learning 2 - 3
33 pages
Decision Tree Algorithm
No ratings yet
Decision Tree Algorithm
18 pages
ID3 Algorithm Machine Learning, Btech Cse
No ratings yet
ID3 Algorithm Machine Learning, Btech Cse
6 pages
Data Mining: Classification-1
No ratings yet
Data Mining: Classification-1
53 pages
2.decision Tree
No ratings yet
2.decision Tree
74 pages
Speed Mathamatics
From Everand
Speed Mathamatics
Naila Hina
1/5 (1)
Basic Computer Architecture
No ratings yet
Basic Computer Architecture
33 pages
667400a31d833d00172262cf - ## - Inverse Trigonometric Functions - DPP 01 (Of Lec 03) - Lakshya JEE 2025
No ratings yet
667400a31d833d00172262cf - ## - Inverse Trigonometric Functions - DPP 01 (Of Lec 03) - Lakshya JEE 2025
2 pages
L2 - Architectural Styles
No ratings yet
L2 - Architectural Styles
46 pages
Business Goldmine - 100 Profitable Business Models
100% (1)
Business Goldmine - 100 Profitable Business Models
23 pages
Advantages and Disadvantages of Doing Coursework
100% (1)
Advantages and Disadvantages of Doing Coursework
5 pages
Invitation Letter - Summer School - 03-14 June 2024
No ratings yet
Invitation Letter - Summer School - 03-14 June 2024
1 page
Configuring The SMC Flex With The MicroLogix 1100 Via Modbus RTU
No ratings yet
Configuring The SMC Flex With The MicroLogix 1100 Via Modbus RTU
9 pages
3GPP TS 28.554
No ratings yet
3GPP TS 28.554
21 pages
SCG-1 Manual
No ratings yet
SCG-1 Manual
18 pages
Coding Guidelines
No ratings yet
Coding Guidelines
22 pages
Packard Bell Easynote M3 Disassembly Manual
No ratings yet
Packard Bell Easynote M3 Disassembly Manual
20 pages
Motion Control NC62 Complete English 2021
No ratings yet
Motion Control NC62 Complete English 2021
818 pages
Assignment
No ratings yet
Assignment
13 pages
MOONS' ModbusRTU Library User Manual
No ratings yet
MOONS' ModbusRTU Library User Manual
80 pages
English 1st
No ratings yet
English 1st
2 pages
Resume
No ratings yet
Resume
3 pages
WandelGoltermann PJM4 Manual
No ratings yet
WandelGoltermann PJM4 Manual
6 pages
Wrong Number Series Asked in Previous Year Prelims Exams: Get High Standard Mock Test Series For All Bank Exams
No ratings yet
Wrong Number Series Asked in Previous Year Prelims Exams: Get High Standard Mock Test Series For All Bank Exams
14 pages
Topic 4 Convolution Integral
No ratings yet
Topic 4 Convolution Integral
5 pages
Archives and Records Management Lecture Note
No ratings yet
Archives and Records Management Lecture Note
12 pages
DP Biometric 13115 Drivers
No ratings yet
DP Biometric 13115 Drivers
185 pages
Sap MM2
No ratings yet
Sap MM2
113 pages
Kubernetes 3
No ratings yet
Kubernetes 3
16 pages
OPC UA Part 1 - Overview and Concepts 1.02 Specification
No ratings yet
OPC UA Part 1 - Overview and Concepts 1.02 Specification
30 pages
000 - MX 15 19 - sn20600 49999 - D.P
No ratings yet
000 - MX 15 19 - sn20600 49999 - D.P
48 pages
Campus - Backend Developer
No ratings yet
Campus - Backend Developer
2 pages
Jana Resume
No ratings yet
Jana Resume
6 pages

What Is An ID3 Algorithm?

Uploaded by

What Is An ID3 Algorithm?

Uploaded by

What is an ID3 Algorithm?

ID3 stands for Iterative Dichotomiser 3

What is Entropy and Information gain?

What are the steps in the ID3 algorithm?

Use ID3 algorithm on a data

First Attribute - Outlook

H(Outlook=sunny) = -(2/5)*log2(2/5)-(3/5)*log2(3/5) =0.971

Average Entropy Information for Outlook -

I(Outlook) = p(sunny) * H(Outlook=sunny) + p(rain) * H(Outlook=rain) + p(overcast) * H(Outlook=overcast)

= (5/14)*0.971 + (5/14)*0.971 + (4/14)*0

Information Gain = H(S) - I(Outlook)

Second Attribute - Temperature

Categorical values - hot, mild, cool

H(Temperature=cool) = -(3/4)*log2(3/4)-(1/4)*log2(1/4) = 0.811

H(Temperature=mild) = -(4/6)*log2(4/6)-(2/6)*log2(2/6) = 0.9179

Average Entropy Information for Temperature -

I(Temperature) = p(hot)*H(Temperature=hot) + p(mild)*H(Temperature=mild) + p(cool)*H(Temperature=cool)

= (4/14)*1 + (6/14)*0.9179 + (4/14)*0.811

Information Gain = H(S) - I(Temperature)

Third Attribute - Humidity

Categorical values - high, normal

H(Humidity=high) = -(3/7)*log2(3/7)-(4/7)*log2(4/7) = 0.983

Average Entropy Information for Humidity -

I(Humidity) = p(high)*H(Humidity=high) + p(normal)*H(Humidity=normal)

Information Gain = H(S) - I(Humidity)

Fourth Attribute - Wind

Categorical values - weak, strong

H(Wind=weak) = -(6/8)*log2(6/8)-(2/8)*log2(2/8) = 0.811

Average Entropy Information for Wind -

I(Wind) = p(weak)*H(Wind=weak) + p(strong)*H(Wind=strong)

Information Gain = H(S) - I(Wind)

Complete entropy of Sunny is -

H(S) = - p(yes) * log2(p(yes)) - p(no) * log2(p(no))

= - (2/5) * log2(2/5) - (3/5) * log2(3/5)

First Attribute - Temperature

Categorical values - hot, mild, cool

H(Sunny, Temperature=hot) = -0-(2/2)*log2(2/2) =0

H(Sunny, Temperature=cool) = -(1)*log2(1)- 0 = 0

H(Sunny, Temperature=mild) = -(1/2)*log2(1/2)-(1/2)*log2(1/2) = 1

Average Entropy Information for Temperature -

= (2/5)*0 + (1/5)*0 + (2/5)*1

Information Gain = H(Sunny) - I(Sunny, Temperature)

Second Attribute - Humidity

Categorical values - high, normal

H(Sunny, Humidity=high) = - 0 - (3/3)*log2(3/3) = 0

H(Sunny, Humidity=normal) = -(2/2)*log2(2/2)-0 = 0

Average Entropy Information for Humidity -

I(Sunny, Humidity) = p(Sunny, high)*H(Sunny, Humidity=high) + p(Sunny, normal)*H(Sunny,

Information Gain = H(Sunny) - I(Sunny, Humidity)

Categorical values - weak, strong

H(Sunny, Wind=weak) = -(1/3)*log2(1/3)-(2/3)*log2(2/3) = 0.918

H(Sunny, Wind=strong) = -(1/2)*log2(1/2)-(1/2)*log2(1/2) = 1

Average Entropy Information for Wind -

I(Sunny, Wind) = p(Sunny, weak)*H(Sunny, Wind=weak) + p(Sunny, strong)*H(Sunny,

Information Gain = H(Sunny) - I(Sunny, Wind)

Complete entropy of Rain is -

H(S) = - p(yes) * log2(p(yes)) - p(no) * log2(p(no))

= - (3/5) * log2(3/5) - (2/5) * log2(2/5)

First Attribute - Temperature

Categorical values - mild, cool

H(Rain, Temperature=cool) = -(1/2)*log2(1/2)- (1/2)*log2(1/2) = 1

H(Rain, Temperature=mild) = -(2/3)*log2(2/3)-(1/3)*log2(1/3) = 0.918

Average Entropy Information for Temperature -

I(Rain, Temperature) = p(Rain, mild)*H(Rain, Temperature=mild) + p(Rain, cool)*H(Rain,

Information Gain = H(Rain) - I(Rain, Temperature)

Second Attribute - Wind

Categorical values - weak, strong

Average Entropy Information for Wind -

I(Wind) = p(Rain, weak)*H(Rain, Wind=weak) + p(Rain, strong)*H(Rain, Wind=strong)

Information Gain = H(Rain) - I(Rain, Wind)

You might also like

H(Outlook=sunny) = -(2/5)log2(2/5)-(3/5)log2(3/5) =0.971

= (5/14)0.971 + (5/14)0.971 + (4/14)*0

H(Temperature=cool) = -(3/4)log2(3/4)-(1/4)log2(1/4) = 0.811

H(Temperature=mild) = -(4/6)log2(4/6)-(2/6)log2(2/6) = 0.9179

I(Temperature) = p(hot)H(Temperature=hot) + p(mild)H(Temperature=mild) + p(cool)*H(Temperature=cool)

= (4/14)1 + (6/14)0.9179 + (4/14)*0.811

H(Humidity=high) = -(3/7)log2(3/7)-(4/7)log2(4/7) = 0.983

I(Humidity) = p(high)H(Humidity=high) + p(normal)H(Humidity=normal)

H(Wind=weak) = -(6/8)log2(6/8)-(2/8)log2(2/8) = 0.811

I(Wind) = p(weak)H(Wind=weak) + p(strong)H(Wind=strong)

H(Sunny, Temperature=mild) = -(1/2)log2(1/2)-(1/2)log2(1/2) = 1

= (2/5)0 + (1/5)0 + (2/5)*1

I(Sunny, Humidity) = p(Sunny, high)H(Sunny, Humidity=high) + p(Sunny, normal)H(Sunny,

H(Sunny, Wind=weak) = -(1/3)log2(1/3)-(2/3)log2(2/3) = 0.918

H(Sunny, Wind=strong) = -(1/2)log2(1/2)-(1/2)log2(1/2) = 1

I(Sunny, Wind) = p(Sunny, weak)H(Sunny, Wind=weak) + p(Sunny, strong)H(Sunny,

H(Rain, Temperature=cool) = -(1/2)log2(1/2)- (1/2)log2(1/2) = 1

H(Rain, Temperature=mild) = -(2/3)log2(2/3)-(1/3)log2(1/3) = 0.918

I(Rain, Temperature) = p(Rain, mild)H(Rain, Temperature=mild) + p(Rain, cool)H(Rain,

I(Wind) = p(Rain, weak)H(Rain, Wind=weak) + p(Rain, strong)H(Rain, Wind=strong)