0% found this document useful (0 votes)

25 views15 pages

Essentials of Machine Learning - Lesson 02

The document discusses key concepts in probability and machine learning: 1) It defines probability as the quantification of uncertainty and discusses frequentist and Bayesian interpretations. 2) It introduces random variables as quantities subject to variation from chance, which can take discrete or continuous values. 3) It explains concepts like expectation, joint distributions, marginal and conditional probability, and independence. Bayes' rule is also covered for calculating conditional probabilities.

Uploaded by

e1840006

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views15 pages

Essentials of Machine Learning - Lesson 02

Uploaded by

e1840006

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Essentials of Machine Learning

Lesson 02 - Probability

T Essentials of Machine Learning

What is probability?

• Quantification of uncertainty
• Frequentist interpretation: long run frequencies of events
e.g.: The probability of a particular coin landing heads up is 0.5
• Bayesian interpretation: quantify our degrees of belief about something
e.g.: the probability of it raining tomorrow is 0.3
• Not possible to repeat “tomorrow" many times
• Basic rules of probability are the same, no matter which interpretation is
adopted
• 3

Thushari Silva, PhD Essentials of Machine Learning

Random Variables

• A random variable (RV), X denotes a quantity that is subject to variations due to

chance
• May denote the result of an experiment (e.g. flipping a coin) or the
measurement of a real-world fluctuating quantity (e.g. temperature)
• Use capital letters to denote random variables and lower case letters to denote
values that they take, e.g. p(X = x)
• A discrete variable takes on values from a finite or countably infinite set
• Probability mass function p(X = x) for discrete random variables

Thushari Silva, PhD Essentials of Machine Learning

Random Variables – Examples

• Examples:
• Colour of a car blue, green, red
• Number of children in a family 0, 1, 2, 3, 4, 5, 6, > 6
• Toss two coins, let X = (number of heads)2. X can take on the values 0, 1 and
4.
• Example p(Colour = red) = 0:3
• ∑! 𝑃 𝑥 = 1

Thushari Silva, PhD Essentials of Machine Learning

Continuous Random Variables

• Continuous RVs take on values that vary continuously within one or more real
intervals
• Probability density function (pdf) p(x) for a continuous random variable X
#
𝑃 𝑎≤𝑋≤𝑏 = ∫" 𝑝 𝑥 𝑑𝑥
therefore
𝑃 𝑥 ≤ 𝑋 ≤ 𝑥 + 𝛿𝑥 ≅ 𝑝 𝑥 𝛿𝑥
• ∫ 𝑝 𝑥 𝑑𝑥 = 1 (but values of p(x) can be greater than 1)
• Examples (coming soon): Gaussian, Gamma, Exponential, Beta

Thushari Silva, PhD Essentials of Machine Learning

Expectation

• Consider a function f(x) mapping from x onto numerical values

• 𝐸[𝑓 𝑥 ] = ∑! 𝑓 𝑥 𝑃 𝑥
= ∫ 𝑓 𝑥 𝑃 𝑥 𝑑𝑥

For discrete and continuous variable resp.

• f(x) = x, we obtain the mean, 𝜇!
• f(x) = (x - 𝜇! )2 , we obtain variance

Thushari Silva, PhD Essentials of Machine Learning

Joint distributions

• Properties of several random variables are important for modelling complex

problems
• 𝑃(𝑋! = 𝑥!, 𝑋" = 𝑥" , …, 𝑋# = 𝑥# )
• “,” is read as “and”
• Examples about Grade and Intelligence (from Koller and Friedman, 2009)

Intelligence = low Intelligence = high

Grade = A 0.07 0.18
Grade = B 0.28 0.09
Grade = C 0.35 0.03

Thushari Silva, PhD Essentials of Machine Learning

Marginal Probability

• The sum rule

𝑃 x = ∑! 𝑝(𝑥, 𝑦)
• p(Grade = A) ??

• Replace sum by an integral for continuous RVs

Thushari Silva, PhD Essentials of Machine Learning

Conditional Probability

• Let X and Y be two disjoint groups of variables, such that p(Y = y) > 0. Then the conditional
probability distribution (CPD) of X given Y = y is given by:
$(&,()
𝑝 𝑋=𝑥𝑌=𝑦 =𝑝 𝑥𝑦 = $(()
• Product rule
p(𝑋, 𝑌) = 𝑝 𝑋 𝑝 𝑌 𝑋 = 𝑝 𝑌 𝑝 𝑋 𝑌
• Example: In the grades example, what is p(Intelligence = high|Grade = A)?
• ∑! p 𝑋 = 𝑥 𝑌 = 𝑦 = 1 for all x

Thushari Silva, PhD Essentials of Machine Learning

Chain Rule

• The chain rule is derived by repeated application of the product rule

𝑝(𝑋! , 𝑋" , … , 𝑋# ) = 𝑝(𝑋! , 𝑋" , … , 𝑋#$! )𝑝(𝑋# |𝑋! , 𝑋" , … , 𝑋#$! )
= 𝑝(𝑋" , 𝑋# , … , 𝑋$%# )𝑝(𝑋$%" |𝑋" , 𝑋# , … , 𝑋$%# )
𝑝(𝑋$ |𝑋" , 𝑋# , … , 𝑋$%" )
=…
= 𝑝(𝑋" ) ∏$&'# 𝑝(𝑋& |𝑋" , 𝑋# , … , 𝑋&%" )

Exercise : give decompositions of p(x, y, z) using the chain rule

Thushari Silva, PhD Essentials of Machine Learning

Bayes' Rule

• From the product rule,

*𝑌 𝑋 *(+) % 𝑌 𝑋 %(')
𝑃(𝑋|𝑌) =
∑+ * 𝑌 𝑋 *(+)
= %(')

Thushari Silva, PhD Essentials of Machine Learning

Bayes’ rule example

• Consider the following medical diagnosis problem.

Suppose you decide to have a medical test for a cancer. If the test is positive, what is the
probability you have cancer? Test has a sensitivity of 80% and prior probability of having a
cancer is 0.004.
Assume that false positive are quite likely. i.e. p(x = 1|y = 0) = 0.1
p(x =1 |y = 1) = 0.8 , p(y =1 |x = 1) = ??
p(y =1 |x = 1) = p(x =1 |y = 1) p(y = 1)
p(x =1 |y = 1) p(y = 1) +p(x =1 |y = 0) p(y = 0)
= 0.8×0.004 = 0.031 = 3%
0.8×0.004 + 0.1×0.996

Thushari Silva, PhD Essentials of Machine Learning

Probabilistic Inference using Bayes' Rule

• Tuberculosis (TB) and a skin test (Test)

• p(TB = yes) = 0:001 (for subjects who get tested)
• p(Test = yes | TB = yes) = 0.95
• p(Test = no | TB = no) = 0.95

• Person gets a positive test result. What is p(TB = yes |Test = yes)?
𝑃 𝑇𝐵 = 𝑦𝑒𝑠 | 𝑇𝑒𝑠𝑡 = 𝑦𝑒𝑠 = * -./01(./*(-./014./)
| -31(./ *(-31(./)

).+, × ).))!
= ≅ 0.0187
).+,× ).)!.).),×).+++
Thushari Silva, PhD Essentials of Machine Learning
Independence

• Let X and Y be two disjoint groups of variables. Then X is said to be independent

of Y if and only if
𝑝(𝑋|𝑌) = p(X) ; for all possible values x and y of X and Y;
otherwise X is said to be dependent on Y
• Using the definition of conditional probability, we get an equivalent expression
for the independence condition
𝑝(𝑋, 𝑌) = p(X)p(Y)
• X independent of Y , Y independent of X
• Independence of a set of variables. X1,…,XD are independent iff
𝑝(𝑋" , 𝑋# , … , 𝑋$ ) = ∏$
&'" 𝑃(𝑥& )
Thushari Silva, PhD Essentials of Machine Learning
Conditional Independence

• Let X, Y and Z be three disjoint groups of variables. X is said to be conditionally

independent of Y given Z iff:
p(x|y,z) = p(x, z) = p(x|z)
for all possible values of x, y and z.

Thushari Silva, PhD Essentials of Machine Learning

2022 Naive Bayes and Probability
No ratings yet
2022 Naive Bayes and Probability
30 pages
Bayes Theorem in Machine Learning
No ratings yet
Bayes Theorem in Machine Learning
37 pages
Pengukuran Indikator Mutu RS
No ratings yet
Pengukuran Indikator Mutu RS
26 pages
CMT Curriculum 2021 LEVEL III Wiley FINAL
25% (4)
CMT Curriculum 2021 LEVEL III Wiley FINAL
8 pages
Spek Teknis Fa
No ratings yet
Spek Teknis Fa
18 pages
Fundamentals of Applied Econometrics: by Richard A. Ashley
No ratings yet
Fundamentals of Applied Econometrics: by Richard A. Ashley
26 pages
Mathematics in Machine Learning
No ratings yet
Mathematics in Machine Learning
83 pages
Lecture1 Intro ML
No ratings yet
Lecture1 Intro ML
60 pages
BCS-DS-602: Machine Learning: Dr. Sarika Chaudhary Associate Professor Fet-Cse
No ratings yet
BCS-DS-602: Machine Learning: Dr. Sarika Chaudhary Associate Professor Fet-Cse
18 pages
2223hk1 Slide01 ML2022-2
No ratings yet
2223hk1 Slide01 ML2022-2
23 pages
Lecture5 Maximum Likelihood
No ratings yet
Lecture5 Maximum Likelihood
13 pages
Foundations of Machine Learning: Part A: Probability Basics
No ratings yet
Foundations of Machine Learning: Part A: Probability Basics
75 pages
Fall 2019 Prob Review
No ratings yet
Fall 2019 Prob Review
33 pages
Probability Theory For Machine Learning: Chris Cremer September 2015
No ratings yet
Probability Theory For Machine Learning: Chris Cremer September 2015
40 pages
ML 3
No ratings yet
ML 3
14 pages
Dealing With Uncertainty P (X - E) : Probability Theory The Foundation of Statistics
No ratings yet
Dealing With Uncertainty P (X - E) : Probability Theory The Foundation of Statistics
34 pages
Chapter 4 Bayesian Networks
No ratings yet
Chapter 4 Bayesian Networks
62 pages
Probabilities For Machine Learning: Roland Memisevic
No ratings yet
Probabilities For Machine Learning: Roland Memisevic
19 pages
ML BayesionBeliefNetwork Lect12 14
No ratings yet
ML BayesionBeliefNetwork Lect12 14
99 pages
Unit 1
No ratings yet
Unit 1
21 pages
Applied Maths
No ratings yet
Applied Maths
34 pages
Lec04 Classifiers NBC
No ratings yet
Lec04 Classifiers NBC
24 pages
Probability Theory: Sargur N. Srihari Srihari@cedar - Buffalo.edu
No ratings yet
Probability Theory: Sargur N. Srihari Srihari@cedar - Buffalo.edu
49 pages
Probability Theory - Towards Data Science
No ratings yet
Probability Theory - Towards Data Science
19 pages
Machine Learning and Data Mining: Prof. Alexander Ihler
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler
51 pages
Machine Learning and Data Mining: Prof. Alexander Ihler
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler
51 pages
Wa0002.
No ratings yet
Wa0002.
24 pages
Introduction To Bayesian Learning: Aaron Hertzmann University of Toronto SIGGRAPH 2004 Tutorial
No ratings yet
Introduction To Bayesian Learning: Aaron Hertzmann University of Toronto SIGGRAPH 2004 Tutorial
141 pages
EIE4105 Multimodal Human Computer Interaction Technology: Fundamental of Statistical Learning
No ratings yet
EIE4105 Multimodal Human Computer Interaction Technology: Fundamental of Statistical Learning
31 pages
PRML RefSheet
No ratings yet
PRML RefSheet
6 pages
ML Academy - Part II
No ratings yet
ML Academy - Part II
8 pages
Machine - Learning (Unit 3)
No ratings yet
Machine - Learning (Unit 3)
9 pages
ECE523 Engineering Applications of Machine Learning and Data Analytics - Bayes and Risk - 1
No ratings yet
ECE523 Engineering Applications of Machine Learning and Data Analytics - Bayes and Risk - 1
7 pages
EE2211 Lecture 3
No ratings yet
EE2211 Lecture 3
35 pages
BaYesian Models Machine Learning 2016
No ratings yet
BaYesian Models Machine Learning 2016
126 pages
Machine Learning Unit 5 Part 2
No ratings yet
Machine Learning Unit 5 Part 2
16 pages
Bishop2008 Chapter ANewFrameworkForMachineLearnin
No ratings yet
Bishop2008 Chapter ANewFrameworkForMachineLearnin
24 pages
2 Unit PR Statistical Decision Making
No ratings yet
2 Unit PR Statistical Decision Making
61 pages
Probabilistic Models in Machine Learning: Unit - III Chapter - 1
No ratings yet
Probabilistic Models in Machine Learning: Unit - III Chapter - 1
18 pages
Probability
No ratings yet
Probability
19 pages
2 Probability
No ratings yet
2 Probability
30 pages
BML Lecture Notes
No ratings yet
BML Lecture Notes
126 pages
Module05 - Bayesian Reasoning
No ratings yet
Module05 - Bayesian Reasoning
37 pages
Applied Statistics - Lecture 1: Mario Beraha
No ratings yet
Applied Statistics - Lecture 1: Mario Beraha
52 pages
Unit Iv L Earning
No ratings yet
Unit Iv L Earning
23 pages
Unit Iv L Earning
No ratings yet
Unit Iv L Earning
33 pages
Bayes Algorithm
No ratings yet
Bayes Algorithm
26 pages
Introduction To Probabilistic Learning
No ratings yet
Introduction To Probabilistic Learning
9 pages
Chapter 5 - Machine Learning
No ratings yet
Chapter 5 - Machine Learning
59 pages
Scribe: Naive Bayes Classifier
No ratings yet
Scribe: Naive Bayes Classifier
16 pages
Slide07 Bayes
No ratings yet
Slide07 Bayes
51 pages
Bayesian Networks
No ratings yet
Bayesian Networks
48 pages
Bayesian Networks
No ratings yet
Bayesian Networks
45 pages
Baysian Belief Networks
No ratings yet
Baysian Belief Networks
32 pages
07 - Bayesian Learning
No ratings yet
07 - Bayesian Learning
55 pages
Bayesian
No ratings yet
Bayesian
91 pages
Naive Bayes
No ratings yet
Naive Bayes
29 pages
Unit 5 - Machine Learning - WWW - Rgpvnotes.in
No ratings yet
Unit 5 - Machine Learning - WWW - Rgpvnotes.in
12 pages
Mllabprog 5
No ratings yet
Mllabprog 5
6 pages
Unit IV CI PDF
No ratings yet
Unit IV CI PDF
24 pages
03 MultivariateProbability
No ratings yet
03 MultivariateProbability
73 pages
Lecture # 2-1 Probabilistic Models
No ratings yet
Lecture # 2-1 Probabilistic Models
40 pages
Bayes Network
100% (1)
Bayes Network
80 pages
Applications of Derivatives Errors and Approximation (Calculus) Mathematics Question Bank
From Everand
Applications of Derivatives Errors and Approximation (Calculus) Mathematics Question Bank
Mohmmad Khaja Shareef
No ratings yet
3532 Q3
No ratings yet
3532 Q3
2 pages
Q01
No ratings yet
Q01
2 pages
Analysis of SRS - A CASE Study
No ratings yet
Analysis of SRS - A CASE Study
3 pages
How To Write An SRS Document
No ratings yet
How To Write An SRS Document
10 pages
Group 4 Research
No ratings yet
Group 4 Research
20 pages
Finding Answers Through Data Collection
100% (1)
Finding Answers Through Data Collection
7 pages
Acted
No ratings yet
Acted
6 pages
UN Back Casting Handbook
No ratings yet
UN Back Casting Handbook
34 pages
Debjyoti Das+20BM63026
No ratings yet
Debjyoti Das+20BM63026
19 pages
Ee 769 Assignment 1 PDF
No ratings yet
Ee 769 Assignment 1 PDF
11 pages
Understanding Self-Compassion in Adolescents - Validation Study of The Self Compassion Scale
No ratings yet
Understanding Self-Compassion in Adolescents - Validation Study of The Self Compassion Scale
7 pages
Management Science Chapter 15 Powerpoint: Optimization in Simulation
No ratings yet
Management Science Chapter 15 Powerpoint: Optimization in Simulation
18 pages
Q4 - Performance Task #1 Testing Hypothesis: Statistics & Probability Second Semester
No ratings yet
Q4 - Performance Task #1 Testing Hypothesis: Statistics & Probability Second Semester
2 pages
Chapter16 Distributed Lag Models
No ratings yet
Chapter16 Distributed Lag Models
30 pages
A Comparative Study On Fake Job Post Prediction Using Different Data Mining Techniques
100% (1)
A Comparative Study On Fake Job Post Prediction Using Different Data Mining Techniques
5 pages
Data Science in Context V.99 Web Beta
No ratings yet
Data Science in Context V.99 Web Beta
293 pages
FINAL (SG) - PR 2 11 - 12 - UNIT 7 - LESSON 2 - Testing The Difference of Two Means
No ratings yet
FINAL (SG) - PR 2 11 - 12 - UNIT 7 - LESSON 2 - Testing The Difference of Two Means
27 pages
Stature Estimation: 1.0 Principle, Spirit and Intent
No ratings yet
Stature Estimation: 1.0 Principle, Spirit and Intent
4 pages
DBSCAN
No ratings yet
DBSCAN
18 pages
Validity and Reliability
No ratings yet
Validity and Reliability
33 pages
Hrm554 Chapter 7
No ratings yet
Hrm554 Chapter 7
34 pages
Formula Sheet For Final - Exam
No ratings yet
Formula Sheet For Final - Exam
1 page
Leucorrhea Symptoms and Care Seeking Behavior Among Women in Port-Said City
No ratings yet
Leucorrhea Symptoms and Care Seeking Behavior Among Women in Port-Said City
7 pages
Biostats m6 CIA 1 - Dela Cruz, Jan Philip
No ratings yet
Biostats m6 CIA 1 - Dela Cruz, Jan Philip
2 pages
Chapter 10: Artificial Neural Networks
No ratings yet
Chapter 10: Artificial Neural Networks
17 pages
SAS DATA Step - Compile, Execution, and The Program Data Vector
No ratings yet
SAS DATA Step - Compile, Execution, and The Program Data Vector
10 pages
The Influence of Budget Cuts On Public Services: An Analytical Review
No ratings yet
The Influence of Budget Cuts On Public Services: An Analytical Review
6 pages
Dynamic Spatio-Temporal Pattern Discovery: A Novel Grid and Density-Based Clustering Algorithm
No ratings yet
Dynamic Spatio-Temporal Pattern Discovery: A Novel Grid and Density-Based Clustering Algorithm
11 pages
F1 CRD Lecture Stat-701 Final
No ratings yet
F1 CRD Lecture Stat-701 Final
10 pages
CIOMS Cumulative Pharmacovigilance GLOSSARY (Version 1.0)
No ratings yet
CIOMS Cumulative Pharmacovigilance GLOSSARY (Version 1.0)
63 pages

Essentials of Machine Learning - Lesson 02

Uploaded by

Essentials of Machine Learning - Lesson 02

Uploaded by

Essentials of Machine Learning

T Essentials of Machine Learning

Thushari Silva, PhD Essentials of Machine Learning

• A random variable (RV), X denotes a quantity that is subject to variations due to

Thushari Silva, PhD Essentials of Machine Learning

Thushari Silva, PhD Essentials of Machine Learning

Thushari Silva, PhD Essentials of Machine Learning

• Consider a function f(x) mapping from x onto numerical values

For discrete and continuous variable resp.

Thushari Silva, PhD Essentials of Machine Learning

• Properties of several random variables are important for modelling complex

Intelligence = low Intelligence = high

Thushari Silva, PhD Essentials of Machine Learning

• The sum rule

• Replace sum by an integral for continuous RVs

Thushari Silva, PhD Essentials of Machine Learning

Thushari Silva, PhD Essentials of Machine Learning

• The chain rule is derived by repeated application of the product rule

Exercise : give decompositions of p(x, y, z) using the chain rule

Thushari Silva, PhD Essentials of Machine Learning

• From the product rule,

Thushari Silva, PhD Essentials of Machine Learning

• Consider the following medical diagnosis problem.

Thushari Silva, PhD Essentials of Machine Learning

• Tuberculosis (TB) and a skin test (Test)

• Let X and Y be two disjoint groups of variables. Then X is said to be independent

• Let X, Y and Z be three disjoint groups of variables. X is said to be conditionally

Thushari Silva, PhD Essentials of Machine Learning

You might also like