Open navigation menu

Scribd

0% found this document useful (0 votes)

73 views49 pages

Level-Up Probability and Statistics

The document outlines an agenda for a workshop on probability theory and statistics, beginning with an overview of probability theory including probability distributions, expected value, variance, conditional probability, and independence. It then discusses statistics topics such as data types, common distributions like the normal distribution, and the central limit theorem. The document provides examples and explanations of key probability concepts to help introduce participants to probability and statistics.

Uploaded by

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

73 views49 pages

Level-Up Probability and Statistics

The document outlines an agenda for a workshop on probability theory and statistics, beginning with an overview of probability theory including probability distributions, expected value, variance, conditional probability, and independence. It then discusses statistics topics such as data types, common distributions like the normal distribution, and the central limit theorem. The document provides examples and explanations of key probability concepts to help introduce participants to probability and statistics.

Uploaded by

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 49

Level-up!

Probability and Statistics

Morris Alper

Morris Alper – ITC Level-up December 2020 cohort

Agenda

Probability Theory
● Probability and random variables
● Expected value, variance, and standard deviation
● Conditional probability
● Independence

Statistics
● Data types
● Common distributions
● Central limit theorem
Probability Theory
Probability Theory

The probability of an event is a number between 0 and 1 that

describes how likely that event is to occur.

For example, I can ask:

● What is the probability of getting heads when I flip a fair

coin?
Probability Theory

The probability of an event is a number between 0 and 1 that

describes how likely that event is to occur.

For example, I can ask:

● What is the probability of getting heads when I flip a fair

coin? 0.5
Probability Theory

The probability of an event is a number between 0 and 1 that

describes how likely that event is to occur.

For example, I can ask:

● What is the probability of getting HH when I flip a fair coin

twice?
Probability Theory

The probability of an event is a number between 0 and 1 that

describes how likely that event is to occur.

For example, I can ask:

● What is the probability of getting HH when I flip a fair coin

twice? 0.25
Probability Theory

The probability of an event is a number between 0 and 1 that

describes how likely that event is to occur.

For example, I can ask:

● What is the probability of getting all heads when I flip a fair

coin 100 times?
Probability Theory

The probability of an event is a number between 0 and 1 that

describes how likely that event is to occur.

For example, I can ask:

● What is the probability of getting all heads when I flip a fair

coin 100 times? 1/2100 ≈ 7.9 x 10-31
Probability Theory

For discrete, equally likely events we can measure the probabilities of

outcomes by counting:

Example: Probability of heads when flipping one coin:

Positive outcome(s): H (1 possibility) P(H) = 1/(1+1) = 0.5

Negative outcome(s): T (1 possibility)

Reproduced from
https://en.wikipedia.org
/wiki/Probability
Probability Theory

For discrete, equally likely events we can measure the probabilities of

outcomes by counting:

Example: Probability of HH when flipping two coins:

Positive outcome(s): HH (1) P(HH) = 1/(1+3) = 0.25

Negative outcome(s): TT HT TH (3)

Reproduced from
https://en.wikipedia.org
/wiki/Probability
Probability Theory

Example: Probability of all heads when flipping 100 coins:

Positive outcome(s): HHH… (1)

Negative outcome(s): 2100-1 possibilities

P(all heads) = 1/2100 ≈ 7.9 x 10-31

Reproduced from
https://en.wikipedia.org
/wiki/Probability
Probability Theory

Example: Probability of getting

total K when rolling two dice.

P(K=1) = 1/36 = 0.02777…

P(K=7) = 6/36 = 0.1666…

etc.
Reproduced from
https://en.wikipedia.org
/wiki/Probability
Probability Theory

A random variable is any random value that can be sampled

A probability distribution is a function that represents the

probability of the random variable taking some value(s)

For a random variable X with discrete values, we write P(X = k)

to mean the probability that X takes value k. This is called the
Probability Mass Function (PMF)
Probability Theory

Example 1: The random variable X is the result of a coin toss

P(X = heads) = 0.5

P(X = tails) = 0.5

Example 2: The random variable X is the age bracket of a

random Israeli

P(X = “15-24 years”) = 0.16

Source: https://www.indexmundi.com/israel/age_structure.html
Probability Theory

For a random variable X with continuous values, we write

𝑏
P(a < X < b) = ‫𝑋𝑝 𝑎׬‬ 𝑡 𝑑𝑡

Where pX(t) is the Probability Density Function (PDF) of X

Probability Theory

Example 3: The random variable X is

the height (cm) of a random adult male

The PDF is approximately a normal

curve (explanation to come…)
𝑡 −177 2
1
pX(t) ≈ 𝑒 − 128
8 2𝜋

Integrating this gives the probability of

an adult being in a height range
Probability Theory

Definition: The expected value of a random variable X is

defined to be

• E[X] = σ 𝑡 ⋅ 𝑃 𝑋 = 𝑡 (if X is discrete)

• E[X] = ‫𝑡𝑑 𝑡 𝑋𝑝 ⋅ 𝑡 ׬‬ (if X is continuous)

Probability Theory

Example:

Let X be the payoff upon playing the lottery, with probability

p=5.7x10-9 of winning $2 million.

The expected payoff is:

𝐸 𝑋 = 𝑝 ⋅ 2 × 106 + 1 − 𝑝 ⋅ 0 ≈ $0.0114

So on average you can expect to win over one cent every

time you play! 🤑 🤑 🤑
Probability Theory

Different probability
distributions can have the
same expected value but be
more or less spread out:

Q: How can we measure this?

Probability Theory

A: Variance, defined as:

𝑉𝑎𝑟 𝑋 = 𝐸[ 𝑋 − 𝜇 2 ]

where 𝜇 = 𝐸[𝑋] is the expected value of X.

Standard Deviation is defined as the square root of

variance:
𝜎𝑋 = 𝑉𝑎𝑟(𝑋)
Probability Theory

Example 1:

For a fair coin ( P(X = 0) = P(X = 1) = 0.5), we have

𝑬 𝑿 = 0.5 ⋅ 0 + 0.5 ⋅ 1 = 𝟎. 𝟓
𝑽𝒂𝒓 𝑿 = 𝐸 𝑋 − 0.5 2 = 0.5 ⋅ −0.5 2 + 0.5 ⋅ 0.5 2 = 𝟎. 𝟐𝟓
𝝈𝑿 = 0.25 = 𝟎. 𝟓
Probability Theory

Example 2:

Suppose X ~ Unif(4, 6) is selected from a uniform distribution

on the interval [4, 6]. 𝑝𝑋 (𝑡)
Reproduced from
https://en.wikipedi
1 a.org/wiki/Uniform
2 _distribution_(cont
inuous)#/media/Fil
e:Uniform_Distribu
tion_PDF_SVG.sv
g

4 6 𝑡
Probability Theory

Example 2:

Try sampling from X~Unif(4, 6) yourself:

Probability Theory

Example 2:

6 6
1 1 1 2 1 2
𝐸 𝑋 = න 𝑡 𝑝𝑋 𝑡 𝑑𝑡 = න 𝑡 𝑑𝑡 = 6 − 4 =5
4 4 2 2 2 2
Probability Theory

Example 2:
6
𝑉𝑎𝑟 𝑋 = 𝐸 𝑋 − 𝐸 𝑋 2 =𝐸 𝑋−5 2 =න 𝑡−5 2 𝑝𝑋 𝑡 𝑑𝑡
4
Probability Theory

Example 2:
6
𝑉𝑎𝑟 𝑋 = 𝐸 𝑋 − 𝐸 𝑋 2 =𝐸 𝑋−5 2 =න 𝑡−5 2 𝑝𝑋 𝑡 𝑑𝑡
4
1 6 2
1 1 3
1 3
1
= න 𝑡−5 𝑑𝑡 = 6−5 − 4−5 = = 0.333 …
2 4 2 3 3 3
Probability Theory

Example 2:
6
𝑉𝑎𝑟 𝑋 = 𝐸 𝑋 − 𝐸 𝑋 2 =𝐸 𝑋−5 2 =න 𝑡−5 2 𝑝𝑋 𝑡 𝑑𝑡
4
1 6 2
1 1 3
1 3
1
= න 𝑡−5 𝑑𝑡 = 6−5 − 4−5 = = 0.333 …
2 4 2 3 3 3

𝜎𝑋 = 𝑉𝑎𝑟(𝑋) = 1/3 = 0.577 …

Probability Theory

Example 3:
Probability Theory

Given two events A and B, the conditional probability

𝑷 𝑨 𝑩) is defined as:
𝑃(𝐴 ∩ 𝐵)
𝑃 𝐴 𝐵) =
𝑃(𝐵)
Where 𝑃(𝐴 ∩ 𝐵) is the probability of A and B
occurring.
Probability Theory

Example:

Suppose in our clinic we observe the following

distribution of people and symptoms:

Healthy Sick
Not coughing 0.72 0.06
Coughing 0.08 0.14
Probability Theory

Q1: If someone is sick, what is the probability that they are

coughing?

Healthy Sick
Not coughing 0.72 0.06
Coughing 0.08 0.14
Probability Theory

Q1: If someone is sick, what is the probability that they are

coughing?

𝑃 𝑐𝑜𝑢𝑔ℎ𝑖𝑛𝑔 ∩ 𝑠𝑖𝑐𝑘 0.14

A1: 𝑃 𝑐𝑜𝑢𝑔ℎ𝑖𝑛𝑔 𝑠𝑖𝑐𝑘) = = = 0.7
𝑃(𝑠𝑖𝑐𝑘) 0.14+0.06

Healthy Sick
Not coughing 0.72 0.06
Coughing 0.08 0.14
Probability Theory

Q2: If someone is coughing, what is the probability that they

are sick?

Healthy Sick
Not coughing 0.72 0.06
Coughing 0.08 0.14
Probability Theory

Q2: If someone is coughing, what is the probability that they

are sick?

𝑃 𝑐𝑜𝑢𝑔ℎ𝑖𝑛𝑔 ∩ 𝑠𝑖𝑐𝑘 0.14

A2: 𝑃 𝑠𝑖𝑐𝑘 𝑐𝑜𝑢𝑔ℎ𝑖𝑛𝑔) = = ≈ 0.64
𝑃(𝑐𝑜𝑢𝑔ℎ𝑖𝑛𝑔) 0.08+0.14

Healthy Sick
Not coughing 0.72 0.06
Coughing 0.08 0.14
Probability Theory

Two random variables X and Y are independent if

𝑃 𝑋∩𝑌 =𝑃 𝑋 𝑃 𝑌

This means that the outcomes of X and Y do not affect each

other.

Otherwise, X and Y are dependent.

Probability Theory

Example 1: I roll two dice; X represents the outcome of the

first die and Y the outcome of the second die.

X and Y are independent (do not affect each other)

1 1 1
𝑃 𝑋 =4∩𝑌 =5 =𝑃 𝑋 =4 𝑃 𝑌 =5 = ⋅ =
6 6 36
Probability Theory

Example 2: Let X and Y be as before and let Z=X+Y be the

sum of the values on the two dice.

X and Z are dependent

1
𝑃 𝑋 =4∩𝑍 =5 = (can only happen if X=4 and Y=1)
36

1 5
But 𝑃 𝑋 = 4 = and 𝑃 𝑍 = 5 = ,
6 36

so it is not equal to 𝑃 𝑋 = 4)𝑃(𝑍 = 5

Statistics
Statistics

Numerical Data Categorical Data

Heights Months (Jan, May, Dec)

Temperature Gender (male, female, other)

Number of children Nationality (Israel, USA, France)

Age Age brackets (18-25, 26-35, …)

Statistics

⚠️ Be careful!

A feature containing numbers might be either numerical

data or categorical data.

Example: family data collected on subjects split into six

different groups

Number of siblings (0, 1, 2, 3, 4): numerical

Group number (0, 1, 2, 3, 4): categorical

Statistics

Discrete probability distribution of

random variable that can take two
values (normally 0 and 1).

X ~ Bernoulli(p)
P(X = 1) = p
P(X = 0) = 1– p
Statistics

Example 1: flipping a coin

p ≈ 0.5 q ≈ 0.5

Example 2:
winning the lottery
p ≈ 5.7x10-9 q =1-p
Statistics

For X ~ Bernoulli(p):
𝐸 𝑋 =𝑝
𝑉𝑎𝑟 𝑋 = 𝑝 1 − 𝑝

𝜎𝑋 = 𝑝(1 − 𝑝)
Statistics

𝑋 ~ 𝒩(𝜇, 𝜎 2 )
1 𝑡 −𝜇 2
−
𝑝𝑋 𝑡 = 𝑒 2𝜎2
𝜎 2𝜋

𝐸𝑋 =𝜇
Var(X) = 𝜎 2
𝜎𝑋 =𝜎

The shape of the normal PDF is known

as a “bell curve”.
Q: Why is this a useful definition?
Statistics

A: The Central Limit Theorem states

that, under reasonable conditions,
the sum of independent, identically
distributed trials asymptotically
approaches a normal distribution

Lots of real-life data includes many

nearly independent sources of
random noise added together
Statistics

Examples:
• Number of heads when flipping
one million coins (approximately)
• Height in human population
• Birth weight of newborn babies
• Variation in outdoor temperature
from monthly average

See
https://galtonboard.com/probability
examplesinlife for more examples
Further Reading
Further reading

● Introduction to Probability from MIT OCW

● An Introduction to Statistics by Keone Hon

● The Probability Cheatsheet

● The Statistics Cheatsheet

● Common Probability Distributions by Sean Owens

● Central Limit Theorem Explained

You might also like

Complete Canadian Curriculum - ANS
100% (2)
Complete Canadian Curriculum - ANS
41 pages
Chris Millar: George Bennet™
No ratings yet
Chris Millar: George Bennet™
268 pages
ScienceVIC9 Full
No ratings yet
ScienceVIC9 Full
386 pages
Grade 7-Cambridge-Syllabus
No ratings yet
Grade 7-Cambridge-Syllabus
8 pages
Cambridge Primary Digital Literacy Curriculum Outline
No ratings yet
Cambridge Primary Digital Literacy Curriculum Outline
4 pages
ScienceVIC8 Full
No ratings yet
ScienceVIC8 Full
436 pages
Letter Writing Education Presentation in Blue Cream Simple Lined Style
No ratings yet
Letter Writing Education Presentation in Blue Cream Simple Lined Style
35 pages
R D Sharma Negative Numbers Class 6
No ratings yet
R D Sharma Negative Numbers Class 6
36 pages
كتب المدارس الخاصة المعتمدة في السنوات السابقة
No ratings yet
كتب المدارس الخاصة المعتمدة في السنوات السابقة
189 pages
Thanksgiving Grade 1 Math Worksheet
No ratings yet
Thanksgiving Grade 1 Math Worksheet
4 pages
Grade 7 Mathematics - Learner Book
No ratings yet
Grade 7 Mathematics - Learner Book
311 pages
Math Tool Kits
No ratings yet
Math Tool Kits
31 pages
Physics s3
No ratings yet
Physics s3
5 pages
Science Workbook 9 PDF Free
No ratings yet
Science Workbook 9 PDF Free
197 pages
Leaner Book Stage 7
No ratings yet
Leaner Book Stage 7
392 pages
Iprimary Computing Year 1 Term 3 Final Test
No ratings yet
Iprimary Computing Year 1 Term 3 Final Test
3 pages
2023 - An Advanced Deep Neural Network For Fundus Image Analysis and Enhancing Diabetic Retinopathy Detection
100% (1)
2023 - An Advanced Deep Neural Network For Fundus Image Analysis and Enhancing Diabetic Retinopathy Detection
19 pages
Cambridge Checkpoint Lower Secondary English Workbook 9 (John Reynolds) (Z-Library)
No ratings yet
Cambridge Checkpoint Lower Secondary English Workbook 9 (John Reynolds) (Z-Library)
96 pages
CONSTRUCT
100% (1)
CONSTRUCT
30 pages
Unit j249 03 Physics Higher Tier Paper 3 Sample Assessment Material
No ratings yet
Unit j249 03 Physics Higher Tier Paper 3 Sample Assessment Material
48 pages
2017 Aptitude Entrance Exam Answers
No ratings yet
2017 Aptitude Entrance Exam Answers
23 pages
Statistics and Probability Katabasis
No ratings yet
Statistics and Probability Katabasis
7 pages
Math Cheat Sheet
No ratings yet
Math Cheat Sheet
64 pages
Acid-Base Titration
No ratings yet
Acid-Base Titration
150 pages
Dr. Pankti Bhatt
No ratings yet
Dr. Pankti Bhatt
20 pages
MYP Grade 9 Mathematics - International Baccalaureate - FutureSchool
No ratings yet
MYP Grade 9 Mathematics - International Baccalaureate - FutureSchool
6 pages
3-10-15fractions Improper
No ratings yet
3-10-15fractions Improper
2 pages
Mathletics: The Annual Mathematics Magazine
No ratings yet
Mathletics: The Annual Mathematics Magazine
54 pages
Science F3 - 1 of 2
No ratings yet
Science F3 - 1 of 2
170 pages
The Colors The Colors of Emotions of Emotions
No ratings yet
The Colors The Colors of Emotions of Emotions
9 pages
G9Adv-Building An MP3 Player in Python (Word Version)
No ratings yet
G9Adv-Building An MP3 Player in Python (Word Version)
11 pages
How Should I Prepare and Which Books Should I Refer To For Petroleum Engineering in The GATE
No ratings yet
How Should I Prepare and Which Books Should I Refer To For Petroleum Engineering in The GATE
6 pages
Letter Writing:: Reading and Thoughtfully Corresponding
No ratings yet
Letter Writing:: Reading and Thoughtfully Corresponding
36 pages
Science Focus 9 Textbook - Skill Focus - Science Skills Guide
No ratings yet
Science Focus 9 Textbook - Skill Focus - Science Skills Guide
45 pages
Grade/Subject: Mathematics/Grade 8 Unit 6: Congruence and Similarity
100% (1)
Grade/Subject: Mathematics/Grade 8 Unit 6: Congruence and Similarity
18 pages
Complex Number PDF
No ratings yet
Complex Number PDF
30 pages
Science Fact File TG-6
No ratings yet
Science Fact File TG-6
111 pages
Higher Order Questions On Fossil Record
100% (1)
Higher Order Questions On Fossil Record
4 pages
HMH Worktext Forces
50% (2)
HMH Worktext Forces
16 pages
Circles Arcs Tangency G9
No ratings yet
Circles Arcs Tangency G9
14 pages
Scope and Sequence: Inspire Biology
No ratings yet
Scope and Sequence: Inspire Biology
2 pages
CKSci - G3 - U3 - Habitats and Change - SR
No ratings yet
CKSci - G3 - U3 - Habitats and Change - SR
69 pages
Cross Sections: Lesson
No ratings yet
Cross Sections: Lesson
3 pages
USA Sports Day Mini Book Compressed
No ratings yet
USA Sports Day Mini Book Compressed
16 pages
G4 Math Week 9. Shapes
No ratings yet
G4 Math Week 9. Shapes
15 pages
Long Division: by Monica Yuskaitis
100% (2)
Long Division: by Monica Yuskaitis
20 pages
A Childrens Guide To Python Programming
No ratings yet
A Childrens Guide To Python Programming
12 pages
Telling The Time
No ratings yet
Telling The Time
14 pages
Double Number Line Activity
No ratings yet
Double Number Line Activity
2 pages
HALF YEARLY CLASS-8 MATH Rev. W.S - Updated
No ratings yet
HALF YEARLY CLASS-8 MATH Rev. W.S - Updated
2 pages
Year 5 End of Year Review Pack
No ratings yet
Year 5 End of Year Review Pack
35 pages
Algebra I Honors
No ratings yet
Algebra I Honors
47 pages
English Tenses and Practice
No ratings yet
English Tenses and Practice
19 pages
IM10!51!52 Ch6 Simultaneous Equations
No ratings yet
IM10!51!52 Ch6 Simultaneous Equations
24 pages
A Simple Circuit A Fact Sheet 2
No ratings yet
A Simple Circuit A Fact Sheet 2
3 pages
Assignment On SI Units
0% (1)
Assignment On SI Units
11 pages
Full-Potential Linearized Augmented Plane Wave
100% (1)
Full-Potential Linearized Augmented Plane Wave
8 pages
TG-TG 9780199065240
No ratings yet
TG-TG 9780199065240
152 pages
Quiz Math Week 1-5
100% (1)
Quiz Math Week 1-5
6 pages
Module 1.5 Square Roots and Scientific Notations
100% (1)
Module 1.5 Square Roots and Scientific Notations
34 pages
Imporvement Requirement
No ratings yet
Imporvement Requirement
8 pages
Formulas
No ratings yet
Formulas
2 pages
Math Mnemonics
100% (1)
Math Mnemonics
21 pages
A Division Of: Teacher Created Materials
No ratings yet
A Division Of: Teacher Created Materials
12 pages
Answers
No ratings yet
Answers
13 pages
PE281 Green's Functions Course Notes
100% (1)
PE281 Green's Functions Course Notes
11 pages
Links
No ratings yet
Links
3 pages
PEARSON Chemistry Chapter 9 Flashcards - Quizlet
No ratings yet
PEARSON Chemistry Chapter 9 Flashcards - Quizlet
4 pages
Chapter 1 Sequence
No ratings yet
Chapter 1 Sequence
9 pages
Color Wheel For Kids
No ratings yet
Color Wheel For Kids
2 pages
Chapter 2
No ratings yet
Chapter 2
28 pages
De Notes Final
No ratings yet
De Notes Final
176 pages
Module 7: Discrete State Space Models: Lecture Note 4
No ratings yet
Module 7: Discrete State Space Models: Lecture Note 4
7 pages
Informed Search: Prepared by Dr. Megharani Patil
No ratings yet
Informed Search: Prepared by Dr. Megharani Patil
22 pages
CSE Lab Report (1706045)
No ratings yet
CSE Lab Report (1706045)
30 pages
JEE Advanced 2015 Paper Analysis
No ratings yet
JEE Advanced 2015 Paper Analysis
25 pages
Computer Science and Technology
No ratings yet
Computer Science and Technology
4 pages
Solution Manual Introduction To Metric and Topological Spaces 2nd Edition by Sutherland
No ratings yet
Solution Manual Introduction To Metric and Topological Spaces 2nd Edition by Sutherland
4 pages
22 Scientific Notation
No ratings yet
22 Scientific Notation
5 pages
Worksheet X Maths 2023
No ratings yet
Worksheet X Maths 2023
4 pages
Covering Spaces and Graph Theory PDF
No ratings yet
Covering Spaces and Graph Theory PDF
46 pages
Notes On Exponential Distribution: January 2008
No ratings yet
Notes On Exponential Distribution: January 2008
13 pages
Maxwell
No ratings yet
Maxwell
6 pages
5 SpanningTrees EN
No ratings yet
5 SpanningTrees EN
23 pages
Abstract Algebra
No ratings yet
Abstract Algebra
21 pages
Birla Institute of Technology and Science, Pilani Pilani Campus
No ratings yet
Birla Institute of Technology and Science, Pilani Pilani Campus
3 pages
Searching and Sorting Algorithms: CS117, Spring 2006 Supplementary Lecture Notes Written by Amy Csizmar Dalal
No ratings yet
Searching and Sorting Algorithms: CS117, Spring 2006 Supplementary Lecture Notes Written by Amy Csizmar Dalal
15 pages
Solid Works Assignment 1
No ratings yet
Solid Works Assignment 1
3 pages
Xercise: Jee Problems
No ratings yet
Xercise: Jee Problems
2 pages
Bfs Find Shortest Path On Unweighted Graph
No ratings yet
Bfs Find Shortest Path On Unweighted Graph
3 pages