0% found this document useful (0 votes)

44 views20 pages

Chapter6 Probability

The document provides an introduction to key concepts in probability, including: 1) It defines probability as quantifying uncertainty of events and introduces basic notation. 2) It discusses dependence and independence of events and how conditional probability relates the probability of two events. 3) It explains Bayes' Theorem as a way to "reverse" conditional probabilities. 4) It introduces random variables and their expected values, as well as continuous distributions defined by probability density functions.

Uploaded by

Chirantan Sahoo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views20 pages

Chapter6 Probability

Uploaded by

Chirantan Sahoo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

Probability

Lecture 6

Centre for Data Science, ITER

Siksha ‘O’ Anusandhan (Deemed to be University), Bhubaneswar, Odisha, India.

1 / 20
Contents

1 Introduction
2 Dependence and independence of events
3 Conditional Probability
4 Bayes’s Theorem
5 Random Variables
6 Continuous Distributions
7 Probability Density Function
8 Cumulative Distribution Function
9 The Normal Distribution
10 The Central Limit Theorem

2 / 20
Introduction

Probability is a way of quantifying the uncertainty associated with

events chosen from some universe of events.
Notationally, we write P(E) to mean “the probability of the event E.”

3 / 20
Dependence and independence of events

Mathematically, we say that two events E and F are independent if

the probability that they both happen is the product of the
probabilities that each one happens:
P(E,F) = P(E)P(F)
For instance, if we flip a fair coin twice, knowing whether the first
flip is heads gives us no information about whether the second flip
is heads. These events are independent.
On the other hand, knowing whether the first flip is heads certainly
gives us information about whether both flips are tails. (If the first
flip is heads, then definitely it’s not the case that both flips are
tails.) These two events are dependent.

4 / 20
Conditional Probability

If two events E and F are not necessarily independent (and if the

probability of F is not zero), then we define the probability of E
“conditional on F” as:
P(E | F ) = P(E, F )/P(F )
We can say that this is the probability that E happens, given that
we know that F happens.
We often rewrite this as:
P(E, F ) = P(E | F )P(F )

5 / 20
Conditional Probability (Contd.)

When E and F are independent, you can check that this gives:
P(E | F ) = P(E)
which is the mathematical way of expressing that knowing F
occurred gives us no additional information about whether E
occurred.

6 / 20
Bayes’s Theorem

Bayes’s theorem is a way of “reversing” conditional probabilities.

Let’s say we need to know the probability of some event E
conditional on some other event F occurring. But we only have
information about the probability of F conditional on E occurring.
Using the definition of conditional probability twice tells us that:
P(E | F ) = P(E, F )/P(F ) = P(F | E)P(E)/P(F )

7 / 20
Bayes’s Theorem

The event F can be split into the two mutually exclusive events “F
and E” and “F and not E.” If we write -E for “not E” (i.e., “E doesn’t
happen”), then:
P(F ) = P(F , E) + P(F , −E)
so that:
P(E| F ) = P(F | E)P(E)/[P(F | E)P(E) + P(F | −E)P(−E)]
which is how Bayes’s theorem is often stated.

8 / 20
Random Variables

A random variable is a variable whose possible values have an

associated probability distribution.
Eg: A very simple random variable equals 1 if a coin flip turns up
heads and 0 if the flip turns up tails.
The expected value of a random variable, which is the average of
its values weighted by their probabilities.
Eg: The coin flip variable has an expected value of
1/2 (= 0 * 1/2 + 1 * 1/2)
and the range(10) variable has an expected value of 4.5.

9 / 20
Continuous Distributions

A coin flip corresponds to a discrete distribution—one that

associates positive probability with discrete outcomes.
A continuous distribution describes the probabilities of the
possible values of a continuous random variable i.e. a random
variable which has infinite and uncountable set of possible values
as number of outcomes.
Eg: The uniform distribution puts equal weight on all the
numbers between 0 and 1.

10 / 20
Probability Density Function

Because there are infinitely many numbers between 0 and 1, this

means that the weight it assigns to individual points must
necessarily be zero.
For this reason, we represent a continuous distribution with a
probability density function (PDF) such that the probability of
seeing a value in a certain interval equals the integral of the
density function over the interval.
The density function for the uniform distribution is just:
def uniform pdf(x: float) -> float:
return 1 if 0 <= x < 1 else 0

11 / 20
Cumulative Distribution Function

We will often be more interested in the cumulative distribution

function (CDF), which gives the probability that a random variable
is less than or equal to a certain value.
CDF for the uniform distribution will be:
def uniform cdf(x: float) -> float:
if x < 0: return 0
elif x < 1: return x
else: return 1

12 / 20
The Normal Distribution

The normal distribution is the classic bell curve–shaped

distribution and is completely determined by two parameters: its
mean µ (mu) and its standard deviation σ (sigma).
The mean indicates where the bell is centered, and the standard
deviation how “wide” it is.
(x−µ)2
−
It has the PDF: f (x | µ, σ) = √1 e 2σ 2
2πσ

13 / 20
Normal Distribution (Contd.)

It can be implemented as:

import math
SQRT TWO PI = math.sqrt(2 * math.pi)
def normal pdf(x: float, mu: float = 0, sigma: float = 1) -> float:
return (math.exp(-(x-mu) ** 2 / 2 / sigma ** 2) / (SQRT TWO PI * sigma))

When µ = 0 and σ = 1, it’s called the standard normal distribution.

If Z is a standard normal random variable, then it turns out that:
X = σZ + µ is also normal but with mean µ and standard deviation σ.
Conversely, if X is a normal random variable with mean µ and standard
deviation σ,
Z = (X − µ)/σ is a standard normal variable.

14 / 20
Normal Distribution (Contd.)

The CDF for the normal distribution cannot be written in an

“elementary” manner, but we can write it using Python’s
math.erf error function:
def normal cdf(x: float, mu: float = 0, sigma: float = 1) -> float:
return (1 + math.erf((x - mu) / math.sqrt(2) / sigma)) / 2

15 / 20
The Central Limit Theorem

If x1, ..., xn are random variables with mean µ and standard

deviation σ, and if n is large, then:
1
n (x1 + x2 + . . . + xn )
is approximately normally distributed with mean µ and standard
deviation √σn .
Equivalently (but often more usefully),
(x1 +x2 +...+xn )−µn
√
σ n
is approximately normally distributed with mean 0 and standard
deviation 1.

16 / 20
Central Limit Theorem (Contd.)

A Binomial(n,p) random variable is simply the sum of n

independent Bernoulli(p) random variables, each of which equals
1 with probability p and 0 with probability 1 – p:

def bernoulli trial(p: float) -> int:

return 1 if random.random() < p else 0

def binomial(n: int, p: float) -> int:

return sum(bernoulli trial(p) for in range(n))

17 / 20
Central Limit Theorem (Contd.)

The
pmean of a Bernoulli(p) variable is p, and its standard deviation
is p(1 − p).
The central limit theorem says that as n gets large, a Binomial(n,p)
variable is approximately a normal random
p variable with mean
µ = np and standard deviation σ = np(1 − p).

18 / 20
References

[1] Data Science from Scratch: First Principles with Python by Joel Grus

19 / 20
Thank You
Any Questions?

20 / 20

Probability and Statistics
No ratings yet
Probability and Statistics
48 pages
Sem 6 Notes Maths
No ratings yet
Sem 6 Notes Maths
7 pages
PTSP
No ratings yet
PTSP
74 pages
L05 continuousRV
No ratings yet
L05 continuousRV
52 pages
Lecture 2
No ratings yet
Lecture 2
70 pages
S2 Vol2 Jointcontsdistributions
No ratings yet
S2 Vol2 Jointcontsdistributions
81 pages
ALL ST218 Lecture Notes
No ratings yet
ALL ST218 Lecture Notes
87 pages
Probability Slides
No ratings yet
Probability Slides
45 pages
Prob. Distri.
No ratings yet
Prob. Distri.
36 pages
PTSP
No ratings yet
PTSP
101 pages
Notes
No ratings yet
Notes
56 pages
Intro To Probability (Pattern Recognition)
No ratings yet
Intro To Probability (Pattern Recognition)
94 pages
11 Normal Distribution
No ratings yet
11 Normal Distribution
48 pages
Kostina-Lossy Data Compression PDF
No ratings yet
Kostina-Lossy Data Compression PDF
273 pages
Probability
No ratings yet
Probability
9 pages
Continuous Probability Distributions
No ratings yet
Continuous Probability Distributions
59 pages
Stat 350 Study Guide
No ratings yet
Stat 350 Study Guide
37 pages
Stats Review
No ratings yet
Stats Review
65 pages
SI Chapter-1
No ratings yet
SI Chapter-1
30 pages
Week5 BAM
No ratings yet
Week5 BAM
48 pages
STT201
No ratings yet
STT201
19 pages
Unit 4 - Continuous Random Variables
No ratings yet
Unit 4 - Continuous Random Variables
35 pages
CHP 5
No ratings yet
CHP 5
63 pages
Probability and Statistics: Dr. K.W. Chow Mechanical Engineering
No ratings yet
Probability and Statistics: Dr. K.W. Chow Mechanical Engineering
113 pages
Cheatsheet PS Mid241 Pub-4-1
No ratings yet
Cheatsheet PS Mid241 Pub-4-1
8 pages
Formula Sheet
No ratings yet
Formula Sheet
19 pages
MAS 102 - Topic 1
No ratings yet
MAS 102 - Topic 1
13 pages
Learn Distribute
No ratings yet
Learn Distribute
23 pages
Chapter 2
No ratings yet
Chapter 2
8 pages
Course Materials: Text
No ratings yet
Course Materials: Text
32 pages
Probability Review Stochastic
No ratings yet
Probability Review Stochastic
23 pages
Chap2 Discrete Distributions
No ratings yet
Chap2 Discrete Distributions
22 pages
Chapter - 1
No ratings yet
Chapter - 1
56 pages
Orientation - Basic Mathematics and Statistics - Probability
No ratings yet
Orientation - Basic Mathematics and Statistics - Probability
48 pages
EDA Reviewer
No ratings yet
EDA Reviewer
8 pages
Probs-Stats Revision Notes
No ratings yet
Probs-Stats Revision Notes
19 pages
Math Statistics
No ratings yet
Math Statistics
4 pages
Unit 1-QTM-Introduction To Statistics-MBA 1
No ratings yet
Unit 1-QTM-Introduction To Statistics-MBA 1
48 pages
Probability FoundationalMathofAI S24
No ratings yet
Probability FoundationalMathofAI S24
7 pages
Ids Unit 2 Notes Ckm-1
No ratings yet
Ids Unit 2 Notes Ckm-1
30 pages
Basic Probability Review
No ratings yet
Basic Probability Review
77 pages
Chapter 2 - Random Variables and Probabi - 2016 - Introduction To Statistical Ma
No ratings yet
Chapter 2 - Random Variables and Probabi - 2016 - Introduction To Statistical Ma
14 pages
Section06 Solutions
No ratings yet
Section06 Solutions
11 pages
Unit 8 Packet - Part 1
No ratings yet
Unit 8 Packet - Part 1
21 pages
Chapter 13 Experimental Design and Analysis of Variance PDF
No ratings yet
Chapter 13 Experimental Design and Analysis of Variance PDF
44 pages
Chapter 6
No ratings yet
Chapter 6
5 pages
Distributions and Normal Random Variables
No ratings yet
Distributions and Normal Random Variables
8 pages
Concentration
No ratings yet
Concentration
28 pages
(Ebook PDF) Fundamental Statistics For The Behavioral Sciences 9th Edition by David C. Howellpdf Download
100% (4)
(Ebook PDF) Fundamental Statistics For The Behavioral Sciences 9th Edition by David C. Howellpdf Download
44 pages
Biological Data Science - Lecture1
No ratings yet
Biological Data Science - Lecture1
34 pages
Weighted Mcnemar Test Comparing Two Screening Tests in The Presence of Verification Bias
No ratings yet
Weighted Mcnemar Test Comparing Two Screening Tests in The Presence of Verification Bias
15 pages
STAT 217 Wiest Outline
No ratings yet
STAT 217 Wiest Outline
4 pages
IDP Lab Report (Saswat Mohanty - 1941012407 - CSE-D)
No ratings yet
IDP Lab Report (Saswat Mohanty - 1941012407 - CSE-D)
47 pages
Toc Eco601
No ratings yet
Toc Eco601
1 page
Unit-16 IGNOU STATISTICS
No ratings yet
Unit-16 IGNOU STATISTICS
16 pages
17ec54-Information Theory and Coding
No ratings yet
17ec54-Information Theory and Coding
31 pages
Probability Cheatsheet
100% (1)
Probability Cheatsheet
8 pages
MT131 Tutorial - 5 Discrete Probability 2
No ratings yet
MT131 Tutorial - 5 Discrete Probability 2
40 pages
Probability Basics
No ratings yet
Probability Basics
19 pages
Bayes' Rule and Law of Total Probability: C C C C C 1 2 3
No ratings yet
Bayes' Rule and Law of Total Probability: C C C C C 1 2 3
8 pages
MIT14 381F13 Lec1 PDF
No ratings yet
MIT14 381F13 Lec1 PDF
8 pages
3-Data Pre-Processing
No ratings yet
3-Data Pre-Processing
18 pages
확률과 통계
No ratings yet
확률과 통계
20 pages
CNS 02 PDF
No ratings yet
CNS 02 PDF
19 pages
CNS 01 PDF
No ratings yet
CNS 01 PDF
19 pages
Arya Ady Nugroho 22010110110095 Bab8KTI
No ratings yet
Arya Ady Nugroho 22010110110095 Bab8KTI
27 pages
Chapter 14
No ratings yet
Chapter 14
18 pages
Section 6 - Projection Pursuit Regression
No ratings yet
Section 6 - Projection Pursuit Regression
23 pages
The Ultimate Probability Cheatsheet
No ratings yet
The Ultimate Probability Cheatsheet
8 pages
Probability Cheatsheet 140718
100% (1)
Probability Cheatsheet 140718
7 pages
Revision - Elements or Probability: Notation For Events
No ratings yet
Revision - Elements or Probability: Notation For Events
20 pages
Probability Theory: Much Inspired by The Presentation of Kren and Samuelsson
No ratings yet
Probability Theory: Much Inspired by The Presentation of Kren and Samuelsson
27 pages
Probability Review
No ratings yet
Probability Review
12 pages
Multivariate Analysis of Variance
No ratings yet
Multivariate Analysis of Variance
29 pages
Thông Kê
No ratings yet
Thông Kê
13 pages
BRM Unit 4 Extra
No ratings yet
BRM Unit 4 Extra
10 pages
Statistics Formula Sheet and Tables 2020
No ratings yet
Statistics Formula Sheet and Tables 2020
6 pages
A Practical Guide For Understanding Confidence Intervals and P Values
No ratings yet
A Practical Guide For Understanding Confidence Intervals and P Values
6 pages
SYLLABUS FOR JEE (Main) - 2020: (A) Syllabus For Paper-1 (B.E./B. Tech.) - Mathematics, Physics and Chemistry
No ratings yet
SYLLABUS FOR JEE (Main) - 2020: (A) Syllabus For Paper-1 (B.E./B. Tech.) - Mathematics, Physics and Chemistry
10 pages
BS Question Paper 17
No ratings yet
BS Question Paper 17
9 pages
Salinan Dari Untitled0.Ipynb - Colaboratory
No ratings yet
Salinan Dari Untitled0.Ipynb - Colaboratory
3 pages
Cheat Sheet On Probability
No ratings yet
Cheat Sheet On Probability
2 pages
Refresher Probabilities Statistics PDF
No ratings yet
Refresher Probabilities Statistics PDF
3 pages
Martín Albo, J., Núñez, J., Navarro, J., & Grijalvo, F. (2007)
No ratings yet
Martín Albo, J., Núñez, J., Navarro, J., & Grijalvo, F. (2007)
11 pages
Cyclic Codes
No ratings yet
Cyclic Codes
9 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Exam P Review Sheet
No ratings yet
Exam P Review Sheet
12 pages
Tutorial Worksheet Three
No ratings yet
Tutorial Worksheet Three
4 pages
Probability Cheatsheet
No ratings yet
Probability Cheatsheet
4 pages
Population and Sample - Definition, Types, Formulas and Examples
No ratings yet
Population and Sample - Definition, Types, Formulas and Examples
2 pages
GD Base Calculus Formula
No ratings yet
GD Base Calculus Formula
2 pages
Soa Exam P
100% (1)
Soa Exam P
10 pages
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
Lectures on the Coupling Method
From Everand
Lectures on the Coupling Method
Torgny Lindvall
No ratings yet
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Math for Computer Applications
From Everand
Math for Computer Applications
The Editors of REA
No ratings yet

Chapter6 Probability

Uploaded by

Chapter6 Probability

Uploaded by

Probability

Centre for Data Science, ITER

Probability is a way of quantifying the uncertainty associated with

Mathematically, we say that two events E and F are independent if

If two events E and F are not necessarily independent (and if the

Bayes’s theorem is a way of “reversing” conditional probabilities.

A random variable is a variable whose possible values have an

A coin flip corresponds to a discrete distribution—one that

Because there are infinitely many numbers between 0 and 1, this

We will often be more interested in the cumulative distribution

The normal distribution is the classic bell curve–shaped

It can be implemented as:

When µ = 0 and σ = 1, it’s called the standard normal distribution.

The CDF for the normal distribution cannot be written in an

If x1, ..., xn are random variables with mean µ and standard

A Binomial(n,p) random variable is simply the sum of n

def bernoulli trial(p: float) -> int:

def binomial(n: int, p: float) -> int:

You might also like