0% found this document useful (0 votes)

5 views38 pages

Lecture13 Randomvariables

Db note

Uploaded by

OLATIGBE NASIF

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views38 pages

Lecture13 Randomvariables

Db note

Uploaded by

OLATIGBE NASIF

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

STAT 311: LECTURE 13

Heavily based on lecture notes from Martina Morris

Random Variables
Logistics

 Homework due today

 Midterms to be passed back Friday
 Shiqing will be giving lectures on Monday and
Wednesday
 Sam’s office hours will be on Friday from 1-3

Stat 311 Summer 2016 2

Random Variables
From probabilities of specific events
To describing the full probability distribution

Stat 311 Summer 2016

What is a random variable?

Formally:
 An event in a sample space

 That takes a value

 Discrete or
 Continuous PDF

 With a certain probability

 Can be described by the PDF: P(X=k) CDF
 Or by the CDF: P(X ≤ k), P(X > k), P(j < X < k)
Stat 311 Summer 2016 4
Examples

 Flip a coin: Heads or tails?

 How many cars will drive through an
intersection in an hour?
 What is the height of a random individual?
 How much time until I get my next text
message?

Stat 311 Summer 2016 5

Random variable notation
 Random variables are denoted by capital letters
 For example, X or Y

 The value the RV takes in a specific case is called a “realization”

and denoted by a lowercase letter
 For example: x, y or k

 P(X = k)
 “The probability that the random variable X takes the value k”

 The sample space (set of all possible outcomes) is denoted by

Stat 311 Summer 2016 6

Empirical vs Theoretical Distributions

 We have seen distributions of observed data. These

are often called Empirical Distributions, because
they are what we empirically observe
 Today we will begin to formally discuss distributions
which are mathematical constructs used to model
real world situations
 These distributions are typically a family of
distributions which are specified by a mathematical
equation and are governed by a set of parameters
Stat 311 Summer 2016 7
Notation: statistics vs. parameters

Sample Statistics Theoretical parameters

 Mean n
 Expected value
1
x   xi
n i 1
 Variance  Variance
n
2 1 2
s
X   i ( x  x )
(n  1) i 1
 Each observation has equal  Each possible outcome in
weight (1/n) the sample space receives
its own weight (pi)

Stat 311 Summer 2016 8

Example

 If I roll a single fair dice

Stat 311 Summer 2016 9

What comes next
 Deriving expectations and variances for different distributions
 Discrete (Bernoulli, Binomial, Poisson)
 Continuous (Normal, Uniform, Exponential)

 For example: with coin tosses

 Each toss is a random variable with outcomes {0, 1}
 The sum of these outcomes over n tosses is a linear combination of the
random variables for each toss, with outcome space {0, 1, 2, … , n}

 We start with the rules of expectations and variances for

linear transformations and combinations of RVs
Stat 311 Summer 2016 10
11 Rules of expectations and variances
For linear transformations and combinations of
random variables

Stat 311 Summer 2016

Rules of expectations and variances

1. Variance-Mean relationship

Note the theoretical or

population variance formula
here, not the sample
variance.

Stat 311 Summer 2016 12

Rules of expectations and variances
Transformations and combinations of RVs

 Examples:
 Transformation: converting degrees F to degrees C
 Combination: adding your midterm and final exam scores

 Linear transformations and combinations have simple

expressions for their expected values and variances
 Linear transformations : Y = a + X, Y=bX, Y = a + bX,

 Linear combinations: Z = X + Y, Z = aX + bY

Stat 311 Summer 2016 13

Rules of expectations and variances

2. Linear Transformations of RVs

 If a and b are constants, and X is a random variable, then:

E ( a) a Var (a ) 0
E (bX ) b E ( X ) Var(bX )  b 2 Var ( X )
E ( a  bX ) a  b E ( X ) Var (a  bX )  b 2 Var ( X )

 These are easily proven if you start with the definitions on

slide 8 and work through the algebra (try it…).

Stat 311 Summer 2016 14

Rules of expectations and variances
3. Linear combinations of independent RVs

UH covers this:
Let a, b and c be constants, and X and Y be independent random variables

E ( X  Y ) E ( X )  E (Y )
E ( X  Y ) E ( X )  E (Y )
E (a  bX  cY ) a  bE ( X )  cE (Y )

Var ( X  Y ) Var ( X )  Var (Y )

Var ( X  Y ) Var ( X )  Var (Y )
Var (a  bX  cY ) b2Var ( X )  c 2Var (Y )

Again, these are easily proven

Stat 311 Summer 2016 15
Rules of expectations and variances
4. Linear combinations of dependent RVs

 Not covered in UH, but straightforward

Let a and b be constants, and X and Y be random variables

E ( X  Y ) E ( X )  E (Y ) No difference in the mean

Var ( X  Y ) Var ( X )  Var (Y )  2Cov( X , Y ) But the variance

Var ( X  Y ) Var ( X )  Var (Y )  2Cov( X , Y ) changes
Var (aX  bY ) a 2Var ( X )  b2Var (Y )  2abCov( X , Y )

Stat 311 Summer 2016 16

Rules of expectations and variances

Why the difference for correlated RVs?

 Suppose 10 individuals are deciding whether or not to show up to a
party. Each has a 50/50 chance of going

 If the individuals all decide independently, we tend to end up with

very few extreme events (ie 0 show up or all 10 show up) and
usually will have around 5 individuals

 If the individuals all text each other and decide to either all show
up or all not show up (attendance for each individual is now
dependent), we will always have either 0 or 10 individuals. The
average is still 5, but the outcomes are more extreme
Summary
 There are simple rules for expectations and variances

 When we transform or combine random variables

 As long as the transformation/combination is linear

 And these are the foundation for what comes next

 Derive expected values and variances for some common distributions

Stat 311 Summer 2016 18

19 Discrete random variables
Exploring the derivation of discrete probability
distributions and their properties:
Bernoulli, Binomial and Poisson

Stat 311 Summer 2016

The goal
 To define a theoretical Probability Density Function for a
random variable

“Probability that the RV X=k, given p”

“The RV X is distributed as f with parameter p”

Where p is one or more parameters that determine the outcome of the

random variable.
 And use the PDF to derive expected values, variances, and
probabilities
With discrete distributions, f(x) is technically called a “probability mass function” or PMF, but PDF is also used, and
we will use it here.
Stat 311 Summer 2016 20
Example

 Coin tosses:
 Each individual toss is an RV with 2 outcomes
 Let X be the random variable for each toss: = {H,T}

 Pass Stat 311:

 Each individual student is an RV with 2 outcomes
 Let X be the random variable for each toss:
= {Pass, No Pass}

Stat 311 Summer 2016 21

The Bernoulli distribution
 Let X be a random variable with two outcomes:
={0,1} (we have to decide what is 1 and what is 0)

X ~ Bernoulli ( p)

Stat 311 Summer 2016 22

Derivation of E(Y) and Var(Y)

X ~ Bernoulli ( p)

Stat 311 Summer 2016 23

Getting more complicated
 If there are 50 students in a class, how many total students
will show up to class.
 Assume each student shows up with probability .8
 Assume the attendance of each student is independent of other students
 Each student’s attendance is a Bernoulli trial
 Want to know the sum of the Bernoulli trials
 How can we do this?

Stat 311 Summer 2016 24

Repeated Bernoulli trials: the Binomial

 Define a general probability distribution for the sum of n

independent Bernoulli trials

 Let X = {0, 1, 2, … , n} the count of the number of 1’s.

Example: 3 Coin tosses with H=1

X ~ Binomial(n; p) HTT
THT
HHT
HTH
TTT TTH THH HHH
Value 0 1 2 3
of X
Probab 1/8 3/8 3/8 1/8
ility

Stat 311 Summer 2016 25

The Binomial distribution


 The Binomial describes the probability distribution of
counts of successful trials

 It is a linear combination (sum) of Bernoulli RVs

 So both the number of trials, , and the probability of each trial, ,

outcome influence the result

 And the outcome space is now {0, 1, 2, … , n}

Stat 311 Summer 2016 26

Binomial probabilities

There are 3 elements to the calculation:

1. Define the probability of each individual outcome in the
sample space.

2. Identify the number of outcomes in the set of interest

(i.e., that satisfy the condition X=k).

3. Multiply the probability of the outcome by the number

in the set

Stat 311 Summer 2016 27

What is the probability of each outcome?

 Start with a single trial:

 What is the probability of each outcome Y?

p y (1  p)1 y Our friend the Bernoulli

 What about
 What is the probability of each outcome

p k (1  p ) 2 k
k n k
 And in general:
... p (1  p)
Stat 311 Summer 2016 28
How many outcomes satisfy

 This is a counting problem

 How many ways to get successes in trials when

order does not matter?

 Out of our N trials, we need to choose k trials to be

the successes

 Use the combination rule:

Stat 311 Summer 2016 29
The “binomial coefficient”

The number of outcomes that satisfy the condition:

HTT HHT
THT HTH
TTT TTH THHHHH
 n   3   3  3  3
    ,   ,   ,  
 k   0  1  2  3
 1, 3, 3, 1
 is referred to as the binomial coefficient in this context

Stat 311 Summer 2016 30

The Binomial distribution

Putting these all together

Let Y be a random variable with two outcomes, Y={0,1}
Let X be the number of successes in n trials, and p be the
probability of success on each trial. Then:

 n k n k
X ~ Bin(n; p) P ( X k )   p (1  p )
k

Number of outcome combinations Probability of each n-trial

that have k successes outcome with k successes

Stat 311 Summer 2016 31

Derivation of E(X) and Var(X)
n
X ~ Bin(n; p) the sum of n independent Bernoulli trials: X  Yi
i 1

E ( X )  X Var ( X )  X2
n n
E (  Y ) Var ( Y )
i 1 i 1
n
n Y2 *
 E (Y ) *
i 1 np (1  p )
nY
np
* Using E(Z+W) = E(Z) + E(W) * Using Var(Z+W) = Var(Z) + Var(W)
for independent RVs Z and W
Stat 311 Summer 2016 32
Binomial Summary
 For any repeated Bernoulli trial, the count of successes, X,
has a Binomial distribution:
X ~ Bin (n; p )

 n
f ( x; n, p)   p x (1  p) n x
 x
 X np
 X2 np(1  p )

We can calculate the mean, variance, and the probability of any value
of X from just two values: n and p

Stat 311 Summer 2016 33

Other models
 Suppose I know that on average 10 cars pass through a
specific intersection near my house each hour
 The number of cars that pass through for a given hour is
random
 It is discrete (we’re only considering whole cars)
 No set number of trials, or maximum value that this rv can
take
 How might we describe this process?

Stat 311 Summer 2016 34

Poisson Distribution
 Poisson – Used for counts of events as rates
 n (the number of trials) is large, not fixed,
 λ approximates a rate of events (e.g., per time unit, or per capita)

for x ≥ 0
e   x
f ( x;  ) P( X x) 
x!
 X  X2 

Stat 311 Summer 2016 35

Poisson Distribution
 The numerator is always positive, so there is a positive
probability for all x ≥ 0, although it gets very small as x
becomes large
 We assume that there is a constant rate
 We assume that all ``arrivals” are independent of each other

Stat 311 Summer 2016 36

Example
 Suppose I know that on average 10 cars pass through a
specific intersection near my house each day
 What is λ?
 What is the probability that 8 cars pass through the intersection in an hour?

 What is the probability that 15 cars pass through the intersection in an hour?

 Why might a Poisson assumption be wrong for this model?

Stat 311 Summer 2016 37
Summary:
 A discrete random variable takes specific values
 Typically integer counts
 Each with a certain probability

 If you can represent the underlying stochastic process as a

mathematical function
 You can calculate almost anything you want for the RV
 E(X), Var(X), P(X=k), P(X≤k) or P(i < X < j)

 The distribution is defined by the stochastic process

 The details of the process matter, and are reflected in the formal definition of
the distribution

Stat 311 Summer 2016 38

Statistics For Business and Economics: 7 Edition
No ratings yet
Statistics For Business and Economics: 7 Edition
60 pages
5 Random Variables
No ratings yet
5 Random Variables
116 pages
Statistics For Business and Economics: 7 Edition
No ratings yet
Statistics For Business and Economics: 7 Edition
52 pages
Return of The Lectures: Unit 4
No ratings yet
Return of The Lectures: Unit 4
87 pages
Probability Distributions
No ratings yet
Probability Distributions
56 pages
Probability Final Exam
No ratings yet
Probability Final Exam
12 pages
Astro MS Dist
No ratings yet
Astro MS Dist
49 pages
03-discrete-prob-dist
No ratings yet
03-discrete-prob-dist
71 pages
Chapter 7 - 11th Edition - 1 Slide
No ratings yet
Chapter 7 - 11th Edition - 1 Slide
46 pages
04
No ratings yet
04
41 pages
Discrete Probability Distributions
No ratings yet
Discrete Probability Distributions
34 pages
Contact Session 4 (Discrete Random Variable & Binomial Distribution)
No ratings yet
Contact Session 4 (Discrete Random Variable & Binomial Distribution)
14 pages
Chapter 4
No ratings yet
Chapter 4
30 pages
1_0DiscreteRandomVariables
No ratings yet
1_0DiscreteRandomVariables
26 pages
Lecture 13
No ratings yet
Lecture 13
21 pages
Income Tax RP
No ratings yet
Income Tax RP
295 pages
Lecture 5 Discrete Probability Distributions
No ratings yet
Lecture 5 Discrete Probability Distributions
18 pages
Notes-04-Random variables
No ratings yet
Notes-04-Random variables
17 pages
CH04
No ratings yet
CH04
44 pages
4. discrete RVs
No ratings yet
4. discrete RVs
9 pages
SimCoder User Manual PDF
No ratings yet
SimCoder User Manual PDF
84 pages
DGFGHF
No ratings yet
DGFGHF
4 pages
Special Distributions
No ratings yet
Special Distributions
52 pages
Unit 03 - Random Variables - 1 Per Page
No ratings yet
Unit 03 - Random Variables - 1 Per Page
47 pages
Newbold Sbe8 ch04
No ratings yet
Newbold Sbe8 ch04
61 pages
Statistics For Business and Economics: Discrete Random Variables and Probability Distributions
No ratings yet
Statistics For Business and Economics: Discrete Random Variables and Probability Distributions
82 pages
sunu4
No ratings yet
sunu4
60 pages
4
No ratings yet
4
22 pages
CS 725: Foundations of Machine Learning: Lecture 2. Overview of Probability Theory For ML
No ratings yet
CS 725: Foundations of Machine Learning: Lecture 2. Overview of Probability Theory For ML
23 pages
Chap 4 Doc Them
No ratings yet
Chap 4 Doc Them
52 pages
Discrete Random Variables: 4.1 Definition, Mean and Variance
No ratings yet
Discrete Random Variables: 4.1 Definition, Mean and Variance
15 pages
SB 2023 Lecture5
No ratings yet
SB 2023 Lecture5
62 pages
S5 Prob Dist
No ratings yet
S5 Prob Dist
30 pages
SLIDES Probability-Part2
No ratings yet
SLIDES Probability-Part2
22 pages
Topic 4 Commonly Used Distribution
No ratings yet
Topic 4 Commonly Used Distribution
139 pages
Lecture5 - Random Variable - 0923
No ratings yet
Lecture5 - Random Variable - 0923
44 pages
Unit 07 - Binomial Distribution
No ratings yet
Unit 07 - Binomial Distribution
11 pages
DISC: 203 - Probability & Statistics: Lecture 7 - 9 Probability Distributions - I
No ratings yet
DISC: 203 - Probability & Statistics: Lecture 7 - 9 Probability Distributions - I
34 pages
Probability and Statistics: Dr. K.W. Chow Mechanical Engineering
No ratings yet
Probability and Statistics: Dr. K.W. Chow Mechanical Engineering
113 pages
Chapter 7 Eng
No ratings yet
Chapter 7 Eng
59 pages
Chapter 5 Discrete Random Variables
No ratings yet
Chapter 5 Discrete Random Variables
54 pages
Basic Statistics in Fluid Mechanics
No ratings yet
Basic Statistics in Fluid Mechanics
34 pages
Chapter 4 (1)
No ratings yet
Chapter 4 (1)
63 pages
Computer Science Topic 1.3 Questions
No ratings yet
Computer Science Topic 1.3 Questions
52 pages
Chapter 8
No ratings yet
Chapter 8
30 pages
Maths4 ( 3 Unit
No ratings yet
Maths4 ( 3 Unit
13 pages
STA 120-All Lectures
No ratings yet
STA 120-All Lectures
64 pages
Many Sides Debate Across The Curriculum
No ratings yet
Many Sides Debate Across The Curriculum
321 pages
Topic4 DiscreteRV
No ratings yet
Topic4 DiscreteRV
40 pages
STATISTICS 101 - Day 3 - Discrete Probability Distribution + Normal - T3
No ratings yet
STATISTICS 101 - Day 3 - Discrete Probability Distribution + Normal - T3
68 pages
Statistics Concepts: An Overview of Upper-Division Statistics With R
No ratings yet
Statistics Concepts: An Overview of Upper-Division Statistics With R
69 pages
Review MidtermII Summer09
No ratings yet
Review MidtermII Summer09
51 pages
Chapter 6
No ratings yet
Chapter 6
50 pages
Bab 8 Probablity Distribution
No ratings yet
Bab 8 Probablity Distribution
10 pages
ST 1210-Discrete Random Variables
No ratings yet
ST 1210-Discrete Random Variables
53 pages
Binomial Distribution Y With Examples
No ratings yet
Binomial Distribution Y With Examples
12 pages
BCG Strategy Consulting - Forage
No ratings yet
BCG Strategy Consulting - Forage
4 pages
Unit 1 - Understanding Human Development
No ratings yet
Unit 1 - Understanding Human Development
30 pages
ROLE OF Ngo
No ratings yet
ROLE OF Ngo
5 pages
Get Making Sense of Change Management. A complete guide to the models, tools and techniques of organizational change 6th Edition Esther Cameron PDF ebook with Full Chapters Now
67% (6)
Get Making Sense of Change Management. A complete guide to the models, tools and techniques of organizational change 6th Edition Esther Cameron PDF ebook with Full Chapters Now
65 pages
TOPIC 1 - Overview of Financial Management
No ratings yet
TOPIC 1 - Overview of Financial Management
32 pages
Chapter 2
No ratings yet
Chapter 2
8 pages
About Me: EE 359: Wireless Communications
No ratings yet
About Me: EE 359: Wireless Communications
12 pages
D4020 - Rev - N - 10-60-SL2 Series Direct Access Numbers
No ratings yet
D4020 - Rev - N - 10-60-SL2 Series Direct Access Numbers
7 pages
Chap 3
No ratings yet
Chap 3
18 pages
Probability 2 FPM
No ratings yet
Probability 2 FPM
55 pages
ProbabilityStatistics_Probability2 (1)
No ratings yet
ProbabilityStatistics_Probability2 (1)
11 pages
Factors Affecting The Performance of Micro and Small Enterprises in Wolita Sodo Town
100% (1)
Factors Affecting The Performance of Micro and Small Enterprises in Wolita Sodo Town
9 pages
42.0 - Pre-Cast Construction v3.0 English
No ratings yet
42.0 - Pre-Cast Construction v3.0 English
20 pages
Chapter 1
No ratings yet
Chapter 1
13 pages
02-Common Data Service Lab Manual
No ratings yet
02-Common Data Service Lab Manual
51 pages
458 The Future Test A1 A2 Grammar Exercises
No ratings yet
458 The Future Test A1 A2 Grammar Exercises
4 pages
Solution: PS04CGEN21: Recombinant DNA Technology
No ratings yet
Solution: PS04CGEN21: Recombinant DNA Technology
8 pages
Unit 4. Probability Distributions
No ratings yet
Unit 4. Probability Distributions
18 pages
Importance of Blood Donation
No ratings yet
Importance of Blood Donation
3 pages
Biology: Benedict's Solution
No ratings yet
Biology: Benedict's Solution
7 pages
LAS #2 (Statistics and Probability)
100% (1)
LAS #2 (Statistics and Probability)
6 pages
5 Chapter 10 Health Care Plans by Thompson
No ratings yet
5 Chapter 10 Health Care Plans by Thompson
5 pages
TLE Cookery 10 2nd Grading Reviewers
No ratings yet
TLE Cookery 10 2nd Grading Reviewers
5 pages
Volhard Puppy Aptitude Test
100% (1)
Volhard Puppy Aptitude Test
6 pages
Copywriting 101
100% (6)
Copywriting 101
55 pages
PD Consular & Passport Officer
No ratings yet
PD Consular & Passport Officer
4 pages
Kotler Mm16e Inppt 02
No ratings yet
Kotler Mm16e Inppt 02
12 pages
Title Experiment On Characteristics of Zener Diode
No ratings yet
Title Experiment On Characteristics of Zener Diode
2 pages
Anhydrous Ammonia-20160225
No ratings yet
Anhydrous Ammonia-20160225
11 pages
Casanova Body Language How_ (Z-Library)_00001
100% (1)
Casanova Body Language How_ (Z-Library)_00001
31 pages
69 75
No ratings yet
69 75
10 pages
WeAct-CH57xCH58xCoreBoard V10 SchDoc
No ratings yet
WeAct-CH57xCH58xCoreBoard V10 SchDoc
1 page
English: Quarter 2 - Module 1: Making Connections
No ratings yet
English: Quarter 2 - Module 1: Making Connections
46 pages
Math for Computer Applications
From Everand
Math for Computer Applications
The Editors of REA
No ratings yet