0% found this document useful (0 votes)
61 views87 pages

Return of The Lectures: Unit 4

The document discusses discrete probability concepts including sample spaces, probability properties, discrete random variables, expected value, variance and standard deviation. It provides examples and explanations of these concepts.

Uploaded by

richard langley
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
61 views87 pages

Return of The Lectures: Unit 4

The document discusses discrete probability concepts including sample spaces, probability properties, discrete random variables, expected value, variance and standard deviation. It provides examples and explanations of these concepts.

Uploaded by

richard langley
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 87

UNIT 4: RETURN OF THE LECTURES

PROBABILITY IS HERE

MATHS METHODS
FT. BRACKETS
& BRACKETS
& BRACKETS
NO
SPAGHETTI
HERE
LECTURE OUTLINE
Integration

Discrete Probability

Binomial Distributions

Normal Distributions

Continuous Probability

Statistics
LEARNING OUTCOMES

Antidifferentiation and Integration


Computations

Discrete Probability
Probability properties
Sample spaces
Conditional probability, independence and mutual
exclusiveness
Discrete random variables
Expectancy and Variance
INTEGRATION

Integral, heh, to our learning of continuous probability.

Is the opposite function to the derivative


= +

Dont forget and dont forget +


BASIC RULES
RULE EXAMPLE
1
= +1 + 2 = 2 +
+1


1 +1 2
1 3
+ = + + 2 + 3 = 2 + 3 +
+1 6

1
= + 2 2 = 2 +

1 3
= log + = 3 log +

1
sin = cos + 2 sin = 2 cos +

1 1
cos = sin + cos 3 = sin 3 +
3
INTEGRATION

Integrating between two bounds is called integration

We apply the fundamental theorem of calculus


= ()

Where () is an antiderivative of ()

We can drop the + but not the


INTEGRATION EXAMPLE

2
sin 4 + 4 3
0

1 2
= cos 4 + 4
4 0
1 4 4 1
= cos + cos 0 + 0 4
4 2 2 4
1 4 1
= + +
4 16 4
4
=
16
PROBABILITY

Probability is the extent to which an event is likely to occur

Is measured bye the ratio of favourable outcomes to total


outcomes

number of
Pr =
total

We measure probability in decimals such that

0 Pr() 1
SAMPLE SPACE

A sample space is a list of the total possible outcomes

Consider two fair six sided dice, the sample space is as


follows:

1,1 , 1,2 , 1,3 6,6

Consider a trio of fair coins being tossed, the sample space


is as follows:

, , , , , , , , , , ,
PROBABILITY PROPERTIES

We represent common entities between sample spaces as


intersections

Pr

We represent all of the entities between sample spaces as


the union

Pr = Pr + Pr Pr( )
VENN DIAGRAMS

Pr(
)
KARNAUGH MAPS
Pr = 1 Pr

Read as: A complement, meaning all but

Pr( ) Pr( ) Pr()

Pr( ) Pr( ) Pr( )

Pr() Pr( ) 1
KARNAUGH MAPS
In a sample space:

Pr = 0.1 and Pr = 0.2 and Pr A = 0.1

0.1 0.1 0.2

0 0.8 0.8

0.1 0.9 1
TREE DIAGRAMS

Tree diagrams are useful for multi-step probability


problems, that is, problems that require multiple stages,
such as flipping a coin and then flipping another one.

Pr = 0.5 0.3 = 0.15


0.3


0.5
0.7
Pr = 0.5 0.7 = 0.35

0.5

PROBABILITY PROPERTIES

The probability of an event occurring, given that a certain


event has already happened is called conditional

Pr( )
Pr =
Pr()

If two events are mutually exclusive, then they have no


intersection, thus their union is their added probabilities

Pr = Pr + Pr()
PROBABILITY PROPERTIES

If two events are independent, that is one event occurring


has no impact on the other, they have the following
properties

Pr = Pr()

Pr = Pr Pr()

Mutual exclusivity should not be confused with


independence, they have different properties and one
existing does not necessarily mean the other does as well
DISCRETE RANDOM VARIABLES

A discrete random variable has the following properties:

It can only take countable values


The probabilities add to one

Which of the following are discrete variables?

Money Gender Postcodes

Months in progress Number of houses Height


DISCRETE RANDOM VARIABLES

Heres what a discrete random variable can look like:

1 2 2.5 3
Pr( = ) 0.1 0.2 0.3 0.4

From this we can deduce that:

0 Pr( = ) 1

Pr = = 1
DETERMINING PROBABILITIES
1 2 2.5 3
Pr( = ) 0.1 0.2 0.3 0.4

The probability of an event occurring can be established


straight from the table

Pr = 2.5 = 0.3

The probability that spans more than one discrete value can
be established through addition

Pr > 2 = Pr = 2.5 + Pr = 3 = 0.7


EXAMPLE
For the following discrete random variable, find all possible
values of and calculate Pr( 2) .
1 2 2.5 3
Pr( = ) 0.2 3 0.4

The sum of the probabilities must equal one

+ 2 + 0.2 + 0.4 = 1
= 0.1

Determine the probability by adding the required values

Pr 2 = Pr = 2 + Pr = 2.5 + Pr( = 3)
= 0.2 + 0.3 + 0.4 = 0.9
EXPECTED VALUE (MEAN)
Measures the centre of the distribution

E = = Pr( = )

Is calculated by summing the multiples of the random


variables and their respective probabilities

1 2 2.5 3
Pr( = ) 0.1 0.2 0.3 0.4

E = 1 0.1 + 2 0.2 + 2.5 0.3 + 3 0.4 = 2.45


VARIANCE

Measures the spread of the distribution

Var = 2 = E 2 E X 2

Is calculated by summing the squares of the variables and


their respective probabilities, less the square of the mean
1 2 2.5 3
Pr( = ) 0.1 0.2 0.3 0.4

E 2 = 12 0.1 + 22 0.2 + 2.52 0.3 + 32 0.4 = 6.375


Var = 6.375 2.452 = 0.3725
STANDARD DEVIATION

Is also a measure of spread of a distribution

= = Var() = E 2 E X 2

Is calculated by taking the square root of the variance

Notice this means that the variance can never be negative

= Var = 0.610
EXAMPLE
For the following discrete random variable find the mean and
the standard deviation.

2 0 2 4
Pr( = ) 0.1 0.3 0.1 0.5

E = 0.2 + 0.2 + 2 = 2

E 2 = 0.4 + 0.4 + 8 = 8.6

Var X = 8.6 22 = 4.6

= 4.6 = 2.144
EXPECTATION THEORIES

Expectation theories allow us to work with the mean and


variance of a random discrete distribution without re-writing
the distribution

E + = E +

Var + = 2 Var()

The variance has no horizontal translation, why do you think


this is?
EXAMPLE
2 0 2 4
Pr( = ) 0.1 0.3 0.1 0.5

E =2 and Var = 4.6

Given that = 3 + 2, Find Var , E() and its standard


deviation

E =32+2=8

Var = 32 4.6 = 41.4

= 41.4 = 6.434
APPLICATION
The amount of cars that pass through a certain drive-thru on
any particular day follows the discrete random variable, .
0 6 9 12
Pr( = ) 0.1 0.3 0.1 0.5

A. What is the average amount of cars that pass through?

B. On a particular day, a few cars have passed through the


drive-thru, what is the probability that the amount of cars is
greater than 6?

C. What is the probability that no cars pass, 5 days in a row?


APPLICATION (A)

0 6 9 12
Pr( = ) 0.1 0.3 0.1 0.5

What is the average amount of cars that pass through?

E = 6 0.3 + 9 0.1 + 12 0.5 = 8.7

It is expected that, on any given day, that 8.7 cars will pass
through the drive thru.
APPLICATION (B)

0 6 9 12
Pr( = ) 0.1 0.3 0.1 0.5

On a particular day, a few cars have passed through the drive-


thru, what is the probability that the amount of cars is greater
than 6?

This is a conditional probability question.

Pr( > 6) 0.6 2


Pr > 6 > 0 = = =
Pr( > 0) 0.9 3
APPLICATION (C)

0 6 9 12
Pr( = ) 0.1 0.3 0.1 0.5

What is the probability that no cars pass, 5 days in a row?

Each day is independent of each other, so we determine the


probability of it occurring on day one and raise it to the power
of 5.

Pr = 0 = 0.1

Pr 0,0,0,0,0 = 0.15 = 0.00001


EXAM TIPS
You must know all of the properties of probability before
heading into your first exam, without them you will not be
able to answer most questions

The terminology, and their respective formulae (union,


independence, mutual exclusivity, conditional, etc.) are the
same for all distributions.

Express your answers in the required form, especially when


dealing with expected value.

Discrete probability questions always appear in exam 1


SUMMARY

Integrations basic rules

Definition of a probability
Sample space
Terminology
Union, intersect
Probability properties
Independent, conditional and mutually exclusive
Discrete random variables
Mean, variance, standard deviation and
expectation theories
LEARNING OUTCOMES

Binomial Distributions
Factorial and combinations
Bernoulli sequences
Expectancy and Variance

Continuous Probability
Continuous random variables
Measures of centre
Measure of spread
BERNOULLI SEQUENCES

A Bernoulli sequence is a sequence of trials with the


following properties:
Each trial result has one of two outcomes, either
success or failure
The success of a trial, , is constant for all trials,
and each failure of a trial, 1 , is also constant
The trials are independent, such that any one trail
does not affect any other trial

An example could be a coin toss where heads is success,


as each event is independent and the chance for success
and failure is the same throughout all trials
BINOMIAL DISTRIBUTION

If , is the number of successes in a Bernoulli sequence,


each with a success probability of , then , is called a
binomial random variable, and has a binomial distribution.
We write any binomial random variable as follows:

~ ,

The difference between a binomial distribution and a


discrete one is that order doesnt matter.

For example, obtaining 5 favourable outcomes out of 10


trials, can all occur at the start or towards the end.
FACTORIAL AND COMBINATION

Factorial is another operator and is represented using the


exclamation mark such that:

! = 1 2 2 1

The combination formula represents the amount of groups


of size , that can be formed from entities

!
=
! !
EXAMPLE

Evaluate 6!.

6! = 6 5 4 3 2 1 = 720

How many different combination of 4 students can be chosen


from a group of 15?

15 15! 15 14 13 12
= = = 1365
4 4! 11! 4321

There are 1365 different combinations.


DERIVATATION OF FORMULA

If we were to observe trials and obtain successes, of


probability,, regardless of order, we would:

Pr 3 success, 2 fails = 1 1

Pr success, fails = 1


Ways we can arrange the amount of successes =


Pr = = 1

EXAMPLE
Given a soccer player, in practice, has a 90% chance of kicking
a goal, what is the probability that he makes all his shots if he
takes 10? What is the probability he makes half his shots?

~(10, 0.9)

10
Pr = 10 = 0.910 0.10
10
= 0.910

10
Pr( = 5) = 0.95 0.15
5
= 252 0.95 0.15
= 0.001 (3d. p)
EXAMPLE CONTINUED

What is the probability that he makes at least 1 of his shots?

~(10, 0.9)

In scenarios like this, you can add all the probabilities from 1
to 10, however, as the sum of all probabilities must total 1, we
can take the complement and subtract it from the total
probability.

Pr 0 = 1 Pr = 0 = 1 0.110 = 0.999
EXPECTANCY AND VARIANCE

Expected value, or mean, is the average amount of


successes we can expect given our distribution

E =

Variance measures the spread of the distribution

Var = (1 )
EXAMPLE

Stephen Curry makes 55% of his three point shots in a typical


game. Given that he takes 16 shots in a particular game what
is the expected amount of shots he will make? What is the
variance of Currys shooting distribution?

~(16, 0.55)

E = = 0.55 16 = 8.8

Var = 1 = 0.55 16 0.45 = 3.96


CALCULATORS

Being able to utilise a calculator is extremely important. It is


integral, sorry, as we most of probability is on exam 2.

Speed and efficiency is the most important.

The following information will be for the TI-nspire CX CAS

For all other calculators, make sure you know shortcuts to


bring up the screens required. Usually they are titled
Binomial CDF or Binomial PDF.
EXAMPLE CONTINUED
On game 5 of the NBA finals, Curry makes a few shots, what is
the probability that he made two of his shots, given that he
shot the ball 9 times?

~(9, 0.55)

Pr( = 2)
Pr = 2 > 0 =
Pr( > 0)

0.041
=
0.999
= 0.041
Curry has a 4.1% chance to make 2 of his 9 shots.
CONTINOUS PROBABILITY
A continuous variable is one that can take any real number
on a number line
A continuous random variable has the following properties:
0 for all
The area enclosed by the graph and the
must equal to one
Which of the following are continuous variables?

Money Gender Postcodes

Months in progress Number of houses Height


EXAMPLE
Is the follow a continuous random variable?

3 2
= 01
0 elsewhere

Is () 0 for all ?

3 2 increases as does, 0 = 0 0 for all

Is the area under the graph 1?


1
= 1
0
DETERMINING PROBABILTIIES

Probabilities for continuous functions only occur over


intervals, that is we cannot have Pr( = ) .

We determine probabilities using integration


Pr < < =

Whether or not the signs are inclusive is irrelevant, we


obtain the same answer
EXAMPLE
2
=3 01
0 elsewhere

Pr 0 < < 1 =
Is a C. R. V. so Pr 0 < < 1 = 1
1
Pr 0 < < =
2
1/2
3 2 = 0.125
0
1
Pr <<1 =
2
1
1 Pr 0 < < = 0.875
2
EXPECTANCY

The expected value, or mean, of a continuous random


variable, , is given by:


= E =

If () is a function of then the expected value of () is


given by:


E = ()

MEDIAN

The median is another measure of centre and is the middle


most value such that 50% of the distribution lies above the
median, and 50% below.

The median, , is calculated as follows:


= 0.5

In general, the median is different to the mean.


VARIANCE AND STANDARD DEVIATION

Variance and standard deviation are measures of spread

Variance is calculated as follows

Var = E X 2 2

E X 2 = 2

Standard deviation is the square root of variance

= V(X)
INTERQUARTILE RANGE

The interquartile range is the difference between the 75th


percentile and the 25th percentile, that is, it is the range in
which the middle 50% of values exist.

IQR =


= 0.25 and = 0.75

EXAMPLE
Consider the following probability density function:

+ 2 , 01
=
0,

2
If E X = find and .
3

1 1
2
+ 2 = 1 and + 2 =
0 0 3
1
= and = 2
3
EXAMPLE CONTINUED
1 2,
= 3 + 2 01
0,

Evaluate Var(X)

Var X = E X 2 2
1 2
1 2
= 2 2
+ 2
0 3 3
23 4
=
45 9
1
=
15
EXAMPLE CONTINUED
1 2,
= 3 + 2 01
0,

Find the median for the distribution.


1
+ 2 2 = 0.5
0 3
25 3
+ = 0.5
5 9
= 0.995

The median is 0.995, correct to 3 decimal places.


EXAMPLE CONTINUED
1 2
= 3 + 2 , 01
0,

Find the interquartile range for this distribution.


= 0.25 and = 0.75

= 0.5 and = 0.881

IQR = = 0.381
EXAMPLE CONTINUED
1 2,
= 3 + 2 01
0,

Find Pr( > 0.2| < 0.7)

Pr(0.2 < < 0.7)


Pr( > 0.2 < 0.7 =
Pr( < 0.7)
0.39
=
0.462

= 0.844
EXAM TIPS
Binomial probability rarely appears in exam 1, if it does it
will be scenarios that involve small numbers.

Students often have trouble using the new screens on their


calculator, get practice on it, speed test yourself, this is
what separates students.

Rounding is also an issue, students are careless when


answering questions, round your answers to correct decimal
places.

Note in exam 1, if your answer is incomputable or is difficult


to compute, you most likely messed up.
EXAM TIPS

Continuous probability rarely appears in exam 1, if it does it


would be a simple integration question, or the final question
on the exam, such as in 2016.

That means, get to know your calculator, speed and


accuracy is what sets students apart.

This is the easiest of the all the probabilities as no new


knowledge is required to compute problems, no new
screens to worry about. This should be the quickest section
in your exam.
SUMMARY
Binomial Distributions
Properties of a binomial distribution
Determining probabilities
Expected value and variance
Understanding the binomial distribution formula
Using your calculator

Continuous distributions
Properties of a P.D.F
Determining probabilities
Mean, median, variance, standard deviation and
IQR
LEARNING OUTCOMES

Normal Distributions
Standardisation and the 68 95 99.7% rule
Determining normal probabilities
Solving problems
Central limit theorem

Statistics
Populations and samples
Large and small populations
Distribution of sample proportion
Confidence intervals
NORMAL DISTRIBUTION

When something is normally, or approximately, normally


distributed, it means the data is centred around a mean
and follows a bell curve.

The normal distribution is a special type of continuous


probability distribution and is written like:

~(, )

1 1 2

= 2
2
THE BELL CURVE

Bertrand,. (2016, 20/5). Miller Analogies Test Bell Curve. https://magoosh.com/mat/miller-analogies-test-bell-curve.


BELL CURVE EXPLAINED

Most data lies around the centre

The further out you go, the less data

Each line represents a standard deviation from the mean,


the more standard deviations the further out

68% of the data lie within 1 standard deviation of the mean


95% within 2
99.7% within 3
68 95 99.7% RULE
If the age of students in a particular awesomely spectacular
methods lecture was normally distributed with a mean of 17
and a standard deviation of 0.3:

Over what range, do we expect 68% of the students to lie?

~(17, 0.32 )

Using our rule, 68% of data lies within 1 standard deviation of


the mean:

16.7 and 17.3


EXAMPLE CONTINUED

~(17, 0.32 )

Using the rule, find such that Pr < < 17 = 0.475

We have discovered that the value for is two standard


deviations below the mean

= 17 2 0.3 = 16.4
THE STANDARD NORMAL

The bell curve you just saw was one of many, it was the
standard normal distribution and has special properties

~(0, 1)

We can standardise any normal distribution:

The standardised value tells us the number of standard


deviations the value lies from the mean
EXAMPLE
~(17, 0.32 )

How many standard deviations from the mean is the age 17.6?
17.6 17
= =2
0.3

How many standard deviations from the mean is the age 16.7?
16.7 17
= = 1, =1
0.3

Hence evaluate Pr(16.7 < < 17.6) using the rule.


0.68 0.95
Pr 16.7 < < 17.6 = + = 0.815
2 2
EXACT VALUES

The rule is only approximate, for example, not exactly 68%


of the data lies between 1 standard deviation of the mean,
but its close enough

In order to determine the exact values we will need


assistance from our calculators

We will use the Normal CDF function on our calculators


for this purpose
EXAMPLE

Find the exact percentage of the amount of data that lies


within one standard deviation of the mean.

normCdf(1, 1, 0, 1)

= 0.682689

This was performed on a TI - nspire CX CAS

Notice that it is not exactly 68%, this is true for the other
percentages as well.
EXAMPLE

The amounts of spins per minute on a fidget spinner is


normally distributed with a mean of 120 and a standard
deviation of 10. Find the probability that the amount of spins
is greater than 117.

~(120, 102 )

normCdf(117, , 120, 10)

= 0.6179
INVERSE NORMAL
The inverse normally, as the name may suggest, is the
opposite of the normalCdf function.

When we used the normalCdf before we plugged in the


lower and upper bound to give us the area

The inverse normal we can provide the area and it will give
us the upper bound, ASSUMING, that the lower bound is
.

This is for the TI - nspire CX CAS, some calculators may


allow you to set both the lower and upper bound
EXAMPLE
The amounts of spins per minute on a fidget spinner is
normally distributed with a mean of 120 and a standard
deviation of 10.

It is found that, 10% of fidget spinners are defective. A spinner


is defective is it more than amount of spins. Find . (3dp)

Pr > = 0.1
Pr < = 0.9
invNorm(0.9, 120, 10)
= 132.8155

= 132.816
DETERMINING AND

Sometimes we are not given the standard deviation or the


mean, however we can determine these using the inverse
normal function. Recall that:

By standardising any normal distribution we can use the


inverse function as we know what the mean and standard
deviation is

~(0, 1)
EXAMPLE
is normally distributed with a mean and standard
deviation . It is known that 35% of the data lies under = 6.
And 45% lies above = 10. Find and .

Pr < = 0.35 Pr > = 0.45


6 10
Pr < = 0.35 Pr > = 0.45

6 10
Pr < = 0.35 Pr < = 0.55

invNorm(0.35 , 0, 1) invNorm(0.55, 0, 1)
6 10
= 0.385 = 0.126

= 9.016 = 7.828
STATISTICS

Statistics is taking all about taking a sample from a


population

In the real world, the population is too large, so we work


with statistics

A population can be defined as anything, for example, this


lecture room can be a population and the people at the
front are a sample.
SAMPLE PROPORTION

A sample proportion is the ratio of desired outcomes to


total outcomes in a certain sample

For example, if I was interested in red balls, and I take a


sample of 10 balls, where in which 5 of them were red, then
1
my sample proportion is .
2

The sample proportion is denoted with .


The distribution of sample proportion is .
SMALL POPULATION

Sample proportions are all based on the binomial


distribution

A small population, is a population of size about equal or


less to approximately 20.

When establishing the distribution of sample proportions


we use counting methods.
EXAMPLE

In a population of 10 batteries, it is known that 6 of these are


defective. Determine the sample proportion distribution of
defective batteries if I take a sample of 3.

3 6 4 3 3
Pr = 1 = =
1 10 9 8 10
0 1 2 3
Pr( = ) 1 3 1 1
6 10 2 30

0 1 2 1
3 3
Pr( = ) 1 3 1 1
6 10 2 30
LARGE POPULATION

A large population has an uncountable amount of entities,


so instead of using counting methods we must use the
binomial distribution

For a large sample you will be given a population


proportion, . This will be our success rate.

We will still take a sample size to create a sample propotion


distribution
EXAMPLE

A shipment of ten million fidget spinners has arrived. It is


known that 20% of all fidget spinners are red. We take a
sample of 100 spinners.

Evaluate Pr > 0.1

Pr > 0.1 = Pr > 10


= 0.994
EXAMPLE CONTINUED

Evaluate Pr( 0.2| < 0.3)

Pr(20 < 30)


Pr 0.2 < 0.3 =
Pr( < 30)

0.5398
=
0.9939

= 0.5431
CENTRAL LIMIT THEOREM

When we have a large population the distribution can be


approximately normal

Which means we have methods of determining the


standard deviation and mean

= =

(1 )
=

CONFIDENCE INTERVAL
A confidence interval is determined from a sample

It is the estimate, such that a certain percentage of all


sample proportions will lie within this confidence interval

In general the confidence interval is:

1 1
, +


Where Pr < < =
100
EXAMPLE
A certain sample of 100 Spotify users were taken, out of these
80 were premium users. Determine a 95% confidence interval
for .
80
= = 1.96
100

80 0.8 0.2 80 0.8 0.2


: 1.96 , + 1.96
100 100 100 100

451 549
: ,
625 625
SUMMARY
Normal Distributions
68 95 99.7% Rule
Determining exact and approximate areas
Determining inverse normals
Using your calculator

Statistics
Sampling from small and large populations
Distributions of sample proportions
Expected value and standard deviation
Confidence intervals
Questions?

You might also like