0% found this document useful (0 votes)

9 views11 pages

Tutorial 2 - Questions.

Uploaded by

bhattibaba118

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views11 pages

Tutorial 2 - Questions.

Uploaded by

bhattibaba118

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 11

ICT583

7 Mar 2023

ICT583 Data Science Applications

Tutorial 2
Mathematical Preliminaries

Scenarios
Many instances of binomial distributions can be found in real life.
- For example, if a new drug is introduced to cure a disease, it either cures the disease
(it’s successful) or it doesn’t cure the disease (it’s a failure).
- If you purchase a lottery ticket, you’re either going to win money, or you aren’t.
Basically, anything you can think of that can only be a success or a failure can be represented by
a binomial distribution.
A binomial distribution can be thought of as simply the probability of a SUCCESS or FAILURE
outcome in an experiment or survey that is repeated multiple times.

Maths
X ~ Binomial(n, p)
- For binomial distribution of the outcomes, n = number of observations, p = probability
Question: it is a discrete probability distribution or continues one? – discrete

Coding
In these exercises, you will practice using rbinom() function to generate random “flips” that are
either “heads” (1) or “tails” (0), to simulate random data
https://stat.ethz.ch/R-manual/R-devel/library/stats/html/Binomial.html
rbinom(n, size, prob)
generates required number of random values of given probability from a given sample.
n - number of observations
size - number of trials
prob - probability of success of each trial.

1
ICT583
7 Mar 2023

Part One: Simulating coin flips

1.1 Flipping a coin in R
# you will simulate 10 coin flips, each with a 30% chance of coming up “heads”:
- 10 coin flips = 10 observations
- Flipping a coin
- 30% probability of head
rbinom(10, 1, .3)

Flipping multiple coins in R

# Generate 100 occurrences of flipping 10 coins, each with 30% probability
- 100 observations
- Flipping 10 coins,
- 30% chance of getting head
- rbinom(100, 10, .3)

how about increasing the number of observations, as well as the number of trials in each
observation?

rbinom(100, 3, .5) %>% hist

rbinom(100, 6, .5) %>% hist

rbinom(1000, 6, .5) %>% hist

rbinom(1000, 9, .5) %>% hist

rbinom(10000, 9, .5) %>% hist

rbinom(10000, 12, .5) %>% hist

f (event) = prob
rbinom(10000, 12, .5) %>% hist(freq = F)

2
ICT583
7 Mar 2023

1.2 Calculating density of a binomial

by hands
calculate the probability that 2 are heads, out of 10 trials, using the binomial probability mass
function (binomial PMF) formula
- n = number of trials = 10
- k = number of desired outcomes = 2
- p = probability = 0.3

First, compute the binomial coefficient

10!/ (8! 2!)
= (10 × 9 × 8 × 7 × 6 × 5 × 4 × 3 × 2 × 1) / (8 × 7 × 6 × 5 × 4 × 3 × 2 × 1) × (2 × 1)
= (10 * 9) / (2 * 1)
= 90 / 2
= 45
Then calculate the probability when k = 2
P(X = 2)
= 45 * (0.3 ^ 2) * (0.7 ^ 8) = 0.2334744

Coding
dbinom(x, size, prob)
gives the probability density distribution at each point.
x - vector of numbers (specify where you want to evaluate the binomial density)
If you flip 10 coins each with a 30% probability of coming up heads, what is the probability
exactly 2 of them are heads?
# Calculate the probability that 2 are heads using dbinom
dbinom(2, 10, .3)
plot(1:10, dbinom(1:10, size=10, prob=.3), type='h')

3
ICT583
7 Mar 2023

# Confirm your answer with a simulation using rbinom

For example, you will observe 100 times, the random deviates are
r = rbinom(100, 10, .3)

# which of the results exactly have 2 heads?

R <- r == 2

# to compute the proportion of these logical results,

mean(R)

# how about increasing the number of observations?

# what do you observe?

# we know the chance of head is 0.5, so, what is the probability of getting 2 heads given 10
trials per observation?
dbinom(2, 10, .5)

# how about 8 heads?

dbinom(8, 10, .5)

4
ICT583
7 Mar 2023

1.3 Calculating cumulative density of a binomial

pbinom(x, size, prob)
gives the cumulative probability of an event.

Scenario
If you flip ten coins that each have a 30% probability of heads, what is the probability at least
five are heads?

# Calculate the probability that at least five coins are heads

# we know the cumulative density of less than five heads is
r = pbinom(4, 10, .3)

# cumulative density curve

lapply(1:10, function(x) pbinom(x, 10, .3)) %>% unlist%>% plot
plot(1:10, pbinom(1:10, 10, .3), type="h")

# the cumulative density of five heads or more will be

1-r

# Confirm your answer with a simulation using rbinom, with 10000 observations
mean( rbinom(10000, 10, .3) >= 5 )

Try to simulate 100, 1000, 10000, 100000 observations.

Which is closest to the exact answer?

5
ICT583
7 Mar 2023

1.4 Expected values and variance for binomial distribution

Expected values
- e.g., we expect the chance of getting head for flipping a coin for unlimited number of
times will be 0.5
Calculate the expected value using the exact formula
Expect value = n * p
- n = number of trials
- p = probability
# What is the expected value of a binomial distribution where 1 coin is flipped, having 50%
chance of head?
1 * 0.5
# What is the expected value of a binomial distribution where 25 coins are flipped, each having a
30% chance of heads?
25 * .3 = 7.5
# Confirm with a simulation using rbinom, assuming 10000 observations
mean(rbinom(10000, 1, 0.5))
mean(rbinom(10000, 25, 0.3))

Variance
What is the variance of a binomial distribution where n coins are flipped, each having a p chance
of heads?
Var= n * p * (1-p)
When n = 1, p = .5, var = 0.25, SD = sqrt(.25)
When n = 25, p = .3, var = 5.25 , SD = 2.291288
# Confirm with a simulation using rbinom
r = rbinom(10000, 25, 0.3)
var(r)
sd(r)

6
ICT583
7 Mar 2023

Part Two Probability of compound events

If events A and B are independent, and
- A has a 40% chance of happening, and
- event B has a 20% chance of happening,

by hands
what is the probability they will both happen?
Joint probability: P(A ⋂ B) = P(A) * P(B) # intersection
= .4 * .2
what is the probability either A or B will come up heads?
Union probability: P(A ⋃ B) # union
= P(A) + P(B) – P(A ⋂ B)
= .4+.2 - .4 * .2

Coding
Assuming 100000 observations done, one trial,
A <- rbinom(100000, 1, .4)
B <- rbinom(100000, 1, .2)
a = mean(A)
b = mean(B)

j = a * b
u = a + b – j

# or
mean(A & B)
mean(A | B)

7
ICT583
7 Mar 2023

Part Three: Normal distribution

- For continuous variable
Suppose you flipped 1000 coins, each with a 20% chance of being heads.
What would be the mean and variance of the binomial distribution?
# Mean
=n*p
= 1000 * 0.2 = 200
# Variance
= n * p * (1-p)
= 1000 * 0.2 * 0.8 = 160

3.1 Simulating from binomial and normal

rnorm(n, mean, sd)
mean - mean value of the sample data. It's default value is zero.
sd - standard deviation. It's default value is 1.

When a random variable X is normally distributed with mean mu and standard deviation sigma.
# Draw a random sample of 100,000 from the Binomial(1000, .2) distribution
b <- rbinom(100000, 1000, .2)
plot(hist(b, breaks=30))
hist(b, breaks=50, main = "my binomial dist")

# Draw a random sample of 100,000 from the normal approximation

g <- rnorm(100000, 200, sqrt(160))
plot(hist(g, breaks=30))
hist(g, breaks=50, main = "Gaussian dist")

8
ICT583
7 Mar 2023

# probability density
g <- rnorm(100000, 0, sqrt(1))
plot(hist(g, breaks=30))
hist(g, breaks=50, main = "Gaussian dist", freq = F)

9
ICT583
7 Mar 2023

3.2 Comparing cumulative density of the binomial

pnorm(x, mean, sd)
gives the probability of a normally distributed random number to be less that the value of a
given number. It is also called "Cumulative Distribution Function".

# Simulations from the normal and binomial distributions

b <- rbinom(100000, 1000, .2)
g <- rnorm(100000, 200, sqrt(160))

# Use binom_sample to estimate the probability of <= 190 heads

mean(b <= 190)

# Use normal_sample to estimate the probability of <= 190 heads

mean(g <= 190)

# Calculate the probability of <= 190 heads with pbinom

pbinom(190, 1000, .2)

# Calculate the probability of <= 190 heads with pnorm

pnorm(190, 200, sqrt(160))

10
ICT583
7 Mar 2023

Expected value and variance for random variables

# dice
dice = 1:6
p = 1/6
EV = sum(dice)*p
var = map(dice, function(x) p * ( x - EV)^2 ) %>% unlist %>% sum
sd = sqrt(var)

# blood type
A couple has a 25% (p) chance of a having a child with type O
blood. What is the chance that three (X) of their five (n) kids
have type O blood?

dbinom(3, 5, .25)
p= map(0:5, function(x) dbinom(x, 5, .25))
EV = map2(0:5, p, function(x, y) x*y ) %>% unlist %>% sum
var= pmap(
list(
as.list(0:5),
EV,
p
) ,
\(x,y,z) z * (x-y)^2
) %>% unlist %>%sum

n = 5
p = .25
1-p = .75

5*.25
5*.25*.75

Guo S Manuals SOA Exam C PDF
No ratings yet
Guo S Manuals SOA Exam C PDF
284 pages
Binomial Distribution Powerpoint 1
100% (2)
Binomial Distribution Powerpoint 1
17 pages
2 - Probability and Queueing Theory
No ratings yet
2 - Probability and Queueing Theory
178 pages
Lecture 13 (Discrete Probability Distribution)
No ratings yet
Lecture 13 (Discrete Probability Distribution)
14 pages
UNIT 4 - Part B
No ratings yet
UNIT 4 - Part B
15 pages
Ma 11 Syllabus
No ratings yet
Ma 11 Syllabus
36 pages
Lec 4
No ratings yet
Lec 4
69 pages
Module 5 Common Discrete Probability Distribution - Latest
No ratings yet
Module 5 Common Discrete Probability Distribution - Latest
45 pages
5221 Basic Probability Distributions in R MCA MMS 20MCA2CC9
No ratings yet
5221 Basic Probability Distributions in R MCA MMS 20MCA2CC9
30 pages
Maximum Likelihood Estimation
No ratings yet
Maximum Likelihood Estimation
8 pages
Statistical Thermodynamics: Advanced Physical Chemistry
No ratings yet
Statistical Thermodynamics: Advanced Physical Chemistry
97 pages
Binomial Normal Distribution
No ratings yet
Binomial Normal Distribution
47 pages
ChapterStat 2
No ratings yet
ChapterStat 2
77 pages
5 Probability Distributions
No ratings yet
5 Probability Distributions
88 pages
Discrete Probability Distribution
No ratings yet
Discrete Probability Distribution
34 pages
Random Variables
No ratings yet
Random Variables
68 pages
Some Important Theoretical Distributions: 3.1 Binomial Distribution
No ratings yet
Some Important Theoretical Distributions: 3.1 Binomial Distribution
35 pages
Richman Moorman 2000 Physiological Time Series Analysis Using Approximate Entropy and Sample Entropy
No ratings yet
Richman Moorman 2000 Physiological Time Series Analysis Using Approximate Entropy and Sample Entropy
11 pages
O Level Maths Book 4 24-25
No ratings yet
O Level Maths Book 4 24-25
216 pages
Module 5 - Discrete Probability Distributions Upd
No ratings yet
Module 5 - Discrete Probability Distributions Upd
68 pages
ExamQuestions Probabaility
No ratings yet
ExamQuestions Probabaility
118 pages
Probability Distributions.
No ratings yet
Probability Distributions.
46 pages
Class 1 - Binomial
No ratings yet
Class 1 - Binomial
25 pages
Chapter 7 Eng
No ratings yet
Chapter 7 Eng
59 pages
5 Random Variables
No ratings yet
5 Random Variables
116 pages
Foundations of Probability With R
No ratings yet
Foundations of Probability With R
70 pages
Probability Distributions in R
No ratings yet
Probability Distributions in R
42 pages
Binomial Probability Distribution
No ratings yet
Binomial Probability Distribution
23 pages
SB 2023 Lecture5
No ratings yet
SB 2023 Lecture5
62 pages
RV 1
No ratings yet
RV 1
22 pages
Chapter 3 (Part 2) of The Book of Why: From Evidence To Causes - Reverend Bayes Meets Mr. Holmes
No ratings yet
Chapter 3 (Part 2) of The Book of Why: From Evidence To Causes - Reverend Bayes Meets Mr. Holmes
29 pages
Chapter 3: Discrete Distributions: Probability and Statistics For Science and Engineering With Examples in R Hongshik Ahn
No ratings yet
Chapter 3: Discrete Distributions: Probability and Statistics For Science and Engineering With Examples in R Hongshik Ahn
44 pages
Probability Distribution
No ratings yet
Probability Distribution
20 pages
L2 - Mathematical Preliminaries.
No ratings yet
L2 - Mathematical Preliminaries.
42 pages
S5 Prob Dist
No ratings yet
S5 Prob Dist
30 pages
Astro MS Dist
No ratings yet
Astro MS Dist
49 pages
Chapter 13: Policy Gradient Methods: by Richard Sutton and Andrew Barto
No ratings yet
Chapter 13: Policy Gradient Methods: by Richard Sutton and Andrew Barto
35 pages
ETM Week 2 Rev
No ratings yet
ETM Week 2 Rev
61 pages
Sta 111 Lecture Note 2
No ratings yet
Sta 111 Lecture Note 2
19 pages
Binomial Distribution
No ratings yet
Binomial Distribution
20 pages
Comm 214 Chapter 6 - Part 1 - Discrete Probability Distributions
No ratings yet
Comm 214 Chapter 6 - Part 1 - Discrete Probability Distributions
38 pages
AEM Lecture 5
No ratings yet
AEM Lecture 5
52 pages
BBS11 PPT ch05
No ratings yet
BBS11 PPT ch05
36 pages
Topic 5
No ratings yet
Topic 5
29 pages
Classifiers: Numerical Problems and Solutions
No ratings yet
Classifiers: Numerical Problems and Solutions
13 pages
Xtabs
No ratings yet
Xtabs
25 pages
Distrubition 20 Questions
No ratings yet
Distrubition 20 Questions
15 pages
Probability 2024
No ratings yet
Probability 2024
14 pages
Raghunath Chatterjee - Binomial Distribution - Lecture
No ratings yet
Raghunath Chatterjee - Binomial Distribution - Lecture
37 pages
Binomial Distribution Y With Examples
No ratings yet
Binomial Distribution Y With Examples
12 pages
Topic 05-Effective Visual Design
No ratings yet
Topic 05-Effective Visual Design
43 pages
ICT622 Topic 6 Workshop Slides 2024
No ratings yet
ICT622 Topic 6 Workshop Slides 2024
40 pages
Topic 09-Presenting BA
No ratings yet
Topic 09-Presenting BA
37 pages
ICT582 Topic 08
No ratings yet
ICT582 Topic 08
37 pages
PGD DSHCS TSG Lecture 241117
No ratings yet
PGD DSHCS TSG Lecture 241117
44 pages
R-Prog Unit-5
No ratings yet
R-Prog Unit-5
23 pages
11 Binomial Distribution
No ratings yet
11 Binomial Distribution
21 pages
Topic 6
No ratings yet
Topic 6
32 pages
1 Markov Chains: Indian Institute of Technology Bombay
No ratings yet
1 Markov Chains: Indian Institute of Technology Bombay
15 pages
ICT622 Topic 10 Lecture Slides 2024
No ratings yet
ICT622 Topic 10 Lecture Slides 2024
30 pages
Lecture 7
No ratings yet
Lecture 7
32 pages
Unit 5
No ratings yet
Unit 5
15 pages
Lecture 10 - 5 - Expectation of Binomial Variable
No ratings yet
Lecture 10 - 5 - Expectation of Binomial Variable
36 pages
Arvore de Falha PDF
No ratings yet
Arvore de Falha PDF
13 pages
Statistics Handout CH 1&2
No ratings yet
Statistics Handout CH 1&2
20 pages
Topic 8
No ratings yet
Topic 8
25 pages
1 0DiscreteRandomVariables
No ratings yet
1 0DiscreteRandomVariables
26 pages
Topic 10-Data Mining
No ratings yet
Topic 10-Data Mining
24 pages
Contact Session 4 (Discrete Random Variable & Binomial Distribution)
No ratings yet
Contact Session 4 (Discrete Random Variable & Binomial Distribution)
14 pages
Probability Definitions of Statistics
No ratings yet
Probability Definitions of Statistics
19 pages
Binomial Geometric Practice
No ratings yet
Binomial Geometric Practice
13 pages
Assignment1 PC Template
No ratings yet
Assignment1 PC Template
12 pages
Representation of Data Solved From 2020-2022
No ratings yet
Representation of Data Solved From 2020-2022
49 pages
Normal Distribution - Workbook
No ratings yet
Normal Distribution - Workbook
21 pages
Computers Education: Chiu-Liang Chen, Cheng-Chih Wu
No ratings yet
Computers Education: Chiu-Liang Chen, Cheng-Chih Wu
18 pages
Topic 3
No ratings yet
Topic 3
18 pages
Bivariate Dynamic Cumulative
No ratings yet
Bivariate Dynamic Cumulative
21 pages
r059210501 Probability and Statistics
No ratings yet
r059210501 Probability and Statistics
8 pages
SM025 Chapter 9 2018
No ratings yet
SM025 Chapter 9 2018
35 pages
R Programming 1
No ratings yet
R Programming 1
21 pages
Topic 7
No ratings yet
Topic 7
16 pages
Some Typical Use of Bayes Theorem
No ratings yet
Some Typical Use of Bayes Theorem
13 pages
4 Discrete Probability Distribution
No ratings yet
4 Discrete Probability Distribution
11 pages
SLG 18.3 Random Variable and Its Probability Distribution, Part 2 - Normal Approximation To The Binomial, Practice Problems
No ratings yet
SLG 18.3 Random Variable and Its Probability Distribution, Part 2 - Normal Approximation To The Binomial, Practice Problems
5 pages
TLG 18.1 Random Variable and Its Probability Distribution, Part 2 - Binomial Random Variable and Its Distribution
No ratings yet
TLG 18.1 Random Variable and Its Probability Distribution, Part 2 - Binomial Random Variable and Its Distribution
6 pages
Bab 8 Probablity Distribution
No ratings yet
Bab 8 Probablity Distribution
10 pages
Son2
No ratings yet
Son2
6 pages
ICT583 Case Study (1) (1) .Edited
No ratings yet
ICT583 Case Study (1) (1) .Edited
9 pages
Failure Analysis and Risk Concepts
No ratings yet
Failure Analysis and Risk Concepts
23 pages
Probability Tutorial
No ratings yet
Probability Tutorial
8 pages
ECE069 Module 9
No ratings yet
ECE069 Module 9
25 pages
Statistic 6.4 Lesson and Assignment
No ratings yet
Statistic 6.4 Lesson and Assignment
7 pages
Ac 213 Random Variables
No ratings yet
Ac 213 Random Variables
14 pages
Lesson 6.2 The Binomial Distribution COMPLETE
No ratings yet
Lesson 6.2 The Binomial Distribution COMPLETE
6 pages
Binomial Distribution
No ratings yet
Binomial Distribution
6 pages
Chapter 13 Notes Part-1
No ratings yet
Chapter 13 Notes Part-1
6 pages
Binomial Poisson
No ratings yet
Binomial Poisson
5 pages
3 BMGT 220 Binomial Distribution
No ratings yet
3 BMGT 220 Binomial Distribution
3 pages
Math 10 Draft Exam
No ratings yet
Math 10 Draft Exam
4 pages
Experiment 5
No ratings yet
Experiment 5
4 pages
Business Analytics Module 2 Summary
No ratings yet
Business Analytics Module 2 Summary
3 pages
Assignment 2 Data Science Application Project
No ratings yet
Assignment 2 Data Science Application Project
3 pages
A Binomial Random Variable and Its Distribution: DEF Examples
No ratings yet
A Binomial Random Variable and Its Distribution: DEF Examples
3 pages
Topic 1
No ratings yet
Topic 1
3 pages
Week 04
No ratings yet
Week 04
2 pages
ICT515 Assignment1
No ratings yet
ICT515 Assignment1
2 pages
Chapter6project Katieashley
No ratings yet
Chapter6project Katieashley
2 pages
Structured Decision Making
From Everand
Structured Decision Making
Andreas Michael Theodorou
No ratings yet
Functions and Probability for Sixth Graders
From Everand
Functions and Probability for Sixth Graders
Home School Brew
No ratings yet

Tutorial 2 - Questions.

Uploaded by

Tutorial 2 - Questions.

Uploaded by

ICT583

ICT583 Data Science Applications

Part One: Simulating coin flips

Flipping multiple coins in R

rbinom(100, 3, .5) %>% hist

rbinom(1000, 6, .5) %>% hist

rbinom(10000, 9, .5) %>% hist

1.2 Calculating density of a binomial

First, compute the binomial coefficient

# Confirm your answer with a simulation using rbinom

# which of the results exactly have 2 heads?

# to compute the proportion of these logical results,

# how about increasing the number of observations?

# what do you observe?

# how about 8 heads?

1.3 Calculating cumulative density of a binomial

# Calculate the probability that at least five coins are heads

# cumulative density curve

# the cumulative density of five heads or more will be

Try to simulate 100, 1000, 10000, 100000 observations.

Which is closest to the exact answer?

1.4 Expected values and variance for binomial distribution

Part Two Probability of compound events

Part Three: Normal distribution

3.1 Simulating from binomial and normal

# Draw a random sample of 100,000 from the normal approximation

3.2 Comparing cumulative density of the binomial

# Simulations from the normal and binomial distributions

# Use binom_sample to estimate the probability of <= 190 heads

# Use normal_sample to estimate the probability of <= 190 heads

# Calculate the probability of <= 190 heads with pbinom

# Calculate the probability of <= 190 heads with pnorm

Expected value and variance for random variables

You might also like