0% found this document useful (0 votes)

12 views4 pages

Lab 6 Activities

Lab #6 involves R programming exercises focused on simulating probabilities using the sample() function. Students are required to perform simulations related to a spinner and dice rolls, calculate proportions, and compare them to theoretical probabilities. The lab emphasizes the importance of sample size in achieving accurate probability estimates.

Uploaded by

Mai Anh Đào

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views4 pages

Lab 6 Activities

Uploaded by

Mai Anh Đào

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Lab #6 Activities

Anh Dao Mai

Instructions: fill in the code chunks below and answer the questions with text responses. Your responses
must use code that was covered in class; other methods to solve the problems will not be accepted. Submit
your knit pdf file to Crowdmark.
A reminder that the R code we have covered in class is available on our STAT 2150 A01 UM Learn page,
under Content > Course Material. It is recommended that you knit to pdf after you fill in each code chunk.
Introduction:
Recall that a probability is a theoretical value of a proportion after an infinitely long series of trials. We
can easily make a long series of trials with R, stored in some vector, and estimate probabilities with the
proportion of times an event occurs in the series of trials.
The sample() function in R has the following syntax: sample(x, size, replace = FALSE, prob) where
x is an R object containing values (like a vector or a matrix) from which we select a sample, size is the
desired sample size, replace is set to FALSE by default indicating sampling is done without replacement
(that is, once an element of the population has been sampled, it cannot be sampled again), and prob is a
vector of probabilities (created on-the-fly or stored in the environment) of the same length as x, where each
element of prob is the probability for the corresponding element of the x vector. If the prob vector is not
specified, then each element of x will be sampled with equal probability.
For example, we can simulate rolling a die 120,000 times with the following code that randomly selects a
number between 1 and 6, where the probability of obtaining each of the numbers between 1 and 6 is one-
sixth. We sample with replacement since if you get a 1, for example, on a die roll, you can still get a 1 again
on a future die roll.

data = sample(1:6,120000,replace=TRUE,prob=rep(1/6,6))
data[1:30] # See the first 30 die rolls

## [1] 4 6 1 5 4 1 3 6 6 3 6 3 3 6 2 1 6 4 3 6 1 6 3 2 4 3 1 4 5 6

table(data)

## data
## 1 2 3 4 5 6
## 19971 20133 19872 19929 20050 20045

(Each time you knit this .Rmd file, you will get different results because you will get a different simulation
of 120,000 die rolls.) Note that when we write the number 120,000 in R, we cannot use a comma because
commas are used to separate arguments of a function. As we can see in the output of the table() function
above, roughly one-sixth of the die rolls (approximately 20,000) land on each of the numbers between 1 and
6.
Also, we could have obtained the proportion of times each of the possible outcomes occur by dividing the
output of the table() function by 120,000:

1
table(data)/120000

## data
## 1 2 3 4 5 6
## 0.1664250 0.1677750 0.1656000 0.1660750 0.1670833 0.1670417

We see that each of the six numbers occur about one-sixth of the time.
Question 1:
Suppose we have a spinner in the shape of a regular hexagon as seen in the image in Crowdmark. We can
easily see the probabilities that the spinner will land on 1, 2, or 3. Using the sample() function, simulate
spinning the spinner 180 times and obtain counts of how many times the numbers 1, 2, and 3 occurred.
Write your code after the set.seed(11111) below.

set.seed(11111)
spinner = sample(1:3,180,replace=TRUE,prob=c(1/6, 1/3, 1/2))
table(spinner)

## spinner
## 1 2 3
## 26 61 93

What proportion of the 180 simulated spins landed on 1? Enter your calculation into the below code chunk
so that the knit pdf will show the result of the calculation.

table(spinner)[1] / 180

## 1
## 0.1444444

Now simulate spinning the spinner 180,000 times and calculate the proportion of spins that land on 1. Write
your code after the set.seed(11111) below.

set.seed(11111)
spinner_large = sample(1:3, 180000, replace=TRUE, prob=c(1/6, 1/3, 1/2))
table(spinner_large)[1] / 180000

## 1
## 0.16705

Comment on the two sample proportions you have calculated compared to the theoretical probability of
landing on 1.
The results showed that the sample proportion of 1s in the 180,000-spin simulation was closer to the theoretical
probability (1/6) than in the 180-spin simulation, clearly demonstrating that a larger sample size provides a
more accurate estimate of the true probability.
Question 2:
When we roll two dice, there are 36 possible outcomes. We can obtain the sum showing on the two dice for
those 36 outcomes in a 6 x 6 matrix with the outer() function:

2
x = 1:6
y = 1:6
z = outer(x,y,FUN="+")
z

## [,1] [,2] [,3] [,4] [,5] [,6]

## [1,] 2 3 4 5 6 7
## [2,] 3 4 5 6 7 8
## [3,] 4 5 6 7 8 9
## [4,] 5 6 7 8 9 10
## [5,] 6 7 8 9 10 11
## [6,] 7 8 9 10 11 12

The following code will create a vector of the possible sums:

sums = unique(as.vector(z))

And the following code creates a vector probs of the probabilities of each of the possible sums:

dice = sort(as.vector(z))
dice

## [1] 2 3 3 4 4 4 5 5 5 5 6 6 6 6 6 7 7 7 7 7 7 8 8 8 8
## [26] 8 9 9 9 9 10 10 10 11 11 12

probs = table(dice)/36
probs

## dice
## 2 3 4 5 6 7 8
## 0.02777778 0.05555556 0.08333333 0.11111111 0.13888889 0.16666667 0.13888889
## 9 10 11 12
## 0.11111111 0.08333333 0.05555556 0.02777778

(a) Let X be the sum of the dice rolls. Use the probs vector to show that P(X is greater than or equal to
9) equals 5/18 (approximately 0.2778).

sum(probs[as.numeric(names(probs)) >= 9])

## [1] 0.2777778

(b) We can estimate the 5/18 probability in (a) by sampling from the sums vector. Simulate 30 rolls of
the two dice. Note that the output of the sample() function will not be the outcome of the dice rolls
(like (3,4)), but rather the sum of the two dice rolls (like 7). Then calculate what proportion of dice
rolls result in a sum greater than or equal to 9. This should be an estimate of the 0.2778 probability.
Write your code after the set.seed(11111) below.

set.seed(11111)
dice_rolls = sample(sums, 30, replace=TRUE, prob=probs)
mean(dice_rolls >= 9)

3
## [1] 0.3

(c) Repeat part (b) with 600 dice rolls (copy/paste your code from above and make any necessary changes).
Write your code after the set.seed(11111) below.

set.seed(11111)
dice_rolls = sample(sums, 600, replace=TRUE, prob=probs)
mean(dice_rolls >= 9)

## [1] 0.2683333

Compare the proportion of dice rolls where the sum is greater than or equal to 9 when there are 30 dice rolls
or 600 dice rolls, compared to the theoretical probability of 0.2777.
For 30 rolls, the estimated probability can differ significantly from 0.2778 due to sample variability. However,
with 600 rolls, the estimate stabilizes and aligns more closely with the theoretical probability. This illustrates
that increasing the sample size reduces randomness in probability estimation.

SM 13
100% (2)
SM 13
71 pages
Cloud Computing Chapter3 2
0% (1)
Cloud Computing Chapter3 2
36 pages
Solutions
No ratings yet
Solutions
323 pages
03 - CT3S Introduction To Probability Simulation and Gibbs Sampling With R Solutions
100% (1)
03 - CT3S Introduction To Probability Simulation and Gibbs Sampling With R Solutions
270 pages
STATISTICS AND PROBABILITY Until T Distribution 2
No ratings yet
STATISTICS AND PROBABILITY Until T Distribution 2
214 pages
SM 11
100% (1)
SM 11
75 pages
DBMS File
No ratings yet
DBMS File
96 pages
4 Word Processor
No ratings yet
4 Word Processor
22 pages
Dice Roll - Familiarizing With R Ecosystem
No ratings yet
Dice Roll - Familiarizing With R Ecosystem
6 pages
BIOB20 Notes
No ratings yet
BIOB20 Notes
45 pages
Garvit 102216089
No ratings yet
Garvit 102216089
24 pages
10 TheBoxModel
No ratings yet
10 TheBoxModel
47 pages
R Programming 1
No ratings yet
R Programming 1
21 pages
Unit 4 Probability Distributions-MA231CT
No ratings yet
Unit 4 Probability Distributions-MA231CT
21 pages
Penjelasan Listing Program
No ratings yet
Penjelasan Listing Program
63 pages
Probabilities
No ratings yet
Probabilities
7 pages
1 0DiscreteRandomVariables
No ratings yet
1 0DiscreteRandomVariables
26 pages
Stat 361 Unit 2 (Autosaved)
No ratings yet
Stat 361 Unit 2 (Autosaved)
58 pages
Assignment-4 STA351
No ratings yet
Assignment-4 STA351
8 pages
Installation: Order No.: Customer: Equipment: Converter Type: Document: 3BHS213774E01 ACS 1000 W
No ratings yet
Installation: Order No.: Customer: Equipment: Converter Type: Document: 3BHS213774E01 ACS 1000 W
73 pages
IEC 61850-Introduction-Sv
No ratings yet
IEC 61850-Introduction-Sv
32 pages
TOPIC8. Random Variables and Probability Distributions
100% (1)
TOPIC8. Random Variables and Probability Distributions
8 pages
CH - 5. Memory Management
No ratings yet
CH - 5. Memory Management
86 pages
Exam 2 Practice Problems
No ratings yet
Exam 2 Practice Problems
14 pages
Bombardier 2022 Financial Report en
No ratings yet
Bombardier 2022 Financial Report en
176 pages
Main
No ratings yet
Main
13 pages
Movie Recommendation System
No ratings yet
Movie Recommendation System
28 pages
9.02 Probability
No ratings yet
9.02 Probability
22 pages
STA124 Complete Note (Edward Cares)
No ratings yet
STA124 Complete Note (Edward Cares)
41 pages
Cosc 416
No ratings yet
Cosc 416
6 pages
SM 05
No ratings yet
SM 05
92 pages
Discrete Probability 2023 N
No ratings yet
Discrete Probability 2023 N
91 pages
Virtual Reality As An Empirical Research Tool - Exploring User Experience in A Real Building and A Corresponding Virtual Model
No ratings yet
Virtual Reality As An Empirical Research Tool - Exploring User Experience in A Real Building and A Corresponding Virtual Model
3 pages
OperatingSystemConcepts 3 OperatingSystemStructures
No ratings yet
OperatingSystemConcepts 3 OperatingSystemStructures
30 pages
Discrete Random Variables
No ratings yet
Discrete Random Variables
5 pages
SM 06
No ratings yet
SM 06
73 pages
SM 14
No ratings yet
SM 14
67 pages
7probability Distributions (Binomial, Poisson and Normal)
No ratings yet
7probability Distributions (Binomial, Poisson and Normal)
33 pages
Leviat - Ancon - AUS Coupler BR - 2024
No ratings yet
Leviat - Ancon - AUS Coupler BR - 2024
24 pages
SM 10
No ratings yet
SM 10
62 pages
Probst at Lab
No ratings yet
Probst at Lab
9 pages
IG Job Posting (CPA Intern Co-Op or Summer) - Asper Spring FINAL
No ratings yet
IG Job Posting (CPA Intern Co-Op or Summer) - Asper Spring FINAL
2 pages
Part 3 Simulation With R
No ratings yet
Part 3 Simulation With R
42 pages
Introduction To Probability
No ratings yet
Introduction To Probability
510 pages
PTSP Lab Record
No ratings yet
PTSP Lab Record
27 pages
00 Lab Notes
No ratings yet
00 Lab Notes
10 pages
Week One Notes
No ratings yet
Week One Notes
10 pages
hw4 Sol PDF
100% (2)
hw4 Sol PDF
23 pages
Config WCM
100% (1)
Config WCM
17 pages
Data Science Probability
No ratings yet
Data Science Probability
97 pages
Lab 8
No ratings yet
Lab 8
5 pages
Data Science - Probability
No ratings yet
Data Science - Probability
53 pages
ACC 1100 Day 08&09 Ratio and Comparative Analysis
No ratings yet
ACC 1100 Day 08&09 Ratio and Comparative Analysis
32 pages
ACC 1100 Day 12&13 Inventory
No ratings yet
ACC 1100 Day 12&13 Inventory
36 pages
SM 07
No ratings yet
SM 07
40 pages
Stat. W2D2
No ratings yet
Stat. W2D2
5 pages
Tutorial3 Q&A 2023-09-08 04 - 44 - 25
No ratings yet
Tutorial3 Q&A 2023-09-08 04 - 44 - 25
6 pages
LM8 Summary
No ratings yet
LM8 Summary
1 page
SlidesCourse 21 Oct
No ratings yet
SlidesCourse 21 Oct
10 pages
Cisco Intersight Infrastructure Service Data Sheet
No ratings yet
Cisco Intersight Infrastructure Service Data Sheet
15 pages
21bce0427 VL2022230503921 Ast03
No ratings yet
21bce0427 VL2022230503921 Ast03
13 pages
3 Lec1cCh1ques
No ratings yet
3 Lec1cCh1ques
21 pages
ACC 1100 Day 10 Cash and Bank Account
No ratings yet
ACC 1100 Day 10 Cash and Bank Account
17 pages
AssignmentProblems Ago18 2021
No ratings yet
AssignmentProblems Ago18 2021
4 pages
CSF213 OOP Handout 2023 24 Sem I
No ratings yet
CSF213 OOP Handout 2023 24 Sem I
3 pages
Unit 3 Probability Distributions - 21MA41
No ratings yet
Unit 3 Probability Distributions - 21MA41
17 pages
Extended Essay BM IB
No ratings yet
Extended Essay BM IB
51 pages
Practice Operations Simulation - Reflection Template
No ratings yet
Practice Operations Simulation - Reflection Template
1 page
Building Internet Brands: Brand Equity and Brand Image Creating A Strong Brand On The Internet
No ratings yet
Building Internet Brands: Brand Equity and Brand Image Creating A Strong Brand On The Internet
22 pages
02 Probability
No ratings yet
02 Probability
5 pages
The Business of Intellectual Property A Literature Review of IP Management Research
No ratings yet
The Business of Intellectual Property A Literature Review of IP Management Research
20 pages
Probability Distributions in R
No ratings yet
Probability Distributions in R
42 pages
Lab-6-Binomail and Poisson Distribution
100% (1)
Lab-6-Binomail and Poisson Distribution
13 pages
Tutorial 7 - Questions
No ratings yet
Tutorial 7 - Questions
4 pages
Research Shows That Business Meetings, Discussions and Training Are Happening Online Nowadays. Do The Advantages Outweigh The Disadvantages?
No ratings yet
Research Shows That Business Meetings, Discussions and Training Are Happening Online Nowadays. Do The Advantages Outweigh The Disadvantages?
3 pages
LT08
No ratings yet
LT08
5 pages
Technical Information Sheet: Lashing Points
No ratings yet
Technical Information Sheet: Lashing Points
2 pages
ADR Sabre
No ratings yet
ADR Sabre
2 pages
Maths Methods Probability Notes
100% (1)
Maths Methods Probability Notes
10 pages
Technical Manual: Includes
No ratings yet
Technical Manual: Includes
13 pages
Problem Set #4 Due: 1:00pm On Wednesday, February 19: Written Problems
No ratings yet
Problem Set #4 Due: 1:00pm On Wednesday, February 19: Written Problems
5 pages
More Employment Opportunities For People With High Income in Many Countries
No ratings yet
More Employment Opportunities For People With High Income in Many Countries
1 page
Lab3 Fitting and Plotting of Binomial Distribution & Poisson Distribution (Challenging Experiment 2 (A) and 2 (B) ) Aim
No ratings yet
Lab3 Fitting and Plotting of Binomial Distribution & Poisson Distribution (Challenging Experiment 2 (A) and 2 (B) ) Aim
18 pages
Think and Decide Think and Observe: 3 Quarter Week 1 Lesson Plan Mathematics 4 I. Objectives
100% (1)
Think and Decide Think and Observe: 3 Quarter Week 1 Lesson Plan Mathematics 4 I. Objectives
3 pages
Pointers Reviewer For Second Periodical Exam
No ratings yet
Pointers Reviewer For Second Periodical Exam
2 pages
Bf2 Flanger Eng
No ratings yet
Bf2 Flanger Eng
5 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Combined Voltage and Current Post Insulator Sensors: Ordering Table Part Number Sequence 96AB/CDEFGH Where
No ratings yet
Combined Voltage and Current Post Insulator Sensors: Ordering Table Part Number Sequence 96AB/CDEFGH Where
2 pages
Exponential & Logarithmic Equations
100% (1)
Exponential & Logarithmic Equations
8 pages
Stats
No ratings yet
Stats
2 pages
Simulation: Programming in R For Data Science Anders Stockmarr, Kasper Kristensen, Anders Nielsen
No ratings yet
Simulation: Programming in R For Data Science Anders Stockmarr, Kasper Kristensen, Anders Nielsen
19 pages
Stat Project
No ratings yet
Stat Project
10 pages
Probability Quiz
No ratings yet
Probability Quiz
5 pages
Saint Andrew'S Junior College: 2008 Preliminary Examination Mathematics Higher 2
No ratings yet
Saint Andrew'S Junior College: 2008 Preliminary Examination Mathematics Higher 2
5 pages
1 Solving Probability Problems With R: # We Can Use The Built-In Function "Sample" As Follows
No ratings yet
1 Solving Probability Problems With R: # We Can Use The Built-In Function "Sample" As Follows
4 pages
Answer: (A) PR (B
No ratings yet
Answer: (A) PR (B
14 pages
Monte Carlo Simulations and MATLAB
No ratings yet
Monte Carlo Simulations and MATLAB
7 pages
"SCILAB - An Open Source Substitute For MATLAB": Organized By: JNTUH College of Engineering, Sultanpur
No ratings yet
"SCILAB - An Open Source Substitute For MATLAB": Organized By: JNTUH College of Engineering, Sultanpur
4 pages
Java Frame
No ratings yet
Java Frame
3 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
2.5/5 (2)
Chapter #19 Solutions - Engineering Economy, 7 TH Editionleland Blank and Anthony Tarquin
No ratings yet
Chapter #19 Solutions - Engineering Economy, 7 TH Editionleland Blank and Anthony Tarquin
15 pages
Chapter 1 Probability
No ratings yet
Chapter 1 Probability
13 pages
Jupyter Notebook Viewer
No ratings yet
Jupyter Notebook Viewer
32 pages
Fast mental calculation tricks
From Everand
Fast mental calculation tricks
EasyMath
No ratings yet
Painless Pre-Algebra
From Everand
Painless Pre-Algebra
Barron's Educational Series
3/5 (2)

Lab 6 Activities

Uploaded by

Lab 6 Activities

Uploaded by

Lab #6 Activities

Anh Dao Mai

## [,1] [,2] [,3] [,4] [,5] [,6]

The following code will create a vector of the possible sums:

sum(probs[as.numeric(names(probs)) >= 9])

You might also like