0% found this document useful (0 votes)

74 views5 pages

BES - R Lab

Uploaded by

Viem Anh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

74 views5 pages

BES - R Lab

Uploaded by

Viem Anh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

STA 2 - R LAB 2

Review on Basics of R (cont.) + Paired-Samples t Test

1. Objectives
 Basic graphical and tabular methods
 Editing graphs
 Paired-samples t-test
2. Basic tabular methods
Remember that before we apply a graphical or statistical method on a variable that is to be treated as a
categorical variable, we should be sure that it has been converted into a factor.
We can convert a character variable into a factor by the factor() function. For example,
 data1 <- read.table("mtcars.csv",header=TRUE,sep=",",quote="\"",
stringsAsFactors = FALSE) #Read data into R
 data1$am <- factor(data1$am,ordered=FALSE,levels=c(0,1),labels =
c("automatic","manual")) #Convert into categorical variable

Exercise 1:
a. Explain why the conversion of the data1$am variable is necessary in the above code.
b. Let’s check whether the data1$am variable has been converted into a factor correctly.
c. Go back to the code for importing a text file. What happens if we use stringAsFactors =
TRUE? Try this:
 data2 <- read.table("mtcars.csv",header=TRUE, sep=",",quote="\"",
stringsAsFactors=TRUE)

Now, let’s create a frequency table. We can try:

 am.table <- table(data1$am)
 prop.table(am.table)
 prop.table(am.table)*100

To create a contingency table, use the following format of the table() function:
tableName <- table(row variable, column variable)

Exercise 2: Create a contingency table named gearVSam.table2 showing the relationship between
gear and am. Are you happy with the output? Let’s discuss how to improve it.

3. Basic graphical methods

3.1 Simple bar graph
Based on the frequency table produced previously, we can now produce a simple bar graph. The
following listing shows different ways to plot a bar graph.
 am.table <- table(data1$am)
 barplot(am.table)

1|P a ge
STA 2 - R LAB 2

 barplot(am.table, main="Bar graph of Transmission", xlab="Types of

Transmission", ylab="Frequency",ylim=c(0,20))
 barplot(am.table, main="Bar graph of Transmission", xlab="Types of
Transmission", ylab="Frequency", horiz=TRUE)
 barplot(am.table, col="skyblue",main="Bar graph of Transmission ",
xlab=" Types of Transmission ", ylab="Frequency")

3.2 Clustered bar graph

Let’s type in the following code:
 trans.vs.gear<-table(data1$am,data1$gear)
 trans.vs.gear
 barplot(trans.vs.gear, beside=TRUE)
 barplot(trans.vs.gear, col=c("red", "yellow"),
beside=TRUE,ylim=c(0,20))

Exercise 3: For the above clustered bar graph

a. Add a title for the graph and labels for the two axes.
b. Use different colors for the bars
c. Convert gear variable to a factor before producing a contingency table and clustered bar graph
and observe the difference.
3.3 Stacked bar graph
The code below will produce a stacked bar graph
 barplot(trans.vs.gear,col=c("red","yellow"))
We can use the spineplot() function to produce a spine plot, a generalized version of the stacked bar
graph. Let’s observe how the spine plot differs from the previous stacked bar graph.
 spineplot(trans.vs.gear, col=c("blue", "green", "pink"))

3.4 Stem and leaf display

 mpg <- data1$mpg
 stem(mpg)

3.5 Histogram
 hist(mpg)
 hist(mpg,breaks=5,col="red")
 hist(mpg, freq=FALSE, breaks=5,col="red")

3.6 Boxplot
The following command is to work with boxplot (for numerical data):
 boxplot(data1$mpg)
 boxplot.stats(data1$mpg)
 boxplot(data1$mpg ~ data1$gear)

2|P a ge
STA 2 - R LAB 2

4. Editing graphs
4.1 Adding title and axis labels
The function title() adds title and axis labels to a graph. The general format is:
title(main="my title", sub="my sub-title", xlab="x-axis label", ylab="y-
axis label")

The title() function works with the currently active graph.

4.2 Adding a box outside the graph
Use the box()function

5. Paired-samples t Test
We can use the following code to conduct a paired-samples t test to see if the population mean
difference is not zero:
t.test (y1, y2, paired=TRUE, alternative = …,conf.level=0.95)

where y1 and y2 are numeric vectors for the two matched groups and conf.level argument allows us
to specify the confidence level of the reported CI.
Exercise 4. Load the GolfScores.csv dataset. The dataset contains scores of the first and final rounds
for a sample of 20 golfers who completed in PGA tournaments. Suppose you would like to determine
if the mean score for the first round of a PGA Tour event is significantly different than the mean score
for the fourth and final round. Use R to generate the test output. Use  = 0.1.
a) What is the mean difference between in scores for the two rounds? For which round is the
sample mean score lower?
b) What is the p-value? Was the mean score significantly different for the two rounds?
c) What is the 90% confidence interval estimate for the difference between two population
means? Does this CI support your conclusion in part (b) (Does the interval include 0)?
d) Remember that in practice we have to check assumptions for each test we perform. Is the data
distribution for the paired differences reasonably normal?
Note: To check the normality of a dataset, a histogram can be used (but a QQ plot is more useful). In
case of small sample size, however, it is better to use the stem and leaf display and the qq plot to check
if data is normally distributed. The R code for a qq plot is as follows.
 qqnorm(data)#Compare quantiles of our data with theoretical normal
quantiles
 qqline(data)# Add a line to a normal quantile-quantile plot passing
through the first and third quartiles

If the data is normally distributed, the data points should fall in a straight line. Departures from the line
are indicative of a lack of normality.

3|P a ge
STA 2 - R LAB 2

The R output for this exercise is provided below. You are expected to write R code that produces the
same output:
Paired t-test

data: data3$First and data3$Final

t = -1.416, df = 19, p-value = 0.173
alternative hypothesis: true difference in means is not equal to 0
90 percent confidence interval:
-2.3322058 0.2322058
sample estimates:
mean of the differences
-1.05

Stem and Leaf Display of Golf Score Differences

The decimal point is 1 digit(s) to the right of the |

-0 | 7765
-0 | 43221
0 | 00111112234

Note: If the assumption of normality is violated, the t test may provide misleading results (you should
refer to the practical guidelines regarding how to use one-sample t-test in the Probability and Statistics
course). In such cases, we should use a nonparametric test (to be taught later in this course).
Exercise 5. Load the PriceChange.csv dataset. In early 2009, the economy was experiencing a
recession. The dataset contains data price per share of stock for a sample of 15 companies on January 1
and April 30 (The Wall Street Journal, May 1, 2009).

4|P a ge
STA 2 - R LAB 2

a. What is the change in the mean price per share of stock over the four-month period?
b. Provide a 90% confidence interval estimate of the change in the mean price per share of stock.
Interprete the results.
c. How was the recession affecting the stock market? Use  = .1
The R output for this exercise is provided below. You are expected to write R code that produces the
same output:

Paired t-test

data: data4$Jan and data4$Apr

t = 2.0043, df = 14, p-value = 0.06478
alternative hypothesis: true difference in means is not equal to 0
90 percent confidence interval:
0.2970457 4.6029543
sample estimates:
mean of the differences
2.45
Stem and Leaf Display of Price Changes
The decimal point is 1 digit(s) to the right of the |

-0 | 432211
0 | 12344
0 | 778
1 | 2

5|P a ge

Electrostatic Handbook 2003
100% (9)
Electrostatic Handbook 2003
228 pages
Daily Activity Booklet
No ratings yet
Daily Activity Booklet
143 pages
Intro To R
No ratings yet
Intro To R
18 pages
Chapter 9 Real Mortgage
100% (3)
Chapter 9 Real Mortgage
6 pages
2003 Peugeot 807 65093 PDF
No ratings yet
2003 Peugeot 807 65093 PDF
184 pages
R Intro 2011
No ratings yet
R Intro 2011
115 pages
2023 Tutorial 12
No ratings yet
2023 Tutorial 12
6 pages
Transformando La Movilidad Urbana en Mexico2
No ratings yet
Transformando La Movilidad Urbana en Mexico2
4 pages
Module2 BDA
No ratings yet
Module2 BDA
44 pages
00 Lab Notes
No ratings yet
00 Lab Notes
8 pages
RDocumentation - Func (Ttest)
No ratings yet
RDocumentation - Func (Ttest)
3 pages
Advanced Statistical Methods Using R Notes
No ratings yet
Advanced Statistical Methods Using R Notes
55 pages
R Module 11 - Statistics
No ratings yet
R Module 11 - Statistics
35 pages
Workshop Activity: X Seq y Length
No ratings yet
Workshop Activity: X Seq y Length
3 pages
Which Test When: 1 Exploratory Tests
No ratings yet
Which Test When: 1 Exploratory Tests
5 pages
Stat 362 UNIT 4
No ratings yet
Stat 362 UNIT 4
30 pages
Aditya Garg DMDW
No ratings yet
Aditya Garg DMDW
40 pages
DEV Lab Manual
No ratings yet
DEV Lab Manual
27 pages
R Programming Slides
No ratings yet
R Programming Slides
73 pages
Descriptive and Inferential Statistics With R
No ratings yet
Descriptive and Inferential Statistics With R
6 pages
ProbList2 24 SLN
No ratings yet
ProbList2 24 SLN
20 pages
Session 6-15 - Unit II & III: Probability and Distribution, Classical Tests
No ratings yet
Session 6-15 - Unit II & III: Probability and Distribution, Classical Tests
34 pages
Hypothesis
No ratings yet
Hypothesis
16 pages
Lecture 10 R
No ratings yet
Lecture 10 R
117 pages
R For Data Exploration
No ratings yet
R For Data Exploration
52 pages
Final Cost Practical
No ratings yet
Final Cost Practical
29 pages
Commands For Data Analysis Using R
No ratings yet
Commands For Data Analysis Using R
11 pages
STAT 1000 - Worksheet 2
No ratings yet
STAT 1000 - Worksheet 2
14 pages
R Console
No ratings yet
R Console
6 pages
Modelling in R
No ratings yet
Modelling in R
47 pages
Unit4 R
No ratings yet
Unit4 R
21 pages
IBS Sample I
No ratings yet
IBS Sample I
10 pages
R Regression Commands
No ratings yet
R Regression Commands
5 pages
Type I and Type II Errors Type I Error
No ratings yet
Type I and Type II Errors Type I Error
7 pages
R Commands
No ratings yet
R Commands
5 pages
Statistical Hypothesis Testing
No ratings yet
Statistical Hypothesis Testing
20 pages
Data Analyses R Manual NYTS
No ratings yet
Data Analyses R Manual NYTS
24 pages
STAT 1000 - Worksheet 2
No ratings yet
STAT 1000 - Worksheet 2
14 pages
R Cheat Sheet
No ratings yet
R Cheat Sheet
9 pages
Unit 2 R
No ratings yet
Unit 2 R
16 pages
Statistical Computing by Using R
100% (1)
Statistical Computing by Using R
11 pages
All v2 Basic Statistics Using R
No ratings yet
All v2 Basic Statistics Using R
241 pages
Unit3-Data Science
No ratings yet
Unit3-Data Science
37 pages
STAT 1000 - Worksheet 2
No ratings yet
STAT 1000 - Worksheet 2
14 pages
R Manual PDF
No ratings yet
R Manual PDF
78 pages
Capital Gains
No ratings yet
Capital Gains
8 pages
Unit Ii DS LM
No ratings yet
Unit Ii DS LM
20 pages
Edar M-5
No ratings yet
Edar M-5
32 pages
STTN 225 R Summary
No ratings yet
STTN 225 R Summary
18 pages
Lab0 R Tutorial EHS
No ratings yet
Lab0 R Tutorial EHS
9 pages
R Lab Manual
No ratings yet
R Lab Manual
31 pages
Assignment 1
No ratings yet
Assignment 1
7 pages
Analysis Using Statistical: Introduction & Data Exploration
No ratings yet
Analysis Using Statistical: Introduction & Data Exploration
23 pages
STAT359 Study Guide
No ratings yet
STAT359 Study Guide
7 pages
Using R For Basic Statistical Analysis
No ratings yet
Using R For Basic Statistical Analysis
11 pages
STA1007S Lab 3: Plots (II) and Sub-Setting: "Sample"
No ratings yet
STA1007S Lab 3: Plots (II) and Sub-Setting: "Sample"
10 pages
Rdias FDP
No ratings yet
Rdias FDP
50 pages
Acts 372 Unit 6
No ratings yet
Acts 372 Unit 6
40 pages
R Questions With Solution
No ratings yet
R Questions With Solution
11 pages
Statistics Cheat Sheet
100% (1)
Statistics Cheat Sheet
4 pages
RBasics Handout
No ratings yet
RBasics Handout
6 pages
An R Companion To Statistical Thinking For The 21st Century
No ratings yet
An R Companion To Statistical Thinking For The 21st Century
159 pages
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Basic Exercises for Competitive Programming: Python
From Everand
Basic Exercises for Competitive Programming: Python
Jan Pol
No ratings yet
What Is The Main Function of Financial Markets?: Tutorial 1 Overview of Financial System Part 1. Questions For Review
No ratings yet
What Is The Main Function of Financial Markets?: Tutorial 1 Overview of Financial System Part 1. Questions For Review
16 pages
19040400095- Nguyễn Hữu Phú- Lab03 Model Design 21092020
No ratings yet
19040400095- Nguyễn Hữu Phú- Lab03 Model Design 21092020
5 pages
Business Modeling Lab Room Activities Lab03 - Model Design Activity 1 - Management Fee
No ratings yet
Business Modeling Lab Room Activities Lab03 - Model Design Activity 1 - Management Fee
4 pages
Threshold Marginal % Base Tax: Calculation Section
No ratings yet
Threshold Marginal % Base Tax: Calculation Section
1 page
Exercises For Lab 3: Complete The Following Exercises in Your Textbook by Hand
No ratings yet
Exercises For Lab 3: Complete The Following Exercises in Your Textbook by Hand
1 page
Business Modeling Lab Room Activities Lab03 - Model Design Activity 1 - Management Fee
No ratings yet
Business Modeling Lab Room Activities Lab03 - Model Design Activity 1 - Management Fee
4 pages
Threshold Marginal % Base Tax: Calculation Section
No ratings yet
Threshold Marginal % Base Tax: Calculation Section
1 page
BES - R Lab 1
No ratings yet
BES - R Lab 1
4 pages
Assignment MCA 103
No ratings yet
Assignment MCA 103
4 pages
Pseudocode - 2
No ratings yet
Pseudocode - 2
106 pages
Tabel Ses
No ratings yet
Tabel Ses
6 pages
Interview Questions With Answers On All Topics (Rev1)
No ratings yet
Interview Questions With Answers On All Topics (Rev1)
41 pages
Column Layout Plan: Trims International (BD) LTD
No ratings yet
Column Layout Plan: Trims International (BD) LTD
1 page
GLM vs. Machine Leaning: - With Case Studies in Pricing
No ratings yet
GLM vs. Machine Leaning: - With Case Studies in Pricing
28 pages
Winglets Brochure 2009
No ratings yet
Winglets Brochure 2009
4 pages
BOQ - Zallaf South Refinery Project - CAMP & TSF
No ratings yet
BOQ - Zallaf South Refinery Project - CAMP & TSF
18 pages
Tap Magic Eco Oil Sds en Us 2023pdf
No ratings yet
Tap Magic Eco Oil Sds en Us 2023pdf
8 pages
Mid Semester Theory Exam17079936871961
No ratings yet
Mid Semester Theory Exam17079936871961
17 pages
Aguinaldo Industries V CIR - Peralta
No ratings yet
Aguinaldo Industries V CIR - Peralta
2 pages
Puboff-Torredo Vs Villamor
No ratings yet
Puboff-Torredo Vs Villamor
6 pages
Blueberries: Growing Beyond Production Challenges
No ratings yet
Blueberries: Growing Beyond Production Challenges
12 pages
Wa0005.
No ratings yet
Wa0005.
17 pages
MATULAC Activity 1 MidTerm
No ratings yet
MATULAC Activity 1 MidTerm
3 pages
Event Action Script Call Equivalents
No ratings yet
Event Action Script Call Equivalents
17 pages
Southpoint School & College: Time: 30 Mins Subject: Computer Studies (Objectives) Full Marks: 30
No ratings yet
Southpoint School & College: Time: 30 Mins Subject: Computer Studies (Objectives) Full Marks: 30
2 pages
Expectancy Theory Overview
100% (3)
Expectancy Theory Overview
27 pages
Cataloge E&H Weld-In Adapter and Flanges
No ratings yet
Cataloge E&H Weld-In Adapter and Flanges
40 pages
Linearization OpenFAST
No ratings yet
Linearization OpenFAST
13 pages
FMX / Cruiso / BW 8-12: Ganzeboom Transmission Parts & Torque Converters
No ratings yet
FMX / Cruiso / BW 8-12: Ganzeboom Transmission Parts & Torque Converters
2 pages
Impact of Covid-19 in Business
0% (1)
Impact of Covid-19 in Business
17 pages
TCP Ip Multimedia
No ratings yet
TCP Ip Multimedia
87 pages
5 Open Source Wi-Fi Hotspot Solutions
No ratings yet
5 Open Source Wi-Fi Hotspot Solutions
3 pages
V.K.S 7233 16.12.23
No ratings yet
V.K.S 7233 16.12.23
1 page
Print Money Receipt
No ratings yet
Print Money Receipt
3 pages

BES - R Lab

Uploaded by

BES - R Lab

Uploaded by

STA 2 - R LAB 2

Review on Basics of R (cont.) + Paired-Samples t Test

Now, let’s create a frequency table. We can try:

3. Basic graphical methods

 barplot(am.table, main="Bar graph of Transmission", xlab="Types of

3.2 Clustered bar graph

Exercise 3: For the above clustered bar graph

3.4 Stem and leaf display

The title() function works with the currently active graph.

data: data3$First and data3$Final

Stem and Leaf Display of Golf Score Differences

data: data4$Jan and data4$Apr

You might also like