0% found this document useful (0 votes)

37 views3 pages

Summary of R Commands For Statistics 100

This document summarizes R commands that will be used in Statistics 100, including commands for reading and manipulating data, descriptive statistics, graphics, probability distributions, random sampling, statistical inference, and regression. Some key commands are read.table() for importing data, summary() for data summaries, plot() for scatter plots, t.test() for t-tests, lm() for linear regression and glm() for logistic regression. Help pages for each command can be accessed with help(command).

Uploaded by

Simon Maya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views3 pages

Summary of R Commands For Statistics 100

Uploaded by

Simon Maya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Summary of R commands for Statistics 100

Statistics 100 – Fall 2011

Professor Mark E. Glickman

The following is a summary of R commands we will be using throughout Statistics 100, and maybe
a few extras we will not end up using. Please refer to the homework and course notes for examples
of their usage, including the appropriate arguments of the commands. In the descriptions below,
fnc is an arbitrary R command.

Reading, viewing, and assigning data in R:

y = fnc(x) − assigns the results of the function fnc evaluated at x to the variable y.

file.choose() − navigates to a data file on your computer.

read.table(fname) − reads data into R from file fname.

read.csv(fname) − reads data into R from a comma-separated value file fname

data.frame(...) − creates a data frame within R.

View(x) − view data frame x within R. Can also just type the name of the data frame at the
prompt.

help(fnc) − help page for function “fnc”.

Descriptive statistics:

summary(x) − data summary of x.

mean(x) − sample mean of x.

sd(x) − sample standard deviation of x.

length(x) − number of values in x.

table(x) − for categorical variable x, creates vector of counts of each unique category.

cor(x,y) − correlation between x and y.

by(y,x,fnc) − with categorical x and function fnc, carry out fnc(y) for each level of x.

1
Graphics:

hist(x) − histogram of data in x.

stem(x) − stem and leaf plot of data in x.

plot(x,y) − scatter plot of y against x.

lines(supsmu(x,y)) − add smoother to existing scatter plot.

boxplot(list(x1,x2,...)) − side-by-side boxplots of variables x1, x2, etc.

boxplot(y ~ x) − alternative method for boxplots if y is quantitative and x is categorical.

barplot(x) − barplot of x (where x contains the heights of the bars).

abline(a,b) − add the line y = a + bx to an existing plot.

abline(h=a) − add a horizontal line at y = a to an existing plot.

abline(v=a) − add a vertical line at x = a to an existing plot.

abline(model.fit) − add a regression line based on the model model.fit to an existing plot.

qqnorm(x) − normal probability plot of data in x.

qqline(x) − adds a line to a normal probability plot passing through 1Q and 3Q

Probability distribution computations:

dbinom(x, n, p) − P(X = x) where X ∼ B(n, p)

pnorm(x, mean, sd) − P(X < x) where X ∼ N(mean, sd)

qnorm(p, mean, sd) − the value of x in p = P(X < x), where X ∼ N(mean, sd)

pt(x, df) − P(X < x) where X ∼ t(df)

qt(p, df) − the value of x in p = P(T < x), where T ∼ t(df)

pchisq(x, df) − P(X 2 < x) where X 2 ∼ χ2 (df)

Random sampling (without replacement):

sample(n) − a random arrangement of the first n positive integers.

sample(n, size) − a random sample of size values from among the first n positive integers.

2
Statistical inference:

t.test(x, mu) − one-sample t-test or confidence interval with data in x, with null hypothesized
value mu.

t.test(x1, x2) − two-sample t-test or confidence interval for difference in means with data in
x1 and x2

t.test(y ~ x, data=data.df) − alternative method for two-sample t-test; y is the quantitative

response and x is binary categorical variable in data frame data.df.

prop.test(x, n, p) − one-sample z-test or confidence interval for a Binomial probability, with

x successes in a sample size of n, and a hypothesized probability p.

prop.test(x, n) − two-sample z-test or confidence interval for difference in Binomial probabili-

ties, with x containing two counts of successes, and n containing two sample sizes.

mcnemar.test(x) − McNemar’s test for difference in Binomial probabilities with paired data,
with x containing 2 × 2 data frame.

aov(y ~ x, data=data.df) − analysis of variance of response y on categorical variable x con-

tained in data frame data.df.

lm(y~x1+x2+x3+..., data=data.df) − least-squares regression of y on x1, x2, etc., within data

frame data.df.

glm(y~x1+x2+x3+..., family=binomial, data=data.df) − logistic regression of y on x1, x2,

etc., within data frame data.df.

summary(model.fit) − summarize model.fit, the results of either analysis of variance, least-

squares regression, or logistic regression.

step(model.fit) − stepwise variable selection for least-squares or logistic regressions, with largest
model in model.fit.

predict(model.fit, newdata=newdata.df) − prediction of least-squares or logistic regression

model in model.fit using data in newdata.df.

fitted(model.fit) − fitted values from model.fit.

residuals(model.fit) − residuals from model.fit.

chisq.test(x, p) − chi-squared goodness-of-fit test, with vector of counts in x and vector of

probabilities in p.

chisq.test(x) − chi-squared test of independence, with counts in x as a data frame.

R Questions With Solution
No ratings yet
R Questions With Solution
11 pages
Analysing Data Using Linear Models 5th Ed January 2021
No ratings yet
Analysing Data Using Linear Models 5th Ed January 2021
388 pages
R-Web-Appendix of Foundations of Statistics For Data Scientists
No ratings yet
R-Web-Appendix of Foundations of Statistics For Data Scientists
122 pages
R语言学习笔记
No ratings yet
R语言学习笔记
78 pages
Essential R
No ratings yet
Essential R
261 pages
Lucero R Tutorial 2016
No ratings yet
Lucero R Tutorial 2016
135 pages
Boulder Handout 2019
No ratings yet
Boulder Handout 2019
187 pages
ComputerLabNotes 2024
No ratings yet
ComputerLabNotes 2024
109 pages
Shipunov Visual Statistics
No ratings yet
Shipunov Visual Statistics
429 pages
Econometrics I - R Summary (Maite Cabeza-Gutes)
No ratings yet
Econometrics I - R Summary (Maite Cabeza-Gutes)
77 pages
STAT319 Lab Manual Based On R - Final Version
No ratings yet
STAT319 Lab Manual Based On R - Final Version
127 pages
Visual Statistics Use R
No ratings yet
Visual Statistics Use R
451 pages
R Practice
No ratings yet
R Practice
38 pages
Mathematical Computations Using R
No ratings yet
Mathematical Computations Using R
53 pages
STAT-1000---Worksheet-2 (1)
No ratings yet
STAT-1000---Worksheet-2 (1)
14 pages
STAT-1000---Worksheet-2
No ratings yet
STAT-1000---Worksheet-2
14 pages
R Code
No ratings yet
R Code
13 pages
Krijnen IntroBioInfStatistics
No ratings yet
Krijnen IntroBioInfStatistics
278 pages
unit3_R[1] (1)
No ratings yet
unit3_R[1] (1)
30 pages
R Manual PDF
No ratings yet
R Manual PDF
78 pages
Statistics With R Programming PDF
No ratings yet
Statistics With R Programming PDF
53 pages
Applied Statistics For Bioinformatics PDF
No ratings yet
Applied Statistics For Bioinformatics PDF
278 pages
Intro To R Software
No ratings yet
Intro To R Software
7 pages
R For Data Exploration
No ratings yet
R For Data Exploration
52 pages
Questions With No Solutions
No ratings yet
Questions With No Solutions
20 pages
Introduction To R: Nihan Acar-Denizli, Pau Fonseca
No ratings yet
Introduction To R: Nihan Acar-Denizli, Pau Fonseca
50 pages
AP Statistics Michel Liao
No ratings yet
AP Statistics Michel Liao
20 pages
R Intro 2011
No ratings yet
R Intro 2011
115 pages
R CODES
No ratings yet
R CODES
5 pages
Difference Between (Median, Mean, Mode, Range, Midrange) (Descriptive Statistics)
No ratings yet
Difference Between (Median, Mean, Mode, Range, Midrange) (Descriptive Statistics)
11 pages
McCright A.M. (Ed.), Clark T.N. (Ed.) - Community and Ecology, Volume 10 - Dynamics of Place, Sustainability, and Politics (Research in Urban Policy) (2006) PDF
100% (1)
McCright A.M. (Ed.), Clark T.N. (Ed.) - Community and Ecology, Volume 10 - Dynamics of Place, Sustainability, and Politics (Research in Urban Policy) (2006) PDF
319 pages
Unit3__R
No ratings yet
Unit3__R
19 pages
r file code
No ratings yet
r file code
16 pages
Lecture 2 - R Graphics PDF
No ratings yet
Lecture 2 - R Graphics PDF
68 pages
Ap Stats Cram Sheet: Symmetric - When The Left Half Is
No ratings yet
Ap Stats Cram Sheet: Symmetric - When The Left Half Is
7 pages
Common Stat 101 Commands For Rstudio: 1 One Categorical Variable
No ratings yet
Common Stat 101 Commands For Rstudio: 1 One Categorical Variable
5 pages
STTN 225 R Summary
No ratings yet
STTN 225 R Summary
18 pages
Week13 - LAQs - SWR
No ratings yet
Week13 - LAQs - SWR
2 pages
Z-Chart & Loss Function v05
No ratings yet
Z-Chart & Loss Function v05
1 page
Complementary Error Function Table: X Erfc (X) X Erfc (X) X Erfc (X) X Erfc (X) X Erfc (X) X Erfc (X) X Erfc (X)
No ratings yet
Complementary Error Function Table: X Erfc (X) X Erfc (X) X Erfc (X) X Erfc (X) X Erfc (X) X Erfc (X) X Erfc (X)
1 page
R Functions List
No ratings yet
R Functions List
8 pages
Useful R Functions-1
No ratings yet
Useful R Functions-1
4 pages
Chapter - 3 Common Statistical Procedure
No ratings yet
Chapter - 3 Common Statistical Procedure
20 pages
Statistics Cheat Sheet
100% (1)
Statistics Cheat Sheet
4 pages
R Console
No ratings yet
R Console
6 pages
Francis Martin Fungi
No ratings yet
Francis Martin Fungi
474 pages
Resumo Adp
No ratings yet
Resumo Adp
5 pages
R
No ratings yet
R
4 pages
R Course
No ratings yet
R Course
7 pages
A Short List of The Most Useful R Commands
No ratings yet
A Short List of The Most Useful R Commands
11 pages
R Commands: Appendix B
No ratings yet
R Commands: Appendix B
5 pages
A Short List of The Most Useful R Commands
No ratings yet
A Short List of The Most Useful R Commands
8 pages
Manual
No ratings yet
Manual
52 pages
R Commands
No ratings yet
R Commands
5 pages
R Studio Cheat Sheet
No ratings yet
R Studio Cheat Sheet
6 pages
Basics: TH TH TH TH TH TH TH
No ratings yet
Basics: TH TH TH TH TH TH TH
3 pages
UL2
No ratings yet
UL2
2 pages
BAN5
No ratings yet
BAN5
2 pages
Stat and Prob - Q3 Week 6 - Mod 6
No ratings yet
Stat and Prob - Q3 Week 6 - Mod 6
26 pages
Sim R
No ratings yet
Sim R
6 pages
CP&M - Lec 11-Pert
No ratings yet
CP&M - Lec 11-Pert
22 pages
(Focus on Biotechnology 3C) S. R. Weijers (auth.), Spiros N. Agathos, Walter Reineke (eds.) - Biotechnology for the Environment_ Wastewater Treatment and Modeling, Waste Gas Handling-Springer Netherla
No ratings yet
(Focus on Biotechnology 3C) S. R. Weijers (auth.), Spiros N. Agathos, Walter Reineke (eds.) - Biotechnology for the Environment_ Wastewater Treatment and Modeling, Waste Gas Handling-Springer Netherla
275 pages
Customer Churn Case Answers
No ratings yet
Customer Churn Case Answers
8 pages
Unit VI Stochastic Processes: Dr. Nita V. Patil Date:27/July/2021
0% (1)
Unit VI Stochastic Processes: Dr. Nita V. Patil Date:27/July/2021
50 pages
Normal Probability Curve: By: Keerthi Samuel.K, Lecturer Vijay Marie College of Nursing
No ratings yet
Normal Probability Curve: By: Keerthi Samuel.K, Lecturer Vijay Marie College of Nursing
22 pages
Prmia II
No ratings yet
Prmia II
49 pages
Locally Weighted Regression
No ratings yet
Locally Weighted Regression
17 pages
MAT102 - Statistics For Business - UEH-ISB - T3 2022 - Unit Guide - DR Chon Le
No ratings yet
MAT102 - Statistics For Business - UEH-ISB - T3 2022 - Unit Guide - DR Chon Le
12 pages
Iep 213 Handouts 3
No ratings yet
Iep 213 Handouts 3
6 pages
An R Tutorial Starting Out
No ratings yet
An R Tutorial Starting Out
9 pages
downloadMathsA LevelS3Papers EdexcelJune20201220MS20 20S320Edexcel PDF
No ratings yet
downloadMathsA LevelS3Papers EdexcelJune20201220MS20 20S320Edexcel PDF
12 pages
Ba Yes Thinking W FM
No ratings yet
Ba Yes Thinking W FM
5 pages
Summative 3 SP
No ratings yet
Summative 3 SP
4 pages
Spring 2023 Signature Assignment ADA MAT 152 With Rubric Excel Bonanno 2
No ratings yet
Spring 2023 Signature Assignment ADA MAT 152 With Rubric Excel Bonanno 2
4 pages
Central Limit Theorem Examples and Exercises
No ratings yet
Central Limit Theorem Examples and Exercises
4 pages
The Black-Scholes Model: Liuren Wu
No ratings yet
The Black-Scholes Model: Liuren Wu
17 pages
Leese 1983
No ratings yet
Leese 1983
5 pages
Sta361: Time Series Analysis: T T T T
No ratings yet
Sta361: Time Series Analysis: T T T T
3 pages
Lampiran 5 Hasil Analisis SPSS 20 1. Karakteristik Responden
No ratings yet
Lampiran 5 Hasil Analisis SPSS 20 1. Karakteristik Responden
4 pages
Modelos Graficos
No ratings yet
Modelos Graficos
4 pages
Completely Randomized Design (One-Way ANOVA)
No ratings yet
Completely Randomized Design (One-Way ANOVA)
5 pages
Lecture 7 & 8 Brief Lecture Notes On Probability Distributions: Binomial, Poisson and Normal Distribution
No ratings yet
Lecture 7 & 8 Brief Lecture Notes On Probability Distributions: Binomial, Poisson and Normal Distribution
17 pages
Hypothesis Testing: Cee 3040 - Uncertainty Analysis in Engineering
No ratings yet
Hypothesis Testing: Cee 3040 - Uncertainty Analysis in Engineering
1 page
Econometrics and Softwar Applications (Econ 7031) Assignment
No ratings yet
Econometrics and Softwar Applications (Econ 7031) Assignment
8 pages
What Is Sample Size
No ratings yet
What Is Sample Size
6 pages
Logistic Regression Quiz: Pandas Version: 1.0.5 Seaborn Version: 0.10.1 Matplotlib Version: 3.2.1 Sklearn Version: 0.23.1
50% (2)
Logistic Regression Quiz: Pandas Version: 1.0.5 Seaborn Version: 0.10.1 Matplotlib Version: 3.2.1 Sklearn Version: 0.23.1
1 page
Aps U9 Test Review Key
No ratings yet
Aps U9 Test Review Key
5 pages
Fram Compre
No ratings yet
Fram Compre
2 pages
Oooo 1.: 5. The Sum of The Percent Frequencies For All Classes Will Always Equal
No ratings yet
Oooo 1.: 5. The Sum of The Percent Frequencies For All Classes Will Always Equal
6 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Hexagon Number Sense
From Everand
Hexagon Number Sense
Christopher Casey
No ratings yet
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)
A Short Course in Discrete Mathematics
From Everand
A Short Course in Discrete Mathematics
Edward A. Bender
3/5 (1)
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)

Summary of R Commands For Statistics 100

Uploaded by

Summary of R Commands For Statistics 100

Uploaded by

Summary of R commands for Statistics 100

Statistics 100 – Fall 2011

Reading, viewing, and assigning data in R:

file.choose() − navigates to a data file on your computer.

read.table(fname) − reads data into R from file fname.

read.csv(fname) − reads data into R from a comma-separated value file fname

data.frame(...) − creates a data frame within R.

help(fnc) − help page for function “fnc”.

summary(x) − data summary of x.

mean(x) − sample mean of x.

sd(x) − sample standard deviation of x.

length(x) − number of values in x.

cor(x,y) − correlation between x and y.

hist(x) − histogram of data in x.

stem(x) − stem and leaf plot of data in x.

plot(x,y) − scatter plot of y against x.

lines(supsmu(x,y)) − add smoother to existing scatter plot.

boxplot(list(x1,x2,...)) − side-by-side boxplots of variables x1, x2, etc.

boxplot(y ~ x) − alternative method for boxplots if y is quantitative and x is categorical.

barplot(x) − barplot of x (where x contains the heights of the bars).

abline(a,b) − add the line y = a + bx to an existing plot.

abline(h=a) − add a horizontal line at y = a to an existing plot.

abline(v=a) − add a vertical line at x = a to an existing plot.

qqnorm(x) − normal probability plot of data in x.

qqline(x) − adds a line to a normal probability plot passing through 1Q and 3Q

Probability distribution computations:

dbinom(x, n, p) − P(X = x) where X ∼ B(n, p)

pnorm(x, mean, sd) − P(X < x) where X ∼ N(mean, sd)

pt(x, df) − P(X < x) where X ∼ t(df)

qt(p, df) − the value of x in p = P(T < x), where T ∼ t(df)

pchisq(x, df) − P(X 2 < x) where X 2 ∼ χ2 (df)

Random sampling (without replacement):

sample(n) − a random arrangement of the first n positive integers.

t.test(y ~ x, data=data.df) − alternative method for two-sample t-test; y is the quantitative

prop.test(x, n, p) − one-sample z-test or confidence interval for a Binomial probability, with

prop.test(x, n) − two-sample z-test or confidence interval for difference in Binomial probabili-

aov(y ~ x, data=data.df) − analysis of variance of response y on categorical variable x con-

lm(y~x1+x2+x3+..., data=data.df) − least-squares regression of y on x1, x2, etc., within data

glm(y~x1+x2+x3+..., family=binomial, data=data.df) − logistic regression of y on x1, x2,

summary(model.fit) − summarize model.fit, the results of either analysis of variance, least-

predict(model.fit, newdata=newdata.df) − prediction of least-squares or logistic regression

fitted(model.fit) − fitted values from model.fit.

residuals(model.fit) − residuals from model.fit.

chisq.test(x, p) − chi-squared goodness-of-fit test, with vector of counts in x and vector of

chisq.test(x) − chi-squared test of independence, with counts in x as a data frame.

You might also like