0% found this document useful (0 votes)

14 views9 pages

R Cheat Sheet

This document is an R cheat sheet that provides generic syntax for common R functions and analyses, including data input/output, manipulation, summary statistics, basic graphics, classical bivariate tests, and modeling techniques. It covers essential topics such as reading data, creating factors, performing t-tests and chi-squared tests, and fitting linear and logistic regression models. The cheat sheet is structured with sections and subsections for easy reference and includes example code snippets for practical application.

Uploaded by

hayeg2024

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views9 pages

R Cheat Sheet

Uploaded by

hayeg2024

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

R cheat sheet: C.

Hurst

R cheat sheet

Contents
1 Data I/O and manipulation 2
1.1 Reading in data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.2 (very) Basic manipulation of a data frame . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.3 Creating factors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

2 Data summary 3
2.1 Summary statistics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2.1.1 Summarizing categorical variables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2.1.2 Desriptive statistics for a continuous variable . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
2.2 Basic graphics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4

3 Classical bivariate tests 5

3.1 χ2 test of independance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
3.2 Independant and paired t-tests . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
3.3 Correlation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

4 Modelling 6
4.1 Linear regression and the general linear model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
4.2 Logistic regression and other GLMs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
4.3 Longitudinal and other correlated outcomes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
4.3.1 Continuous longitudinal or otherwise correlated outcomes . . . . . . . . . . . . . . . . . . . . . 8
4.3.2 Categorical longitudinal or otherwise correlated outcomes . . . . . . . . . . . . . . . . . . . . . 8

Preamble
This document is to provide generic syntax for many of the more common R functions and analyses you are likely to
use. Any idenitfier (name) prefixed with ’my” is generic and is not to be taken literately (i.e. Your code shouldn’t
have these names in it), instead you should adapt the code for your own purposes. Specifically:
• my.y, my.x1 and my.x2 are three continuous variables with my.y assumed to be the outome (endpoint) variable
and my.x1 and my.x2 assumed to be predictors.

• my.a and my.b are two categorical variables that are initially just number coded (i.e. R doesn’t yet know they
are categorical)
• my.a.fac and my.b.fac are the ”Factors” correponding to my.a and my.b (see code below)
• my.c.fac.within is a within-subject effect (e.g. for longitudinal data)

• my.pat.id is a variable that idenitfies the patient (for longitudinal data)

• myfoo.csv is some comma delimited texte file (can be created by excel) that stores our data
• mydata.df is an R data frame that holds all of the above variables

1
R cheat sheet: C.Hurst

1 Data I/O and manipulation

1.1 Reading in data
There are MANY ways to input data in R. Personally, I like to keep it simple. I tend to just use comma delimited
text file, or ”csv” files that I have created in R.

READING IN DATA

##Read in data from a comma delimited file##

#1. Set working directory (note forwardslash)

setwd("C:/myrdata")

#2. Read data into mydata.df (dataframe)

mydata.df<-read.csv("somedatafile.csv")

Also, see the foreign library help to see how to read in files from other statistics packages like Stata or SPSS.

1.2 (very) Basic manipulation of a data frame

Now let’s do some very simple manipulation of the data. Note that R is VERY powerful when it comes to data
manipulation (subsetting etc). I will only touch on it VERY briefly here. Look up some of the free texts available for
R for MUCH more in this area.

BASIC SUBSETTING

#1a. New data frame with subset of variables (cols 1 to 3 only)

mynew.df<-mydata.df[,c(1:3)]

#1b. New df with subset of variables (excluding cols 1 to 3)

mynew.df<-mydata.df[,-c(1:3)]

#1c. Alternatively, use the variable names

mynew.df<-mydata.df[,c("age", "sex", "bmi")]

#2a. Now a subset of row (e.g. males only)

mynew.df<-mydata.df[mydata.df$sex==0,]

#2b. Or only patients less than 60 years old

mynew.df<-mydata.df[mydata.df$age<60,]

2
R cheat sheet: C.Hurst

1.3 Creating factors

Factors are useful because they are our way of telling R that we want a variable to be considered as categorical. If
a variable is a text variable (e.g. ”Male” and ”Female”), R automatically makes it a factor. BUT if a variable is
coded numerically (e.g. Sex = 0 and 1), we have to let R know that this is not a continuous variable We do this using
factors. This is also a useful way of providing labels for our variables. Now how do we create these factors.

CREATING FACTORS

#1. Create a factor for gender

# (currently coded as 0[males] and 1[females])
sex.fac<-factor(sex, labels=c("Males", "Females"))

#2. Now see if it works

table(sex.fac)

2 Data summary
2.1 Summary statistics
2.1.1 Summarizing categorical variables
To tabulate a single categorical variable (frequency table):

FREQUENCY TABLES

#1. Generate feq table for A factor

my.tab<-table(mydata.df$mya.fac)
my.tab

To cross tabulate two categorical variable

CROSS TABULATIONS

#1. Generate X-tab of A by B factor variables

my.tab<-table(mydata.df$mya.fac, mydata.df$myb.fac)
my.tab

3
R cheat sheet: C.Hurst

2.1.2 Desriptive statistics for a continuous variable

Many ways to do this, but I think it is easiest to use the R library psych (You may need to download this from CRAN
first).

#Load psych library into memory

library(psych)

#Generate descriptive stats for the continuous variable ’myx’

describe(mydata.df$myx)

#Generate descriptive stats for ’myx’ by groups (e.g.by gender)

describeBy(mydata.df$myx, group=mydata.df$mya.fac)

Note the describe function gives you most summary stats (mean, median, sd, IQR, range, min, max, n, nmiss etc)

2.2 Basic graphics

Now for continuous variable (univariate). To get a histogram and a boxplot for a single variable

UNIVARIATE GRAPHICS FOR A CONTINUOUS VARIABLE

#1. Histogram of continuous variable

hist(mydata.df$my.y)

#2. Boxplot of continuous variable

boxplot(mydata.df$my.y)

and now the bivariate relationship between a continuous outcome in terms of a categorical explanatory (factor)

BIVARIATE CONTINUOUS Y AND CATEGORICAL X

#1. Side-by-side boxplot

boxplot(my.y~my.a, data=mydata.df)

Now for two continuous variables.

BIVARIATE CONTINUOUS Y AND CONTINUOUS X

4
R cheat sheet: C.Hurst

#1. Scatter plot

plot(x=mydata.df$my.x, y=mydata.df$my.y)

3 Classical bivariate tests

Now for the classical bivariate tests. I will consider three:
1. χ2 test of independance

2. t-tests (both independant and paired)

3. Correlation (Pearson’s and Spearman’s)

3.1 χ2 test of independance

χ2 TEST OF INDEPENDANCE

#1. Generate X-tab of A by B factor variables

my.tab<-table(mydata.df$mya.fac, mydata.df$myb.fac)

#2. Input table into the chi.sq test

chisq.test(my.tab)

3.2 Independant and paired t-tests

T-TESTS

#1. Indpendant t-test

t.test(my.y ~ my.a, data = mydata.df)

#2. Paired t-test

t.test(y1, y2, paired=TRUE, data = mynewdata.df)
#Note data needs to be in wide format for this type of test

3.3 Correlation
CORRELATION

5
R cheat sheet: C.Hurst

#1. Pearson correlation

cor(mydata.df$my.x1, mydata.df$my.x2 , method ="pearson")
#Note Pearson’s is default so you could just write
cor(mydata.df$my.x1,mydata.df$my.x2)

#1b. Above only generates coeffeicent, need to tests using

cor.test(mydata.df$my.x1,mydata.df$my.x2)

2. Spearmans
cor(mydata.df$my.x1, mydata.df$my.x2 , method ="spearman")

4 Modelling
One of the things you will notice about R is that modelling is very simple. Once you understand the basics of mod-
eling in one situation (e.g. a continous outcome), extending to other types and outcomes and situations is very easy.
Modelling in R is based on the concept of a formula:

y ∼ x1 + x2
in a linear regression (assumeing y, x1 and x2 are continuous) implies:

y = β0 + β1 x 1 + β2 x 2

If this same formula is used (for example) in a Poisson regression (for count data) using a log link then

y ∼ x1 + x2
implies

y = eβ0 +β1 x1 +β2 x2

Regardless, the same basic form of the formula is used throughout all R modelling

4.1 Linear regression and the general linear model

Linear regression and the general linear model use exactly the same model (lm(), they only differ in that factors
(categorical predictors) will occur in the general linear model

LINEAR REGRESSION AND THE GENERAL LINEAR MODEL

#1. A multivariable linear regression

my.model<-lm(my.y~my.x1+my.x2, data=mydata.df)

#1b. Get summary (R-sq and coeffcients) and ANOVA table

summary(my.model)

6
R cheat sheet: C.Hurst

anova(my.model)

2. A general linear model (using a temporary factors)

my.model<-lm(my.y~my.x1+as.factor(my.a), data=mydata.df)

2b. ...or defining factor (permanantly) first

mya.fac<-factor(my.a, labels=c("low", "med", "high")
my.model<-lm(my.y~my.x1+mya.fac, data=mydata.df)

2c. Now get summary and ANOVA table

summary(my.model)
anova(my.model)

4.2 Logistic regression and other GLMs

The Generalized Linear Model (GLM) in R is a simple extension of the command used for the standard Linear Model
(Linear Regression, ANOVA and the General Linear Model). Note that the only REAL difference (as far as coding
goes), is the addition of the family argument. Here we will use Binary Logistic regression as an example, but we can
also fit any of the other GLMs (e.g. Possion regression) using the same command (just change the ’family’ argument).
Assume that the outcome variable, my.a is a binary outcome variable:

#1. A binary logistic regression

my.model<-lm(my.a~my.x1+my.x2, data=mydata.df, family=binomial)

#1b. Get summary (R-sq and coeffcients) and ANOVA table

summary(my.model)
anova(my.model)

#Note I wrote the below function myself-get the R code from me

print.ORCIs.glmm.wald(my.model)

4.3 Longitudinal and other correlated outcomes

Longitudinal data (and correlated data) is where we have several observations associated with each observational unit.
For example:
• Longitudianl data: We have several observations (over time) for each patient
• For clustered data (e.g. a multicentre study), we might have many patients associated with each clinic or hospital
(and to not account for this in our analysis would render our results invalid).

Here, I will just give an example of a longitudunal analysis (using the patient identifier my.pat.id), but the
approach used for clustered data (e.g. Hospital ID) is very similar.

7
R cheat sheet: C.Hurst

4.3.1 Continuous longitudinal or otherwise correlated outcomes

For a continuous outcome variable, we would use a Linear Mixed Model (LMM) to analysis our data. Typically we
will have a ’within-subject effect’ (e.g. Time), and one or more ’between level effect(s)’ (e.g. Treatment, Gender).
Between-subject effects are the ones we are used to.

Note that there are several R libraries for LMMs. My prefered library is called lme4. Again, the first time you
use this, you may have to download from CRAN.

library(lme4)

#1. A linear mixed model (Random intercept model)

# Note: As long as you specify the patient ID, R can workout
# the between- and within-subject effects itself
my.model<-lmer(my.y~my.x1+my.c.fac.within + (1|my.pat.id), data=mydata.df)

#1b. Summaries model and coeffcients

summary(my.model)
anova(my.model)
#Note I wrote the below function myself-get the R code from me
print.BetaCI.lmm(my.model)

#2. A linear mixed model (Random coeffcients model)

my.model<-lmer(my.y~my.x1+my.c.fac.within + (my.c.fac.within|my.pat.id), data=mydata.df)

#2b. Summaries model and coeffcients

summary(my.model)
anova(my.model)
print.BetaCI.lmm(my.model)

4.3.2 Categorical longitudinal or otherwise correlated outcomes

Like a standard linear regession (Genral Linear Model) can be extended to a Generalized Linear model, a LMM can
also be extended to a Generalized Linear Mixed Model. In terms of R, the only real difference is the addition of the
family argument.

library(lme4)

#1. A Generalized linear mixed model (Random intercept model)

# Note: This is a binary logistic mixed effect regression
# Assume that my.a is a binary avriable
my.model<-glmer(my.a~my.x1+my.c.fac.within+(1|my.pat.id), data=mydata.df, family=binomial)

#1b. Summaries model and coeffcients

8
R cheat sheet: C.Hurst

summary(my.model)
anova(my.model)
#Note I wrote the below function myself-get the R code from me
print.ORCIs.glmm.wald(my.model)

#Note: As with the LMM we can also fit a Random coeffcients

#version of the GLMM

LAB 3 Triangle of Forces
67% (3)
LAB 3 Triangle of Forces
3 pages
Extra Practice - Listening
No ratings yet
Extra Practice - Listening
7 pages
Krijnen IntroBioInfStatistics
No ratings yet
Krijnen IntroBioInfStatistics
278 pages
Basic R Commands For Data Analysis
No ratings yet
Basic R Commands For Data Analysis
7 pages
Essential R
No ratings yet
Essential R
261 pages
Visual Statistics Use R!
50% (2)
Visual Statistics Use R!
388 pages
Visual Statistics Use R PDF
No ratings yet
Visual Statistics Use R PDF
388 pages
Shipunov Visual Statistics
No ratings yet
Shipunov Visual Statistics
429 pages
Visual Statistics Use R
No ratings yet
Visual Statistics Use R
451 pages
Rintro
No ratings yet
Rintro
42 pages
Module - 4 (R Training) - Basic Stats & Modeling
No ratings yet
Module - 4 (R Training) - Basic Stats & Modeling
15 pages
Mendenhall R
No ratings yet
Mendenhall R
14 pages
Unit - 2: Data Manipulation With R & Data Visualization in Watson Studio
No ratings yet
Unit - 2: Data Manipulation With R & Data Visualization in Watson Studio
58 pages
Lab0 R Tutorial EHS
No ratings yet
Lab0 R Tutorial EHS
9 pages
R语言学习笔记
No ratings yet
R语言学习笔记
78 pages
Useful R Functions-1
No ratings yet
Useful R Functions-1
4 pages
Lecture 1
No ratings yet
Lecture 1
167 pages
Introduction To R: Nihan Acar-Denizli, Pau Fonseca
No ratings yet
Introduction To R: Nihan Acar-Denizli, Pau Fonseca
50 pages
BM-1, Applied Statistics, Lesson 2: Comparing Two Groups (And One Group)
No ratings yet
BM-1, Applied Statistics, Lesson 2: Comparing Two Groups (And One Group)
39 pages
Commands For Data Analysis Using R
No ratings yet
Commands For Data Analysis Using R
11 pages
Basic Statistics
No ratings yet
Basic Statistics
66 pages
R Course
No ratings yet
R Course
7 pages
Statistics With R
No ratings yet
Statistics With R
20 pages
R Manual PDF
No ratings yet
R Manual PDF
78 pages
Statistical Modelling
No ratings yet
Statistical Modelling
39 pages
R For Health Data Science 1st Edition Complete Volume Download
No ratings yet
R For Health Data Science 1st Edition Complete Volume Download
15 pages
R Commands
No ratings yet
R Commands
18 pages
R Programming Slides
No ratings yet
R Programming Slides
73 pages
R Studio Notes
No ratings yet
R Studio Notes
10 pages
Applied Statistics For Bioinformatics PDF
No ratings yet
Applied Statistics For Bioinformatics PDF
278 pages
DA Lab Week-1
No ratings yet
DA Lab Week-1
7 pages
Lucero R Tutorial 2016
No ratings yet
Lucero R Tutorial 2016
135 pages
Unit 3
No ratings yet
Unit 3
36 pages
X - 15 x-1 2. Print ('Hello Word!') ## (1) "Hello Word!" 3. X - 4 y - 5 Z - X+y Print (Z) 4. X - 4 y - 5 Cat ('The Sum of X and y Is', X+y)
No ratings yet
X - 15 x-1 2. Print ('Hello Word!') ## (1) "Hello Word!" 3. X - 4 y - 5 Z - X+y Print (Z) 4. X - 4 y - 5 Cat ('The Sum of X and y Is', X+y)
15 pages
R Programming Cheat Sheet
No ratings yet
R Programming Cheat Sheet
7 pages
Module2 BDA
No ratings yet
Module2 BDA
44 pages
Introduction To R PDF
No ratings yet
Introduction To R PDF
56 pages
Notes
No ratings yet
Notes
6 pages
01 IntroSlides
No ratings yet
01 IntroSlides
43 pages
Boulder Handout 2019
No ratings yet
Boulder Handout 2019
187 pages
Module 3 R Data Science
No ratings yet
Module 3 R Data Science
158 pages
R With RCMDR: Basic Instructions: 1 Running & Installation R Under Windows
No ratings yet
R With RCMDR: Basic Instructions: 1 Running & Installation R Under Windows
29 pages
Workshop Activity: X Seq y Length
No ratings yet
Workshop Activity: X Seq y Length
3 pages
Final Cost Practical
No ratings yet
Final Cost Practical
29 pages
Simple Tutorial in R
No ratings yet
Simple Tutorial in R
15 pages
Intro To R Software
No ratings yet
Intro To R Software
7 pages
Business Analytics - L2
No ratings yet
Business Analytics - L2
41 pages
R Tutorial #1: Applied Econometrics (Econ3005)
No ratings yet
R Tutorial #1: Applied Econometrics (Econ3005)
21 pages
R File Code
No ratings yet
R File Code
16 pages
R For Data Exploration
No ratings yet
R For Data Exploration
52 pages
R Programming
No ratings yet
R Programming
47 pages
R With RCMDR: Basic Instructions: 1 Running & Installation R Under Windows
No ratings yet
R With RCMDR: Basic Instructions: 1 Running & Installation R Under Windows
23 pages
R1 Uptovisualisation
No ratings yet
R1 Uptovisualisation
122 pages
Modeling and Visulizing Data Using R: A Practical Introduction
No ratings yet
Modeling and Visulizing Data Using R: A Practical Introduction
106 pages
R Commands
No ratings yet
R Commands
5 pages
Presentation of R
No ratings yet
Presentation of R
109 pages
UL2
No ratings yet
UL2
2 pages
Variables & Chart
No ratings yet
Variables & Chart
60 pages
DSR 2879
No ratings yet
DSR 2879
25 pages
Unlocking Statistics for the Social Sciences
From Everand
Unlocking Statistics for the Social Sciences
Norma Sinclair
No ratings yet
A Discourse Analysis of 1 Peter
From Everand
A Discourse Analysis of 1 Peter
Ervin Ray Starwalt
No ratings yet
Advanced Multiplayer Game Development with Ureal Engine 5: A Comprehensive Guide to C++ Scripting
From Everand
Advanced Multiplayer Game Development with Ureal Engine 5: A Comprehensive Guide to C++ Scripting
Vladimir Kiselev
No ratings yet
Dyslexia Powerpoint
No ratings yet
Dyslexia Powerpoint
13 pages
Consumer Behavior and Utility Maximization: Ap Economics - Chapter 5
100% (1)
Consumer Behavior and Utility Maximization: Ap Economics - Chapter 5
35 pages
05.contraction of Skeletal Muscle
No ratings yet
05.contraction of Skeletal Muscle
87 pages
Racism Essay
67% (3)
Racism Essay
3 pages
Process Diagrams
100% (1)
Process Diagrams
139 pages
Milgram Questions
No ratings yet
Milgram Questions
2 pages
Promot Electric Purchase Order: Bill To
No ratings yet
Promot Electric Purchase Order: Bill To
2 pages
IGEH Cuadernillo 1ro
No ratings yet
IGEH Cuadernillo 1ro
61 pages
Matrix Theory and Linear Algebra
100% (1)
Matrix Theory and Linear Algebra
4 pages
Selling Skills MMS I NOTES
No ratings yet
Selling Skills MMS I NOTES
48 pages
AMSOIL Synthetic Motor Oils For OE Oil Change Interval. 3000 Mile Oil Change
No ratings yet
AMSOIL Synthetic Motor Oils For OE Oil Change Interval. 3000 Mile Oil Change
2 pages
DGR 61st Edition Checklist For A Radioactive Shipment 11
No ratings yet
DGR 61st Edition Checklist For A Radioactive Shipment 11
1 page
Final Examination: (Oral Communication)
100% (1)
Final Examination: (Oral Communication)
4 pages
КТЖ 10 Action 68 Сағат Жаңа
No ratings yet
КТЖ 10 Action 68 Сағат Жаңа
9 pages
TS EAPCET Top To Bottom Colleges List
No ratings yet
TS EAPCET Top To Bottom Colleges List
4 pages
BCC-351 - Notice of Mini Project Presentation Along With Group List and Title.
No ratings yet
BCC-351 - Notice of Mini Project Presentation Along With Group List and Title.
2 pages
Faq Eng
No ratings yet
Faq Eng
2 pages
Caffeine Experiment
No ratings yet
Caffeine Experiment
6 pages
6c. Beam Deflection: Deflect V (X)
No ratings yet
6c. Beam Deflection: Deflect V (X)
2 pages
Q3 - ENGLISH - MOD 2 - Create or Expand Word Clines
No ratings yet
Q3 - ENGLISH - MOD 2 - Create or Expand Word Clines
24 pages
Space and Culture Using Space Syntax For The Tenganan Pageringsingan Housing of Bali, Indonesia
No ratings yet
Space and Culture Using Space Syntax For The Tenganan Pageringsingan Housing of Bali, Indonesia
5 pages
Hall and Wayman 1990
No ratings yet
Hall and Wayman 1990
25 pages
Top Ranked of Gtu List
No ratings yet
Top Ranked of Gtu List
6 pages
Limit, Fit and Tolerance
100% (1)
Limit, Fit and Tolerance
14 pages
Customer Service Officer Robinsons Place Cebu: Profile
No ratings yet
Customer Service Officer Robinsons Place Cebu: Profile
1 page
Cse Department - ANNA UNIVERSITY Important Question and Answers - Regulation 2013,2017 - STUDY MATERIAL, Notes
No ratings yet
Cse Department - ANNA UNIVERSITY Important Question and Answers - Regulation 2013,2017 - STUDY MATERIAL, Notes
5 pages
(21st Century Skills Library - Cool Military Careers) Josh Gregory-Avionics Technician-Cherry Lake Publishing (2012)
No ratings yet
(21st Century Skills Library - Cool Military Careers) Josh Gregory-Avionics Technician-Cherry Lake Publishing (2012)
36 pages
Writing Successful Undergraduate Dissertations in Social Sciences A Student S Handbook 2nd Edition Francis Jegede Download
No ratings yet
Writing Successful Undergraduate Dissertations in Social Sciences A Student S Handbook 2nd Edition Francis Jegede Download
46 pages

R Cheat Sheet

Uploaded by

R Cheat Sheet

Uploaded by

R cheat sheet: C.

3 Classical bivariate tests 5

• my.pat.id is a variable that idenitfies the patient (for longitudinal data)

1 Data I/O and manipulation

##Read in data from a comma delimited file##

#1. Set working directory (note forwardslash)

#2. Read data into mydata.df (dataframe)

1.2 (very) Basic manipulation of a data frame

#1a. New data frame with subset of variables (cols 1 to 3 only)

#1b. New df with subset of variables (excluding cols 1 to 3)

#1c. Alternatively, use the variable names

#2a. Now a subset of row (e.g. males only)

#2b. Or only patients less than 60 years old

1.3 Creating factors

#1. Create a factor for gender

#2. Now see if it works

#1. Generate feq table for A factor

To cross tabulate two categorical variable

#1. Generate X-tab of A by B factor variables

2.1.2 Desriptive statistics for a continuous variable

#Load psych library into memory

#Generate descriptive stats for the continuous variable ’myx’

#Generate descriptive stats for ’myx’ by groups (e.g.by gender)

2.2 Basic graphics

UNIVARIATE GRAPHICS FOR A CONTINUOUS VARIABLE

#1. Histogram of continuous variable

#2. Boxplot of continuous variable

BIVARIATE CONTINUOUS Y AND CATEGORICAL X

#1. Side-by-side boxplot

Now for two continuous variables.

BIVARIATE CONTINUOUS Y AND CONTINUOUS X

#1. Scatter plot

3 Classical bivariate tests

2. t-tests (both independant and paired)

3.1 χ2 test of independance

#1. Generate X-tab of A by B factor variables

#2. Input table into the chi.sq test

3.2 Independant and paired t-tests

#1. Indpendant t-test

#2. Paired t-test

#1. Pearson correlation

#1b. Above only generates coeffeicent, need to tests using

y = eβ0 +β1 x1 +β2 x2

4.1 Linear regression and the general linear model

LINEAR REGRESSION AND THE GENERAL LINEAR MODEL

#1. A multivariable linear regression

#1b. Get summary (R-sq and coeffcients) and ANOVA table

2. A general linear model (using a temporary factors)

2b. ...or defining factor (permanantly) first

2c. Now get summary and ANOVA table

4.2 Logistic regression and other GLMs

#1. A binary logistic regression

#1b. Get summary (R-sq and coeffcients) and ANOVA table

#Note I wrote the below function myself-get the R code from me

4.3 Longitudinal and other correlated outcomes

4.3.1 Continuous longitudinal or otherwise correlated outcomes

#1. A linear mixed model (Random intercept model)

#1b. Summaries model and coeffcients

#2. A linear mixed model (Random coeffcients model)

#2b. Summaries model and coeffcients

4.3.2 Categorical longitudinal or otherwise correlated outcomes

#1. A Generalized linear mixed model (Random intercept model)

#1b. Summaries model and coeffcients

#Note: As with the LMM we can also fit a Random coeffcients

You might also like