0% found this document useful (0 votes)

28 views7 pages

R Programming Cheat Sheet

Uploaded by

Mohiuddin Ahmed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views7 pages

R Programming Cheat Sheet

Uploaded by

Mohiuddin Ahmed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

This cheat sheet provides a quick reference for essential R programming Basics
Statistics
commands, helping you perform data manipulation, visualization, and install.packages, library,
mean, median, sd, cor, lm
assignment (<-), print, class
statistical analysis with confidence. It covers foundational topics like installing
packages and understanding R's data structures, alongside advanced tasks
Data Structures
such as building models and applying machine learning techniques.
  Programming
c, list, matrix, data.frame,
if, for, while, function, apply
df$a or df

Each section includes concise syntax and practical examples to illustrate how R
commands are used in real-world scenarios. You'll find guidance on working Data Manipulation Machine Learning
with vectors, lists, matrices, and data frames, performing common data filter, select, mutate, Matrices, Linear Model, Visualize

summarize, arrange Residuals

wrangling tasks like filtering and summarizing, and creating visualizations
such as histograms, bar plots, and boxplots. The cheat sheet also highlights R's
File I/O
capabilities for statistical analysis with commands like mean, lm, and cor.
  Data Visualization
read.csv, write.csv, readRDS,
plot, barplot, hist, boxplot
saveRDS, list.files

Designed for clarity and accessibility, this resource is ideal for data analysts,
statisticians, and programmers seeking to enhance their workflows in R.
Whether you're exploring data, developing algorithms, or building
reproducible reports, this cheat sheet ensures you can quickly apply R's
powerful tools to your projects.

R Cheat Sheet
Basics Data Structures
Syntax for How to use Explained Syntax for How to use Explained

Install

install.packages("dplyr") Installs the dplyr package. Create Vector c(1, 2, 3) Combines elements into a vector.
Package

Load Package library(dplyr) Loads the dplyr package into the Create List list(a=1, b="two") Creates a list with named elements.
current R session.

Assignment <- 5 Create Matrix

Creates a matrix with 2 rows and 3
x Assigns value 5 to the variable x. matrix(1:6, nrow=2)
columns.

Create Data
Creates a data frame with columns a
Print Output print(x) Prints the value of x to the console. Frame
data.frame(a=1:3, b=4:6)
and b .

Examples of logical, integer, numeric, Access df$a | df[1, 1] Performs a logical OR operation between
125, 12.5, "Hello"
Literals and
TRUE, Element
Data Types and character literals in R. a column and a specific element.

Loading stringr Loads the stringr library to work

Extracting library(readr)
Uses parse_number to extract package
library(stringr)
with strings in R.
Numbers from data_frame <- mutate(data_frame, numeric values from string columns.
Strings column = parse_number(column)) Opening a f <- fromJSON('filename.json') Loads a JSON file into an R dataframe
JSON File using the jsonlite package.

Basic String str_sub("Dataquest is awesome", Extracts “Dataquest”

Indexing
as a substring
Creating a List new_list <- list("data scientist", Defines a list containing diverse data
9) by specifying start and end indices. types.
1, c(50000,40000), "programming
experience")

R Cheat Sheet
Data Manipulation Data Visualization

Syntax for How to use Explained Syntax for How to use Explained

Filter Rows filter(df, a > 2)

Filters rows where column a is greater Creating a
data %>% plot()
gg
Initialize a basic ggplot2 chart

than 2. Basic Plot without specifying any aesthetics.

Select

select(df, a, b) Selects specific columns by name.

Creating
data %>% plot(aes(x = variable_1,
gg
Plots subsets of data in separate
Columns Sub plots
y = variable_2)) + geom_line() + facets.

facet_wrap(~variable_3)
Mutate Adds a new column c as sum of a
mutate(df, c = a + b)
Columns and b .

Creating Bar Create a bar chart using gg plot2 ,

_frame %>% ggplot(aes(x =
variables to x and y axes.
data
Summarize C hart mapping
summarize(df, avg=mean(a)) Calculates mean of column a and variable_1, y = variable_2)) +
Data
returns as avg . geom_col()

Sorts rows by column a in

Arrange Rows arrange(df, desc(a))
Plotting Plots multiple columns on the same
descending order. data %>% ggplot(aes(x =
multiple
variable_1)) + geom_line(aes(y = axes using ggplot2 .
Importing
data <- Imports dataset into R using the columns
variable_2)) + geom_line(aes(y =

Data read_csv function from readr . variable_3))

read_csv("name_of_file_with_data.cs

v")

Scatter plots Generate scatterplots to visualize

ggplot(data = uber_trips, aes(x =
Summing Values Sums specified columns for each row
bivariate relationships in plot2 .
df %>% mutate(new_column_name = distance, y = cost)) + geom_point() gg
Across Rows and adds as a new column.
rowSums(.[1:3]))

Summing Values Sums specified rows for each column

Scatter plots ggplot(data = df, aes(x = predictor, x
Create scatterplots with y-a is labels
df %>% bind_rows(tibble(total = h Labels
Across Columns and adds as a new row.
wit y = response)) + geom_point() + formatted using commas instead of
colSums(across(everything()))))
scale_y_continuous(labels = scientific notation.

scales::comma)

Importing CSV dataframe <- V files into R using readr's

Read CS

files re d_ s () for efficient data

a c v
read_csv("name_of_the_dataset.csv")
import.

R Cheat Sheet
Data Visualization Statistics & Probability
Syntax for How to use Explained Syntax for How to use Explained

Scatterplot Plots a scatterplot with y-axis labels in Mean

ggplot(data = df, aes(x = mean(x) Calculates the mean of vector x .
with Comma comma format.
predictor, y = response)) +
Labels
scale_y_continuous(labels =
Median median(x) Calculates the median of vector x .
scales::comma) + geom_point()

Weighted Mean mean <- weighted.mean(x = Computes the weighted mean of a

Scatterplot Creates scatterplots of response vs numerical vector using specific
ggplot(data = df, aes(x = distribution, w = weights)
with Groups predictor, grouped by a categorical weights.
predictor, y = response)) +
variable.
geom_point() + facet_wrap(~
Standard Calculates the standard deviation
categorical_variable, ncol = 2) sd(x)
Deviation of x .

Correlation cor(x, y) Calculates correlation between x

Scatterplot Creates scatterplots of response vs and y .
ggplot(data = df, aes(x =
with Groups predictor, grouped by a categorical
predictor, y = response)) +
variable. Linear
geom_point() + facet_wrap(~ lm(y ~ x, data=df) Fits a linear regression model.
Model
categorical_variable, ncol = 2)
Types of
# Example Variables: Age Classify variables as Quantitative
Variables
Vertical Bar Creates a vertical bar chart to visualize (Quantitative), Gender (numerical) or Qualitative
ggplot(data = df, aes(x = col)) + (categorical).
Chart counts of data. (Qualitative)
geom_bar()

P-Value
if (p_value < 0.05) { print('Reject Decide on hypothesis rejection using
Grouped Bar Creates a grouped bar plot to compare Decision
ggplot(data = df, aes(x = col_1, null hypothesis') } else { a common p-value threshold of 0.05.
Plot frequency distributions of categorical Threshold
fill = col_2)) + geom_bar(position print('Fail to reject null
variables. hypothesis') }
= "dodge")

R Cheat Sheet
Statistics & Probability
Syntax for How to use Explained Syntax for How to use Explained

Chi-Squared Calculates the cumulative probability Simulate Simulates a random coin toss using
pchisq(3.84, df = 1) set.seed(1)

Distribution for a chi-squared distribution with Coin Toss R's uniform random numbers.
coin_toss <- function() { if
specific degrees of freedom.
(runif(1) <= 0.5) 'HEADS' else
Chi-Squared Calculate cumulative probability for a 'TAILS' }
pchisq(q = 10, df = 5)
Test chi-squared statistic of 10 with 5
degrees of freedom. Addition Formula to calculate probabilities of
P(A ∪ B) = P(A) + P(B) − P(A ∩ B)
Rule for unions of events, adjusting for
Multi-category
data <- table(income$sex,
Performs a chi-squared test on the Probability overlap in non-exclusive cases.
Chi-squared given contingency table.
income$high_income)
Test
Independ Probability of independent events
P(A ∩ B) = P(A) * P(B)
Defines a function to calculate the ent Events occurs as product of individual
Computing compute_mode <- function(vector)
mode of a given vector using dplyr probabilities.
Mode in R {counts_df <- tibble(vector) %>%
functions.
group_by(vector) %>%
Product Calculate the total outcomes for two
summarise(frequency=n()) %>% total_outcomes <- a * b
Rule in independent experiments using the
arrange(desc(frequency)); Experiments product rule.
counts_df$vector[1]}

Calculate Z- This calculates the Z-score for a value Uniform # Assuming all outcomes have equal Demonstrates a uniform distribution
z_score <- function(value, vector)
relative to a vector's distribution. Distribution chance
for a dice roll, where outcomes
score { (value - mean(vector)) /
outcomes <- c(1, 2, 3, 4, 5, 6)
equally likely.
sd(vector) }
probabilities <- rep(1/6, 6)

paste('Outcome:', outcomes,
Chi-Squared Calculates the cumulative probability 'Probability:', probabilities)
pchisq(3.84, df = 1)
Distribution for a chi-squared distribution with
specific degrees of freedom.

R Cheat Sheet
Statistics & Probability Programming
Syntax for How to use Explained Syntax for How to use Explained

Conditional Compute P(A|B) given the probability

P_A_given_B <- P_A_and_B / P_B If Statement if (x > 0) print("positive") Executes code if condition is true.
Probability of A and B, and probability of B.
Calculation
For Loop for (i in 1:3) print(i) Iterates over a sequence.
Conditional Compute P(A B) using set
P_A_given_B <- length(intersect(A,
∣

Probability cardinalities.
B)) / length(B)
While Loop while (x < 5) x <- x + 1
Repeats code while the x < 5
condition is true.
Conditional Conditional probabilities are
P_A_given_B <- 1 - P_Ac_given_B Syntax for Defines a reusable function structure
Probability interrelated P(A|B) and its
; function_name <- function(input) {

functions in R.
Definition complement P(Ac|B) can be # Code to manipulate the input

calculated mutually. return(output)

}
Defines independent events joint
Independence P_A_and_B <- P_A P_B
:
*

probability equals product of

Define Defines a function with two
individual probabilities. f <- function(a, b) a + b
Function arguments.

Apply apply(m, 1, sum)

Applies a function over rows/columns
Function of a matrix.

Exponentiation 3^5 Calculates 3 raised to the power of 5.

Converts a string into a Date object

Creating Dates ymd('20/04/21')
using 'year-month-day'.

Creating Dates Converts a string to a date object using

ymd("20/04/21")
from Strings the specified format.

Define Defines a window frame including one

ROWS BETWEEN 1 PRECEDING AND 1
Window row before and after the current row
FOLLOWING
Frame for computations.

R Cheat Sheet
Machine Learning File I/O
Syntax for How to use Explained Syntax for How to use Explained

Fitting a Fit a linear regression model with a Read CSV read.csv("file.csv") Reads a CSV file into a data frame.
lm_fit <- lm(response ~ predictor,
Linear response and a predictor variable.
data = df)
Model
Write CSV write.csv(df, "file.csv") Writes a data frame to a CSV file.
Visualize library(ggplot2)
Visualize the distribution of residuals
Residuals ggplot(data.frame(residuals = to check the linear model's fit.
lm_fit$residuals), aes(x = Read RDS readRDS("file.rds") Reads an RDS file into R.
residuals)) + geom_histogram()

Write RDS saveRDS(df, "file.rds") Saves an object as an RDS file.

Hyperparam
knn_grid <- expand.grid(k = 1:20)
Performs grid search to optimize k
eter Grid
knn_model <- train(tidy_price ~ for k-NN model and visualizes results.
Search List Files list.files() Lists files in the current directory.
accommodates + bathrooms +
bedrooms, data = training_data,
method = "knn", trControl =
train_control, preProcess =
c("center", "scale"), tuneGrid =
knn_grid)

plot(knn_model)

Naive Bayes P(Spam|w1,...,wn) ∝ P(Spam) * Classifies messages as spam using

Algorithm ΠiP(wi|Spam) conditional probabilities.

R Cheat Sheet

R Cheat Sheet PDF
100% (1)
R Cheat Sheet PDF
38 pages
Geographical Data Science and Spatial Data Analysis An Introduction in R (Spatial Analytics and GIS) 1st Edition
100% (1)
Geographical Data Science and Spatial Data Analysis An Introduction in R (Spatial Analytics and GIS) 1st Edition
384 pages
F-Number Reference Chart
No ratings yet
F-Number Reference Chart
2 pages
楊睿中統計學合併版
No ratings yet
楊睿中統計學合併版
557 pages
Lab1 411 Eman Yahya 7773225
No ratings yet
Lab1 411 Eman Yahya 7773225
16 pages
RSTUDIO
No ratings yet
RSTUDIO
44 pages
Introduction To R PDF
No ratings yet
Introduction To R PDF
56 pages
UL2
No ratings yet
UL2
2 pages
Importing The Files
No ratings yet
Importing The Files
14 pages
Unit3 R
No ratings yet
Unit3 R
19 pages
R Program Cheat Sheet 1
No ratings yet
R Program Cheat Sheet 1
2 pages
R Cheat Sheet 3 PDF
No ratings yet
R Cheat Sheet 3 PDF
2 pages
Cheat R Sheet
No ratings yet
Cheat R Sheet
5 pages
R File Code
No ratings yet
R File Code
16 pages
FDP Indoglobal Group of Colleges: 27 April To 1 May R Programming Language Assignment Submission
No ratings yet
FDP Indoglobal Group of Colleges: 27 April To 1 May R Programming Language Assignment Submission
12 pages
DR - Pierpaolo-Delser - Introduction R
No ratings yet
DR - Pierpaolo-Delser - Introduction R
83 pages
Introduction To R: Nihan Acar-Denizli, Pau Fonseca
No ratings yet
Introduction To R: Nihan Acar-Denizli, Pau Fonseca
50 pages
R Basic and Advanced
No ratings yet
R Basic and Advanced
9 pages
R Cheatsheet Base R
No ratings yet
R Cheatsheet Base R
2 pages
Lab01 Note R
No ratings yet
Lab01 Note R
7 pages
Basic R Commands For Data Analysis
No ratings yet
Basic R Commands For Data Analysis
7 pages
Lab0 R Tutorial EHS
No ratings yet
Lab0 R Tutorial EHS
9 pages
CRM Cheat Sheet
No ratings yet
CRM Cheat Sheet
7 pages
Unit3-Data Science
No ratings yet
Unit3-Data Science
37 pages
R Intro STAT5000
No ratings yet
R Intro STAT5000
17 pages
STATA - Subject Table of Contents
No ratings yet
STATA - Subject Table of Contents
15 pages
S24 Stats10 Lab1-1
No ratings yet
S24 Stats10 Lab1-1
8 pages
R
No ratings yet
R
13 pages
Unit3 R
No ratings yet
Unit3 R
30 pages
R Complete
No ratings yet
R Complete
24 pages
Module IV
No ratings yet
Module IV
43 pages
R Workshop Material 18-19, Oct-2023
No ratings yet
R Workshop Material 18-19, Oct-2023
67 pages
Problem Set 1: Introduction To R - Solutions With R Output: 1 Install Packages
No ratings yet
Problem Set 1: Introduction To R - Solutions With R Output: 1 Install Packages
24 pages
Data analytic R
No ratings yet
Data analytic R
28 pages
R Tutorial #1: Applied Econometrics (Econ3005)
No ratings yet
R Tutorial #1: Applied Econometrics (Econ3005)
21 pages
X - 15 x-1 2. Print ('Hello Word!') ## (1) "Hello Word!" 3. X - 4 y - 5 Z - X+y Print (Z) 4. X - 4 y - 5 Cat ('The Sum of X and y Is', X+y)
No ratings yet
X - 15 x-1 2. Print ('Hello Word!') ## (1) "Hello Word!" 3. X - 4 y - 5 Z - X+y Print (Z) 4. X - 4 y - 5 Cat ('The Sum of X and y Is', X+y)
15 pages
P6ADBMS
No ratings yet
P6ADBMS
34 pages
DS Lab
No ratings yet
DS Lab
31 pages
R Studio Lab Summary Sheet
No ratings yet
R Studio Lab Summary Sheet
3 pages
R Studio Commands
No ratings yet
R Studio Commands
19 pages
CH 3
No ratings yet
CH 3
33 pages
Getting Started With R
No ratings yet
Getting Started With R
155 pages
R Course Own English HS
No ratings yet
R Course Own English HS
70 pages
Unit - 2: Data Manipulation With R & Data Visualization in Watson Studio
No ratings yet
Unit - 2: Data Manipulation With R & Data Visualization in Watson Studio
58 pages
R - Lecture #2
No ratings yet
R - Lecture #2
21 pages
STAT 1000 - Worksheet 2
No ratings yet
STAT 1000 - Worksheet 2
14 pages
R Program
No ratings yet
R Program
22 pages
Practical 1 - Data Frame Manipulation - 072502
No ratings yet
Practical 1 - Data Frame Manipulation - 072502
16 pages
Chapter 03 Wrangling
No ratings yet
Chapter 03 Wrangling
40 pages
Advance R Prog.-1
No ratings yet
Advance R Prog.-1
24 pages
Data - Analysis - With - R - 24
No ratings yet
Data - Analysis - With - R - 24
47 pages
Pushpendra Lab File
No ratings yet
Pushpendra Lab File
51 pages
Da Lab File
No ratings yet
Da Lab File
33 pages
STAT 1000 - Worksheet 2
No ratings yet
STAT 1000 - Worksheet 2
14 pages
STAT 1000 - Worksheet 2
No ratings yet
STAT 1000 - Worksheet 2
14 pages
Lecture 10 R
No ratings yet
Lecture 10 R
117 pages
Textile Design Economics Tex0: Dr. Tamer F. Khalifa 2018-2019
No ratings yet
Textile Design Economics Tex0: Dr. Tamer F. Khalifa 2018-2019
13 pages
People of The Philippines vs. Camilo Camenforte, G.R. No. 220916, June 14, 2021
100% (2)
People of The Philippines vs. Camilo Camenforte, G.R. No. 220916, June 14, 2021
2 pages
Roth Hydraulics TI Blasenspeicher en
No ratings yet
Roth Hydraulics TI Blasenspeicher en
20 pages
Manuale D'uso Silos
No ratings yet
Manuale D'uso Silos
28 pages
LOI Letter 16
No ratings yet
LOI Letter 16
2 pages
Member-Resolution LLC
No ratings yet
Member-Resolution LLC
2 pages
Manwah
No ratings yet
Manwah
7 pages
Riks Method
100% (2)
Riks Method
7 pages
Microeconomics: Demand and Supply
No ratings yet
Microeconomics: Demand and Supply
56 pages
Manual de Usuario Baño GD 100
No ratings yet
Manual de Usuario Baño GD 100
132 pages
Not 0162024 5162024
No ratings yet
Not 0162024 5162024
4 pages
BP11 Container Safety Convention 2011 PDF
100% (1)
BP11 Container Safety Convention 2011 PDF
44 pages
FI Configuration-S - 4 Hana 1809
100% (1)
FI Configuration-S - 4 Hana 1809
43 pages
hm01 c2727 TACC 607 Managerial Accounting
No ratings yet
hm01 c2727 TACC 607 Managerial Accounting
9 pages
Visionmaster FT Radar Out Line For New Update
No ratings yet
Visionmaster FT Radar Out Line For New Update
4 pages
Saltzman Et Al-2024-Clinical Proteomics
No ratings yet
Saltzman Et Al-2024-Clinical Proteomics
17 pages
Santhosh Res
No ratings yet
Santhosh Res
3 pages
Sta. Elena Elementary School: Enhanced School Improvement Plan
No ratings yet
Sta. Elena Elementary School: Enhanced School Improvement Plan
19 pages
Communication and Society Course Outline
No ratings yet
Communication and Society Course Outline
7 pages
National Landscape Policy: Malaysia Beautiful Garden Nation
No ratings yet
National Landscape Policy: Malaysia Beautiful Garden Nation
50 pages
GreenForge Presentation
No ratings yet
GreenForge Presentation
22 pages
Wingo - PS5C0I
No ratings yet
Wingo - PS5C0I
1 page
ABB Squirrel Cage Motor AMI 710-1000 Manual
No ratings yet
ABB Squirrel Cage Motor AMI 710-1000 Manual
84 pages
Preliminary
No ratings yet
Preliminary
10 pages
Ryanair Confidential Report Form
No ratings yet
Ryanair Confidential Report Form
2 pages
Lesson 1 - The Basics: Maple's Constants
No ratings yet
Lesson 1 - The Basics: Maple's Constants
2 pages
Cardinal Utility Approach
100% (1)
Cardinal Utility Approach
25 pages

R Programming Cheat Sheet

Uploaded by

R Programming Cheat Sheet

Uploaded by

Table of Contents

summarize, arrange Residuals

Assignment <- 5 Create Matrix

Loading stringr Loads the stringr library to work

Basic String str_sub("Dataquest is awesome", Extracts “Dataquest”

Filter Rows filter(df, a > 2)

than 2. Basic Plot without specifying any aesthetics.

select(df, a, b) Selects specific columns by name.

Creating Bar Create a bar chart using gg plot2 ,

Sorts rows by column a in

Data read_csv function from readr . variable_3))

Scatter plots Generate scatterplots to visualize

Summing Values Sums specified rows for each column

Importing CSV dataframe <- V files into R using readr's

files re d_ s () for efficient data

Scatterplot Plots a scatterplot with y-axis labels in Mean

Weighted Mean mean <- weighted.mean(x = Computes the weighted mean of a

Correlation cor(x, y) Calculates correlation between x

Conditional Compute P(A|B) given the probability

calculated mutually. return(output)

probability equals product of

Apply apply(m, 1, sum)

Exponentiation 3^5 Calculates 3 raised to the power of 5.

Converts a string into a Date object

Creating Dates Converts a string to a date object using

Define Defines a window frame including one

Write RDS saveRDS(df, "file.rds") Saves an object as an RDS file.

Naive Bayes P(Spam|w1,...,wn) ∝ P(Spam) * Classifies messages as spam using

You might also like