0% found this document useful (0 votes)

20 views7 pages

Introduction To R For Business Analytics

Uploaded by

Todd Wang

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views7 pages

Introduction To R For Business Analytics

Uploaded by

Todd Wang

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Introduction to R for Business Analytics

Professor Stephan Onggo

12 October 2021

Vector

R is a vector-based language. A vector is a collection of data items. In the example below, we create
a vector x and assign values from 1 to 10. We can create a vector using c() function or an operator
such as sequence operator (:)
x <- c(1, 2, 3, 4, 5, 6, 7, 8, 9, 10)

or
x <- 1:10

To see the content of x:

To see the content of a specific element of x:

x[3]

or
x[3:5]

We can apply standard operations to the vector such as:

x <- x * 2

We can apply some functions to the vector

z <- sum(x)

In the following example, we create two vectors of 50 normally distributed random numbers and
plot them.
x <- rnorm(50)
x <- rnorm(50)
plot(x, y)

Object

You can list the available R objects using ls()function.

ls()

You can remove an object using rm()function.

ls(object_name)

1
NOTE: Object names must start with a letter or a dot. The names should contain letters, numbers,
underscore (_) or dots (.) only. The names cannot be the same as R keywords such as if, else and
for.

Packages

To install a package, use install.packages() function.

install.packages("package_name")

To load a package, use library() function.

library("package_name")

Arithmetic

In R, we can apply the following arithmetic operators: +, -, *, /, ^ and %% (modulo). For example
x <- 1:10
x <- x + 1

Functions

R comes with a lot of mathematical functions such as: abs(), exp(), sqrt(),min(x), and
sum(). For example
x <- 1:10
z <- sum(x)

Matrix

We can define a matrix using matrix() function. Compare the two commands below.
matrix_A <- matrix(1:12, ncol = 4)
matrix_B <- matrix(1:12, ncol = 4, byrow = TRUE)

We can find the dimension of a matrix using dim() function, the number of rows using nrow()
function, and the number of columns using ncol() function.
dim(matrix_A)
nrow(matrix_A)
ncol(matrix_A)

To see the content of a specific element of a matrix:

matrix_A[1, 2]
matrix_A[1:3, 2]

2
matrix_A[1:3, ]

We can apply standard matrix operations such as:

matrix_C <- matrix_A + matrix_B
t(matrix_C)
matrix_C <- matrix_A * matrix_B
matrix_C <- matrix_A %*% t(matrix_B)

Data frame

We often store our data in a data frame before we do some analysis. The following example shows a
data frame (in practice, the data is usually read from a file).
x <- rnorm(10)
y <- rnorm(10)
df = data.frame(x=x, y=y)
df

We can access the content using the following commands

df$x
df$y
df[1,1]

We can filter the data using an expression. For example:

df[df$x < 0,]

We can sort the data using order() function. For example:

df[order(df$x),]
df[order(df$x, decreasing = TRUE),]

We can use rbind() function to insert a row into a data frame

z <- rbind(df, c(5, 5))

We can delete a row using the following command

z <- z[-c(11),]

We can apply some functions to a column.

z <- sum(df$x)

Reading csv files

3
It is easier if we change the working directory to where the csv file is located. If you do not have any
csv file to play with, you can use revenue.csv from the blackboard. To change the working directory,
select session -> load workspace -> Choose Directory.

We can read a csv file using one of the following commands. The read.csv() function works if
the file uses comma as the separator symbol. The read.csv2() function works if the file uses
semicolon as the separator symbol. The read.table() function is the most flexible as we can
specify the separation symbol. The header argument is set to TRUE if the first line of the file being
read contains the header with the variable names. Please note that the data will be stored as data
frame df in the example below (hence, you can apply what you have learned from the earlier
section on data frame).
df <- read.csv("mydata.csv", header = TRUE)
df <- read.csv2("mydata.csv", header= TRUE)
df <- read.table("mydata.csv", header = TRUE, sep = ",")

To check that you are in the right working directory, you can use getwd() function. You can also
check if the data file is in the directory by using list.files() function.

Packages

To install a package, use install.packages("package_name") function.

install.packages("moments")

To load a package, use library("package_name") function.

library("moments")

Packages

To define a function, use this template

function(parameters){
do_your_calculation here
return(a_value)

4
}

Descriptive Statistics

Measures for centrality:

mean(df$NorthAm)
median(df$NorthAm)
midrange <-function(v) {
return((min(v) + max(v))/2)
}
getmode <- function(v) {
unique_val <- unique(v)
return(unique_val [which.max(tabulate(match(v, unique_val)))])
}

Measures for dispersion:

range(df$NorthAm)
quantile(df$NorthAm)
var(df$NorthAm)
sd(df$NorthAm)
scale(df$NorthAm)

Measures for shape:

library("moments")
skewness(df$NorthAm)
kurtosis(df$NorthAm)

Plot the histogram

hist(df$NorthAm)
hist(df$NorthAm, breaks = c(500000, 1000000, 1500000, 2000000))

Scatter plot
plot(df$NorthAm, df$SouthAm)

Measures of association:
cov(x=df$NorthAm, y=df$SouthAm)
cor(x=df$NorthAm, y=df$SouthAm)

5
Nominal data

R uses a special data structure called factors for nominal data. To create a factor, we use factor()
function that requires a vector that we want to turn into a factor. We can also include an optional
parameter called levels (in case we want the levels to be different than the one in the vector).
Alternatively, we can use as.factor()function. For example:
directions <- c("North", "East", "South", "West")
dir_cat <- factor(directions, labels = c("N", "E", "S", "W"))
dir_cat2 <- as.factor(directions)

Ordinal data

We can also use factor() function for ordinal data by setting ordered=TRUE. For example:
scale <- c("Low", "Medium", "High")
scale_cat <- factor(scale, ordered = TRUE)

Is there anything wrong with the content of scale_cat? Now try the following:
scale_ord <- factor(scale, ordered = TRUE, levels=c("Low",
"Medium", "High"))

Note that setting the correct data type will allow R to conduct correct analysis. To demonstrate this,
please load college.csv from the blackboard and check the state field. It is defined as characters;
hence, when you obtain the summary statistics using summary() function, it does not show
anything useful (unless you are interested in the total number of characters). Compare the output of
the summary() function after you convert the state field into a nominal or categorical data.
college <- read.csv("college.csv")
college$state
summary(college$state)
my_states <- as.factor(college$state)
summary(my_state)

Dates

We can use as.Date() function to create a date. For example:

a_date <- as.Date("2021-10-14")

R provides several functions to work on date variables. For example:

weekdays(a_date)
what_month <- months(a_date)

6
which_q <- quarters(a_date)

You can also apply some arithmetic operators. For example:

next_week <- a_date + 7
last_week <- a_date - 7

The end

Cloud Storage and Local Storage
No ratings yet
Cloud Storage and Local Storage
15 pages
R - A Practical Course
No ratings yet
R - A Practical Course
42 pages
Unit 2
No ratings yet
Unit 2
32 pages
Getting Started With Target For Arcgis
No ratings yet
Getting Started With Target For Arcgis
10 pages
MP Assignment 1
No ratings yet
MP Assignment 1
9 pages
IT Project Quality Management Plan
No ratings yet
IT Project Quality Management Plan
4 pages
Practical 1 - Data Frame Manipulation - 072502
No ratings yet
Practical 1 - Data Frame Manipulation - 072502
16 pages
Apunts BLOC 1 Estadística
No ratings yet
Apunts BLOC 1 Estadística
15 pages
Lecture 1
No ratings yet
Lecture 1
35 pages
Dar Lecture 7
No ratings yet
Dar Lecture 7
24 pages
Da Session 4
No ratings yet
Da Session 4
75 pages
R Study Material I
No ratings yet
R Study Material I
8 pages
Lecture 1
No ratings yet
Lecture 1
42 pages
R Programming: © 2016 SMART Training Resources Pvt. LTD
No ratings yet
R Programming: © 2016 SMART Training Resources Pvt. LTD
28 pages
Mod1 R Programming
No ratings yet
Mod1 R Programming
49 pages
Capital Gains
No ratings yet
Capital Gains
8 pages
Data in R
No ratings yet
Data in R
7 pages
R Prog
No ratings yet
R Prog
27 pages
N2 Data in R
No ratings yet
N2 Data in R
7 pages
Introduction To R PDF
No ratings yet
Introduction To R PDF
56 pages
R Lab
No ratings yet
R Lab
114 pages
Data Science Using R - Lab Manual-Complete Ver 2.0 - Nov 2024
No ratings yet
Data Science Using R - Lab Manual-Complete Ver 2.0 - Nov 2024
36 pages
#02 R Basics
No ratings yet
#02 R Basics
30 pages
01 IntroSlides
No ratings yet
01 IntroSlides
43 pages
Module 5-6
No ratings yet
Module 5-6
12 pages
Programming With R: Lecture #4
No ratings yet
Programming With R: Lecture #4
34 pages
Module 1: Unit - 1.1: Introduction To Analytics or R Programming
No ratings yet
Module 1: Unit - 1.1: Introduction To Analytics or R Programming
26 pages
ProgrammingForDS14 Rbasics
No ratings yet
ProgrammingForDS14 Rbasics
32 pages
R Data Types 8
No ratings yet
R Data Types 8
7 pages
MDPN460 Lecture05
No ratings yet
MDPN460 Lecture05
32 pages
Introduction To R
No ratings yet
Introduction To R
20 pages
R Programming For NGS Data Analysis
No ratings yet
R Programming For NGS Data Analysis
5 pages
R Pres
No ratings yet
R Pres
53 pages
Introduction To R
No ratings yet
Introduction To R
52 pages
Getting Started With R
No ratings yet
Getting Started With R
155 pages
An R Tutorial Starting Out
No ratings yet
An R Tutorial Starting Out
9 pages
DA Lab Week-2
No ratings yet
DA Lab Week-2
22 pages
Lecture 10 R
No ratings yet
Lecture 10 R
117 pages
Lesson 7 - The Data Frame
No ratings yet
Lesson 7 - The Data Frame
7 pages
Beginner Guide To R and R Studio V1
No ratings yet
Beginner Guide To R and R Studio V1
27 pages
MultivariateRGGobi PDF
No ratings yet
MultivariateRGGobi PDF
60 pages
Lab 1
No ratings yet
Lab 1
26 pages
Empirical Software Engineering (Swe504) : Practical File
No ratings yet
Empirical Software Engineering (Swe504) : Practical File
27 pages
R Language Lab Manual Lab 1
100% (1)
R Language Lab Manual Lab 1
33 pages
S24 Stats10 Lab1-1
No ratings yet
S24 Stats10 Lab1-1
8 pages
Broomspatial
No ratings yet
Broomspatial
31 pages
Bdo Co1 Session 4
No ratings yet
Bdo Co1 Session 4
43 pages
Unit 4
No ratings yet
Unit 4
27 pages
Tutorial 1
No ratings yet
Tutorial 1
29 pages
Data - Analysis - With - R - 24
No ratings yet
Data - Analysis - With - R - 24
47 pages
R Statistical Package
No ratings yet
R Statistical Package
63 pages
6 Working With Data Frames in R
No ratings yet
6 Working With Data Frames in R
8 pages
R Software - Notes
No ratings yet
R Software - Notes
18 pages
R - Lecture 4
No ratings yet
R - Lecture 4
37 pages
MTech R Notes
No ratings yet
MTech R Notes
14 pages
R Tutorial
No ratings yet
R Tutorial
15 pages
R Cheatsheet Base R
No ratings yet
R Cheatsheet Base R
2 pages
Data Analytics Using R
No ratings yet
Data Analytics Using R
37 pages
Introduction To R
No ratings yet
Introduction To R
21 pages
DSA1101 2019 Week1 Part2
No ratings yet
DSA1101 2019 Week1 Part2
38 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
GaussianSurfaceEField Stu
No ratings yet
GaussianSurfaceEField Stu
3 pages
Capacitor e Storage
No ratings yet
Capacitor e Storage
1 page
ElastPE Calc
No ratings yet
ElastPE Calc
1 page
MANG6513 2023 Lecture 1
No ratings yet
MANG6513 2023 Lecture 1
31 pages
PHP My Admin Intro
No ratings yet
PHP My Admin Intro
11 pages
User'S Manual: Revision 1.0a
No ratings yet
User'S Manual: Revision 1.0a
126 pages
CommonCore Gateway
No ratings yet
CommonCore Gateway
26 pages
Mesh Warping
No ratings yet
Mesh Warping
6 pages
Single Line
No ratings yet
Single Line
54 pages
NetVu Observer 1.18.11
No ratings yet
NetVu Observer 1.18.11
15 pages
L13 - Business Process Management Perspective
100% (2)
L13 - Business Process Management Perspective
76 pages
Step Broucher
No ratings yet
Step Broucher
16 pages
Chapter 1
No ratings yet
Chapter 1
31 pages
Birla Institute of Technology Welfare Society: Mess Fee Deposit Procedure
No ratings yet
Birla Institute of Technology Welfare Society: Mess Fee Deposit Procedure
9 pages
Fuzzing With AFL Fuzz A Practical Example (AFL Vs Binutils)
No ratings yet
Fuzzing With AFL Fuzz A Practical Example (AFL Vs Binutils)
5 pages
Labview - Programming - Reference - Manual - 7 30 2024 6001 9000 0001 1500
No ratings yet
Labview - Programming - Reference - Manual - 7 30 2024 6001 9000 0001 1500
1,500 pages
Tutorial How To Write Bangla Using LaTeX
No ratings yet
Tutorial How To Write Bangla Using LaTeX
3 pages
Ravinder Reddy Velma: Objective
No ratings yet
Ravinder Reddy Velma: Objective
3 pages
Section 11 I&C
No ratings yet
Section 11 I&C
31 pages
SDS2 7.0-Assorted Tools
No ratings yet
SDS2 7.0-Assorted Tools
96 pages
12 Substitutes To Showbox App
No ratings yet
12 Substitutes To Showbox App
3 pages
Group Assignment 1 PDF
No ratings yet
Group Assignment 1 PDF
2 pages
Inter RAT Mobility Management
No ratings yet
Inter RAT Mobility Management
6 pages
Authorization Management: at The Customer Site
No ratings yet
Authorization Management: at The Customer Site
20 pages
MSTD - BillingSoftware - User Manual Ver 1.01
No ratings yet
MSTD - BillingSoftware - User Manual Ver 1.01
52 pages
Dotw
No ratings yet
Dotw
2 pages
Session Cookie Authentication in Golang (With Complete Examples)
No ratings yet
Session Cookie Authentication in Golang (With Complete Examples)
12 pages
MODULE 1 - UNDERSTANDING DIGITAL MEDIA - Digital Media Terminologies (DM 1.1.1) (1) - Mod
No ratings yet
MODULE 1 - UNDERSTANDING DIGITAL MEDIA - Digital Media Terminologies (DM 1.1.1) (1) - Mod
20 pages
Introduction To Von Neumann Architecture
No ratings yet
Introduction To Von Neumann Architecture
8 pages
Digi Go User Manual
No ratings yet
Digi Go User Manual
23 pages

Introduction To R For Business Analytics

Uploaded by

Introduction To R For Business Analytics

Uploaded by

Introduction to R for Business Analytics

Professor Stephan Onggo

To see the content of x:

To see the content of a specific element of x:

We can apply standard operations to the vector such as:

We can apply some functions to the vector

You can list the available R objects using ls()function.

You can remove an object using rm()function.

To install a package, use install.packages() function.

To load a package, use library() function.

To see the content of a specific element of a matrix:

We can apply standard matrix operations such as:

We can access the content using the following commands

We can filter the data using an expression. For example:

We can sort the data using order() function. For example:

We can use rbind() function to insert a row into a data frame

We can delete a row using the following command

We can apply some functions to a column.

Reading csv files

To install a package, use install.packages("package_name") function.

To load a package, use library("package_name") function.

To define a function, use this template

Measures for centrality:

Measures for dispersion:

Measures for shape:

Plot the histogram

We can use as.Date() function to create a date. For example:

R provides several functions to work on date variables. For example:

You can also apply some arithmetic operators. For example:

You might also like