0% found this document useful (0 votes)

30 views32 pages

MDPN460 Lecture05

Uploaded by

mohamedggharib02

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views32 pages

MDPN460 Lecture05

Uploaded by

mohamedggharib02

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

MDPN460 – Industrial

Engineering Lab
Lecture 5

ANOVA Using R
1 / 32
Today’s Lecture
●
Basic R programming (Continued)
– Logical vectors and relational operators.
– Data frames and lists.
– Data input and output.
●
ANOVA using R
●
Applying ANOVA using R for Example 1 of Lecture 4

2 / 32
Logical Vectors
●
We have used the c() function to put numeric vectors
together as well as character vectors.
●
R also supports logical vectors. These contain two
different elements:
– TRUE and
– FALSE ,
– as well as NA for missing.

> l <- c(TRUE, FALSE, FALSE, TRUE, TRUE, NA, TRUE)

>l
[1] TRUE FALSE FALSE TRUE TRUE NA TRUE

3 / 32
Boolean Algebra
●
The idea of Boolean algebra is to formalize a
mathematical approach to logic.
●
Boolean algebra tells us how to evaluate the truth of
compound statements.
– A ← “sky is clear”
– B ← “it is raining”
– “A and B” is the statement that it is both clear and
raining.

4 / 32
Boolean Algebra – Truth Table

5 / 32
Logical Operations in R
●
You can use a Boolean vector to access selected
elements in any vector.
> a <- c(TRUE, FALSE, FALSE, TRUE, FALSE, TRUE, TRUE, FALSE)
> v <- 1:8
> v[a]
[1] 1 4 6 7

●
We can do some arithmetic operations on Boolean
vectors:
> sum(a)
[1] 4
> mean(a)
[1] 0.5
>v+a
6 / 32
[1] 2 2 3 5 5 7 8 8
Boolean Algebra in R
>a
[1] TRUE FALSE FALSE TRUE FALSE TRUE TRUE FALSE
> b = sample(rep(c(TRUE,FALSE),4),size=8)
>b
[1] TRUE TRUE FALSE FALSE FALSE TRUE FALSE TRUE
> !a
[1] FALSE TRUE TRUE FALSE TRUE FALSE FALSE TRUE
>a|b
[1] TRUE TRUE FALSE TRUE FALSE TRUE TRUE TRUE
>a&b
[1] TRUE FALSE FALSE FALSE FALSE TRUE FALSE FALSE
> !(a | b)
[1] FALSE FALSE TRUE FALSE TRUE FALSE FALSE FALSE
> !a | !b
[1] FALSE TRUE TRUE TRUE TRUE FALSE TRUE TRUE
> !(a & b)
[1] FALSE TRUE TRUE TRUE TRUE FALSE TRUE TRUE
> xor(a, b) 7 / 32
[1] FALSE TRUE FALSE TRUE FALSE FALSE TRUE TRUE
Relational Operators
●
It is often necessary to test relations when
programming. R allows for equality and inequality
relations to be tested using the relational operators:
< , > , == , >= , <= , !=
●
Some simple examples follow.
> x <- sample(1:10, size=5, replace=TRUE)
>x
[1] 4 10 9 8 4
> x == 4
[1] TRUE FALSE FALSE FALSE TRUE
> x != 8
[1] TRUE TRUE TRUE FALSE TRUE
> x / 2 <= 4
[1] TRUE FALSE FALSE TRUE TRUE
> x[x/3 >= 3] 8 / 32
[1] 10 9
Try it Yourself!

9 / 32
Data Frames, Tibbles, and Lists
●
Data sets frequently consist of more than one column of
data, where each column represents measurements of a
single variable. Each row usually represents a single
observation.
●
Most data sets are stored in R as data frames or tibbles.
Tibbles are very similar to data frames.
●
Both are like matrices, but with the columns having their
own names.
●
Several data frames come with R.
– An example is women and mtcars.

10 / 32
Summary Content of Data frames
●
In R, we can see a brief content of the table using head() function.
●
Summary statistics and plotting of data frames have their special
functions.
> head(women)
height weight
1 58 115
2 59 117
3 60 120
4 61 123
5 62 126
6 63 129
> summary(women)
height weight
Min. : 58.0 Min. : 115.0
1st Qu.: 61.5 1st Qu.:124.5
Median : 65.0 Median :135.0
Mean : 65.0 Mean :136.7
3rd Qu.: 68.5 3rd Qu.:148.0 11 / 32
Max. : 72.0 Max. : 164.0
Summary Content of Data frames
●
You can also display the content of a data frame using str() function:

> str(mtcars)
'data.frame': 32 obs. of 11 variables:
$ mpg : num 21 21 22.8 21.4 18.7 18.1 14.3 24.4 22.8 19.2 ...
$ cyl : num 6 6 4 6 8 6 8 4 4 6 ...
$ disp: num 160 160 108 258 360 ...
$ hp : num 110 110 93 110 175 105 245 62 95 123 ...
$ drat: num 3.9 3.9 3.85 3.08 3.15 2.76 3.21 3.69 3.92 3.92 ...
$ wt : num 2.62 2.88 2.32 3.21 3.44 ...
$ qsec: num 16.5 17 18.6 19.4 17 ...
$ vs : num 0 0 1 1 0 1 0 1 1 1 ...
$ am : num 1 1 1 0 0 0 0 0 0 0 ...
$ gear: num 4 4 4 3 3 3 3 4 4 4 ...
$ carb: num 4 4 1 1 2 1 4 2 2 4 ...

12 / 32
Dimensions of Data Frames

●
The number of rows and number of columns can be
determined for any data frame using the following
functions:

> nrow(mtcars)
[1] 32
> ncol(mtcars)
[1] 11
> dim(mtcars)
[1] 32 11

13 / 32
Simple Plots for Data in Data frames
●
Simple plots of data in data frames is easy.

> plot(wt ~ hp, data = mtcars)

14 / 32
Extracting data frame elements and
subsets
●
We can extract elements from data frames using similar
syntax to what was used with matrices. Consider the
following examples:
> mtcars[3, 5]
[1] 3.85
> mtcars[2,]
mpg cyl disp hp drat wt qsec vs am gear carb
Mazda RX4 Wag 21 6 160 110 3.9 2.875 17.02 0 1 4 4
> mtcars[,4]
[1] 110 110 93 110 175 105 245 62 95 123 123 180 180 180 205
215 230 66 52 65 97 150
[23] 150 245 175 66 91 113 264 175 335 109

15 / 32
Extracting data frame columns

●
Data frame columns can also be addressed using their
names using the $ operator. For example, the weight
column can be extracted as follows:
> mtcars$wt
[1] 2.620 2.875 2.320 3.215 3.440 3.460 3.570 3.190 3.150 3.440
3.440 4.070 3.730 3.780 5.250
[16] 5.424 5.345 2.200 1.615 1.835 2.465 3.520 3.435 3.840 3.845
1.935 2.140 1.513 3.170 2.770
[31] 3.570 2.780
> women$weight
[1] 115 117 120 123 126 129 132 135 139 142 146 150 154 159 164
> women$height[women$weight > 130]
[1] 64 65 66 67 68 69 70 71 72

16 / 32
Using “with”

●
The with() function allows us to access columns of a data
frame directly without using the $ . For example, we can
divide the weights by the heights in the women data
frame using

> with(women, weight/height)

[1] 1.982759 1.983051 2.000000 2.016393 2.032258 2.047619
2.062500 2.076923 2.106061 2.119403
[11] 2.147059 2.173913 2.200000 2.239437 2.277778

17 / 32
Taking Random Samples
●
The sample() function can be used to take samples (with
or without replacement) from larger finite populations
whose data are stored in data frames.
> s <- sample(1:nrow(mtcars), size=8, replace=FALSE)
> mtcars[s,]
mpg cyl disp hp drat wt qsec vs am gear carb
Duster 360 14.3 8 360.0 245 3.21 3.570 15.84 0 0 3 4
Lotus Europa 30.4 4 95.1 113 3.77 1.513 16.90 1 1 5 2
Honda Civic 30.4 4 75.7 52 4.93 1.615 18.52 1 1 4 2
Datsun 710 22.8 4 108.0 93 3.85 2.320 18.61 1 1 4 1
Lincoln Continental 10.4 8 460.0 215 3.00 5.424 17.82 0 0 3 4
Hornet Sportabout 18.7 8 360.0 175 3.15 3.440 17.02 0 0 3 2
Mazda RX4 21.0 6 160.0 110 3.90 2.620 16.46 0 1 4 4
Merc 280C 17.8 6 167.6 123 3.92 3.440 18.90 1 0 4 4

18 / 32
Constructing Data Frames
●
Use the data.frame() function to construct data
frames from vectors that already exist in your
workspace:
> x <- 2 ^ seq(1,15)
> y <- seq(1,15) ^ 2
> z <- x > y
> f <- data.frame(x, y, z)
> head(f)
x y z
1 2 1 TRUE
2 4 4 FALSE
3 8 9 FALSE
4 16 16 FALSE
5 32 25 TRUE
6 64 36 TRUE
19 / 32
Non-numeric columns in data frames
●
Columns of data frames can be of different types. For
example, the built-in data frame chickwts has a numeric
column and a factor. Again, the summary() function provides a
quick peek at this data set.

> summary(chickwts)
weight feed
Min. : 108.0 casein :12
1st Qu.: 204.5 horsebean:10
Median : 258.0 l inseed :12
Mean : 261.3 meatmeal :11
3rd Qu.: 323.5 soybean :14
Max. : 423.0 sunflower:12

> nrow(chickwts)
[1] 71
20 / 32
Lists in R
●
Data frames are actually a special kind of list, or structure.
●
Lists in R can contain any other objects. You won’t often
construct these yourself, but many functions return
complicated results as lists.
●
The list() function is one way of organizing multiple pieces of
output from functions. For example,
> x <- c(2, 4, 6)
> y <- c(8, 9)
> z <- list(x = x, y = y)
>z
$x
[1] 2 4 6

$y
21 / 32
[1] 8 9
Working with lists
●
There are several functions which make working with lists
easy. Two of them are lapply() and vapply() .
●
The lapply() function “applies” another function to every
element of a list and returns the results in a newlist; for
example,
>z
$x
[1] 2 4 6
$y
[1] 8 9
> lapply(z, mean)
$x
[1] 4
$y
[1] 8.5
22 / 32
Working with lists
●
In a case like this, it might be more convenient to have the
results in a vector; the vapply() function does that. It takes a
third argument to tell R what kind of result to expect from the
function. In this case each result of mean should be a number,
so we could use where the 1 just serves as an example of the
type of output expected.

> vapply(z, mean, 1)

x y
4.0 8.5

23 / 32
Data Input and Output
●
When in an R session, it is possible to read and
write data to files outside of R, for example on
your computer’s hard drive.
●
You can get and set the working directory in
which files are stored.

> getwd()
[1] "/home/tamer"

> setwd("/home/tamer/work")
Error in setwd("/home/tamer/work") : cannot change working
directory
> setwd("/home/tamer/Work")
24 / 32
dump() and source()
●
To write your data on the default working directory, use dump()
function. To read data stored in the default working directory, use
source() function

> w <- women

> dim(w)
[1] 15 2
> age <- sample(40:70, size=nrow(w), replace=TRUE)
> wa <- data.frame(w, age)
> head(wa)
height weight age
1 58 115 45
2 59 117 45
3 60 120 63
4 61 123 42
5 62 126 69
6 63 129 44
> dump("wa", "wa.R") 25 / 32
dump() and source()
●
To retrieve the vector in a future session, type
> source("wa.R")

●
This reads and executes the command in wa.R, resulting in the
creation of the wa object in your global environment.
●
If there was an object of the same name there before, it will be
replaced.
●
To save all of the objects that you have created during a session,
type:
> dump(list = objects(), "all.R")

●
This produces a file called all.R on your computer’s hard drive. Using
source("all.R") at a later time will allow you to retrieve all of these
objects.
26 / 32
Redirecting R Output
●
By default, R directs the output of most of its functions to
the screen. Output can be directed to a file with the sink()
function.
●
Consider the greenhouse data in solar.radiation . The
command mean(solar.radiation) prints the mean of the
data to the screen. To print this output to a file called
solarmean.txt instead, run
sink("solarmean.txt") # Create a file solarmean.txt for output
mean(solar.radiation) # Write mean value to solarmean.txt
●
All subsequent output will be printed to the file
solarmean.txt until the command
sink()
27 / 32
The read.table() function
●
Consider the following text data table:
x y z
61 13 4
175 21 18
111 24 14
124 23 18

●
If such a data set is stored in a file called pretend.dat in the
directory myfiles on “/home/tamer/Work” folder, then it can be
read into an R data frame. This can be accomplished by typing
> pretend.df <- read.table("home/tamer/Work/pretend.dat", header = TRUE)
> pretend.df
x y z
1 61 13 4
2 175 21 18
3 111 24 14 28 / 32
4 124 23 18
Reading csv files
●
Comma separated files (csv) are text files that can be
obtained from spreadsheet applications.
●
You can upload csv files in R using the function
read.table() by using sep = "," in the argument.
●
Now, load the “MDPN460-Lecture04.csv” file that
contains the formatted data found in Example 1 in
Lecture 4.
●
Get first the right directory in which the file is stored in
the server’s shared folder.
> papfl1 <- read.table("????/MDPN460-Lecture04.csv", header = TRUE, sep = “,”)

29 / 32
ANOVA using R
> dryingData <- read.table("/home/tamer/Work/MDPN460-Lecture04.csv",
header = TRUE, sep = ",")
> dryingData$Applicator <- as.factor(dryingData$Applicator)
> levels(dryingData$Applicator)
[1] "Brush" "Pad" "Roller"
> boxplot(dryingData$DryingTime ~ dryingData$Applicator, data = dryingData)
ANOVA using R

> anova(lm(dryingData$DryingTime ~ dryingData$Applicator, dryingData))

Analysis of Variance Table

Response: dryingData$DryingTime
Df Sum Sq Mean Sq F value Pr(>F)
dryingData$Applicator 2 108.97 54.483 4.2748 0.03255 *
Residuals 16 203.92 12.745
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
Lab Assignment 4

Use the results of the paper airplane

flight experiments in the last week to
test the hypothesis that the two
teams resulted in similar mean flight
lengths.

-- to be done this Thursday.

Calculus Early Transcendentals 3rd Edition Briggs Full Download
100% (1)
Calculus Early Transcendentals 3rd Edition Briggs Full Download
409 pages
Chapter 3 - Scientific Measurement
0% (1)
Chapter 3 - Scientific Measurement
30 pages
Data Science Using R - Lab Manual-Complete Ver 2.0 - Nov 2024
No ratings yet
Data Science Using R - Lab Manual-Complete Ver 2.0 - Nov 2024
36 pages
R1 Uptovisualisation
No ratings yet
R1 Uptovisualisation
122 pages
R Programming
No ratings yet
R Programming
30 pages
Chapter 2 PPT Num.I.pptxxxxxx New
No ratings yet
Chapter 2 PPT Num.I.pptxxxxxx New
107 pages
Statistics and Data Science With R Part - 4
No ratings yet
Statistics and Data Science With R Part - 4
23 pages
Data Visualisation Slides 1-6
No ratings yet
Data Visualisation Slides 1-6
318 pages
Empirical Software Engineering (Swe504) : Practical File
No ratings yet
Empirical Software Engineering (Swe504) : Practical File
27 pages
Data - Analysis - With - R - 24
No ratings yet
Data - Analysis - With - R - 24
47 pages
R Pres
No ratings yet
R Pres
53 pages
MBA Sem 1 Unit 3 Fundamentals of R
No ratings yet
MBA Sem 1 Unit 3 Fundamentals of R
41 pages
R Studio Lab Summary Sheet
No ratings yet
R Studio Lab Summary Sheet
3 pages
Starting With R
No ratings yet
Starting With R
34 pages
Lecture 10 R
No ratings yet
Lecture 10 R
117 pages
Practical 1 - Data Frame Manipulation - 072502
No ratings yet
Practical 1 - Data Frame Manipulation - 072502
16 pages
Unit 4
No ratings yet
Unit 4
27 pages
S24 Stats10 Lab1-1
No ratings yet
S24 Stats10 Lab1-1
8 pages
DA Lab Week-2
No ratings yet
DA Lab Week-2
22 pages
CH 03
No ratings yet
CH 03
42 pages
Da Session 4
No ratings yet
Da Session 4
75 pages
DA Lab Week-1
No ratings yet
DA Lab Week-1
7 pages
6 Working With Data Frames in R
No ratings yet
6 Working With Data Frames in R
8 pages
STA 272 Chapter 02 Notes and Codes Data Frames in R
No ratings yet
STA 272 Chapter 02 Notes and Codes Data Frames in R
5 pages
ProgrammingForDS14 Rbasics
No ratings yet
ProgrammingForDS14 Rbasics
32 pages
R - Lecture 4
No ratings yet
R - Lecture 4
37 pages
Biostat S1 Handout
No ratings yet
Biostat S1 Handout
7 pages
Maths Curriculum GRADE 3
No ratings yet
Maths Curriculum GRADE 3
3 pages
Computationallab 2
No ratings yet
Computationallab 2
6 pages
Introduction To R For Business Analytics
No ratings yet
Introduction To R For Business Analytics
7 pages
Tutorial 1
No ratings yet
Tutorial 1
29 pages
R Data Types 8
No ratings yet
R Data Types 8
7 pages
Data Analytics Using R
100% (1)
Data Analytics Using R
27 pages
Week3 2020
No ratings yet
Week3 2020
20 pages
Paper 3 Physics Practical s4 Guide Notes
No ratings yet
Paper 3 Physics Practical s4 Guide Notes
8 pages
Calculus 1 Chapter 3 To 6
No ratings yet
Calculus 1 Chapter 3 To 6
5 pages
Logarithm PDF
No ratings yet
Logarithm PDF
5 pages
Scalar Product HELM
No ratings yet
Scalar Product HELM
15 pages
Basic R Dplyr Session 4 Demonstration
No ratings yet
Basic R Dplyr Session 4 Demonstration
18 pages
Y6 Autumn Block 1 WO1 Numbers To 10 Million 2019
No ratings yet
Y6 Autumn Block 1 WO1 Numbers To 10 Million 2019
2 pages
Basic R Commands For Data Analysis
No ratings yet
Basic R Commands For Data Analysis
7 pages
Logarithms (New) 4037
No ratings yet
Logarithms (New) 4037
10 pages
R
No ratings yet
R
13 pages
Society of Actuaries/Casualty Actuarial Society: Exam C Construction and Evaluation of Actuarial Models
No ratings yet
Society of Actuaries/Casualty Actuarial Society: Exam C Construction and Evaluation of Actuarial Models
83 pages
Data Science Lab Manual
No ratings yet
Data Science Lab Manual
40 pages
Lecture 1
No ratings yet
Lecture 1
35 pages
Module 1 Rprogramming Introduction Part A
No ratings yet
Module 1 Rprogramming Introduction Part A
20 pages
CV (Sagnik) PDF
100% (2)
CV (Sagnik) PDF
3 pages
STAT 04 Simplify Notes
No ratings yet
STAT 04 Simplify Notes
34 pages
DS Lab
No ratings yet
DS Lab
31 pages
R Programming
No ratings yet
R Programming
50 pages
Final Cost Practical
No ratings yet
Final Cost Practical
29 pages
Apunts BLOC 1 Estadística
No ratings yet
Apunts BLOC 1 Estadística
15 pages
MIS 4.hafta (Introduction To R)
No ratings yet
MIS 4.hafta (Introduction To R)
52 pages
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Transportation Network Design: Dr. Tom V. Mathew
No ratings yet
Transportation Network Design: Dr. Tom V. Mathew
17 pages
Discussion 1 Computer Programming Overview
No ratings yet
Discussion 1 Computer Programming Overview
10 pages
Unit - 2
No ratings yet
Unit - 2
23 pages
(Worksheet KSSM AddMaths) Bab 1 - Sukatan Membulat (Form 5)
No ratings yet
(Worksheet KSSM AddMaths) Bab 1 - Sukatan Membulat (Form 5)
9 pages
Lesson 7 - The Data Frame
No ratings yet
Lesson 7 - The Data Frame
7 pages
Differential Equations: 9.1 Overview
No ratings yet
Differential Equations: 9.1 Overview
25 pages
Engineering Mathematics I
No ratings yet
Engineering Mathematics I
4 pages
p3 Complex Numbers Notes
No ratings yet
p3 Complex Numbers Notes
12 pages
MultivariateRGGobi PDF
No ratings yet
MultivariateRGGobi PDF
60 pages
Boolean Algebra and Logic Simplification: Truth Tables For The Laws of Boolean
No ratings yet
Boolean Algebra and Logic Simplification: Truth Tables For The Laws of Boolean
20 pages
Design And Analysis Of Algorithm
From Everand
Design And Analysis Of Algorithm
Bhupendra Mandloi
No ratings yet
Unit 1 Big Data Analytics - An Introduction (Final)
No ratings yet
Unit 1 Big Data Analytics - An Introduction (Final)
65 pages
Statistics 2 - Power Regression Model Example
100% (1)
Statistics 2 - Power Regression Model Example
3 pages
BSCMSC
No ratings yet
BSCMSC
1 page
Daubechies
No ratings yet
Daubechies
11 pages
ACC324 Lab2
No ratings yet
ACC324 Lab2
4 pages
R Programming: © 2016 SMART Training Resources Pvt. LTD
No ratings yet
R Programming: © 2016 SMART Training Resources Pvt. LTD
28 pages
Assignment - 13-BT&PC&P (01.05
No ratings yet
Assignment - 13-BT&PC&P (01.05
9 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
R Prog
No ratings yet
R Prog
27 pages
Introduction To R
No ratings yet
Introduction To R
20 pages
Introduction To R PDF
No ratings yet
Introduction To R PDF
56 pages
R Examples
No ratings yet
R Examples
56 pages
Programming With R: Lecture #4
No ratings yet
Programming With R: Lecture #4
34 pages
Importing The Files
No ratings yet
Importing The Files
14 pages
A Brief Introduction To R
No ratings yet
A Brief Introduction To R
17 pages
03 Programming in Haskell - Chapter 9
No ratings yet
03 Programming in Haskell - Chapter 9
11 pages
Review of Random Processes
No ratings yet
Review of Random Processes
34 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
R-Tutorial - Introduction
No ratings yet
R-Tutorial - Introduction
30 pages
R
No ratings yet
R
15 pages
Solving Systems of Linear Equations Worksheet
No ratings yet
Solving Systems of Linear Equations Worksheet
5 pages
CBSE Class 11th Mathematics Sample Ebook
No ratings yet
CBSE Class 11th Mathematics Sample Ebook
21 pages
RBasics Handout
No ratings yet
RBasics Handout
6 pages
An R Tutorial Starting Out
No ratings yet
An R Tutorial Starting Out
9 pages