Split

The document discusses the R split function which splits a vector or data frame into groups based on factors. It provides examples of splitting a numeric vector and the airquality data frame by the Month factor. It also demonstrates splitting a vector on more than one factor level and the use of drop = TRUE to remove empty factor levels.

Uploaded by

Augusto Ferrari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views9 pages

Split

Uploaded by

Augusto Ferrari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

split

split takes a vector or other objects and splits it into groups determined by a factor or list of

factors.
> str(split)
function (x, f, drop = FALSE, ...)

x is a vector (or list) or data frame

f is a factor (or coerced to one) or a list of factors
drop indicates whether empty factors levels should be dropped

6/14

split
> x <- c(rnorm(10), runif(10), rnorm(10, 1))
> f <- gl(3, 10)
> split(x, f)
$1
[1] -0.8493038 -0.5699717 -0.8385255 -0.8842019
[5] 0.2849881 0.9383361 -1.0973089 2.6949703
[9] 1.5976789 -0.1321970
$2
[1] 0.09479023 0.79107293 0.45857419 0.74849293
[5] 0.34936491 0.35842084 0.78541705 0.57732081
[9] 0.46817559 0.53183823
$3
[1] 0.6795651 0.9293171 1.0318103 0.4717443
[5] 2.5887025 1.5975774 1.3246333 1.4372701

7/14

split
A common idiom is split followed by an lapply.
> lapply(split(x, f), mean)
$1
[1] 0.1144464
$2
[1] 0.5163468
$3
[1] 1.246368

8/14

Splitting a Data Frame

> library(datasets)
> head(airquality)
Ozone Solar.R Wind Temp Month Day
1
41
190 7.4
67
5
1
2
36
118 8.0
72
5
2
3
12
149 12.6
74
5
3
4
18
313 11.5
62
5
4
5
NA
NA 14.3
56
5
5
6
28
NA 14.9
66
5
6

9/14

Splitting a Data Frame

> s <- split(airquality, airquality$Month)
> lapply(s, function(x) colMeans(x[, c("Ozone", "Solar.R", "Wind")]))
$5
Ozone Solar.R
Wind
NA
NA 11.62258
$6
Ozone
Solar.R
Wind
NA 190.16667 10.26667
$7
Ozone
Solar.R
Wind
NA 216.483871
8.941935

10/14

Splitting a Data Frame

> sapply(s, function(x) colMeans(x[, c("Ozone", "Solar.R", "Wind")]))
5
6
7
8
9
Ozone
NA
NA
NA
NA
NA
Solar.R
NA 190.16667 216.483871
NA 167.4333
Wind
11.62258 10.26667
8.941935 8.793548 10.1800
> sapply(s, function(x) colMeans(x[, c("Ozone", "Solar.R", "Wind")],
na.rm = TRUE))
5
6
7
8
9
Ozone
23.61538
29.44444
59.115385
59.961538
31.44828
Solar.R
181.29630
190.16667
216.483871
171.857143 167.43333
Wind
11.62258
10.26667
8.941935
8.793548
10.18000

11/14

Splitting on More than One Level

> x <- rnorm(10)
> f1 <- gl(2, 5)
> f2 <- gl(5, 2)
> f1
[1] 1 1 1 1 1 2 2 2 2 2
Levels: 1 2
> f2
[1] 1 1 2 2 3 3 4 4 5 5
Levels: 1 2 3 4 5
> interaction(f1, f2)
[1] 1.1 1.1 1.2 1.2 1.3 2.3 2.4 2.4 2.5 2.5
10 Levels: 1.1 2.1 1.2 2.2 1.3 2.3 1.4 ... 2.5

12/14

Splitting on More than One Level

Interactions can create empty levels.
> str(split(x, list(f1, f2)))
List of 10
$ 1.1: num [1:2] -0.378 0.445
$ 2.1: num(0)
$ 1.2: num [1:2] 1.4066 0.0166
$ 2.2: num(0)
$ 1.3: num -0.355
$ 2.3: num 0.315
$ 1.4: num(0)
$ 2.4: num [1:2] -0.907 0.723
$ 1.5: num(0)
$ 2.5: num [1:2] 0.732 0.360

13/14

split
Empty levels can be dropped.
> str(split(x, list(f1, f2), drop = TRUE))
List of 6
$ 1.1: num [1:2] -0.378 0.445
$ 1.2: num [1:2] 1.4066 0.0166
$ 1.3: num -0.355
$ 2.3: num 0.315
$ 2.4: num [1:2] -0.907 0.723
$ 2.5: num [1:2] 0.732 0.360

14/14

Exploratory Graphs
No ratings yet
Exploratory Graphs
23 pages
Python Data Cleaning
100% (1)
Python Data Cleaning
20 pages
Data Wrangling Cheatsheet PDF
No ratings yet
Data Wrangling Cheatsheet PDF
2 pages
R-Cheat Sheet
100% (1)
R-Cheat Sheet
4 pages
Dplyr Cheatsheet PDF
100% (1)
Dplyr Cheatsheet PDF
2 pages
OpenAir Manual PDF
No ratings yet
OpenAir Manual PDF
287 pages
The Manual: Openair
No ratings yet
The Manual: Openair
224 pages
1 - 4 Subsetting
No ratings yet
1 - 4 Subsetting
14 pages
DV Lab
No ratings yet
DV Lab
52 pages
Comparison With R / R Libraries
No ratings yet
Comparison With R / R Libraries
12 pages
Summarizing Data
No ratings yet
Summarizing Data
13 pages
HW1 Xinjie Hu
No ratings yet
HW1 Xinjie Hu
7 pages
OpenAir Manual
No ratings yet
OpenAir Manual
287 pages
高杨驰 17420202201427 HW1
No ratings yet
高杨驰 17420202201427 HW1
9 pages
R Program3
No ratings yet
R Program3
21 pages
Curso Básico de Iniciación A La Programación Con R Álvaro Mauricio Bustamante Lozano
No ratings yet
Curso Básico de Iniciación A La Programación Con R Álvaro Mauricio Bustamante Lozano
9 pages
HW1 For R-Lizhi Fu
No ratings yet
HW1 For R-Lizhi Fu
5 pages
R Subnetting
No ratings yet
R Subnetting
16 pages
Theil-Sen No R
No ratings yet
Theil-Sen No R
5 pages
Reshape2 - R - Flexibly Reshape Data - A Reboot of The Reshape Package
No ratings yet
Reshape2 - R - Flexibly Reshape Data - A Reboot of The Reshape Package
14 pages
Basic R Dplyr Session 4 Demonstration
No ratings yet
Basic R Dplyr Session 4 Demonstration
18 pages
Dev Lab Manual Org
No ratings yet
Dev Lab Manual Org
28 pages
R Functions
No ratings yet
R Functions
8 pages
Data Wrangling Cheatsheet PDF
No ratings yet
Data Wrangling Cheatsheet PDF
2 pages
CSE315:Introduction To Data Science: WEEK-8
No ratings yet
CSE315:Introduction To Data Science: WEEK-8
27 pages
Intro To Data Science Lecture 4
No ratings yet
Intro To Data Science Lecture 4
13 pages
R Programs
No ratings yet
R Programs
30 pages
MIT 302 - Statistical Computing II - Tutorial 02
No ratings yet
MIT 302 - Statistical Computing II - Tutorial 02
5 pages
OpenAir Manual
No ratings yet
OpenAir Manual
230 pages
2.data Frame Selection and Indexing
No ratings yet
2.data Frame Selection and Indexing
4 pages
7 DS Assignment 1
No ratings yet
7 DS Assignment 1
9 pages
EDA With R Lab Manual
No ratings yet
EDA With R Lab Manual
110 pages
R Examples
No ratings yet
R Examples
56 pages
R Command Cheatsheet2551545
No ratings yet
R Command Cheatsheet2551545
2 pages
EX No-3
No ratings yet
EX No-3
3 pages
22 Subsetting Removing Missing Values Subsetting NAs
No ratings yet
22 Subsetting Removing Missing Values Subsetting NAs
3 pages
As 2
No ratings yet
As 2
10 pages
Data Cleaning Using R
No ratings yet
Data Cleaning Using R
26 pages
Deber R
No ratings yet
Deber R
4 pages
Advanced R Programming Tidyverse Packages Notes
No ratings yet
Advanced R Programming Tidyverse Packages Notes
12 pages
DSCI 100 Cheat Sheet
No ratings yet
DSCI 100 Cheat Sheet
3 pages
DATAMINING
No ratings yet
DATAMINING
24 pages
2 Trends Seasonality and Residuals Explained!-Copy1
No ratings yet
2 Trends Seasonality and Residuals Explained!-Copy1
14 pages
Data Cleaning Using R
No ratings yet
Data Cleaning Using R
26 pages
Varela+ +2013+ +Macroecology+and+Species+Distribution+Models
No ratings yet
Varela+ +2013+ +Macroecology+and+Species+Distribution+Models
59 pages
Data Types
No ratings yet
Data Types
27 pages
Debugging
No ratings yet
Debugging
15 pages
Simulation Simulation
No ratings yet
Simulation Simulation
15 pages
Introduction To The R Language Introduction To The R Language
No ratings yet
Introduction To The R Language Introduction To The R Language
12 pages
Introduction To The R Language Introduction To The R Language
No ratings yet
Introduction To The R Language Introduction To The R Language
7 pages
Introduction To The R Language Introduction To The R Language
No ratings yet
Introduction To The R Language Introduction To The R Language
5 pages
The Elf
No ratings yet
The Elf
2 pages
The Elements of Quantitative Investing
From Everand
The Elements of Quantitative Investing
Giuseppe A. Paleologo
No ratings yet
Student Solutions Manual to Accompany Loss Models: From Data to Decisions, Fourth Edition
From Everand
Student Solutions Manual to Accompany Loss Models: From Data to Decisions, Fourth Edition
Stuart A. Klugman
4/5 (1)
Generalized Fermat Equation
From Everand
Generalized Fermat Equation
Ran Van Vo
No ratings yet
Math Crossword Puzzles
From Everand
Math Crossword Puzzles
Anna B. Napolitano
No ratings yet
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Computer Solved Differential Equations
From Everand
Computer Solved Differential Equations
Joe J.
No ratings yet
Solving Math Problems
From Everand
Solving Math Problems
George N. Frempong
No ratings yet
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
2.5/5 (2)
Computer Solved: Nonlinear Differential Equations
From Everand
Computer Solved: Nonlinear Differential Equations
Joe J. Ettl
No ratings yet
Heat Transfer II Essentials
From Everand
Heat Transfer II Essentials
The Editors of REA
3.5/5 (3)
Speed Mathamatics
From Everand
Speed Mathamatics
Naila Hina
1/5 (1)
Shortcuts to College Calculus Refreshment Kit
From Everand
Shortcuts to College Calculus Refreshment Kit
Juan Acevedo
No ratings yet
Analytic Geometry: Graphic Solutions Using Matlab Language
From Everand
Analytic Geometry: Graphic Solutions Using Matlab Language
Ing. Mario Castillo
No ratings yet
Mindful Maths 1: Use Your Algebra to Solve These Puzzling Pictures
From Everand
Mindful Maths 1: Use Your Algebra to Solve These Puzzling Pictures
Ann McNair
No ratings yet
Basic Mathematics. Explained Easy | For Beginners
From Everand
Basic Mathematics. Explained Easy | For Beginners
ExaGrecation
No ratings yet