Data Table Syntax and Usage Guide

The document discusses the syntax of the data.table package in R. It notes that the general syntax is DT[i, j, by] where i represents rows to select, j represents columns to select or transform, and by represents grouping columns. It provides examples of using i to subset and order rows, using j to select, compute on, and name columns, and using by to group by one or more columns and columns or expressions. The flexibility of combining i, j, and by allows for powerful operations like applying functions to grouped data or subsetting groups.

Uploaded by

Johana Coen Janssen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views1 page

Data Table Syntax and Usage Guide

Uploaded by

Johana Coen Janssen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Notes on Data Table

Makar Pravosud
28 08 2019

The general form of [Link] syntax is:

DT[i, j, by]

Using i:

We can also sort a [Link] using order(), which internally uses [Link]’s fast order for performance.
We can do much more in i by keying a [Link], which allows blazing fast subsets and joins.

Using j:

Select columns the [Link] way: DT[, .(colA, colB)].

Select columns the [Link] way: DT[, c(“colA”, “colB”)].
Compute on columns: DT[, .(sum(colA), mean(colB))].
Provide names if necessary: DT[, .(sA =sum(colA), mB = mean(colB))].
Combine with i: DT[colA > value, sum(colB)].

Using by:

Using by, we can group by columns by specifying a list of columns or a character vector of column names or
even expressions. The flexibility of j, combined with by and i makes for a very powerful syntax.
by can handle multiple columns and also expressions.
We can keyby grouping columns to automatically sort the grouped result.
We can use .SD and .SDcols in j to operate on multiple columns using already familiar base functions. Here
are some examples:
DT[, lapply(.SD, fun), by = . . . , .SDcols = . . . ] - applies fun to all columns specified in .SDcols while
grouping by the columns specified in by.
DT[, head(.SD, 2), by = . . . ] - return the first two rows for each group.
DT[col > val, head(.SD, 1), by = . . . ] - combine i along with j and by.
And remember the tip: As long as j returns a list, each element of the list will become a column in the
resulting [Link].

Datatable
No ratings yet
Datatable
2 pages
Introduction To The Data - Table Package in R: Revised: September 18, 2015 (A Later Revision May Be Available On The)
No ratings yet
Introduction To The Data - Table Package in R: Revised: September 18, 2015 (A Later Revision May Be Available On The)
8 pages
Enhanced Data
No ratings yet
Enhanced Data
12 pages
Introduction To The Data - Table Package in R: Revised: October 2, 2014 (A Later Revision May Be Available On The)
No ratings yet
Introduction To The Data - Table Package in R: Revised: October 2, 2014 (A Later Revision May Be Available On The)
8 pages
Data Transformation With Data - Table: Cheat Sheet
No ratings yet
Data Transformation With Data - Table: Cheat Sheet
2 pages
Data Transformation With Data - Table: Cheat Sheet
No ratings yet
Data Transformation With Data - Table: Cheat Sheet
2 pages
Data Transformation With Data - Table: Cheat Sheet
No ratings yet
Data Transformation With Data - Table: Cheat Sheet
2 pages
Data Table
No ratings yet
Data Table
2 pages
Datatable Intro
No ratings yet
Datatable Intro
9 pages
Data.table Transformation Cheat Sheet
No ratings yet
Data.table Transformation Cheat Sheet
2 pages
Faqs About The Data - Table Package in R: Revised: October 2, 2014 (A Later Revision May Be Available On The)
No ratings yet
Faqs About The Data - Table Package in R: Revised: October 2, 2014 (A Later Revision May Be Available On The)
21 pages
R Programming Cheat Sheet: Data Structures
No ratings yet
R Programming Cheat Sheet: Data Structures
2 pages
R Programming Cheatsheet
100% (2)
R Programming Cheatsheet
6 pages
R data.table Guide: 50 Examples
No ratings yet
R data.table Guide: 50 Examples
13 pages
R Basics for Biology Students
No ratings yet
R Basics for Biology Students
17 pages
Basic Data Objects in R
No ratings yet
Basic Data Objects in R
18 pages
Presentation 1
No ratings yet
Presentation 1
34 pages
R data.table Cheat Sheet Guide
No ratings yet
R data.table Cheat Sheet Guide
1 page
Datatable Cheat Sheet R
No ratings yet
Datatable Cheat Sheet R
1 page
Tidyr & Dplyr Functions Guide
No ratings yet
Tidyr & Dplyr Functions Guide
3 pages
R Matrix and Factor Basics
No ratings yet
R Matrix and Factor Basics
2 pages
R Study Material I
No ratings yet
R Study Material I
8 pages
Descriptive Statistics in R Programming
No ratings yet
Descriptive Statistics in R Programming
3 pages
R
No ratings yet
R
15 pages
Obejcts in R A13
No ratings yet
Obejcts in R A13
8 pages
Mydata - Read - CSV ("Nameofthedatafile - CSV") : Sorting A Data Frame
No ratings yet
Mydata - Read - CSV ("Nameofthedatafile - CSV") : Sorting A Data Frame
2 pages
M2 Dar
No ratings yet
M2 Dar
46 pages
Harneet 23-R Presentation
No ratings yet
Harneet 23-R Presentation
22 pages
ISYS3447 - Week 3 Notes
No ratings yet
ISYS3447 - Week 3 Notes
3 pages
R Data Subsetting & Manipulation Guide
No ratings yet
R Data Subsetting & Manipulation Guide
44 pages
R Basics Part2
No ratings yet
R Basics Part2
15 pages
MATLAB Tutorial by Oren Shriki
No ratings yet
MATLAB Tutorial by Oren Shriki
15 pages
R Practicals
No ratings yet
R Practicals
53 pages
Week 1-B. Data in R
No ratings yet
Week 1-B. Data in R
5 pages
R Programming for Data Analysis Guide
No ratings yet
R Programming for Data Analysis Guide
27 pages
MLlab 5 TH
No ratings yet
MLlab 5 TH
17 pages
Unit 1.3
No ratings yet
Unit 1.3
36 pages
R Programming: Vectors and Matrices Guide
No ratings yet
R Programming: Vectors and Matrices Guide
109 pages
Example of Descriptive Statistics Table
No ratings yet
Example of Descriptive Statistics Table
25 pages
Base R
No ratings yet
Base R
9 pages
Data Analysis Using R - 3
No ratings yet
Data Analysis Using R - 3
32 pages
DSF 9-10
No ratings yet
DSF 9-10
25 pages
Data Manipulation and Visualization in R
No ratings yet
Data Manipulation and Visualization in R
58 pages
21Ai51T - Programming Language For Ai: Innovative Assignment - III
No ratings yet
21Ai51T - Programming Language For Ai: Innovative Assignment - III
13 pages
R Statistical Package
No ratings yet
R Statistical Package
63 pages
R Basic and Advanced
No ratings yet
R Basic and Advanced
9 pages
Unit 3 Chatgpt
No ratings yet
Unit 3 Chatgpt
6 pages
Introduction to R Basics and Data Types
No ratings yet
Introduction to R Basics and Data Types
33 pages
R Programming: © 2016 SMART Training Resources Pvt. LTD
No ratings yet
R Programming: © 2016 SMART Training Resources Pvt. LTD
28 pages
Dar Lecture 7
No ratings yet
Dar Lecture 7
24 pages
R Programming Basics: Vectors, Matrices, Dataframes
No ratings yet
R Programming Basics: Vectors, Matrices, Dataframes
13 pages
Lab Week2-3
No ratings yet
Lab Week2-3
26 pages

Data Table Syntax and Usage Guide

Uploaded by

Data Table Syntax and Usage Guide

Uploaded by

Notes on Data Table

The general form of [Link] syntax is:

Select columns the [Link] way: DT[, .(colA, colB)].

You might also like