0% found this document useful (0 votes)

12 views

R Programming Cont..

MBA(BA) R introduction ppts

Uploaded by

Dr Shweta RAI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

R Programming Cont..

MBA(BA) R introduction ppts

Uploaded by

Dr Shweta RAI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 24

R programming

Data Frames
• Data Frames in R Language are generic data
objects of R that are used to store tabular
data. Data frames can also be interpreted as
matrices where each column of a matrix can
be of different data types. R Data Frame is
made up of three principal components, the
data, rows, and columns.
Create a data frame
• x <-
data.frame(GENDAR=c("M","F","M","F"),AGE=
c(25,18,2,8),
WEIGHT=c(2,5,2,8),HEIGHT=c(6,3,2,4))
• print(x)
#output
???????
# R program to create dataframe

# creating a data frame

friend.data <- data.frame(
friend_id = c(1:5),
friend_name = c("Sachin",
"Sourav",
"Dravid",
"Sehwag",
"Dhoni"),
stringsAsFactors = FALSE
)
# print the data frame
print(friend.data)
# Create the data frame
emp.data <- data.frame( emp_id = c (1:5),
emp_name = c("Rick","Dan","Michelle","Ryan","Gary"),
salary = c(623.3,515.2,611.0,729.0,843.25),
start_date = as.Date(c("2012-01-01", "2013-09-23", "2014-
11-15", "2014-05-11", "2015-03-27")),
stringsAsFactors = TRUE)
# Print the data frame.
print(emp.data)
dplyr and tidyr
• dplyr is a package that provides a grammar of data
manipulation, and provides a most used set of verbs that
helps data science analysts to solve the most common
data manipulation. All dplyr verbs take input as
data.frame and return data.frame object.
• tidyr' contains tools for changing the shape (pivoting) and
hierarchy (nesting and 'unnesting') of a dataset, turning
deeply nested lists into rectangular data frames
('rectangling'), and extracting values out of string
columns. It also includes tools for working with missing
values (both implicit and explicit).
# import dplyr package
install.packages('dplyr')
library(dplyr)
# create a data frame
stats <- data.frame(player=c('A', 'B', 'C', 'D'),
runs=c(100, 200, 408, 19),
wickets=c(17, 20, NA, 5))
# fetch players who scored more
# than 100 runs
filter(stats, runs>100)
# Create DataFrame
df <- data.frame( id = c(10,11,12,13,14,15,16,17),
name =
c('sai','ram','deepika','sahithi','kumar','scott','Don','Lin'),
gender = c('M','M','F','F','M','M','M','F'),
dob = as.Date(c('1990-10-02','1981-3-24','1987-6-
14','1985-8-16', '1995-03-02','1991-6-21','1986-3-
24','1990-8-26')),
state = c('CA','NY',NA,NA,'DC','DW','AZ','PH'),
row.names=c('r1','r2','r3','r4','r5','r6','r7','r8') )
df
By using dplyr filter() function you can filter the R data frame rows by name, filter
dataframe by column value, by multiple conditions e.t.c. Here, %>% is an infix
operator which acts as a pipe, it passes the left-hand side of the operator to the first
argument of the right-hand side of the operator.

# Load dplyr library

library('dplyr')
# filter() by row name
df %>% filter(rownames(df) == 'r3')
# filter() by column Value
df %>% filter(gender == 'M')
# filter() by list of values
df %>% filter(state %in% c("CA", "AZ", "PH"))
# filter() by multiple conditions
df %>% filter(gender == 'M' & id > 15)
dplyr::select() Examples
dplyr select() function is used to select the columns or variables from the data frame.
This takes the first argument as the data frame and the second argument is the
variable name or vector of variable names. For more examples refer to
select columns by name and select columns by index position.

• # select() single column

• df %>% select('id')
• # select() multiple columns
• df %>% select(c('id','name'))
• # Select multiple columns by id
• df %>% select(c(1,2))
• # Select rows 2 and 3
• df %>% slice(2,3)
• # Select rows from list
• df %>% slice(c(2,3,5,6))
• # select rows by range
• df %>% slice(2:6)
• # Drop rows using slice()
• df %>% slice(-2,-3,-4,-5,-6)
• # Drop by range
• df %>% slice(-2:-6)
dplyr::mutate() Examples
Use mutate() function and its other
verbs mutate_all(), mutate_if() and mutate_at() from
dplyr package to replace/update the values of the
column (string, integer, or any type) in R DataFrame
(data.frame).

# REPLACE ON SELECTED COLUMN

DF %>%
MUTATE(NAME = STR_REPLACE(NAME,
"SAI", "SAIRAM"))
dplyr::rename() Examples
The rename() function of dplyr is used to change the column name present in the data frame. The first example from the
following renames the column from the old name id to the new name c1. Similarly use dplyr to rename multiple columns.

• #Change the column name - c1 to id

my_dataframe %>% rename("c1" = "id") #
Rename multiple columns by name
my_dataframe <- my_dataframe %>%
rename("c1" = "id", "c2" = "pages", "c3" =
"name") # Rename multiple columns by index
my_dataframe <- my_dataframe %>%
rename(col1 = 1, col2 = 2)
dplyr::distinct() Examples
distinct() function of dplyr is used to select the unique/distinct rows from the input data frame. Not using any column/variable names as

arguments, this function returns unique rows by checking values on all columns .

• # Create dataframe
df=data.frame(id=c(11,11,33,44,44),
pages=c(32,32,33,22,22),
name=c("spark","spark","R","java","jsp"),
chapters=c(76,76,11,15,15),
price=c(144,144,321,567,567)) df # Load library
dplyr library(dplyr) # Distinct rows df2 <- df %>%
distinct() df2
• # Distinct on selected columns df2 <- df %>%
distinct(id,pages) df2
dplyr::arrange() Examples
dplyr arrange() function is used to sort the R dataframe rows by ascending or descending order based on column
values.

• # Create Data Frame

df=data.frame(id=c(11,22,33,44,55),
name=c("spark","python","R","jsp","java"),
price=c(144,NA,321,567,567), publish_date=
as.Date( c("2007-06-22", "2004-02-13", "2006-
05-18", "2010-09-02","2007-07-20")) ) # Load
dplyr library library(dplyr)
# Using arrange in ascending order df2 <- df
%>% arrange(price) df2
dplyr::group_by()
group_by() function in R is used to group the rows in a DataFrame by single or multiple columns and perform the aggregations.

• # Create Data Frame df =

read.csv('/Users/admin/apps/github/r-
examples/resources/emp.csv') df # Load dplyr
library(dplyr) # group_by() on department
grp_tbl <- df %>% group_by(department)
grp_tbl # summarise on groupped data.
agg_tbl <- grp_tbl %>%
summarise(sum(salary)) agg_tbl
One can get the structure of the R data frame using str() function in R. It can display
even the internal structure of large lists which are nested. It provides one-liner output
for the basic R objects letting the user know about the object and its constituents

• # R program to get the

• # structure of the data frame
•
• # creating a data frame
• friend.data <- data.frame(
• friend_id = c(1:5),
• friend_name = c("Sachin", "Sourav",
• "Dravid", "Sehwag",
• "Dhoni"),
• stringsAsFactors = FALSE
• )
• # using str()
• print(str(friend.data))
In the R data frame, the statistical summary and nature of the data can be obtained by applying
summary() function. It is a generic function used to produce result summaries of the results of
various model fitting functions. The function invokes particular methods which depend on the
class of the first argument.

• # R program to get the

• # summary of the data frame
•
• # creating a data frame
• friend.data <- data.frame(
• friend_id = c(1:5),
• friend_name = c("Sachin", "Sourav",
• "Dravid", "Sehwag",
• "Dhoni"),
• stringsAsFactors = FALSE
• )
• # using summary()
• print(summary(friend.data))
Extracting data from an R data frame means that to access its rows or
columns. One can extract a specific column from an R data frame using its
column name.

• # R program to extract
• # data from the data frame
•
• # creating a data frame
• friend.data <- data.frame(
• friend_id = c(1:5),
• friend_name = c("Sachin", "Sourav",
• "Dravid", "Sehwag",
• "Dhoni"),
• stringsAsFactors = FALSE
• )
•
• # Extracting friend_name column
• result <- data.frame(friend.data$friend_name)
• print(result)
A data frame in R can be expanded by adding new
columns and rows to the already existing R data frame .
• # R program to expand
• # the data frame
•
• # creating a data frame
• friend.data <- data.frame(
• friend_id = c(1:5),
• friend_name = c("Sachin", "Sourav",
• "Dravid", "Sehwag",
• "Dhoni"),
• stringsAsFactors = FALSE
• )
•
• # Expanding data frame
• friend.data$location <- c("Kolkata", "Delhi",
• "Bangalore", "Hyderabad",
• "Chennai")
• resultant <- friend.data
• # print the modified data frame
• print(resultant)
A data frame in R removes columns and rows from
the already existing R data frame.
• library(dplyr)
• # Create a data frame
• data <- data.frame(
• friend_id = c(1, 2, 3, 4, 5),
• friend_name = c("Sachin", "Sourav", "Dravid", "Sehwag", "Dhoni"),
• location = c("Kolkata", "Delhi", "Bangalore", "Hyderabad", "Chennai")
• )
•
• # Remove a row with friend_id = 3
• data <- subset(data, friend_id != 3)
•
• # Remove the 'location' column
• data <- select(data, -location)
• Print(data)

Subsetting Data in R
No ratings yet
Subsetting Data in R
44 pages
Normet Spraymec Spare - Part - Manual PDF
100% (2)
Normet Spraymec Spare - Part - Manual PDF
483 pages
R Programming Cheatsheet
100% (1)
R Programming Cheatsheet
6 pages
Manual de Bop Anular
No ratings yet
Manual de Bop Anular
22 pages
6 Working With Data Frames in R
No ratings yet
6 Working With Data Frames in R
8 pages
Data Handling in R Programming notes
No ratings yet
Data Handling in R Programming notes
41 pages
Basic R Dplyr Session 4 Demonstration
No ratings yet
Basic R Dplyr Session 4 Demonstration
18 pages
RSTUDIO
No ratings yet
RSTUDIO
44 pages
3 Scalar, Dataframe
No ratings yet
3 Scalar, Dataframe
13 pages
What Is A Data Frame in R?
No ratings yet
What Is A Data Frame in R?
5 pages
Dar lecture 7
No ratings yet
Dar lecture 7
24 pages
Machine Learning - Unit IV Notes
No ratings yet
Machine Learning - Unit IV Notes
18 pages
BMR Assignment: Tidyr
No ratings yet
BMR Assignment: Tidyr
3 pages
8 R Basics 3
No ratings yet
8 R Basics 3
27 pages
Module IV
No ratings yet
Module IV
43 pages
R Data Frame - Javatpoint
No ratings yet
R Data Frame - Javatpoint
14 pages
R Packages Dplyr Sem-III 2021
No ratings yet
R Packages Dplyr Sem-III 2021
13 pages
Basic_Data_Objects_in_R
No ratings yet
Basic_Data_Objects_in_R
18 pages
R Basic and Advanced
No ratings yet
R Basic and Advanced
9 pages
MIT 302 - Statistical Computing II - Tutorial 02
No ratings yet
MIT 302 - Statistical Computing II - Tutorial 02
5 pages
Dataframes
No ratings yet
Dataframes
13 pages
DSF 11-12
No ratings yet
DSF 11-12
21 pages
BS730 Class 12
No ratings yet
BS730 Class 12
36 pages
Lecture_5_(Managing_and_Understanding_Data)
No ratings yet
Lecture_5_(Managing_and_Understanding_Data)
9 pages
R
No ratings yet
R
15 pages
fancyDPLYR Funcs
No ratings yet
fancyDPLYR Funcs
31 pages
Kids C ("Jack", "Jill") : 5.1 Creating Data Frames
No ratings yet
Kids C ("Jack", "Jill") : 5.1 Creating Data Frames
11 pages
R Course Own English HS
No ratings yet
R Course Own English HS
70 pages
Tutorial-Introduction To Dplyr
No ratings yet
Tutorial-Introduction To Dplyr
54 pages
Base-R
No ratings yet
Base-R
9 pages
Lab6a (1)
No ratings yet
Lab6a (1)
3 pages
MDPN460 Lecture05
No ratings yet
MDPN460 Lecture05
32 pages
R Programming: © 2016 SMART Training Resources Pvt. LTD
No ratings yet
R Programming: © 2016 SMART Training Resources Pvt. LTD
28 pages
R Prog
No ratings yet
R Prog
27 pages
All Codes
No ratings yet
All Codes
10 pages
data-frames-in-R
No ratings yet
data-frames-in-R
7 pages
R study material I
No ratings yet
R study material I
8 pages
Lab11
No ratings yet
Lab11
2 pages
R Reference Card
100% (4)
R Reference Card
4 pages
Daur Unit 2
No ratings yet
Daur Unit 2
28 pages
Data Cleaning Using R
No ratings yet
Data Cleaning Using R
26 pages
R Programming Cheat Sheet: Ata Tructures
No ratings yet
R Programming Cheat Sheet: Ata Tructures
2 pages
MTech R Notes
No ratings yet
MTech R Notes
14 pages
Important R Codes and Notes
No ratings yet
Important R Codes and Notes
13 pages
R Command Cheatsheet2551545
No ratings yet
R Command Cheatsheet2551545
2 pages
R-Cheat Sheet
100% (1)
R-Cheat Sheet
4 pages
M2_DAR_
No ratings yet
M2_DAR_
46 pages
DA_Lab_Week-2
No ratings yet
DA_Lab_Week-2
22 pages
Introduction to R for Business Analytics(1)
No ratings yet
Introduction to R for Business Analytics(1)
7 pages
Learn R_ Learn R_ Data Cleaning Cheatsheet _ Codecademy
No ratings yet
Learn R_ Learn R_ Data Cleaning Cheatsheet _ Codecademy
4 pages
Unit 1.3
No ratings yet
Unit 1.3
36 pages
BDA Section 4
No ratings yet
BDA Section 4
19 pages
What Is Dplyr
No ratings yet
What Is Dplyr
23 pages
BigData_BCom-Unit-4
No ratings yet
BigData_BCom-Unit-4
9 pages
Obejcts in R A13
No ratings yet
Obejcts in R A13
8 pages
Plyr Package in R Programming
No ratings yet
Plyr Package in R Programming
9 pages
Frs Unit - 2
No ratings yet
Frs Unit - 2
27 pages
CH 03
No ratings yet
CH 03
42 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Tidyr
No ratings yet
Tidyr
1 page
Unit 3 PPT (BA)
No ratings yet
Unit 3 PPT (BA)
19 pages
Unit 2 PPT (BA)
No ratings yet
Unit 2 PPT (BA)
33 pages
Introduction To Business Analytics & Data Science 22-23
No ratings yet
Introduction To Business Analytics & Data Science 22-23
1 page
Cadcam Iat - 1 Question Paper
No ratings yet
Cadcam Iat - 1 Question Paper
2 pages
Gate Solved Paper - In: Q.1 - 30 Carry One Mark Each
No ratings yet
Gate Solved Paper - In: Q.1 - 30 Carry One Mark Each
20 pages
NMDC Presentation For BENEFICIATION PLANT - 19.07.2016-1
No ratings yet
NMDC Presentation For BENEFICIATION PLANT - 19.07.2016-1
18 pages
TDSSWL
No ratings yet
TDSSWL
1 page
A Project Report Automatic Traffic Light Prepared in Partial Fulfillment
No ratings yet
A Project Report Automatic Traffic Light Prepared in Partial Fulfillment
6 pages
Method Statement For Soft Landscape - Full - B
No ratings yet
Method Statement For Soft Landscape - Full - B
41 pages
ADP000536
No ratings yet
ADP000536
106 pages
Compresor SullAir VCC200 - 200S
No ratings yet
Compresor SullAir VCC200 - 200S
10 pages
Design Fabrication and Performance Test of Melon Shelling Machines
No ratings yet
Design Fabrication and Performance Test of Melon Shelling Machines
19 pages
Exercises - Ii 13-12-2022
No ratings yet
Exercises - Ii 13-12-2022
5 pages
Forane Refrigerants Properties
No ratings yet
Forane Refrigerants Properties
20 pages
Renu Mathur
No ratings yet
Renu Mathur
4 pages
KDK Me Catalogue
100% (1)
KDK Me Catalogue
16 pages
Profile Snapshot: Recruiters
No ratings yet
Profile Snapshot: Recruiters
6 pages
Acitivity Guidess
100% (1)
Acitivity Guidess
2 pages
CPDS Lab Files
No ratings yet
CPDS Lab Files
3 pages
A. Lun Move Start
No ratings yet
A. Lun Move Start
22 pages
Test1 342 PracticeV1
No ratings yet
Test1 342 PracticeV1
5 pages
Data Sheet Am 22
No ratings yet
Data Sheet Am 22
4 pages
HCU200 Users Guide PDF
No ratings yet
HCU200 Users Guide PDF
20 pages
Desmodur L75: Characterization
No ratings yet
Desmodur L75: Characterization
4 pages
Chapter 4
No ratings yet
Chapter 4
34 pages
Ingot Manufacturing
No ratings yet
Ingot Manufacturing
7 pages
Design of Inspection and Cleaning Robot
No ratings yet
Design of Inspection and Cleaning Robot
6 pages
Eng. Taher Galal
No ratings yet
Eng. Taher Galal
2 pages
Catalogo CGM.3
No ratings yet
Catalogo CGM.3
44 pages
Paich - 25.5.16
No ratings yet
Paich - 25.5.16
12 pages
Paints Industry Raw Materials Unit Operations Equipment Manufacturing Quality Tests
No ratings yet
Paints Industry Raw Materials Unit Operations Equipment Manufacturing Quality Tests
46 pages

R Programming Cont..

Uploaded by

R Programming Cont..

Uploaded by

R programming

# creating a data frame

# Load dplyr library

• # select() single column

# REPLACE ON SELECTED COLUMN

• #Change the column name - c1 to id

• # Create Data Frame

• # Create Data Frame df =

• # R program to get the

• # R program to get the

You might also like