0% found this document useful (0 votes)

15 views3 pages

Class 3

The document reads in employee data from a CSV file, examines the data frame structure and variables, and performs several aggregations and calculations on the data. Key points: - Employee data is read from a CSV file with variables like name, gender, salary, ratings, etc. - Summary statistics are calculated for variables like salary, experience, ratings to understand distributions. - Aggregations are performed to calculate average salary by gender, designation, and their combination. - Median experience is aggregated by gender and gender-designation to understand differences. - Selected variables are aggregated and averaged based on gender, designation, and minority to a new CSV file.

Uploaded by

Jaan Mukherjee

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views3 pages

Class 3

Uploaded by

Jaan Mukherjee

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

class3.

R
apple

2023-09-05
emp <- read.csv("~/Desktop/Cares/RIMS/RDA/emp.csv", stringsAsFactors=TRUE)
head(emp)

## id Names gender DOB educ Designation Level salary

## 1 1 Dr.Liam Johnson Male 1986-11-24 PG MLM III 57000
## 2 2 Mr.Noah Smith Male 1963-06-23 PG MLM I 40200
## 3 3 Mr.William Williams Male 1991-12-23 PG ELM II 32100
## 4 4 Dr.James Brown Male 1998-04-07 PG ELM III 36000
## 5 7 Mr.Oliver Jones Male 1996-03-28 UG ELM I 27300
## 6 11 Dr.Benjamin Davis Male 1987-01-06 PG ELM II 31050
## Last.drawn.salary PRE..EXP minority RATINGS2.BY.INTERVIEWER
## 1 27000 10 No 5
## 2 18750 20 No 5
## 3 13500 10 No 4
## 4 18750 3 No 5
## 5 13500 3 No 8
## 6 12600 10 No 4
## RATINGS1.BY.INTERVIEWER RATINGS3.BY.INTERVIEWER RATINGS4.BY.INTERVIEWER
## 1 5 7 9
## 2 3 4 9
## 3 4 10 7
## 4 5 9 1
## 5 7 1 8
## 6 7 1 2

names(emp)

## [1] "id" "Names"

## [3] "gender" "DOB"
## [5] "educ" "Designation"
## [7] "Level" "salary"
## [9] "Last.drawn.salary" "PRE..EXP"
## [11] "minority" "RATINGS2.BY.INTERVIEWER"
## [13] "RATINGS1.BY.INTERVIEWER" "RATINGS3.BY.INTERVIEWER"
## [15] "RATINGS4.BY.INTERVIEWER"

summary(emp)

## id Names gender DOB educ

## Min. : 1.00 Dr. Abdul Rahman: 1 Female:53 1963-01-13: 1

HS:16
## 1st Qu.: 35.00 Dr. Amir Khan : 1 Male :80 1963-02-05: 1
PG:69
## Median : 68.00 Dr. Anusha Patel: 1 1963-06-23: 1
UG:48
## Mean : 67.95 Dr. Arjun Patel : 1 1964-03-02: 1

## 3rd Qu.:101.00 Dr. Cheng Wang : 1 1964-08-23: 1

## Max. :134.00 Dr. Faisal Ahmed: 1 1964-11-20: 1

## (Other) :127 (Other) :127

## Designation Level salary Last.drawn.salary PRE..EXP

## ELAM:18 I :46 Min. : 20400 Min. :10950 Min. : 2.00
## ELM :75 II :48 1st Qu.: 25950 1st Qu.:13500 1st Qu.: 4.00
## MLM :21 III:39 Median : 32550 Median :15750 Median : 8.00
## TLM :19 Mean : 41405 Mean :19582 Mean :11.01
## 3rd Qu.: 55750 3rd Qu.:21750 3rd Qu.:18.00
## Max. :135000 Max. :79980 Max. :34.50
##
## minority RATINGS2.BY.INTERVIEWER RATINGS1.BY.INTERVIEWER
## No :111 Min. : 1.00 Min. : 1.000
## Yes: 22 1st Qu.: 3.00 1st Qu.: 3.000
## Median : 5.00 Median : 5.000
## Mean : 5.15 Mean : 5.624
## 3rd Qu.: 7.00 3rd Qu.: 8.000
## Max. :10.00 Max. :10.000
##
## RATINGS3.BY.INTERVIEWER RATINGS4.BY.INTERVIEWER
## Min. : 1.000 Min. : 1.000
## 1st Qu.: 3.000 1st Qu.: 3.000
## Median : 6.000 Median : 5.000
## Mean : 5.729 Mean : 5.353
## 3rd Qu.: 8.000 3rd Qu.: 8.000
## Max. :10.000 Max. :10.000
##

### aggregate

aggregate(salary ~ gender,mean,data=emp)

## gender salary
## 1 Female 28141.98
## 2 Male 50191.25

aggregate(salary ~ Designation,mean,data=emp)

## Designation salary
## 1 ELAM 21633.33
## 2 ELM 30310.00
## 3 MLM 56419.05
## 4 TLM 87335.53

aggregate(salary ~ gender+Designation,mean,data=emp)

## gender Designation salary

## 1 Female ELAM 21460.00
## 2 Male ELAM 22500.00
## 3 Female ELM 28542.86
## 4 Male ELM 31856.25
## 5 Female MLM 56875.00
## 6 Male MLM 56343.06
## 7 Male TLM 87335.53

aggregate(PRE..EXP ~ gender,median,data=emp)

## gender PRE..EXP
## 1 Female 4
## 2 Male 10

aggregate(PRE..EXP ~ gender+Designation,median,data=emp)

## gender Designation PRE..EXP

## 1 Female ELAM 3
## 2 Male ELAM 4
## 3 Female ELM 4
## 4 Male ELM 8
## 5 Female MLM 16
## 6 Male MLM 17
## 7 Male TLM 25

emp_cont = emp[,c(3,6,8:15)]
names(emp_cont)

## [1] "gender" "Designation"

## [3] "salary" "Last.drawn.salary"
## [5] "PRE..EXP" "minority"
## [7] "RATINGS2.BY.INTERVIEWER" "RATINGS1.BY.INTERVIEWER"
## [9] "RATINGS3.BY.INTERVIEWER" "RATINGS4.BY.INTERVIEWER"

ag1 = aggregate(. ~ gender+Designation+minority,mean,data=emp_cont)

write.csv(ag1,"output_ag1.csv")

Presentation On Financial Scam - Enron Scandal
No ratings yet
Presentation On Financial Scam - Enron Scandal
11 pages
Base Sas Certification Exercise
No ratings yet
Base Sas Certification Exercise
47 pages
R Working Manuals Students
No ratings yet
R Working Manuals Students
11 pages
Experiment 2
No ratings yet
Experiment 2
7 pages
BDA Assignment Aman 19019
No ratings yet
BDA Assignment Aman 19019
38 pages
R Working Materials Prep
No ratings yet
R Working Materials Prep
43 pages
Experiment 2
No ratings yet
Experiment 2
7 pages
R For Machine Learning Lab Practical Work: Master of Business Administration in Business Analytics
0% (1)
R For Machine Learning Lab Practical Work: Master of Business Administration in Business Analytics
9 pages
Practical 2024
No ratings yet
Practical 2024
10 pages
Rinku Mitra MLID241017
No ratings yet
Rinku Mitra MLID241017
18 pages
Bank Rpubs
No ratings yet
Bank Rpubs
24 pages
L3 Notes-1
No ratings yet
L3 Notes-1
8 pages
Project 5 PDF
100% (1)
Project 5 PDF
48 pages
Assignment Lab 1
No ratings yet
Assignment Lab 1
3 pages
Experiment 5
No ratings yet
Experiment 5
13 pages
R Copy 4
No ratings yet
R Copy 4
14 pages
Assignment Submitted By-Srishti Bhateja 19021141116: STR (Crew - Data)
No ratings yet
Assignment Submitted By-Srishti Bhateja 19021141116: STR (Crew - Data)
11 pages
Assignment On Classification Tree Model Development: Submitted By-Gaurav Khokhani
No ratings yet
Assignment On Classification Tree Model Development: Submitted By-Gaurav Khokhani
19 pages
Assignment 2
No ratings yet
Assignment 2
5 pages
Stat 412 - M - 2022
No ratings yet
Stat 412 - M - 2022
21 pages
IntroR 2
No ratings yet
IntroR 2
18 pages
Mastering Pandas With 103 Practical Questions and Solution 1731584558
No ratings yet
Mastering Pandas With 103 Practical Questions and Solution 1731584558
48 pages
Data Wrangling
No ratings yet
Data Wrangling
12 pages
Base Sas Certification Exercise
No ratings yet
Base Sas Certification Exercise
47 pages
Sunil Test
No ratings yet
Sunil Test
15 pages
Week 7
No ratings yet
Week 7
10 pages
RCommander Resultados
No ratings yet
RCommander Resultados
2 pages
TITLE: Bank Marketing Classification: Submitted To: Dr. Supriya Kumar de Professor XLRI, Jamshedpur
No ratings yet
TITLE: Bank Marketing Classification: Submitted To: Dr. Supriya Kumar de Professor XLRI, Jamshedpur
18 pages
Student Notebook HR Analysis
No ratings yet
Student Notebook HR Analysis
11 pages
Assignment3: 1) Identify Percentage of Missing Values in Each Column and Display The Same
No ratings yet
Assignment3: 1) Identify Percentage of Missing Values in Each Column and Display The Same
30 pages
Basic Commands
No ratings yet
Basic Commands
10 pages
07 HR
No ratings yet
07 HR
15 pages
Unit 5 Fully
No ratings yet
Unit 5 Fully
29 pages
SB Assignment 1 (Group 68)
No ratings yet
SB Assignment 1 (Group 68)
21 pages
Predictive+Modelling+-+Logistic+Regression+-+Student+Version-New2.3.ipynb - Colaboratory
No ratings yet
Predictive+Modelling+-+Logistic+Regression+-+Student+Version-New2.3.ipynb - Colaboratory
12 pages
Frequencies
No ratings yet
Frequencies
14 pages
R Studio Notes
No ratings yet
R Studio Notes
10 pages
Logistic Regression Assignment
No ratings yet
Logistic Regression Assignment
20 pages
Practical Test 1222678
No ratings yet
Practical Test 1222678
5 pages
Diagrama de Cajas: Arlethe Arones Rondon 26 de Junio de 2019
No ratings yet
Diagrama de Cajas: Arlethe Arones Rondon 26 de Junio de 2019
2 pages
Interview Qs - Batch 34
No ratings yet
Interview Qs - Batch 34
5 pages
R Sharing
No ratings yet
R Sharing
16 pages
R - Tutorial: Matrices Are Vectors
No ratings yet
R - Tutorial: Matrices Are Vectors
13 pages
Project3: Loading Library
No ratings yet
Project3: Loading Library
17 pages
Netcourse 101: Answers To Exercises in Lesson 3
No ratings yet
Netcourse 101: Answers To Exercises in Lesson 3
7 pages
Quiz 3
No ratings yet
Quiz 3
56 pages
Experiment Lab-II
No ratings yet
Experiment Lab-II
9 pages
Data Management II
No ratings yet
Data Management II
15 pages
XIIInfo Pract S E 435
0% (1)
XIIInfo Pract S E 435
5 pages
Lookup Functions With Practical Business Case Study
No ratings yet
Lookup Functions With Practical Business Case Study
17 pages
Experiment Lab-II
No ratings yet
Experiment Lab-II
9 pages
早年自敲代码
No ratings yet
早年自敲代码
96 pages
MBA SectionD MBA20235 PranayGupta Assignment R
No ratings yet
MBA SectionD MBA20235 PranayGupta Assignment R
16 pages
Foxpro Practical MLDC
No ratings yet
Foxpro Practical MLDC
18 pages
Data Frames Python
No ratings yet
Data Frames Python
16 pages
Thera Bank - Project
100% (4)
Thera Bank - Project
34 pages
MCPT
No ratings yet
MCPT
34 pages
Holistic Exercises - 96 holistic workout inspirations
From Everand
Holistic Exercises - 96 holistic workout inspirations
Theresia Eggers
No ratings yet
Tom Corbett: Space Cadet #4
From Everand
Tom Corbett: Space Cadet #4
Bill Spangler
No ratings yet
Customer Churn
No ratings yet
Customer Churn
296 pages
Summer - Internship - Project - Report
No ratings yet
Summer - Internship - Project - Report
47 pages
Course Code: 21PGDBA301 Course: R For Data Analytics
No ratings yet
Course Code: 21PGDBA301 Course: R For Data Analytics
2 pages
Module 4 - Data Mining
No ratings yet
Module 4 - Data Mining
13 pages
Module 3 - Data Warehousing
No ratings yet
Module 3 - Data Warehousing
6 pages
Dessler ch16
No ratings yet
Dessler ch16
50 pages
Bill of Materials For Baby Cot
No ratings yet
Bill of Materials For Baby Cot
2 pages
Apple Inc
No ratings yet
Apple Inc
8 pages

Class 3

Uploaded by

Class 3

Uploaded by

class3.

## id Names gender DOB educ Designation Level salary

## [1] "id" "Names"

## id Names gender DOB educ

## Min. : 1.00 Dr. Abdul Rahman: 1 Female:53 1963-01-13: 1

## 3rd Qu.:101.00 Dr. Cheng Wang : 1 1964-08-23: 1

## Max. :134.00 Dr. Faisal Ahmed: 1 1964-11-20: 1

## (Other) :127 (Other) :127

## Designation Level salary Last.drawn.salary PRE..EXP

## gender Designation salary

## gender Designation PRE..EXP

## [1] "gender" "Designation"

ag1 = aggregate(. ~ gender+Designation+minority,mean,data=emp_cont)

You might also like