0% found this document useful (0 votes)

20 views4 pages

Assignment - 1: Data Analytics and R

The document discusses analyzing a health record dataset of 30 people using R. It includes importing the Excel dataset, descriptive analyses like calculating ranges, means, modes and other statistics on variables like miles per day, calories burned, and more. It also analyzes distributions by age, gender, state, and vaccination status.

Uploaded by

Lakshita Saini

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views4 pages

Assignment - 1: Data Analytics and R

Uploaded by

Lakshita Saini

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

DATA ANALYTICS AND R

ASSIGNMENT -1
SUBMITTED BY : LAKSHITA SAINI

TABLE
Dataset of Health record of 30 people distributed.

IMPORTING DATA

DATA TYPE - EXCEL SHEET

#import Excel file into R

#Creation of table and importing the data

data <- read_excel('C:\\Users\\Admin\\Desktop\\NIFT\\Year - 3\\Sem 6\\Data

Analytics & R\\HealthRecord.xlsx')
DESCRIPTIVE ANALYSIS OF PROBLEM STATEMENTS

#1 - What is the range (maximum and minimum),mean and median of the

miles covered per day?

To study what is the maximum and minimum miles covered by the people who
participated in the study and to also find out the average miles covered by
them and the central value, representing the situation of the majority of the
group.

summary_mpd = summary(data$miles_per_day)

print(summary_mpd)

r = range(data$miles_per_day)

print(r)

#2 - What is the mode of calories burned?

The mode is calculated to understand around which value ‘calories burned’ are
the most frequent.

install.packages("modeest")

library(modeest)

mode = mfv(data$calories_burned)

print(mode)

#three - What is the variance of workout duration?

Variance is calculated to understand the variability of the data of workout

duration from the mean value of workout duration.

var(data$workout_duration)
#four - What is the standard deviation of health score?

The value of standard deviation will help us to understand the data values of
the health score dispersed around the mean, telling us about the health of the
group.

std = sd(data$health_score)

print(std)

#five - How many people participated under each age?

The count function will help us study the distribution of our group across
various age groups.

library(dplyr)

data %>% count(age)

#six - How much percentage of people are vaccinated and how many are not?

Prop.table function helps us to study a very important factor about the

vaccination status as it contributes majorly to the health status of an
individual.

v1 <- prop.table(table(data$vaccination_status))

print(v1*100)
#seven - What is the distribution of candidates who participated among states?

The count function will help us study the distribution of our group across
various states.

data %>% count(state)

#eight - Who has a better health score, males or females?

The mean score calculation on the basis of the gender which helps us to
understand which age group has a better health status.

install.packages("xlsx")

library("xlsx")

excel_path <- "C:\\Users\\Admin\\Desktop\\NIFT\\Year - 3\\Sem 6\\Data

Analytics & R\\HealthRecord.xlsx"

gender <- read.xlsx(excel_path, sheetName = "Example", colIndex = 4)

score <- read.xlsx(excel_path, sheetName = "Example", colIndex = 8)

df <- data.frame(gender, score)

df %>%

group_by(gender) %>%

summarise_at(vars(health_score), list(mean_score = mean))

Computer Hardware With Images
97% (32)
Computer Hardware With Images
41 pages
2022 PUMA Handbook-OHS
No ratings yet
2022 PUMA Handbook-OHS
74 pages
Draft Petition For Legal Beneficiary
67% (3)
Draft Petition For Legal Beneficiary
4 pages
Concrete Shear Wall With Complete Details Ram Concept
100% (1)
Concrete Shear Wall With Complete Details Ram Concept
108 pages
Introduction To Animal Science An Sci 1
No ratings yet
Introduction To Animal Science An Sci 1
70 pages
Blake and Mouton Leadership Grid
No ratings yet
Blake and Mouton Leadership Grid
2 pages
Scorecard
No ratings yet
Scorecard
32 pages
Medical Fitness
No ratings yet
Medical Fitness
1 page
Bookshops in Hay-on-Wye
No ratings yet
Bookshops in Hay-on-Wye
4 pages
Class Ii Annual Syllabus 2024-2025
No ratings yet
Class Ii Annual Syllabus 2024-2025
11 pages
Accounting For Share Issuance
No ratings yet
Accounting For Share Issuance
19 pages
Lab 5
0% (1)
Lab 5
5 pages
Advanced R Data Analysis Training PDF
No ratings yet
Advanced R Data Analysis Training PDF
72 pages
WQD2012 Presentaton QualityQuizQuestionsAnswers
No ratings yet
WQD2012 Presentaton QualityQuizQuestionsAnswers
78 pages
CHEM2 Long Quiz 2
No ratings yet
CHEM2 Long Quiz 2
4 pages
List of Programs in R 2 Sem
No ratings yet
List of Programs in R 2 Sem
48 pages
The Puma Forever Better Sustainability Handbooks: Social Standards
No ratings yet
The Puma Forever Better Sustainability Handbooks: Social Standards
55 pages
FBR & IT Applications: Compiled and Presented by DR - Deepak Joshi For Academic Use Only
No ratings yet
FBR & IT Applications: Compiled and Presented by DR - Deepak Joshi For Academic Use Only
77 pages
CS 3362 FDS
No ratings yet
CS 3362 FDS
53 pages
ML Proj Diabetes
No ratings yet
ML Proj Diabetes
51 pages
Diliman Preparatory School 3 Quarter Exam. (Reviewer) Language 6 Name: - Date
No ratings yet
Diliman Preparatory School 3 Quarter Exam. (Reviewer) Language 6 Name: - Date
2 pages
CS ELEC 4 Midterm Module
No ratings yet
CS ELEC 4 Midterm Module
59 pages
AMDA Practical - A048
No ratings yet
AMDA Practical - A048
35 pages
BCCM - Session 21 - Crisis of Confrontation, Malevolence
No ratings yet
BCCM - Session 21 - Crisis of Confrontation, Malevolence
46 pages
BE184
No ratings yet
BE184
47 pages
08 Numerical Summary Measures in R
No ratings yet
08 Numerical Summary Measures in R
34 pages
614 Descriptive Statistcs
No ratings yet
614 Descriptive Statistcs
56 pages
Introduction of EpiInfo
No ratings yet
Introduction of EpiInfo
38 pages
Project Cardio Good Fitness
No ratings yet
Project Cardio Good Fitness
29 pages
Healthcare Analytics
No ratings yet
Healthcare Analytics
72 pages
Contructivism Art
No ratings yet
Contructivism Art
18 pages
R Programming
No ratings yet
R Programming
47 pages
Exploratory Data Analysis
No ratings yet
Exploratory Data Analysis
26 pages
Cost Practical
No ratings yet
Cost Practical
13 pages
X - 15 x-1 2. Print ('Hello Word!') ## (1) "Hello Word!" 3. X - 4 y - 5 Z - X+y Print (Z) 4. X - 4 y - 5 Cat ('The Sum of X and y Is', X+y)
No ratings yet
X - 15 x-1 2. Print ('Hello Word!') ## (1) "Hello Word!" 3. X - 4 y - 5 Z - X+y Print (Z) 4. X - 4 y - 5 Cat ('The Sum of X and y Is', X+y)
15 pages
R1 Uptovisualisation
No ratings yet
R1 Uptovisualisation
122 pages
R Programmimg Practical Journal All-1
No ratings yet
R Programmimg Practical Journal All-1
25 pages
R Practicals
No ratings yet
R Practicals
32 pages
Midterm Project Group 6
No ratings yet
Midterm Project Group 6
41 pages
Necromancer Reference Card
No ratings yet
Necromancer Reference Card
1 page
Ap 2025
No ratings yet
Ap 2025
90 pages
ADS Phase4
No ratings yet
ADS Phase4
19 pages
AQM2 - Assign 1 - Jaishree, Lakshita, Smriti
No ratings yet
AQM2 - Assign 1 - Jaishree, Lakshita, Smriti
12 pages
Data Science Fundamentals
No ratings yet
Data Science Fundamentals
22 pages
Q3 - Stat2100 Dupol Melkiancaesar
No ratings yet
Q3 - Stat2100 Dupol Melkiancaesar
12 pages
United States v. Darrin Joseph Hoffman, 11th Cir. (2013)
No ratings yet
United States v. Darrin Joseph Hoffman, 11th Cir. (2013)
12 pages
IntroR 2
No ratings yet
IntroR 2
18 pages
q3 Stat2100 Bautista-Lhuriely
No ratings yet
q3 Stat2100 Bautista-Lhuriely
11 pages
Group 5 - Applied Statistics and Experimental 152611
No ratings yet
Group 5 - Applied Statistics and Experimental 152611
28 pages
1.biostatistics Introduction
No ratings yet
1.biostatistics Introduction
72 pages
A Short List of The Most Useful R Commands
No ratings yet
A Short List of The Most Useful R Commands
11 pages
Lab 2
No ratings yet
Lab 2
22 pages
Propaganda Techniques - A Long List: An0320X Language and Power A Mcmichael
No ratings yet
Propaganda Techniques - A Long List: An0320X Language and Power A Mcmichael
7 pages
The Ultimate Guide: How To Pitch Magazines and Blogs: Before You Pitch: Build A Media List
No ratings yet
The Ultimate Guide: How To Pitch Magazines and Blogs: Before You Pitch: Build A Media List
6 pages
Report MSA Practice02
No ratings yet
Report MSA Practice02
29 pages
NUR 301 Biostattistics DL CAT
No ratings yet
NUR 301 Biostattistics DL CAT
8 pages
Elements of Poetry
No ratings yet
Elements of Poetry
19 pages
Understanding Contemporary India Sumit Ganguly Editor Neil Devotta Editor PDF Download
No ratings yet
Understanding Contemporary India Sumit Ganguly Editor Neil Devotta Editor PDF Download
88 pages
STAT501 Online - HW2R - Spring2024
No ratings yet
STAT501 Online - HW2R - Spring2024
7 pages
Exercises
No ratings yet
Exercises
11 pages
Phonemic Orthography
No ratings yet
Phonemic Orthography
9 pages
FDS Lab Question Bank
No ratings yet
FDS Lab Question Bank
11 pages
Tutorial 5 - Calculating Mean, Standard Deviation, Frequencies
No ratings yet
Tutorial 5 - Calculating Mean, Standard Deviation, Frequencies
6 pages
Phase 2
No ratings yet
Phase 2
6 pages
R Course
No ratings yet
R Course
7 pages
Notes
No ratings yet
Notes
6 pages
Experiment Lab-II
No ratings yet
Experiment Lab-II
9 pages
CFA L-2 Performance Tracker '23
No ratings yet
CFA L-2 Performance Tracker '23
11 pages
STATISTICS
No ratings yet
STATISTICS
6 pages
Programming For Data Analytics
No ratings yet
Programming For Data Analytics
27 pages
Healthcare-Project-Simplilearn - Week2
No ratings yet
Healthcare-Project-Simplilearn - Week2
8 pages
Pima Tutorial
No ratings yet
Pima Tutorial
8 pages
Animal Farm Analysis 3
No ratings yet
Animal Farm Analysis 3
5 pages
Lab 1 Introduction To Data
No ratings yet
Lab 1 Introduction To Data
11 pages
Indonesia and Malay Archipelago
No ratings yet
Indonesia and Malay Archipelago
5 pages
Explanationdocx
No ratings yet
Explanationdocx
9 pages
Statistic and R Programming Lab Exercise
No ratings yet
Statistic and R Programming Lab Exercise
8 pages
Question No1
No ratings yet
Question No1
6 pages
R Studio Question and Answers
No ratings yet
R Studio Question and Answers
6 pages
Dermatome Maps
No ratings yet
Dermatome Maps
8 pages
F24 Lab-01
No ratings yet
F24 Lab-01
4 pages
L3 Notes-1
No ratings yet
L3 Notes-1
8 pages
DAV Guidelines
No ratings yet
DAV Guidelines
4 pages
Simboluri Flowchart
No ratings yet
Simboluri Flowchart
6 pages
L 2what Is Inclusive History
No ratings yet
L 2what Is Inclusive History
2 pages
Basics: TH TH TH TH TH TH TH
No ratings yet
Basics: TH TH TH TH TH TH TH
3 pages
PUMA - Response - Aspi - ENG
No ratings yet
PUMA - Response - Aspi - ENG
2 pages
Data Scientist /data Analyst - Fresher Resume
No ratings yet
Data Scientist /data Analyst - Fresher Resume
2 pages
The United Theological College, Bengaluru: TH TH RD
No ratings yet
The United Theological College, Bengaluru: TH TH RD
2 pages
Global Marketing
No ratings yet
Global Marketing
1 page
Assignment1 6
No ratings yet
Assignment1 6
5 pages
Grade - 7
No ratings yet
Grade - 7
1 page
Business Statistics I Essentials
From Everand
Business Statistics I Essentials
Louise Clark
5/5 (5)
Introduction To Business Statistics Through R Software: Software
From Everand
Introduction To Business Statistics Through R Software: Software
Editor IJSMI
No ratings yet
Introduction To Non Parametric Methods Through R Software
From Everand
Introduction To Non Parametric Methods Through R Software
Editor IJSMI
No ratings yet

Assignment - 1: Data Analytics and R

Uploaded by

Assignment - 1: Data Analytics and R

Uploaded by

DATA ANALYTICS AND R

DATA TYPE - EXCEL SHEET

#import Excel file into R

#Creation of table and importing the data

data <- read_excel('C:\\Users\\Admin\\Desktop\\NIFT\\Year - 3\\Sem 6\\Data

#1 - What is the range (maximum and minimum),mean and median of the

#2 - What is the mode of calories burned?

#three - What is the variance of workout duration?

Variance is calculated to understand the variability of the data of workout

#five - How many people participated under each age?

data %>% count(age)

Prop.table function helps us to study a very important factor about the

data %>% count(state)

#eight - Who has a better health score, males or females?

excel_path <- "C:\\Users\\Admin\\Desktop\\NIFT\\Year - 3\\Sem 6\\Data

gender <- read.xlsx(excel_path, sheetName = "Example", colIndex = 4)

score <- read.xlsx(excel_path, sheetName = "Example", colIndex = 8)

df <- data.frame(gender, score)

summarise_at(vars(health_score), list(mean_score = mean))

You might also like