Data Analytics Assignment 1

- The document is an assignment submission for a data analytics course containing an R code solution. - The code collects COVID-19 case data from various countries and applies time series analysis and forecasting methods like linear regression, growth rates, SIR modeling. - Visualizations created include time series plots, maps, comparing cases between countries. The code analyzes COVID-19 data from India and US in detail.

Uploaded by

RADHIKA CHANDAK

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

114 views11 pages

Data Analytics Assignment 1

Uploaded by

RADHIKA CHANDAK

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Assignment – I

BFT - 6
(Deepening Specialisation 2: Apparel Production Management)

Name of the Subject : Data Analytics & R Name : Radhika Chandak

Subject Code : BFT603DS2 Roll No. : BFT/19/21
Subject Id : 15250 Date of Submission : 27.04.2022

Assignment:
Using data collection methods and applying principles of statistics carry out the following:
Identify problems faced in industrial engineering and collect appropriate data.
Use any one of the following methods in Principle of Forecasting:
Time Series
Solution:

We start by installing a package already available for covid cases , ie covid19.analytics .

To begin , we take out the time series of the confirmed cases and then death cases .
The code will be as follows :

ag<-covid19.data(case='aggregated')

tsc<-covid19.data(case = 'ts-confirmed')

#summary
report.summary(Nentries=10 , graphical.output = F)

- We will be able to see graphs and charts on the right side under plots , upon
zooming we observe :
● We see that the range of dates is from : january 2020 to april 2022 , it is for top 10
countries .
● The pie chart and bar graph show the countries with the confirmed cases and death
cases respectively.
● While Us has the highest no. of confirmed cases , Turkey has the least .
● For death cases , the US is again the highest but France is the lowest .

TIME SERIES - CONFIRMED CASES

TIME SERIES - DEATH CASES

Time Series Worldwide TOTS ****

ts-confirmed ts-deaths ts-recovered
511748975 6228621 0
1.22% 0%
**** Time Series Worldwide AVGS ****
ts-confirmed ts-deaths ts-recovered
1801933.01 21931.76 0
1.22% 0%
**** Time Series Worldwide SDS ****
ts-confirmed ts-deaths ts-recovered
6617130.29 86526.01 0
1.31% 0%
- Then we take out the total per location for our country India and the country with
most cases , ie. , Us .

#total per location

tots.per.location(tsc, geo.loc = c('us' ,'india'))

So under running model we get the linear regression model .

● On the top we can see no. of cases in the log scale and x axis represent no. of days
. Each line of the plot represents the linear regression model . The plot has the
cumulative values and we can see the concave pattern , that is the increasing trend
and then the small concave pattern showing decrease in trend .
● At the bottom we have a bar chart and the values are in the log scale for y axis .
Similarly , we also get it for Us .

LINEAR REGRESSION MODEL - India and Us

- Now to see the Growth Rate of specific countries we can type (For India here )

#growth rate
growth.rate(tsc, geo.loc = 'india')

We can see that we get 2 plots , on the top , y has 2 axis ,one in regular and other in log
scale , what we can observe from here is that during the second lockdown the cases were
increasing more rapidly than before the first lockdown .
At the bottom we have the growth rate as a part of log scale .
- Now let us extract one more time series data , for all the cases and we save it into
tsa - the name of dataframe.

tsa<-covid19.data(case = 'ts-ALL')

And then using

#TOTALS PLOT
totals.plt(tsa)

We can create interactive data for time series cases .

In the linear graph and log graph , we can see that there are around 511.79 million confirmed
cases and 505.520 million active cases ,and so on .
- To see the different Covid cases across the globe we can use the function of live.map
with the dataframe tsa .

#live map
live.map(tsa)

By clicking on the viewer and scrolling on the particular countries we can see the no. of
cases .
- One of the model that is popular among the researchers working on covid 19 data is
called as SIR model . This groups the people into 3 categories , in the first category
we have

● S-people who are healthy but susceptible to the disease .

● I- people who are infected
● R- people who are recovered

We use the function called generate sir model :

#sir model
generate.SIR.model(tsc, 'india',tot.population = 1383000000)

So on the top we have two plots ,

● On the left we have yn axis which represents no. of infected people in the regular
scale and x axis represents no. of days for the first 25 days and the plot is created .
● On the right , the y axis represents no. Of infected people in the log scale and x axis
represents no. of days for the first 25 days .
● In the bottom we have no. of subjects in the log scale . The 3 different lines are
different linear models. Blue shows people susceptible , red shows infected and
green shows recovered people .
● We can observe that from 0 to day 90 approx the no. of people getting infected
reaches to peak and no. of people recovered also reaches to peak .
This is a screenshot of the coding .

Analysing Covid-19 Data Using R
No ratings yet
Analysing Covid-19 Data Using R
13 pages
Week12 Slides
No ratings yet
Week12 Slides
46 pages
DataQuest---Project
No ratings yet
DataQuest---Project
4 pages
Jupyter Notebook2
No ratings yet
Jupyter Notebook2
15 pages
Pyr Agossou FR
No ratings yet
Pyr Agossou FR
12 pages
A Data-Driven Hybrid Ensemble AI Model For COVID-19 Infection Forecast Using Multiple Neural Networks and Reinforced Learning
No ratings yet
A Data-Driven Hybrid Ensemble AI Model For COVID-19 Infection Forecast Using Multiple Neural Networks and Reinforced Learning
10 pages
3 Solution Modelling An Infected Cohort
No ratings yet
3 Solution Modelling An Infected Cohort
9 pages
Introduction R for DS
No ratings yet
Introduction R for DS
9 pages
Week 10
No ratings yet
Week 10
15 pages
R_training_AM
No ratings yet
R_training_AM
6 pages
My P Report
No ratings yet
My P Report
14 pages
Covid19 Visualization
No ratings yet
Covid19 Visualization
2 pages
Python Pandas Project
No ratings yet
Python Pandas Project
17 pages
IP Project Covid-19 Impact
No ratings yet
IP Project Covid-19 Impact
22 pages
Chapter 6 - Build Exponential Functions and Models PDF
No ratings yet
Chapter 6 - Build Exponential Functions and Models PDF
6 pages
COVID-19-Data-Analysis-Using-Python
No ratings yet
COVID-19-Data-Analysis-Using-Python
10 pages
A Bayesian - Deep Learning Model For Estimating Covid-19 Evolution in Spain
No ratings yet
A Bayesian - Deep Learning Model For Estimating Covid-19 Evolution in Spain
22 pages
Analysis and Prediction of COVID-19 For Different Regions and Countries Methods
No ratings yet
Analysis and Prediction of COVID-19 For Different Regions and Countries Methods
6 pages
Modelling COVID-19 Spatio-Temporal Spread Using Bayesian Nonparametric Covariance Regresssion
No ratings yet
Modelling COVID-19 Spatio-Temporal Spread Using Bayesian Nonparametric Covariance Regresssion
15 pages
COVID
No ratings yet
COVID
19 pages
Eda 21524785
No ratings yet
Eda 21524785
32 pages
report_MSA_Practice02
No ratings yet
report_MSA_Practice02
29 pages
ip project file for class 12
No ratings yet
ip project file for class 12
25 pages
Data Analysis Report Team 5
No ratings yet
Data Analysis Report Team 5
15 pages
Project File -A
No ratings yet
Project File -A
20 pages
Assignment Sujith S
No ratings yet
Assignment Sujith S
13 pages
Análisis de Propagación Del Coronavirus: Angel Villamizar
No ratings yet
Análisis de Propagación Del Coronavirus: Angel Villamizar
16 pages
Corona Virus in India
No ratings yet
Corona Virus in India
29 pages
Sample
No ratings yet
Sample
16 pages
CV0003
No ratings yet
CV0003
43 pages
Report - Data Visualization and Exploration
No ratings yet
Report - Data Visualization and Exploration
14 pages
Data Analytics_Activity 1
No ratings yet
Data Analytics_Activity 1
2 pages
COVID-19 Data Analysis With Pandas and NumPy
No ratings yet
COVID-19 Data Analysis With Pandas and NumPy
5 pages
COMP2501 - Assignment - 1 - Questions - RMD 2
No ratings yet
COMP2501 - Assignment - 1 - Questions - RMD 2
7 pages
r.jeevitha
No ratings yet
r.jeevitha
16 pages
Corona Virus Analysis
No ratings yet
Corona Virus Analysis
27 pages
I.p Project
No ratings yet
I.p Project
24 pages
Informatics Practices Project 12 New
No ratings yet
Informatics Practices Project 12 New
31 pages
Spatial Disparities in COVID-19 Vaccination Coverage in Bangladesh 8july21
No ratings yet
Spatial Disparities in COVID-19 Vaccination Coverage in Bangladesh 8july21
34 pages
Co Vids QL Present N 0710
No ratings yet
Co Vids QL Present N 0710
27 pages
covid data report
No ratings yet
covid data report
21 pages
Package COVID19': January 6, 2021
No ratings yet
Package COVID19': January 6, 2021
6 pages
Covid-19 in Germany: A Case Study: Abrar Ahmed
No ratings yet
Covid-19 in Germany: A Case Study: Abrar Ahmed
44 pages
Covid Data For Pbi Dashboard
No ratings yet
Covid Data For Pbi Dashboard
2 pages
Worksheet 2.5 HW 11 - Descriptive Stat Practice
No ratings yet
Worksheet 2.5 HW 11 - Descriptive Stat Practice
1 page
Computer Science Ip
No ratings yet
Computer Science Ip
16 pages
COVID 19 Some Challenges Some Data 1
No ratings yet
COVID 19 Some Challenges Some Data 1
26 pages
IP Project Covid-19 Impact
No ratings yet
IP Project Covid-19 Impact
25 pages
Regression Analys
No ratings yet
Regression Analys
7 pages
Ashutosh Project
No ratings yet
Ashutosh Project
19 pages
Machine Learning and OLAP On Big COVID-19 Data
No ratings yet
Machine Learning and OLAP On Big COVID-19 Data
10 pages
Covid Report PDF
No ratings yet
Covid Report PDF
17 pages
Maheswari Public School Kalwar Road: Project File Session 2023-24
No ratings yet
Maheswari Public School Kalwar Road: Project File Session 2023-24
28 pages
COVID-19 India Data Analysis
No ratings yet
COVID-19 India Data Analysis
23 pages
Covid 19 India Dashboard Using Python and Voila
No ratings yet
Covid 19 India Dashboard Using Python and Voila
6 pages
Name
No ratings yet
Name
23 pages
A Comparison of Time Series Models To Predict COVID-19 Cases
No ratings yet
A Comparison of Time Series Models To Predict COVID-19 Cases
31 pages
Name
No ratings yet
Name
23 pages
Syadatajveez
No ratings yet
Syadatajveez
21 pages
Visualizing COVID-19 Data Beautifully in Python (In 5 Minutes or Less!!) - by Nik Piepenbreier - Towards Data Science
No ratings yet
Visualizing COVID-19 Data Beautifully in Python (In 5 Minutes or Less!!) - by Nik Piepenbreier - Towards Data Science
8 pages
(Ebook) Real Stats: Using Econometrics for Political Science and Public Policy by Bailey, Michael A. ISBN 9780199981946, 0199981949 pdf download
No ratings yet
(Ebook) Real Stats: Using Econometrics for Political Science and Public Policy by Bailey, Michael A. ISBN 9780199981946, 0199981949 pdf download
48 pages
Start Predicting In A World Of Data Science And Predictive Analysis
From Everand
Start Predicting In A World Of Data Science And Predictive Analysis
Matthew Abbitt
No ratings yet
Student Solutions Manual to Introductory Econometrics 2nd edition Edition Jeffrey M. Wooldridge instant download
No ratings yet
Student Solutions Manual to Introductory Econometrics 2nd edition Edition Jeffrey M. Wooldridge instant download
53 pages
Unit-I (Ensemble Learning)
No ratings yet
Unit-I (Ensemble Learning)
67 pages
ES031 M3 HypothesisTestingSingleSample
No ratings yet
ES031 M3 HypothesisTestingSingleSample
55 pages
Rose & Bliemer 2009 - Constructing Efficient Stated Choice Experimental Designs
No ratings yet
Rose & Bliemer 2009 - Constructing Efficient Stated Choice Experimental Designs
32 pages
Simu Final Note 2
No ratings yet
Simu Final Note 2
17 pages
HASTS211_W8Y23
No ratings yet
HASTS211_W8Y23
2 pages
Module 4
No ratings yet
Module 4
35 pages
Stats Tools Package OLD
No ratings yet
Stats Tools Package OLD
39 pages
MRE Proposal
No ratings yet
MRE Proposal
16 pages
Adoption of Improved Soybean Seeds in GH
No ratings yet
Adoption of Improved Soybean Seeds in GH
6 pages
Confidence Interval
100% (3)
Confidence Interval
27 pages
Econometrics Board Questions
No ratings yet
Econometrics Board Questions
13 pages
Review For Final Exam
No ratings yet
Review For Final Exam
4 pages
Interval Estimation in The Classical Normal Linear Regression Model
No ratings yet
Interval Estimation in The Classical Normal Linear Regression Model
16 pages
Chapter 05 - Multicollinearity
100% (1)
Chapter 05 - Multicollinearity
26 pages
Analisis Pengaruh Kualitas Pelayanan Terhadap Kepuasan Pelanggan Pengguna Kartu
No ratings yet
Analisis Pengaruh Kualitas Pelayanan Terhadap Kepuasan Pelanggan Pengguna Kartu
15 pages
Naive Bayes Project
No ratings yet
Naive Bayes Project
5 pages
Cara Membaca Hasil Regresi
No ratings yet
Cara Membaca Hasil Regresi
17 pages
Manecon Module 3 Notes
No ratings yet
Manecon Module 3 Notes
5 pages
Foundations of Data Science
No ratings yet
Foundations of Data Science
4 pages
2nd Quarter Loa Lumampongschool-Form-jhs
No ratings yet
2nd Quarter Loa Lumampongschool-Form-jhs
13 pages
UMUR Kekurangan Energi Kronis Crosstabulation
No ratings yet
UMUR Kekurangan Energi Kronis Crosstabulation
9 pages
Part 2 Project-Basic Inferential Stat
No ratings yet
Part 2 Project-Basic Inferential Stat
6 pages
Univariate and Bivariate Data Analysis + Probability
100% (1)
Univariate and Bivariate Data Analysis + Probability
5 pages
Finding SDs
No ratings yet
Finding SDs
5 pages
Uji Univariat
No ratings yet
Uji Univariat
2 pages
Stat&Proba Midterm Exam
No ratings yet
Stat&Proba Midterm Exam
4 pages
Week3 Assignment
No ratings yet
Week3 Assignment
6 pages