0% found this document useful (0 votes)

80 views13 pages

Time Series Analysis in R A Beginner's Guide

Time series

Uploaded by

otieni.reagan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

80 views13 pages

Time Series Analysis in R A Beginner's Guide

Time series

Uploaded by

otieni.reagan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

"Time Series Analysis in R: A Beginner's Guide"

SRINIVAS. S

Time series is a sequence of observations recorded at successive points in time. It represents

data collected chronologically, with “time” referring to intervals such as hours, days, weeks,
months, or even years [1,2]. This temporal sequence allows for the analysis of how a variable of
interest changes over time. In this tutorial, we will explore how to fit ARIMA and SARIMA
models to your data. We will also walk through a step-by-step process for conducting time series
analysis.

The data used in the tutorial is from the article “Statistical methods for predicting tuberculosis
incidence based on data from Guangxi, China” [3]. It is an open access dataset which contains
to TB incidence in Guangxi from January 2012 to June 2019.

The first step is “Description”. When you are presented with time series data the first step is to
plot the data and observe the components of time series like trend, seasonality and Cycles.

#Import the excel data to R platform

library(readxl)
Data = read_xlsx("C:\\Users\\WELCOME\\Downloads\\TB time series data.xlsx")
head(Data)

## # A tibble: 6 × 2
## Time TB
## <dttm> <dbl>
## 1 2012-01-31 00:00:00 13.0
## 2 2012-02-29 00:00:00 17.1
## 3 2012-03-31 00:00:00 20.2
## 4 2012-04-30 00:00:00 18.2
## 5 2012-05-31 00:00:00 18.3
## 6 2012-06-30 00:00:00 17.4

We have to convert the time series data into a time series object for the further analysis.
min_date = min(Data$Time) #Start date of the time series
min_date

## [1] "2012-01-31 UTC"

max_date = max(Data$Time) #End date of the time series

max_date

## [1] "2019-06-30 UTC"

#Coverting the data as time series object

Data.ts = ts(Data$TB, start=c(2012,01), end=c(2019, 06), frequency = 12)

class(Data.ts)

## [1] "ts"

The frequency is given as “12” because the observation (TB incidence) is taken monthly, if the
data is taken yearly then frequency will be fixed as “1”.

Now the data is saved as a time series object. Now we can plot the time series object to observes
the descriptive measures.

#Creating a time plot

plot(Data.ts, xlab ="Time", ylab="TB incidence")

We can able to infer that there is a slight downward trend in the incidence of TB cases in
Guangxi and there are regular patterns every year which indicates seasonality. To infer further
we can do “Decomposition of the time plot”.

Decomposition of time series is a process of visually examining a series in an exploratory

fashion, time series are partitioned according to the components of time series (trend, seasonal
variation and random noise).

#Install the packages "tseries" and "forecast" for further analysis

library(tseries)

library(forecast)

#Decomposition of time series

Decomposition = decompose(Data.ts)
plot(Decomposition)
The above figure depicts what is referred to as classical decomposition, when a time series is
conceived of comprising three components: a trend cycle, seasonal pattern and random
component (Here the trend and cycle are combined because the duration of cycle is unknown).

From the above figure, in the trend part we can see that there is downward trend and from the
seasonal part we can see that there is reoccurring seasonal patterns (which indicates seasonality).
This decomposition of a time series leads to the second step called as “Explanation”.

Next, we are going to see a important concept called as “Stationarity”.

Stationarity is one of the primary assumptions of time series. Most of the time series models like
ARIMA, SARIMA etc... assumes thet the time series is stationary. Broadly speaking a time
series is said to be stationary if there is no change in mean (no trend), no systematic change in
variance and it should be independent of time (no seasonality).

Let’s see how to check whether the time series is stationary or not.

The first way is to plot the ACF and PACF plot.

• ACF refers to “Auto Correlation Function” and PACF refers to “Partial Correlation
Function”. The horizontal blue line in the plot is 95% Confidence Interval (CI).

• The vertical line at lag 0 indicates the correlation of the present value with itself, so we
can see that the correlation is equal to 1. With respect to stationarity, the time series is
said to stationary if all the vertical lines fall inside the CI and if the vertical lines fall
outside the CI, then the series is not stationary.

• ACF and PACF further used to find the order of the ARIMA model, we can see it later.
#Plot the ACF plot

acf(Data.ts)

#Plot the PACF plot

pacf(Data.ts)
From the ACF plot we can see that the vertical lines crossing the blue horizontal line which
indicates the high correlation between the present value sand its lagged versions. Therefore, the
time series is not stationary.

The second way to conduct the Augmented Dickey-Fuller (ADF) test or Kwiatkowski-
Phillips-Schmidt-Shin (KPSS) test.

ADF test:
Null Hypothesis (H0): The time series has a unit root (i.e., it is non-stationary).
Alternative Hypothesis (H1): The time series does not have a unit root (i.e., it is stationary).
If the p-value is lesser than 0.05 then the time series is stationary.

KPSS test:
Null Hypothesis (H0): The time series is stationary (either around a level or a trend).
Alternative Hypothesis (H1): The time series is not stationary.
If the p-value is lesser than 0.05 then the time series is not stationarity.

#ADF test

adf.test(Data.ts, k=15)

## data: Data.ts
## Dickey-Fuller = -2.5906, Lag order = 15, p-value = 0.333
## alternative hypothesis: stationary

#KPSS test

kpss.test(Data.ts)
## Warning in kpss.test(Data.ts): p-value smaller than printed p-value

##
## KPSS Test for Level Stationarity
##
## data: Data.ts
## KPSS Level = 1.5344, Truncation lag parameter = 3, p-value = 0.01
From the above results of both the ADF test and KPSS test we can confirm that the time series is
not stationary and it requires transformation (differencing). These tests are available in the
package “tseries” in R [4].

Differencing the most important method of stationarizing the mean of the time series. It can
remove any trend in the series which is not of interest. There are two types of differencing: Trend
Differencing and Seasonal Differencing. From the name itself we can understand that if the time
series shows a significant trend we can apply Trend Differencing, if the time series shows the
seasonality patterns, then we can apply seasonal differencing and if the time series shows both
the trend and seasonality, we have to apply both kinds of differencing [2]. First order
differencing refers to subtracting the previous observation from the current observation and
Second order differencing refers to subtracting the previous two values from the current
observation and so on.

The third step in the time series is analysis is “Prediction”. The primary aim of any time series
analysis is to predict the future values based on the observed values. For the prediction purpose
we are going to fit an ARIMA or SARIMA model for the time series object.

ARIMA (p, d, q) model predicts the future values of a time series by a linear combination of its
past values and series of errors. This method is suitable for forecasting when the data is
stationary/non-stationary and univariate. The parameter “p” in the model refers to the order of
the auto-regressive part, “d” refers to the order of differencing and “q” refers to order of the
moving average part [5].

The SARIMA (Seasonal Auto-Regressive Integrated Moving Average) model, also known as
the Seasonal ARIMA model, extends the ARIMA model to handle seasonality in time series
data. SARIMA models combine non-seasonal and seasonal components. It is denoted as
SARIMA (p, d, q) (P, D, Q) s. The parameter “p” in the model refers to the order of the auto-
regressive part, “d” refers to the order of differencing and “q” refers to order of the moving
average part. The parameter “P” in the model refers to the order of the auto-regressive part
respect to seasonal period, “d” refers to the order of differencing applied with respect to seasonal
period and “q” refers to order of the moving average part with respect to seasonal period. Lastly
the “s” refers to length of seasonal cycle (Eg: 12 for monthly data with annual seasonality) [6].
The package “forecast” in r provides a useful function called as “auto.arima” which
automatically fit an ARIMA or SARIMA model to a time series [7]. The function selects the best
ARIMA model based on information criteria like AIC (Akaike Information Criterion) or BIC
(Bayesian Information Criterion).

#Fitting the suitable ARIMA model for the observed time series

Prediction_model = auto.arima(Data.ts, ic ="aic", trace = T)

##
## ARIMA(2,0,2)(1,1,1)[12] with drift : Inf
## ARIMA(0,0,0)(0,1,0)[12] with drift : 242.998
## ARIMA(1,0,0)(1,1,0)[12] with drift : 230.2584
## ARIMA(0,0,1)(0,1,1)[12] with drift : 221.2812
## ARIMA(0,0,0)(0,1,0)[12] : 271.8778
## ARIMA(0,0,1)(0,1,0)[12] with drift : 238.6041
## ARIMA(0,0,1)(1,1,1)[12] with drift : Inf
## ARIMA(0,0,1)(0,1,2)[12] with drift : Inf
## ARIMA(0,0,1)(1,1,0)[12] with drift : 230.2804
## ARIMA(0,0,1)(1,1,2)[12] with drift : Inf
## ARIMA(0,0,0)(0,1,1)[12] with drift : 222.7067
## ARIMA(1,0,1)(0,1,1)[12] with drift : Inf
## ARIMA(0,0,2)(0,1,1)[12] with drift : Inf
## ARIMA(1,0,0)(0,1,1)[12] with drift : 221.4776
## ARIMA(1,0,2)(0,1,1)[12] with drift : Inf
## ARIMA(0,0,1)(0,1,1)[12] : 256.612
##
## Best model: ARIMA(0,0,1)(0,1,1)[12] with drift

We have fitted ARIMA models with different combination of parameters and we found the best
model based on Akaike Information Criterion (AIC), lower the value better the model. We
figured it out that the ARIMA model (actually it is SARIMA) with the combination (0,0,1)
(0,1,1)[12] is the best fit model for the given time series.
#Summary of the best fit model

summary(Prediction_model)
## Series: Data.ts
## ARIMA(0,0,1)(0,1,1)[12] with drift
##
## Coefficients:
## ma1 sma1 drift
## 0.2482 -0.8834 -0.0600
## s.e. 0.1443 0.3411 0.0043
##
## sigma^2 = 0.7619: log likelihood = -106.64
## AIC=221.28 AICc=221.83 BIC=230.71
##
## Training set error measures:
## ME RMSE MAE MPE MAPE MASE
## Training set -0.0327287 0.7968 0.5878878 -0.3110781 4.415815 0.5231631
## ACF1
## Training set 3.835598e-05

Before using this model for prediction, we have to check whether it is stationary or not using
ACF and PACF plot.

acf(ts(Prediction_model$residuals))
pacf(ts(Prediction_model$residuals))

From the above ACF and PACF plots we can infer that all the vertical lines at each lags falls
inside the horizontal boundary, which indicates that the time series is stationary after applying
seasonal differencing using the SARIMA model. So, now we have to predict the future value
based on the observed values.

#The function forecast is used to predict the future values

Forecast_future = forecast(Prediction_model,level = c(95), h=10)

Forecast_future
## Point Forecast Lo 95 Hi 95
## Jul 2019 12.327558 10.577049 14.07807
## Aug 2019 11.152155 9.348518 12.95579
## Sep 2019 10.312491 8.508854 12.11613
## Oct 2019 9.830395 8.026758 11.63403
## Nov 2019 9.309122 7.505485 11.11276
## Dec 2019 8.606364 6.803326 10.40940
## Jan 2020 9.696074 7.902776 11.48937
## Feb 2020 9.396686 7.603388 11.18998
## Mar 2020 12.071068 10.277770 13.86437
## Apr 2020 11.587178 9.793880 13.38048

"level" indicates the confidence interval required (95% in this case) and "h" indicates for how
many times points you need predictions (here 10 refers to next 10 months)
From the results we can infer the prediction of next 10 months for the TB incidence in Guangxi.

#plot the predictions as time plot

plot(Forecast_future)

The final step is to validate the model used for forecasting.

The Box-Jenkins Q Test (also known simply as the Box-Pierce test or Ljung-Box test) is used
to assess the goodness-of-fit of a time series model by testing whether the residuals from the
model resemble white noise [8].

Essentially, it checks whether there is any significant autocorrelation left in the residuals after
fitting the model, which would suggest that the model might not fully capture the underlying
structure of the data.

Null Hypothesis (H0): The residuals are white noise (i.e, they have no autocorrelation).

Alternative Hypothesis (H1): The residuals are not white noise (i.e, they exhibit
autocorrelation).
#Ljung-Box test for the goodness of fit of the model

Box.test(Forecast_future$residuals, type="Ljung-Box")
##
## Box-Ljung test
##
## data: Forecast_future$residuals
## X-squared = 1.3687e-07, df = 1, p-value = 0.9997

From the results of Ljung-Box test, we can infer that the p-value is greater than 0.05. Therefore,
the given model is a good-fit.

Conclusion:

• The results suggested the TB incidence will experience slight decrease, and its changing
trend will be similar to before.

• The prediction results can provide help for reallocating resources so as to get better in
control and prevention of TB in Guangxi, China.

Summary:

• Import the data to R.

• Convert the data as a time series object.

• Plot and decompose the data.

• Test for stationarity.

• Apply the prediction model (ARIMA/SARIMA).

• Check for goodness of fit of the selected predictive model.

• Forecast the future values and interpret the results obtained.

References:

1. Jebb AT, Tay L, Wang W, Huang Q. Time series analysis for psychological research:
examining and forecasting change. Front Psychol. 2015 Jun 9;6:727.

2. Jose J. INTRODUCTION TO TIME SERIES ANALYSIS AND ITS APPLICATIONS.

2022 Aug 1;

3. Statistical methods for predicting tuberculosis incidence based on data from Guangxi, China |
BMC Infectious Diseases | Full Text [Internet]. [cited 2024 Sep 16]. Available from:
https://bmcinfectdis.biomedcentral.com/articles/10.1186/s12879-020-05033-3

4. Trapletti A, Hornik K. tseries: Time Series Analysis and Computational Finance [Internet].
1999 [cited 2024 Sep 16]. p. 0.10-57. Available from: https://CRAN.R-
project.org/package=tseries

5. Kumar M, Anand M. An Application Of Time Series Arima Forecasting Model For

Predicting Sugarcane Production In India. Studies in Business and Economics. 2014 Apr
30;9:81–94.

6. Liu J, Yu F, Song H. Application of SARIMA model in forecasting and analyzing inpatient

cases of acute mountain sickness. BMC Public Health. 2023 Jan 9;23(1):56.

7. Hyndman R, Athanasopoulos G, Bergmeir C, Caceres G, Chhay L, Kuroptev K, et al.

forecast: Forecasting Functions for Time Series and Linear Models [Internet]. 2009 [cited
2024 Sep 16]. p. 8.23.0. Available from: https://CRAN.R-project.org/package=forecast

8. Bobbitt Z. Ljung-Box Test: Definition + Example [Internet]. Statology. 2020 [cited 2024 Sep
16]. Available from: https://www.statology.org/ljung-box-test/

Cognitive Distortions - When Your Brain Lies To You (+ PDF Worksheets)
75% (8)
Cognitive Distortions - When Your Brain Lies To You (+ PDF Worksheets)
20 pages
4C Factors of Behaviour Based On The Reaserch of William Marston
No ratings yet
4C Factors of Behaviour Based On The Reaserch of William Marston
4 pages
Time Series Analysis - An Introduction
No ratings yet
Time Series Analysis - An Introduction
38 pages
Time Series Analysis (TSA) - Tutorial
No ratings yet
Time Series Analysis (TSA) - Tutorial
136 pages
Applied Time Series Analysis
No ratings yet
Applied Time Series Analysis
200 pages
Metes
No ratings yet
Metes
26 pages
Time Series Analysis
No ratings yet
Time Series Analysis
36 pages
(P3)
No ratings yet
(P3)
9 pages
Ch9 ARIMA-1
No ratings yet
Ch9 ARIMA-1
16 pages
21 - Practice Note On Time Series USING R
No ratings yet
21 - Practice Note On Time Series USING R
17 pages
Gas Prod
100% (3)
Gas Prod
24 pages
Guideline: G1117 VHF Data Exchange System (Vdes)
100% (1)
Guideline: G1117 VHF Data Exchange System (Vdes)
29 pages
Time Series
67% (3)
Time Series
34 pages
Time Series
100% (1)
Time Series
61 pages
How To Identify Patterns in Time Series Data
No ratings yet
How To Identify Patterns in Time Series Data
6 pages
Time Sereis in R
No ratings yet
Time Sereis in R
6 pages
Module 2.3 EDA Part 3 Time Series Data in Python and R
No ratings yet
Module 2.3 EDA Part 3 Time Series Data in Python and R
20 pages
ICIAC11 1E LTompson
No ratings yet
ICIAC11 1E LTompson
40 pages
Time Series Analysis
No ratings yet
Time Series Analysis
12 pages
Life Sciences 2024 Autumn Material
No ratings yet
Life Sciences 2024 Autumn Material
53 pages
Chapter 1
100% (1)
Chapter 1
27 pages
Stationarity & AR, MA, ARIMA, SARIMA
100% (1)
Stationarity & AR, MA, ARIMA, SARIMA
6 pages
Biological Database 1
No ratings yet
Biological Database 1
50 pages
Time Series Modeling: Shouvik Mani April 5, 2018
No ratings yet
Time Series Modeling: Shouvik Mani April 5, 2018
46 pages
12 Gen. Physics Week 3
100% (1)
12 Gen. Physics Week 3
10 pages
Statistical Methods Unit 5 Presentation
No ratings yet
Statistical Methods Unit 5 Presentation
19 pages
Time Series Analysis
No ratings yet
Time Series Analysis
9 pages
N Data Analysis: Bush, T. (2020, June 8) - Time Series Analysis: Definition, Benefits, Models. Retrieved From
No ratings yet
N Data Analysis: Bush, T. (2020, June 8) - Time Series Analysis: Definition, Benefits, Models. Retrieved From
5 pages
Wipro
No ratings yet
Wipro
21 pages
1 What Is ARIMA?: 1.1 A Little Historical Background
No ratings yet
1 What Is ARIMA?: 1.1 A Little Historical Background
5 pages
Time Series Analysis and Forecasting Using R
No ratings yet
Time Series Analysis and Forecasting Using R
30 pages
Time Series Analysis (ETH) PDF
No ratings yet
Time Series Analysis (ETH) PDF
180 pages
Timeseries - Analysis
No ratings yet
Timeseries - Analysis
37 pages
Class Notes
No ratings yet
Class Notes
6 pages
Time Series Decomposition
No ratings yet
Time Series Decomposition
54 pages
Australian Gas Production - Project On Time Series Forecasting
100% (19)
Australian Gas Production - Project On Time Series Forecasting
29 pages
PHD Magnetism & Structure
No ratings yet
PHD Magnetism & Structure
107 pages
Presentation
No ratings yet
Presentation
11 pages
Time Series and Survival Analysis
No ratings yet
Time Series and Survival Analysis
30 pages
Time - Series - in - Brief
No ratings yet
Time - Series - in - Brief
11 pages
Multirole Fighter Aircraft: U6Aeb29-Aircraft Design Project-I
No ratings yet
Multirole Fighter Aircraft: U6Aeb29-Aircraft Design Project-I
71 pages
Introduction To Time Series Analysis and Forecasti
No ratings yet
Introduction To Time Series Analysis and Forecasti
10 pages
Time Analysis in Statistics Presentation
No ratings yet
Time Analysis in Statistics Presentation
16 pages
End Term Project (BA)
No ratings yet
End Term Project (BA)
19 pages
Intro of Time Series
No ratings yet
Intro of Time Series
18 pages
Educate Yourself! in Praise of Self-Education
100% (1)
Educate Yourself! in Praise of Self-Education
17 pages
Arima 1b
No ratings yet
Arima 1b
6 pages
Lecture Guide Introduction To Total Quality Management Defining Quality
No ratings yet
Lecture Guide Introduction To Total Quality Management Defining Quality
29 pages
Ti SPC Psi 40 150 CHGR 1210
No ratings yet
Ti SPC Psi 40 150 CHGR 1210
14 pages
Demgn801 Business Analytics 76 150
No ratings yet
Demgn801 Business Analytics 76 150
75 pages
M1 - L1 (Introduction, Applications)
No ratings yet
M1 - L1 (Introduction, Applications)
39 pages
Arima Modeling With R Listendata
No ratings yet
Arima Modeling With R Listendata
12 pages
Module - 3 Time Series Analysis
No ratings yet
Module - 3 Time Series Analysis
26 pages
Sputtering
No ratings yet
Sputtering
17 pages
Chinese Horoscope Allies and Secret Friend
0% (1)
Chinese Horoscope Allies and Secret Friend
2 pages
Lab Report
No ratings yet
Lab Report
10 pages
Nodes For The ReDFoX Server
No ratings yet
Nodes For The ReDFoX Server
7 pages
1.ethics by Velasquez
No ratings yet
1.ethics by Velasquez
17 pages
Time Series Mid Term-1
No ratings yet
Time Series Mid Term-1
11 pages
Shōgo Kinugasa - Classroom of The Elite Light Novel
No ratings yet
Shōgo Kinugasa - Classroom of The Elite Light Novel
327 pages
OMSM by CA Raghav Goel
No ratings yet
OMSM by CA Raghav Goel
486 pages
Performance Analysis of Petrol - HHO Engine
No ratings yet
Performance Analysis of Petrol - HHO Engine
4 pages
Fpsyg 11 569348
No ratings yet
Fpsyg 11 569348
8 pages
UNIT 5 Time Series Analysis
No ratings yet
UNIT 5 Time Series Analysis
17 pages
English Sample Unit: Let's Talk! Stage 1
No ratings yet
English Sample Unit: Let's Talk! Stage 1
6 pages
Carrillo AA547 FinalProject
No ratings yet
Carrillo AA547 FinalProject
6 pages
Me52 Super-Imp-Tie-23 (1) PDF
No ratings yet
Me52 Super-Imp-Tie-23 (1) PDF
2 pages
07 Time - Series - Analysis - With - R - Ranjeet Paul
No ratings yet
07 Time - Series - Analysis - With - R - Ranjeet Paul
10 pages
Understanding Time Series
No ratings yet
Understanding Time Series
13 pages
Indra: MHE-Demag (M) SDN BHD Nghi Son Refinery and Petrochemical LLC
No ratings yet
Indra: MHE-Demag (M) SDN BHD Nghi Son Refinery and Petrochemical LLC
3 pages
Time Series Analysis. Trends, Patters, Seasonality
No ratings yet
Time Series Analysis. Trends, Patters, Seasonality
14 pages
DCT403 Lecture Plan
No ratings yet
DCT403 Lecture Plan
2 pages
Bss 84
No ratings yet
Bss 84
6 pages
Time-Series-Forecast-A-Comprehensive-Guide - Jupyter Notebook
No ratings yet
Time-Series-Forecast-A-Comprehensive-Guide - Jupyter Notebook
24 pages
Introduction To Time Series
No ratings yet
Introduction To Time Series
6 pages
Time Series Model
No ratings yet
Time Series Model
22 pages
cheatsheet的副本
No ratings yet
cheatsheet的副本
8 pages
Understanding Time Series Data
No ratings yet
Understanding Time Series Data
3 pages
Lecture 9
No ratings yet
Lecture 9
99 pages
Lec 1, STAT 411
No ratings yet
Lec 1, STAT 411
67 pages
08-01-2025 - Sr.S60 - Elite, Target & LIIT-BTs - Jee-Main-GTM-15&10 - KEY & Sol'S
No ratings yet
08-01-2025 - Sr.S60 - Elite, Target & LIIT-BTs - Jee-Main-GTM-15&10 - KEY & Sol'S
11 pages
M2 - L5 (SARIMA General Linear Process Wold Decomposition)
No ratings yet
M2 - L5 (SARIMA General Linear Process Wold Decomposition)
18 pages
Time Series Statistics For Stationary Process Stock Analysis and Prediction
No ratings yet
Time Series Statistics For Stationary Process Stock Analysis and Prediction
9 pages
Time Series
No ratings yet
Time Series
67 pages
Time Series Forecasting
No ratings yet
Time Series Forecasting
29 pages
Time Series Analysis 1
No ratings yet
Time Series Analysis 1
39 pages
Arima
No ratings yet
Arima
12 pages
LAB MANUAL 135 Time Series - Knit
No ratings yet
LAB MANUAL 135 Time Series - Knit
16 pages
Abstract - M. Fahmi Hidayat
No ratings yet
Abstract - M. Fahmi Hidayat
1 page
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet

Time Series Analysis in R A Beginner's Guide

Uploaded by

Time Series Analysis in R A Beginner's Guide

Uploaded by

"Time Series Analysis in R: A Beginner's Guide"

Time series is a sequence of observations recorded at successive points in time. It represents

#Import the excel data to R platform

## [1] "2012-01-31 UTC"

max_date = max(Data$Time) #End date of the time series

## [1] "2019-06-30 UTC"

#Coverting the data as time series object

Data.ts = ts(Data$TB, start=c(2012,01), end=c(2019, 06), frequency = 12)

#Creating a time plot

plot(Data.ts, xlab ="Time", ylab="TB incidence")

Decomposition of time series is a process of visually examining a series in an exploratory

#Install the packages "tseries" and "forecast" for further analysis

#Decomposition of time series

Next, we are going to see a important concept called as “Stationarity”.

The first way is to plot the ACF and PACF plot.

#Plot the PACF plot

Prediction_model = auto.arima(Data.ts, ic ="aic", trace = T)

#The function forecast is used to predict the future values

Forecast_future = forecast(Prediction_model,level = c(95), h=10)

#plot the predictions as time plot

The final step is to validate the model used for forecasting.

• Import the data to R.

• Convert the data as a time series object.

• Plot and decompose the data.

• Test for stationarity.

• Apply the prediction model (ARIMA/SARIMA).

• Check for goodness of fit of the selected predictive model.

• Forecast the future values and interpret the results obtained.

2. Jose J. INTRODUCTION TO TIME SERIES ANALYSIS AND ITS APPLICATIONS.

5. Kumar M, Anand M. An Application Of Time Series Arima Forecasting Model For

6. Liu J, Yu F, Song H. Application of SARIMA model in forecasting and analyzing inpatient

7. Hyndman R, Athanasopoulos G, Bergmeir C, Caceres G, Chhay L, Kuroptev K, et al.

You might also like