0% found this document useful (0 votes)

101 views6 pages

Lin Regr and Arima

This document compares an ARIMA time series model to a linear regression model fitted through machine learning on foreign exchange return data. It fits an ARIMA(0,1,1) model to EUR/USD exchange rate data and shows the results. It then builds a Bayesian linear regression model on the same data set and performs variational inference to estimate the posterior distribution over the model parameters. The goal is to compare the ARIMA approach commonly used in finance to a supervised learning regression model.

Uploaded by

api-223061586

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

101 views6 pages

Lin Regr and Arima

Uploaded by

api-223061586

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Lin_Regr_Finance-inverseX&Y-text 6/14/17, 5)51 PM

ARIMA vs. Linear Model Fitted through Machine Learning

Linear time series analyses are some of the most common techniques for analyzing data in
finance but also other industries where linear dependencies between variables at time t and
previous times t-1, t-2, are assumed. Treating an asset return as a collection of random
variables over time, and capturing the linear relationship between the asset return and
information available prior to time t provides a natural framework to study the dynamic structure
of a time series.

Since we are doing the analysis in Python, we need to import a few modules:

In [4]: import numpy as np

import tensorflow as tf
import edward as ed
from edward.models import Normal
%matplotlib inline
import pandas as pd
from pandas import DataFrame, Series
import statsmodels
import statsmodels.api as sm
import matplotlib.pyplot as plt

Correlations between the variable of interest and its past values dier by type of variable,
whether those are monthly stock returns, value-weighted index returns or foreign exchange
returns. This determines the type of model that is likely to fit best. In the finance literature, a
version of the Capital Asset Pricing Model (CAPM) theory is that the return of an asset is not
predictable and should have no autocorrelations. For demonstration purposes in this example,
lets look at a series of foreign exchange returns for the EUR/USD pair for some time period in
2012:

Import FOREX data file

In [3]: forex = pd.read_csv('/Users/DrC-GStefanita/Desktop/FOREX.csv',index_col=[

parse_dates=['date'])

http://localhost:8888/notebooks/Lin_Regr_Finance-inverseX%26Y-text.ipynb# Page 1 of 6
Lin_Regr_Finance-inverseX&Y-text 6/14/17, 5)51 PM

In [3]: y = pd.DataFrame(forex.price,index = forex.index)

print(y.head())

price
date
2012-09-30 23:12:00 1.281598
2012-09-30 23:13:00 1.281041
2012-09-30 23:14:00 1.281705
2012-09-30 23:16:00 1.280685
2012-10-01 00:00:00 1.280717

It is common to fit an Autoregressive Integrated Moving Average (ARIMA) model of the simplest
form ARIMA (0,1,1), a basic exponential smoothing model to better understand the data or to
predict future point in the series (forecasting). ARIMA models are preferred for foreign exchange
data when there are assumptions of non-stationarity so that the integrated part of the model
can eliminate that. We fit such a model, as shown below:

In [4]: # We fit an ARIMA(0,1,1) model via maximum likelihood.

mod_arima = statsmodels.tsa.api.ARIMA(y, order=(0,1,1))
res_arima = mod_arima.fit()

And print the results:

http://localhost:8888/notebooks/Lin_Regr_Finance-inverseX%26Y-text.ipynb# Page 2 of 6
Lin_Regr_Finance-inverseX&Y-text 6/14/17, 5)51 PM

In [5]: # Show the summary of results

print(res_arima.summary())

ARIMA Model Results

======================================================================
========
Dep. Variable: D.price No. Observations:
202
Model: ARIMA(0, 1, 1) Log Likelihood
1266.011
Method: css-mle S.D. of innovations
0.000
Date: Wed, 14 Jun 2017 AIC -
2526.022
Time: 07:54:03 BIC -
2516.097
Sample: 09-30-2012 HQIC -
2522.006
- 10-01-2012
======================================================================
===========
coef std err z P>|z| [0.025
0.975]
----------------------------------------------------------------------
-----------
const 3.635e-05 3.43e-05 1.061 0.290 -3.08e-05
0.000
ma.L1.D.price 0.0607 0.070 0.874 0.383 -0.075
0.197
Roots
======================================================================
=======
Real Imaginary Modulus Fr
equency
----------------------------------------------------------------------
-------
MA.1 -16.4632 +0.0000j 16.4632
0.5000
----------------------------------------------------------------------
-------

Assuming we accept this model, we now want to see how a Supervised Learning Regression
Model in Python would compare to this particular choice. In conventional regression such as
ARIMA we maximize the likelihood function by estimating the parameters that would do that, so
that parameters are constant. The Bayesian approach to regression used in Machine Learning
turns these concepts upside down: Shouldnt we maximize instead the probability of these
parameters given the data set? If so, the data is considered fixed, a constant, and the

http://localhost:8888/notebooks/Lin_Regr_Finance-inverseX%26Y-text.ipynb# Page 3 of 6
Lin_Regr_Finance-inverseX&Y-text 6/14/17, 5)51 PM

parameters are now random variables that have a probability distrbution function (pdf). We want
to maximize the probability of a random variable given the data we just observed. You would
then update the probability of the parameters given the trends you observe.

For comparison to the ARIMA model, we build a Supervised Learning Regression Model

Our Data Set:

In [6]: n = len(y)

In [7]: y = np.array(y)

In [8]: b_true = np.random.randn(1)

In [9]: x = np.random.randn(n)
x = x.reshape(n,1)

In [36]: y_train = y /b_true + np.random.randn(n,1)

In [37]: y_train = y_train.reshape(n,1)

In [38]: # The Bayesian Linear Regression Model

X = tf.placeholder(tf.float32,[n,1])
b = Normal(loc=tf.zeros(1), scale=tf.ones(1))
alpha = Normal(loc=tf.zeros(1), scale=tf.ones(1))
yt = Normal(loc=((X-alpha)/b), scale=tf.ones(1))

The Bayesian Linear Regression Model assumes a linear relationship between inputs x and
outputs y with linearly distributed noise. Our task is to infer hidden structure from labeled data
comprised of training examples. The latent variables are the linear models weights b and
intercept alpha also known as the bias. We define a placeholder X and during inference we pass
in the value for this placeholder according to data.

The next step is to infer the posterior using variational inference:

In [39]: # Prepare Inference

qb = Normal(loc=tf.Variable(tf.random_normal([1])),
scale=tf.nn.softplus(tf.Variable(tf.random_normal([1]))))
qalpha = Normal(loc=tf.Variable(tf.random_normal([1])),
scale=tf.nn.softplus(tf.Variable(tf.random_normal([1]))))

http://localhost:8888/notebooks/Lin_Regr_Finance-inverseX%26Y-text.ipynb# Page 4 of 6
Lin_Regr_Finance-inverseX&Y-text 6/14/17, 5)51 PM

In [40]: sess = tf.Session()

init = tf.global_variables_initializer()
sess.run(init)

Then run variational inference with the Kullback - Leibler divergence using 250 iterations and 5
latent variable samples in the algorithm:

In [41]: inference = ed.KLqp({b: qb, alpha: qalpha}, data={X: x, yt: y_train})

inference.run(n_samples=5, n_iter=250)

250/250 [100%] Elapsed: 5s | Loss: 755.

810

Criticism

We then evaluate the regression by comparing prediction accuracy on testing data where in our
case we drew inspiration from the fitted ARIMA model parameters:

In [42]: yt_post = ed.copy(yt, {b: qb, alpha: qalpha})

In [43]: yt_test = (y - 0.0607) / 3.635e-05

In [44]: yt_test = yt_test.reshape(n,1)

We can now visualize the fit by comparing data generated with the prior to data generated with
the posterior:

In [45]: def visualise(X_data, Y_data, b, alpha, n_samples=10):

b_samples = b.sample(n_samples)[:, 0].eval()
alpha_samples = alpha.sample(n_samples).eval()
plt.scatter(X_data[:, 0], Y_data)
inputs = np.linspace(-8, 8, num=1000)
for ns in range(n_samples):
output = inputs * b_samples[ns] + alpha_samples[ns]
plt.plot(inputs, output)

http://localhost:8888/notebooks/Lin_Regr_Finance-inverseX%26Y-text.ipynb# Page 5 of 6
Lin_Regr_Finance-inverseX&Y-text 6/14/17, 5)51 PM

In [46]: # Visualize samples from the prior.

visualise(x, y_train, b, alpha)

In [47]: # Visualize samples from the posterior.

visualise(x, y_train, qb, qalpha)

The Bayesian Linear Regression Model has learned a linear relationship between the exchange
rate and the return outputs.

http://localhost:8888/notebooks/Lin_Regr_Finance-inverseX%26Y-text.ipynb# Page 6 of 6

Linear Regression and Corelation (1236)
No ratings yet
Linear Regression and Corelation (1236)
50 pages
Data Analyst Nanodegree Program - Syllabus
50% (2)
Data Analyst Nanodegree Program - Syllabus
7 pages
Final: CS 189 Spring 2013 Introduction To Machine Learning
No ratings yet
Final: CS 189 Spring 2013 Introduction To Machine Learning
9 pages
Bowerman CH15 APPT Final
100% (1)
Bowerman CH15 APPT Final
38 pages
Supervised Machine Learning Algorithm
100% (1)
Supervised Machine Learning Algorithm
111 pages
ARIMA Models in Python Chapter1
No ratings yet
ARIMA Models in Python Chapter1
38 pages
Econometrics - Solution sh.2B 2024
No ratings yet
Econometrics - Solution sh.2B 2024
9 pages
Forecasting: JY Le Boudec
No ratings yet
Forecasting: JY Le Boudec
93 pages
ARIMA Models in Python Chapter2
No ratings yet
ARIMA Models in Python Chapter2
43 pages
Chapter 10
No ratings yet
Chapter 10
6 pages
DL Unit-1
No ratings yet
DL Unit-1
25 pages
Chapter 8
No ratings yet
Chapter 8
8 pages
Gateway Assessment #6 of 6: Correlation and Regression Analysis Submissions
No ratings yet
Gateway Assessment #6 of 6: Correlation and Regression Analysis Submissions
3 pages
Chpater 3
No ratings yet
Chpater 3
33 pages
Modules
No ratings yet
Modules
12 pages
A Data-Driven Model For Software Reliability Prediction
No ratings yet
A Data-Driven Model For Software Reliability Prediction
32 pages
INDR 372 Lecture 11 March19 SP 24
No ratings yet
INDR 372 Lecture 11 March19 SP 24
30 pages
Artificial Neural Networks in Time Series Forecasting: A Comparative Analysis
No ratings yet
Artificial Neural Networks in Time Series Forecasting: A Comparative Analysis
21 pages
EA Linear Regression
No ratings yet
EA Linear Regression
3 pages
Ids Past Papers Merged
No ratings yet
Ids Past Papers Merged
62 pages
Econometrics in MATLAB: ARMAX, Pseudo Ex-Post Forecasting, GARCH and EGARCH, Implied Volatility
No ratings yet
Econometrics in MATLAB: ARMAX, Pseudo Ex-Post Forecasting, GARCH and EGARCH, Implied Volatility
18 pages
Student Engagement in The E-Learning Process and T
No ratings yet
Student Engagement in The E-Learning Process and T
15 pages
Wavelet Denoising and Attention-Based RNN-ARIMA Model To Predict Forex Price
No ratings yet
Wavelet Denoising and Attention-Based RNN-ARIMA Model To Predict Forex Price
7 pages
Bishop Solutions PDF
No ratings yet
Bishop Solutions PDF
87 pages
An Adaptive Hybrid Algorithm For Time Series Prediction in Healthcare
No ratings yet
An Adaptive Hybrid Algorithm For Time Series Prediction in Healthcare
6 pages
Enhancing Customer Experience in Insurance Through AI-Driven Personalization
No ratings yet
Enhancing Customer Experience in Insurance Through AI-Driven Personalization
43 pages
Statistics Thesis Ideas
100% (3)
Statistics Thesis Ideas
6 pages
Analysis of ARIMA and GARCH Model
No ratings yet
Analysis of ARIMA and GARCH Model
14 pages
Forecasting Activity
No ratings yet
Forecasting Activity
2 pages
Time Series Linear Models
No ratings yet
Time Series Linear Models
121 pages
ARIMA Procedure Ebook
No ratings yet
ARIMA Procedure Ebook
110 pages
Da 5
No ratings yet
Da 5
3 pages
Forecast NigeriaRiceOutput
No ratings yet
Forecast NigeriaRiceOutput
7 pages
Expt. 12 Forecasting 214
No ratings yet
Expt. 12 Forecasting 214
12 pages
11 Classical Time Series Forecasting Methods in Python (Cheat Sheet)
No ratings yet
11 Classical Time Series Forecasting Methods in Python (Cheat Sheet)
5 pages
EC2019 Lectures
No ratings yet
EC2019 Lectures
94 pages
Module 3 - Assignment
No ratings yet
Module 3 - Assignment
8 pages
Adsl Exp 9 2024
No ratings yet
Adsl Exp 9 2024
14 pages
Regression Analysis With Python
No ratings yet
Regression Analysis With Python
2 pages
ARIMA Model Python Example - Time Series Forecasting
No ratings yet
ARIMA Model Python Example - Time Series Forecasting
11 pages
MMPM 06 Guess Paper Ignou
No ratings yet
MMPM 06 Guess Paper Ignou
59 pages
Arima
No ratings yet
Arima
8 pages
Notes On ARIMA Modelling: Brian Borchers November 22, 2002
No ratings yet
Notes On ARIMA Modelling: Brian Borchers November 22, 2002
19 pages
Comparing Autoregressive Moving Average (ARMA) Coefficients Determination Using Artificial Neural Networks With Other Techniques
No ratings yet
Comparing Autoregressive Moving Average (ARMA) Coefficients Determination Using Artificial Neural Networks With Other Techniques
6 pages
26 Ads Expt9
No ratings yet
26 Ads Expt9
7 pages
Comparison and Simulation of Building
No ratings yet
Comparison and Simulation of Building
19 pages
Stock Price Forecasting Using Arima and Fourier Transforms
No ratings yet
Stock Price Forecasting Using Arima and Fourier Transforms
8 pages
Arima 1b
No ratings yet
Arima 1b
6 pages
Arma Arima
No ratings yet
Arma Arima
10 pages
Practice of Statistics in The Life Sciences Brigitte Baldi All Chapter Instant Download
100% (1)
Practice of Statistics in The Life Sciences Brigitte Baldi All Chapter Instant Download
40 pages
1 - Forecasting Using ARIMA Models in Python
No ratings yet
1 - Forecasting Using ARIMA Models in Python
38 pages
Godavari Engg College 24-25 Internship Report
No ratings yet
Godavari Engg College 24-25 Internship Report
19 pages
ARIMA Forecasting
No ratings yet
ARIMA Forecasting
9 pages
ATA Unit3 4 5
No ratings yet
ATA Unit3 4 5
51 pages
Machine Learning Using Optimization and Logistic Regression and Sigmoid Function - GRP 06
No ratings yet
Machine Learning Using Optimization and Logistic Regression and Sigmoid Function - GRP 06
31 pages
Module 3.1 Time Series Forecasting ARIMA Model
No ratings yet
Module 3.1 Time Series Forecasting ARIMA Model
19 pages
Survival Analysis of Diabetes Mellitus Patients Using Parametric Non Parametric and Semi Parametric Approaches Addis Ababa Ethiopia
No ratings yet
Survival Analysis of Diabetes Mellitus Patients Using Parametric Non Parametric and Semi Parametric Approaches Addis Ababa Ethiopia
21 pages
Chapter 06 Highlighted
No ratings yet
Chapter 06 Highlighted
62 pages
11 Classical Time Series Forecasting Methods in Python (Cheat Sheet)
100% (1)
11 Classical Time Series Forecasting Methods in Python (Cheat Sheet)
27 pages
Treasury Single Account and Banks' Liquidity of Deposit Money Banks in Nigeria.
No ratings yet
Treasury Single Account and Banks' Liquidity of Deposit Money Banks in Nigeria.
31 pages
Be A 65 Ads Exp 8
No ratings yet
Be A 65 Ads Exp 8
10 pages
Time Series Analysis: C5 ARIMA (Box-Jenkins) Models
No ratings yet
Time Series Analysis: C5 ARIMA (Box-Jenkins) Models
14 pages
Time Arima 002
No ratings yet
Time Arima 002
11 pages
Time Series Forecasting With Python Cheat Sheet
No ratings yet
Time Series Forecasting With Python Cheat Sheet
7 pages
Assigment 17
No ratings yet
Assigment 17
2 pages
Determinants of Public Education Spending in 21 OECD Democracies 1980 2001 - Busemeyer 2007
No ratings yet
Determinants of Public Education Spending in 21 OECD Democracies 1980 2001 - Busemeyer 2007
30 pages
Lecture 17 Arima FCST
No ratings yet
Lecture 17 Arima FCST
26 pages
Giải Bt Kinh Tế Lượng
No ratings yet
Giải Bt Kinh Tế Lượng
55 pages
Lecture 18 Build Arima
No ratings yet
Lecture 18 Build Arima
22 pages
cheatsheet的副本
No ratings yet
cheatsheet的副本
8 pages
TSA R Summary
No ratings yet
TSA R Summary
8 pages
Exam 2
No ratings yet
Exam 2
4 pages
Homework 9: Nhi Ly 2025-04-10
No ratings yet
Homework 9: Nhi Ly 2025-04-10
6 pages
CF Steps
No ratings yet
CF Steps
5 pages
Time Series Analysis
No ratings yet
Time Series Analysis
23 pages
Sarima Group 11
No ratings yet
Sarima Group 11
21 pages
Lec 6
No ratings yet
Lec 6
1 page
Econometrics Final Assignment
No ratings yet
Econometrics Final Assignment
4 pages
Auto-Regressive Integrated Moving Average Models I
No ratings yet
Auto-Regressive Integrated Moving Average Models I
12 pages
Business Analytis C4
No ratings yet
Business Analytis C4
10 pages
Predicting S&P500 Prices Using ARIMA (LinkedIn - Ivan Hung)
No ratings yet
Predicting S&P500 Prices Using ARIMA (LinkedIn - Ivan Hung)
15 pages
Da Exp 07
No ratings yet
Da Exp 07
6 pages
00 Time Series Analysis - Complete Study Guide
No ratings yet
00 Time Series Analysis - Complete Study Guide
26 pages
Business Statistics Communicating With Numbers 5th Edition Jaggia Unlocked Test Bank
No ratings yet
Business Statistics Communicating With Numbers 5th Edition Jaggia Unlocked Test Bank
312 pages
Time Series Components:: The Long-Term Direction.: The Periodic Behavior.: The Irregular Fluctuations
No ratings yet
Time Series Components:: The Long-Term Direction.: The Periodic Behavior.: The Irregular Fluctuations
19 pages
Mifi 564 - Unit 2 - 2022
No ratings yet
Mifi 564 - Unit 2 - 2022
71 pages
Time Series Analysis Handbook 03
No ratings yet
Time Series Analysis Handbook 03
12 pages
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
Top 20 MS Excel VBA Simulations, VBA to Model Risk, Investments, Growth, Gambling, and Monte Carlo Analysis
From Everand
Top 20 MS Excel VBA Simulations, VBA to Model Risk, Investments, Growth, Gambling, and Monte Carlo Analysis
Andrei Besedin
2.5/5 (2)
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet

Lin Regr and Arima

Uploaded by

Lin Regr and Arima

Uploaded by

Lin_Regr_Finance-inverseX&Y-text 6/14/17, 5)51 PM

ARIMA vs. Linear Model Fitted through Machine Learning

In [4]: import numpy as np

Import FOREX data file

In [3]: forex = pd.read_csv('/Users/DrC-GStefanita/Desktop/FOREX.csv',index_col=[

In [3]: y = pd.DataFrame(forex.price,index = forex.index)

In [4]: # We fit an ARIMA(0,1,1) model via maximum likelihood.

And print the results:

In [5]: # Show the summary of results

ARIMA Model Results

Our Data Set:

In [8]: b_true = np.random.randn(1)

In [36]: y_train = y /b_true + np.random.randn(n,1)

In [37]: y_train = y_train.reshape(n,1)

In [38]: # The Bayesian Linear Regression Model

The next step is to infer the posterior using variational inference:

In [39]: # Prepare Inference

In [40]: sess = tf.Session()

In [41]: inference = ed.KLqp({b: qb, alpha: qalpha}, data={X: x, yt: y_train})

250/250 [100%] Elapsed: 5s | Loss: 755.

In [42]: yt_post = ed.copy(yt, {b: qb, alpha: qalpha})

In [43]: yt_test = (y - 0.0607) / 3.635e-05

In [44]: yt_test = yt_test.reshape(n,1)

In [45]: def visualise(X_data, Y_data, b, alpha, n_samples=10):

In [46]: # Visualize samples from the prior.

In [47]: # Visualize samples from the posterior.

You might also like