0% found this document useful (0 votes)

15 views19 pages

Midterm Asm

Uploaded by

hono.stepstudy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views19 pages

Midterm Asm

Uploaded by

hono.stepstudy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

midterm-asm

June 4, 2023

1.Import data
[72]: import pandas as pd
import matplotlib.pyplot as plt

[73]: # Monthly Index

monthly_data = pd.read_csv(r'C:\Users\Admin\Downloads\Monthly Index.csv')
monthly_data['Date'] = pd.to_datetime(monthly_data['Date'])
monthly_data.set_index('Date', inplace=True)

# Annual Index
annual_data = pd.read_csv(r'C:\Users\Admin\Downloads\Annual Index.csv')
annual_data['Date'] = pd.to_datetime(annual_data['Date'])
annual_data.set_index('Date', inplace=True)

# Rename Columns
index = ['Germany', 'Canada', 'USA', 'United Kingdom', 'France', 'Japan']
mapper = {"GERMANY Standard (Large+Mid Cap)": "Germany", "CANADA Standard␣
↪(Large+Mid Cap)": "Canada",

"USA Standard (Large+Mid Cap)": "USA", "UNITED KINGDOM Standard (Large+Mid␣

↪Cap)": "United Kingdom",

"FRANCE Standard (Large+Mid Cap)": "France", "JAPAN Standard (Large+Mid␣

↪Cap)": "Japan"}

monthly_data.rename(columns=mapper, inplace=True)
annual_data.rename(columns=mapper, inplace=True)

2.Clean data by removing NA values. Calculate returns for each series in the data
set. Use tidyverse package to produce plot of the above data set in two frequencies of
monthly and annual levels as well as at both price and return levels. Provide summary
statistics of the data.
2.1 Clean data by removing NA values
[74]: # Clean data by removing NA values
monthly_data_cleaned = monthly_data.apply(pd.to_numeric, errors='coerce').
↪dropna()

annual_data_cleaned = annual_data.apply(pd.to_numeric, errors='coerce').dropna()

1
# Calculate returns for each series
monthly_returns = monthly_data_cleaned.pct_change()
annual_returns = annual_data_cleaned.pct_change()
monthly_returns.dropna(inplace=True)
annual_returns.dropna(inplace=True)

[64]: monthly_returns

[64]: France Canada Germany Japan USA United Kingdom

Date
1970-01-30 0.036700 0.022000 -0.050300 -0.018600 -0.074100 -0.009300
1970-02-27 -0.029227 0.041879 -0.031273 0.009374 0.053353 -0.055617
1970-03-31 -0.013017 0.007701 0.006848 0.037654 0.003076 0.022339
1970-04-30 -0.061210 -0.116589 -0.048257 -0.123845 -0.089339 -0.104339
1970-05-29 -0.028847 -0.097795 -0.096075 -0.029758 -0.059490 -0.052761
… … … … … … …
2022-12-30 -0.003903 -0.051535 0.000540 0.001295 -0.060150 -0.005092
2023-01-31 0.111436 0.087691 0.124068 0.062079 0.064760 0.064521
2023-02-28 -0.001168 -0.044987 -0.021266 -0.038682 -0.025555 -0.001897
2023-03-31 0.030498 -0.002168 0.039903 0.029498 0.033869 -0.012735
2023-04-28 0.042315 0.027756 0.029756 0.003622 0.011752 0.050339

[640 rows x 6 columns]

[65]: annual_returns

[65]: France Canada Germany Japan USA United Kingdom

Date
1970-12-31 -0.087020 0.115710 -0.267280 -0.155780 0.009270 -0.098290
1971-12-31 -0.026529 0.102329 0.195982 0.485584 0.100072 0.426390
1972-12-29 0.199604 0.292996 0.146316 1.211648 0.133022 0.005613
1973-12-31 -0.002092 -0.061299 -0.080505 -0.218119 -0.187359 -0.289050
1974-12-31 -0.270751 -0.299595 0.118939 -0.178365 -0.308617 -0.543222
1975-12-31 0.368193 0.101843 0.250007 0.168487 0.305017 1.034545
1976-12-31 -0.247018 0.052222 0.030567 0.227073 0.186977 -0.175515
1977-12-30 -0.012373 -0.065123 0.210056 0.132022 -0.122057 0.502001
1978-12-29 0.631012 0.152546 0.210952 0.500871 0.004047 0.085592
1979-12-31 0.217332 0.461236 -0.073723 -0.136524 0.081599 0.155563
1980-12-31 -0.076534 0.181229 -0.145735 0.277376 0.230718 0.323642
1981-12-31 -0.344408 -0.143888 -0.134543 0.138433 -0.092990 -0.159295
1982-12-31 -0.096972 -0.025834 0.061689 -0.022806 0.151304 0.031029
1983-12-30 0.281710 0.291220 0.208979 0.230521 0.166098 0.117762
1984-12-31 0.019976 -0.108926 -0.075206 0.157269 0.010062 0.006297

2.2 Use tidyverse package to produce plot of the above data set in two frequencies of monthly and
annual levels as well as at both price and return levels.

2
[75]: import matplotlib.pyplot as plt
import matplotlib.dates as mdates

# Convert index to a column for plotting

monthly_data_cleaned = monthly_data_cleaned.reset_index()
annual_data_cleaned = annual_data_cleaned.reset_index()
monthly_returns = monthly_returns.reset_index()
annual_returns = annual_returns.reset_index()

# Plotting the monthly data

plt.figure(figsize=(18, 5))
for column in monthly_data_cleaned.columns[1:]:
plt.plot(monthly_data_cleaned['Date'], monthly_data_cleaned[column],␣
↪label=column)

plt.title('Monthly Index - Price Level', color='blue')

plt.ylabel('Price')
plt.legend()

# Format x-axis labels

date_format = mdates.DateFormatter('%b %Y') # Format: Month Year
plt.gca().xaxis.set_major_formatter(date_format)

plt.show()

# Plotting the monthly returns

plt.figure(figsize=(18, 5))
for column in monthly_returns.columns[1:]:
plt.plot(monthly_returns['Date'], monthly_returns[column], label=column)
plt.title('Monthly Index - Returns Level', color='blue')
plt.xlabel('Date')
plt.ylabel('Return')
plt.legend()

# Format x-axis labels

plt.gca().xaxis.set_major_formatter(date_format)

plt.show()

3
[76]: # Plotting the annual data
plt.figure(figsize=(18,5))
for column in annual_data_cleaned.columns[1:]:
plt.plot(annual_data_cleaned['Date'], annual_data_cleaned[column],␣
↪label=column)

plt.title('Annual Index - Price Level', color='red')

plt.xlabel('Date')
plt.ylabel('Price')
plt.legend()
plt.show()

# Plotting the annual returns

plt.figure(figsize=(18, 5))
for column in annual_returns.columns[1:]:
plt.plot(annual_returns['Date'], annual_returns[column], label=column)
plt.title('Annual Index - Returns Level', color='red')
plt.xlabel('Date')
plt.ylabel('Return')
plt.legend()
plt.show()

4
2.3 Provide summary statistics of the data.
[78]: import pandas as pd

# Monthly Index
# Compute summary statistics
summary_stats = monthly_returns.describe()

# Add additional statistics if needed

summary_stats.loc['median'] = monthly_returns.median(numeric_only=True)
summary_stats.loc['75%'] = monthly_returns.quantile(0.75, numeric_only=True)
summary_stats.loc['25%'] = monthly_returns.quantile(0.25, numeric_only=True)

# Display the summary statistics

print(summary_stats)

France Canada Germany Japan USA \

count 640.000000 640.000000 640.000000 640.000000 640.000000
mean 0.007075 0.006352 0.006790 0.007177 0.006759
std 0.063838 0.056106 0.062781 0.058543 0.044501
min -0.237799 -0.271553 -0.243511 -0.194234 -0.214612
25% -0.030122 -0.023771 -0.028699 -0.027148 -0.018992
50% 0.008664 0.007768 0.007471 0.005699 0.010179
75% 0.045426 0.039915 0.046030 0.041943 0.035590
max 0.262018 0.210088 0.223898 0.241833 0.172982
median 0.008664 0.007768 0.007471 0.005699 0.010179

United Kingdom
count 640.000000
mean 0.005680
std 0.061128
min -0.217360

5
25% -0.026872
50% 0.006536
75% 0.038066
max 0.554762
median 0.006536
• Count: The count indicates the number of monthly return data points available for each
country, which is 640 in this case. This suggests that there are no missing values in the
dataset.
• Mean: The mean represents the average monthly return for each country. On average, all
countries have positive returns, ranging from 0.005680 (United Kingdom) to 0.007177 (Japan).
• Standard Deviation: The standard deviation measures the dispersion or variability of the
monthly returns around the mean. Countries like Germany and France have relatively higher
standard deviations, indicating greater volatility in their returns compared to other countries.
• Minimum and Maximum: These values represent the lowest and highest monthly returns
observed across all countries. For example, the minimum return is -0.271553 (Canada), and
the maximum return is 0.262018 (France).
• Quartiles: The quartiles (25%, 50%, and 75%) provide information about the distribution
of returns. The median (50%) represents the middle value, separating the data into two equal
halves. The interquartile range (75% - 25%) gives an indication of the spread of the data.
For example, the 25th percentile (Q1) for France is -0.030122, while the 75th percentile (Q3)
is 0.045426.
[79]: # Annual Index
# Compute summary statistics
summary_stats = annual_returns.describe()

# Add additional statistics if needed

summary_stats.loc['median'] = annual_returns.median(numeric_only=True)
summary_stats.loc['75%'] = annual_returns.quantile(0.75, numeric_only=True)
summary_stats.loc['25%'] = annual_returns.quantile(0.25, numeric_only=True)

# Display the summary statistics

print(summary_stats)

France Canada Germany Japan USA United Kingdom

count 15.000000 15.000000 15.000000 15.000000 15.000000 15.000000
mean 0.036942 0.069778 0.043766 0.187846 0.044478 0.094871
std 0.261417 0.195380 0.163253 0.360519 0.167011 0.374618
min -0.344408 -0.299595 -0.267280 -0.218119 -0.308617 -0.543222
25% -0.091996 -0.063211 -0.077855 -0.079665 -0.044471 -0.128793
50% -0.012373 0.101843 0.061689 0.157269 0.081599 0.031029
75% 0.208468 0.166888 0.202480 0.253949 0.158701 0.239602
max 0.631012 0.461236 0.250007 1.211648 0.305017 1.034545
median -0.012373 0.101843 0.061689 0.157269 0.081599 0.031029
• Count: The count indicates the number of annual data points available for each country,
which is 15 in this case. This suggests that there are no missing values in the dataset.
• Mean: The mean represents the average annual return for each country. On average, all

6
countries have positive returns, ranging from 0.036942 (France) to 0.187846 (Japan).
• Standard Deviation: The standard deviation measures the dispersion or variability of
the annual returns around the mean. Countries like France and Japan have relatively higher
standard deviations, indicating greater volatility in their returns compared to other countries.
• Minimum and Maximum: These values represent the lowest and highest annual returns
observed across all countries. For example, the minimum return is -0.344408 (France), and
the maximum return is 1.211648 (Japan).
• Quartiles: The quartiles (25%, 50%, and 75%) provide information about the distribution
of returns. The median (50%) represents the middle value, separating the data into two equal
halves. The interquartile range (75% - 25%) gives an indication of the spread of the data.
For example, the 25th percentile (Q1) for France is -0.091996, while the 75th percentile (Q3)
is 0.208468.
3. Perform Cointegration Test:
a. Using the Engle – Grangle 2 – step method to detect for any possible cointegration between the
US and each of the other 5 countries. For the other data sets, you can decide pairs of each two
series for your study.
[94]: # Checking Order of Intergration
import statsmodels.api as sm

# Perform Augmented Dickey-Fuller test for each series

adf_test_result = {}

for country in ['France', 'Canada', 'Germany', 'Japan', 'USA', 'United␣

↪Kingdom']:

adf_test = sm.tsa.adfuller(monthly_data_cleaned[country])
adf_test_result[country] = {'ADF Statistic': adf_test[0],
'p-value': adf_test[1]}
print(f"Augmented Dickey-Fuller Test for {country}:")
print(f"ADF Statistic: {adf_test[0]}")
print(f"p-value: {adf_test[1]}\n")

Augmented Dickey-Fuller Test for France:

ADF Statistic: -0.5928043291126145
p-value: 0.8725996779949432

Augmented Dickey-Fuller Test for Canada:

ADF Statistic: -0.049139645100995735
p-value: 0.9542746431528027

Augmented Dickey-Fuller Test for Germany:

ADF Statistic: -1.135742885348771
p-value: 0.7006096892538

Augmented Dickey-Fuller Test for Japan:

ADF Statistic: -1.7696935496863893
p-value: 0.3956083836961319

7
Augmented Dickey-Fuller Test for USA:
ADF Statistic: 2.999255142701915
p-value: 1.0

Augmented Dickey-Fuller Test for United Kingdom:

ADF Statistic: -1.193324819185349
p-value: 0.6764605942425147

=> Based on the conclusions above, the USA series is the only one that is stationary.
Proceed with the Engle-Granger two-step method to detect cointegration between the US and each
of the other five countries
US and France
[120]: import statsmodels.api as sm
from statsmodels.tsa.api import VAR
from statsmodels.tsa.vector_ar.vecm import VECM

# Assuming monthly_data_cleaned is a DataFrame containing the monthly data for␣

↪the US and France

# Step 1: VAR Lag Length Selection

model = VAR(monthly_data_cleaned[['USA', 'France']])
lag_order = model.select_order()

# Extract the selected lag order from the results

selected_lag_order = lag_order.selected_orders['aic']

# Step 2: Estimation of Long-Run Equation using VAR

results = model.fit(maxlags=selected_lag_order)
long_run_coefficients = results.params

print("Long-Run Coefficients:")
print(long_run_coefficients)

# Step 3: Estimation of Short-Run VECM

vecm_model = VECM(monthly_data_cleaned[['USA', 'France']],␣
↪k_ar_diff=selected_lag_order)

vecm_results = vecm_model.fit()

cointegration_rank = vecm_results.coint_rank
short_run_coefficients = vecm_results.alpha

print("Cointegration Rank:", cointegration_rank)

print("Short-Run Coefficients:")
print(short_run_coefficients)

8
Long-Run Coefficients:
USA France
const 3.908657 6.610449
L1.USA 0.687983 -0.158048
L1.France 0.200517 1.159797
L2.USA 0.296998 0.225846
L2.France -0.231164 -0.257889
L3.USA 0.099102 -0.079512
L3.France 0.028698 0.187625
L4.USA 0.080207 0.073887
L4.France -0.129421 -0.123000
L5.USA -0.043345 -0.018558
L5.France 0.079933 0.034507
L6.USA -0.237960 -0.119492
L6.France 0.134274 0.072083
L7.USA 0.330034 0.201609
L7.France -0.099939 -0.145689
L8.USA -0.023283 -0.097676
L8.France -0.070396 0.038186
L9.USA -0.193279 -0.074251
L9.France -0.010026 -0.055345
L10.USA -0.066086 -0.041431
L10.France 0.166274 0.147870
L11.USA 0.054160 0.133417
L11.France -0.106036 -0.189471
L12.USA -0.141833 -0.188946
L12.France 0.112326 0.196146
L13.USA 0.117282 0.217650
L13.France -0.080625 -0.117716
L14.USA -0.235244 -0.266335
L14.France 0.024814 0.080239
L15.USA 0.292260 0.208215
L15.France -0.032086 -0.045259
Cointegration Rank: 1
Short-Run Coefficients:
[[0.01260729]
[0.00860016]]
=> The long-run coefficients show the relationship between the USA and France variables over time.
Positive or negative coefficients indicate the direction of the relationship. The cointegration rank of
1 suggests a stable long-term relationship between the variables. In the short run, the coefficients
indicate how the variables adjust towards the long-run equilibrium. The results provide insights into
the long-run and short-run dynamics between the USA and France variables.
US and Japan
[122]: # Assuming monthly_data_cleaned is a DataFrame containing the monthly data for␣
↪the US and Japan

9
# Step 1: VAR Lag Length Selection
model = VAR(monthly_data_cleaned[['USA', 'Japan']])
lag_order = model.select_order()

# Extract the selected lag order from the results

selected_lag_order = lag_order.selected_orders['aic']

# Step 2: Estimation of Long-Run Equation using VAR

results = model.fit(maxlags=selected_lag_order)
long_run_coefficients = results.params

print("Long-Run Coefficients:")
print(long_run_coefficients)

# Step 3: Estimation of Short-Run VECM

vecm_model = VECM(monthly_data_cleaned[['USA', 'Japan']],␣
↪k_ar_diff=selected_lag_order)

vecm_results = vecm_model.fit()

cointegration_rank = vecm_results.coint_rank
short_run_coefficients = vecm_results.alpha

print("Cointegration Rank:", cointegration_rank)

print("Short-Run Coefficients:")
print(short_run_coefficients)

Long-Run Coefficients:
USA Japan
const 0.437536 21.078081
L1.USA 0.835417 -0.029675
L1.Japan 0.023783 1.045266
L2.USA 0.146366 0.008814
L2.Japan -0.030743 -0.083121
L3.USA 0.081900 0.046576
L3.Japan 0.023560 0.089672
L4.USA 0.027683 0.145836
L4.Japan -0.043590 -0.110146
L5.USA -0.016210 -0.094526
L5.Japan 0.038912 0.099939
L6.USA -0.144610 -0.106945
L6.Japan -0.004949 -0.114778
L7.USA 0.280993 0.095119
L7.Japan -0.013329 0.062392
L8.USA -0.136280 -0.160680
L8.Japan 0.017069 0.112720
L9.USA -0.171701 -0.096308

10
L9.Japan -0.010307 0.007940
L10.USA 0.062338 0.189190
L10.Japan 0.010391 -0.088132
L11.USA -0.022810 -0.278016
L11.Japan -0.000344 0.033928
L12.USA -0.009760 0.191722
L12.Japan -0.018134 -0.068157
L13.USA 0.042276 0.195088
L13.Japan 0.001919 -0.087733
L14.USA -0.246197 -0.193043
L14.Japan 0.046028 0.147078
L15.USA 0.356702 0.089102
L15.Japan -0.073751 -0.015872
L16.USA -0.020649 0.154549
L16.Japan 0.031127 -0.193474
L17.USA -0.058097 -0.150804
L17.Japan 0.002356 0.152533
Cointegration Rank: 1
Short-Run Coefficients:
[[0.00684774]
[0.0019781 ]]
=> The long-run coefficients show the relationship between the USA and Japan variables. Positive
coefficients indicate a positive impact, while negative coefficients indicate a negative impact. The
cointegration rank of 1 suggests a stable long-term relationship. The short-run coefficients represent
how the variables adjust towards the long-run equilibrium. In summary, the results indicate the
long-run and short-run dynamics between the USA and Japan variables.
US and Canada
[123]: # Assuming monthly_data_cleaned is a DataFrame containing the monthly data for␣
↪the US and Canada

# Step 1: VAR Lag Length Selection

model = VAR(monthly_data_cleaned[['USA', 'Canada']])
lag_order = model.select_order()

# Extract the selected lag order from the results

selected_lag_order = lag_order.selected_orders['aic']

# Step 2: Estimation of Long-Run Equation using VAR

results = model.fit(maxlags=selected_lag_order)
long_run_coefficients = results.params

print("Long-Run Coefficients:")
print(long_run_coefficients)

# Step 3: Estimation of Short-Run VECM

11
vecm_model = VECM(monthly_data_cleaned[['USA', 'Canada']],␣
↪k_ar_diff=selected_lag_order)

vecm_results = vecm_model.fit()

cointegration_rank = vecm_results.coint_rank
short_run_coefficients = vecm_results.alpha

print("Cointegration Rank:", cointegration_rank)

print("Short-Run Coefficients:")
print(short_run_coefficients)

Long-Run Coefficients:
USA Canada
const 2.103576 4.265292
L1.USA 0.800352 -0.137374
L1.Canada 0.026192 1.061635
L2.USA 0.080777 -0.034169
L2.Canada 0.111629 0.159191
L3.USA 0.243946 0.238537
L3.Canada -0.206150 -0.265254
L4.USA 0.022779 -0.050546
L4.Canada -0.070137 0.063188
L5.USA -0.068556 0.053757
L5.Canada 0.124801 -0.096839
L6.USA -0.093869 0.039233
L6.Canada -0.045115 -0.080220
L7.USA 0.286016 0.053710
L7.Canada -0.026954 0.057376
L8.USA -0.073615 -0.088764
L8.Canada -0.064524 0.026346
L9.USA -0.166921 -0.003251
L9.Canada -0.017373 -0.067218
L10.USA -0.165870 -0.129777
L10.Canada 0.310808 0.254638
L11.USA 0.021461 -0.138703
L11.Canada -0.050913 0.087514
L12.USA 0.057059 0.172041
L12.Canada -0.127966 -0.279571
L13.USA 0.167872 0.228251
L13.Canada -0.149208 -0.182770
L14.USA -0.312103 -0.304856
L14.Canada 0.143470 0.178440
L15.USA 0.209442 0.107526
L15.Canada 0.038053 0.076652
Cointegration Rank: 1
Short-Run Coefficients:
[[0.00668387]

12
[0.00225761]]
• The long-run coefficients indicate the relationship between the USA and Canada variables.
Positive coefficients suggest a positive impact, while negative coefficients indicate a negative
impact. The cointegration rank of 1 implies a stable long-term relationship between the
variables.
• The short-run coefficients represent how the variables adjust towards the long-run equilibrium.
The USA variable, on average, adjusts by 0.0067 units in response to a one-unit deviation
from the long-run equilibrium, while the Canada variable adjusts by 0.0023 units.
Overall, these results provide insights into the long-run and short-run dynamics between the USA
and Canada variables, indicating the nature and magnitude of their relationship over time.
US and Germany
[124]: # Assuming monthly_data_cleaned is a DataFrame containing the monthly data for␣
↪the US and Germany

# Step 1: VAR Lag Length Selection

model = VAR(monthly_data_cleaned[['USA', 'Germany']])
lag_order = model.select_order()

# Extract the selected lag order from the results

selected_lag_order = lag_order.selected_orders['aic']

# Step 2: Estimation of Long-Run Equation using VAR

results = model.fit(maxlags=selected_lag_order)
long_run_coefficients = results.params

print("Long-Run Coefficients:")
print(long_run_coefficients)

# Step 3: Estimation of Short-Run VECM

vecm_model = VECM(monthly_data_cleaned[['USA', 'Germany']],␣
↪k_ar_diff=selected_lag_order)

vecm_results = vecm_model.fit()

cointegration_rank = vecm_results.coint_rank
short_run_coefficients = vecm_results.alpha

print("Cointegration Rank:", cointegration_rank)

print("Short-Run Coefficients:")
print(short_run_coefficients)

Long-Run Coefficients:
USA Germany
const 2.063929 7.654177
L1.USA 0.726840 -0.087282
L1.Germany 0.136558 1.113275

13
L2.USA 0.230449 0.122849
L2.Germany -0.134446 -0.161848
L3.USA 0.126409 -0.071666
L3.Germany -0.014933 0.105742
L4.USA 0.030641 -0.002708
L4.Germany -0.033123 0.012719
L5.USA 0.012816 0.109731
L5.Germany -0.001405 -0.115922
L6.USA -0.275762 -0.185353
L6.Germany 0.147663 0.095693
L7.USA 0.339938 0.273059
L7.Germany -0.084155 -0.095933
L8.USA -0.036124 -0.063231
L8.Germany -0.079233 -0.026895
L9.USA -0.235343 -0.231054
L9.Germany 0.065403 0.116426
L10.USA 0.050010 0.089520
L10.Germany 0.005692 -0.060252
L11.USA 0.001921 0.064126
L11.Germany -0.010941 -0.080004
L12.USA -0.132395 -0.250843
L12.Germany 0.084883 0.219011
L13.USA 0.179588 0.354981
L13.Germany -0.137836 -0.213253
L14.USA -0.275636 -0.216802
L14.Germany 0.064414 0.060699
L15.USA 0.271408 0.103653
L15.Germany -0.016097 0.019402
Cointegration Rank: 1
Short-Run Coefficients:
[[0.01044952]
[0.00366845]]
• The long-run coefficients reveal the relationship between the USA and Germany variables.
Positive coefficients suggest a positive impact, while negative coefficients indicate a negative
impact. The cointegration rank of 1 implies a stable long-term relationship between the
variables.
• The short-run coefficients represent the adjustment process towards the long-run equilibrium.
On average, the USA variable adjusts by 0.0104 units in response to a one-unit deviation from
the long-run equilibrium, while the Germany variable adjusts by 0.0037 units.
Overall, these results provide insights into the long-run and short-run dynamics between the USA
and Germany variables, indicating the nature and magnitude of their relationship over time.
US and United Kingdom
[126]: # Assuming monthly_data_cleaned is a DataFrame containing the monthly data for␣
↪the US and United Kingdom

14
# Step 1: VAR Lag Length Selection
model = VAR(monthly_data_cleaned[['USA', 'United Kingdom']])
lag_order = model.select_order()

# Extract the selected lag order from the results

selected_lag_order = lag_order.selected_orders['aic']

# Step 2: Estimation of Long-Run Equation using VAR

results = model.fit(maxlags=selected_lag_order)
long_run_coefficients = results.params

print("Long-Run Coefficients:")
print(long_run_coefficients)

# Step 3: Estimation of Short-Run VECM

vecm_model = VECM(monthly_data_cleaned[['USA', 'United Kingdom']],␣
↪k_ar_diff=selected_lag_order)

vecm_results = vecm_model.fit()

cointegration_rank = vecm_results.coint_rank
short_run_coefficients = vecm_results.alpha

print("Cointegration Rank:", cointegration_rank)

print("Short-Run Coefficients:")
print(short_run_coefficients)

Long-Run Coefficients:
USA United Kingdom
const 2.995636 4.710506
L1.USA 0.716072 -0.073310
L1.United Kingdom 0.311737 1.126344
L2.USA 0.293435 0.050389
L2.United Kingdom -0.398068 -0.110440
L3.USA 0.104088 0.008351
L3.United Kingdom 0.026379 0.049105
L4.USA 0.022567 0.025779
L4.United Kingdom -0.112983 -0.022658
L5.USA 0.001960 0.081420
L5.United Kingdom 0.041938 -0.152335
L6.USA -0.241266 -0.081354
L6.United Kingdom 0.253374 0.118916
L7.USA 0.328047 0.040925
L7.United Kingdom -0.180078 -0.062693
L8.USA -0.061390 -0.022641
L8.United Kingdom -0.070519 0.005272
L9.USA -0.270352 -0.089684

15
L9.United Kingdom 0.184072 0.147493
L10.USA 0.000562 0.004111
L10.United Kingdom 0.100389 -0.030379
L11.USA 0.076528 0.056845
L11.United Kingdom -0.168634 -0.125460
L12.USA -0.045012 -0.084944
L12.United Kingdom -0.021101 0.125920
L13.USA 0.096879 0.144180
L13.United Kingdom -0.059099 -0.181658
L14.USA -0.339230 -0.131805
L14.United Kingdom 0.237577 0.173212
L15.USA 0.328576 0.074093
L15.United Kingdom -0.152993 -0.067265
Cointegration Rank: 1
Short-Run Coefficients:
[[0.01006416]
[0.00181731]]
• The long-run coefficients represent the relationship between the USA and United Kingdom
variables. Positive coefficients indicate a positive impact, while negative coefficients indicate a
negative impact. The cointegration rank of 1 suggests a stable long-term relationship between
the variables.
• The short-run coefficients indicate the adjustment process towards the long-run equilibrium.
On average, the USA variable adjusts by 0.0101 units in response to a one-unit deviation
from the long-run equilibrium, while the United Kingdom variable adjusts by 0.0018 units.
These results provide insights into the long-run and short-run dynamics between the USA and
United Kingdom variables, revealing the nature and magnitude of their relationship over time.
b. Using the Johansen technique to detect for any possible cointegration among the 6 countries.
[138]: # Step1: Checking Order of Intergration
import statsmodels.api as sm

# Perform Augmented Dickey-Fuller test for each series

adf_test_result = {}

for country in ['France', 'Canada', 'Germany', 'Japan', 'USA', 'United␣

↪Kingdom']:

Augmented Dickey-Fuller Test for France:

ADF Statistic: -0.5928043291126145
p-value: 0.8725996779949432

16
Augmented Dickey-Fuller Test for Canada:
ADF Statistic: -0.049139645100995735
p-value: 0.9542746431528027

Augmented Dickey-Fuller Test for Germany:

ADF Statistic: -1.135742885348771
p-value: 0.7006096892538

Augmented Dickey-Fuller Test for Japan:

ADF Statistic: -1.7696935496863893
p-value: 0.3956083836961319

Augmented Dickey-Fuller Test for USA:

ADF Statistic: 2.999255142701915
p-value: 1.0

Augmented Dickey-Fuller Test for United Kingdom:

ADF Statistic: -1.193324819185349
p-value: 0.6764605942425147

=> Based on the conclusions above, the USA series is the only one that is stationary.
[134]: #Step 2: VAR Lag Length Selection
from statsmodels.tsa.api import VAR

# Subset the data to include only the stationary variable (USA) and␣
↪non-stationary variables

variables = ['USA', 'United Kingdom', 'France', 'Japan', 'Canada', 'Germany']

df_subset = monthly_data_cleaned[variables]

# Identify the stationary and non-stationary variables

stationary_var = 'USA'
non_stationary_vars = [var for var in variables if var != stationary_var]

# Perform VAR lag length selection using AIC

model = VAR(df_subset)
results = model.fit(maxlags=10)
lag_order = results.k_ar

# Estimate the VAR model using the selected lag order

var_model = VAR(df_subset)
var_results = var_model.fit(maxlags=lag_order)

# Print the selected lag order

print(f"Selected Lag Order: {lag_order}")

17
Selected Lag Order: 10

[135]: #Step 3: Estimation of Long-Run Equation using VAR

var_model = VAR(df_subset)
var_results = var_model.fit(maxlags=10)

# Extract the long-run coefficients for the first lag

long_run_coefs = var_results.coefs[10 - 1]

[136]: #Step 4: Johansen Cointegration Test

from statsmodels.tsa.vector_ar.vecm import coint_johansen

johansen_results = coint_johansen(df_subset, det_order=0, k_ar_diff=10-1)

# Print the results

print("Johansen cointegration test results:")
print("Eigenvalues:")
print(johansen_results.eig)
print("Trace statistics:")
print(johansen_results.lr1)
print("Max-Eigen statistics:")
print(johansen_results.lr2)
print("Critical values at 95% confidence:")
print(johansen_results.cvm)

Johansen cointegration test results:

Eigenvalues:
[0.06814584 0.0638826 0.03889604 0.01388539 0.00927712 0.00124822]
Trace statistics:
[126.71625284 82.18093249 40.52585862 15.49238738 6.66930622
0.78812185]
Max-Eigen statistics:
[44.53532035 41.65507387 25.03347123 8.82308117 5.88118436 0.78812185]
Critical values at 95% confidence:
[[37.2786 40.0763 45.8662]
[31.2379 33.8777 39.3693]
[25.1236 27.5858 32.7172]
[18.8928 21.1314 25.865 ]
[12.2971 14.2639 18.52 ]
[ 2.7055 3.8415 6.6349]]
1. Eigenvalues: - The eigenvalues indicate the presence and strength of cointegration rela-
tionships. - In this case, the eigenvalues are [0.06814584, 0.0638826, 0.03889604, 0.01388539,
0.00927712, 0.00124822]. - Generally, larger eigenvalues suggest stronger evidence of cointegra-
tion.
2. Trace statistics and Max-Eigen statistics: - The trace statistics and max-eigen statistics
help determine the number of cointegration relationships present. - The trace statistics values are
[126.71625284, 82.18093249, 40.52585862, 15.49238738, 6.66930622, 0.78812185]. - The max-eigen

18
statistics values are [44.53532035, 41.65507387, 25.03347123, 8.82308117, 5.88118436, 0.78812185].
- These statistics are compared with the critical values to assess the presence of cointegration.
3. Critical values at 95% confidence: - The critical values indicate the threshold values for
accepting or rejecting the null hypothesis of no cointegration. - The critical values provided are for
a 95% confidence level.
[ ]: # We compare the trace statistics and max-eigen statistics with the␣
↪corresponding critical values.

# If the test statistics exceed the critical values, it suggests the presence␣
↪of cointegration.

The trace statistics and max-eigen statistics for all six eigenvalues are above the critical values.
This indicates that there is evidence of cointegration among the variables.
=> Therefore, we can conclude that there is a possibility of cointegration among the six countries
in the dataset (US, UK, France, Japan, Canada, and Germany).

[ ]:

Code - Cap 3
No ratings yet
Code - Cap 3
5 pages
Walmart - Sales: Pandas PD Seaborn Sns Numpy NP Matplotlib - Pyplot PLT Matplotlib Datetime
100% (1)
Walmart - Sales: Pandas PD Seaborn Sns Numpy NP Matplotlib - Pyplot PLT Matplotlib Datetime
26 pages
Regression and Eda
No ratings yet
Regression and Eda
47 pages
BS en Iso 17994-2014
100% (1)
BS en Iso 17994-2014
34 pages
Gold Price Forecasting Using Time Series
100% (2)
Gold Price Forecasting Using Time Series
15 pages
Unit 2b TS Decomposition
No ratings yet
Unit 2b TS Decomposition
44 pages
Machine Learning Stock Time Series 1700932258
No ratings yet
Machine Learning Stock Time Series 1700932258
21 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
23 pages
Importing Libraries: Import As Import As Import As From Import As From Import From Import Import
100% (1)
Importing Libraries: Import As Import As Import As From Import As From Import From Import Import
11 pages
Final Ai M225187154i
No ratings yet
Final Ai M225187154i
25 pages
Financial Analytics With Python
100% (1)
Financial Analytics With Python
40 pages
Machine Learning - Multi Linear Regression Analysis
No ratings yet
Machine Learning - Multi Linear Regression Analysis
29 pages
Anomaly Detection
No ratings yet
Anomaly Detection
14 pages
Saikat Dey Data Science Project
No ratings yet
Saikat Dey Data Science Project
14 pages
What Is Time Series Decomposition and How Does It Work?
No ratings yet
What Is Time Series Decomposition and How Does It Work?
22 pages
Numpy
No ratings yet
Numpy
9 pages
Assignment: Master in Business Administration
No ratings yet
Assignment: Master in Business Administration
18 pages
Bitcoine Data Analysis
No ratings yet
Bitcoine Data Analysis
7 pages
Mean Reversion
No ratings yet
Mean Reversion
10 pages
Unit 1 Pandas - Charts
No ratings yet
Unit 1 Pandas - Charts
18 pages
From Arrays From Tuples From Product From Levels and Codes
No ratings yet
From Arrays From Tuples From Product From Levels and Codes
22 pages
Admin Chart
No ratings yet
Admin Chart
7 pages
Multi Index
No ratings yet
Multi Index
5 pages
Python Code Longterm
No ratings yet
Python Code Longterm
5 pages
13-9-23 Data Pre-Processing - Jupyter Notebook
No ratings yet
13-9-23 Data Pre-Processing - Jupyter Notebook
6 pages
Basic Series Analysis - Gold - Monthly
No ratings yet
Basic Series Analysis - Gold - Monthly
2 pages
ARIMA
No ratings yet
ARIMA
11 pages
Time Series Analysis1a
No ratings yet
Time Series Analysis1a
31 pages
Freda Song Drechsler - Fama-French
No ratings yet
Freda Song Drechsler - Fama-French
7 pages
Cap 793
No ratings yet
Cap 793
17 pages
Unit 5 - Time Series Analysis and Predictive Modeling
No ratings yet
Unit 5 - Time Series Analysis and Predictive Modeling
21 pages
DVT Exp - 7
No ratings yet
DVT Exp - 7
11 pages
Edp 3
No ratings yet
Edp 3
16 pages
Data Curr
No ratings yet
Data Curr
9 pages
Matplotlib Pandas Guide
No ratings yet
Matplotlib Pandas Guide
7 pages
Anagh-Desai BigDataAssignments NYSE Airlines Using DF
No ratings yet
Anagh-Desai BigDataAssignments NYSE Airlines Using DF
9 pages
Lab Manual 4
No ratings yet
Lab Manual 4
23 pages
Ireland Leaving-Cert-Maths-Syllabus-Simplified
No ratings yet
Ireland Leaving-Cert-Maths-Syllabus-Simplified
37 pages
Measure of Dispersion
No ratings yet
Measure of Dispersion
66 pages
JPMC - Task 1
No ratings yet
JPMC - Task 1
4 pages
FUll Code
No ratings yet
FUll Code
43 pages
Satistical Analysis of Time Series Notes
No ratings yet
Satistical Analysis of Time Series Notes
2 pages
UNIT I Introduction PPT Instrumentation
No ratings yet
UNIT I Introduction PPT Instrumentation
55 pages
7 Visualizing Financial Time Series
No ratings yet
7 Visualizing Financial Time Series
26 pages
10 - Jayesh - Prakash - Rane
No ratings yet
10 - Jayesh - Prakash - Rane
26 pages
Retail Analysis Walmart
No ratings yet
Retail Analysis Walmart
18 pages
Economic Data Analysis (Finance Analyst)
No ratings yet
Economic Data Analysis (Finance Analyst)
38 pages
Exercises Part2
No ratings yet
Exercises Part2
7 pages
1902TaniyaDubey TSF Sparkling 2
No ratings yet
1902TaniyaDubey TSF Sparkling 2
36 pages
Special Discrete Distributions Notes
No ratings yet
Special Discrete Distributions Notes
11 pages
TIME - ChatGPT Manual 001
No ratings yet
TIME - ChatGPT Manual 001
7 pages
Lab Record Dev
No ratings yet
Lab Record Dev
20 pages
Trade Backtest
No ratings yet
Trade Backtest
23 pages
Time Series Analysis 1718649022
No ratings yet
Time Series Analysis 1718649022
5 pages
Pandas Syntax Revision For ML
No ratings yet
Pandas Syntax Revision For ML
10 pages
MAS202Group1 Group-Assignment
No ratings yet
MAS202Group1 Group-Assignment
20 pages
Five Commonly Used Trading Strategies: First, Let's Import The Necessary Libraries and Load The Data
No ratings yet
Five Commonly Used Trading Strategies: First, Let's Import The Necessary Libraries and Load The Data
21 pages
DF PD - Read - Excel ('Sample - Superstore - XLS') : Anjaliassignmnet - Ipy NB
No ratings yet
DF PD - Read - Excel ('Sample - Superstore - XLS') : Anjaliassignmnet - Ipy NB
23 pages
Real Estate
No ratings yet
Real Estate
10 pages
4 PythonPandas
No ratings yet
4 PythonPandas
8 pages
Assignment 4 On Visualization On Graph With Solution
No ratings yet
Assignment 4 On Visualization On Graph With Solution
14 pages
Wa0002.
No ratings yet
Wa0002.
4 pages
PCS 7 Control Performance Monitoring PCS 7 V100 DOC V3 0 en
No ratings yet
PCS 7 Control Performance Monitoring PCS 7 V100 DOC V3 0 en
31 pages
Math 9 1ST Quarter
No ratings yet
Math 9 1ST Quarter
56 pages
Complete Notes of BA
100% (1)
Complete Notes of BA
22 pages
Data Manipulation With Pandas - Yulei's Sandbox
No ratings yet
Data Manipulation With Pandas - Yulei's Sandbox
18 pages
Moving Average Cross Strategy
No ratings yet
Moving Average Cross Strategy
1 page
UOP 987-15 Low Trace Sulfur in Liquid Hydrocarbons by Oxidative Combustion With Ultraviolet Fluo
No ratings yet
UOP 987-15 Low Trace Sulfur in Liquid Hydrocarbons by Oxidative Combustion With Ultraviolet Fluo
13 pages
Chapter 1 1
No ratings yet
Chapter 1 1
9 pages
Topic 1
No ratings yet
Topic 1
58 pages
Eco452 Applied Statistics
No ratings yet
Eco452 Applied Statistics
137 pages
FINAL-Nguyễn Quỳnh Chi-2013316663
No ratings yet
FINAL-Nguyễn Quỳnh Chi-2013316663
1 page
MBA 1st Sem Unit-4 Business Statistics
No ratings yet
MBA 1st Sem Unit-4 Business Statistics
13 pages
Neral Mathematics M3
No ratings yet
Neral Mathematics M3
145 pages
Project Assignment 1 2
No ratings yet
Project Assignment 1 2
4 pages
Module I
No ratings yet
Module I
68 pages
7636
No ratings yet
7636
132 pages
Ib A&i 3.1
No ratings yet
Ib A&i 3.1
38 pages
Iba Unit - Ii
No ratings yet
Iba Unit - Ii
31 pages
Fundamental Statistics For The Behavioral Sciences 8ed. Edition Howell D.C. Instant Download
No ratings yet
Fundamental Statistics For The Behavioral Sciences 8ed. Edition Howell D.C. Instant Download
65 pages
Normal Distribution Review
No ratings yet
Normal Distribution Review
22 pages
Controlling Process and Types
No ratings yet
Controlling Process and Types
11 pages
Inbound 6635600360198039413
No ratings yet
Inbound 6635600360198039413
23 pages
Topic2 - 2024 - Descriptive Statistics - STD - Revised
No ratings yet
Topic2 - 2024 - Descriptive Statistics - STD - Revised
20 pages
Assessment Student Learning
No ratings yet
Assessment Student Learning
23 pages
Expt 1 The Bunsen Burner and Laboratory Measurements Instructions
No ratings yet
Expt 1 The Bunsen Burner and Laboratory Measurements Instructions
11 pages
Board Age and Value Diversity Evidence From A Collectivistic and Paternalistic Culture
No ratings yet
Board Age and Value Diversity Evidence From A Collectivistic and Paternalistic Culture
18 pages
Note
No ratings yet
Note
21 pages
Test 1.2
No ratings yet
Test 1.2
2 pages
Kozloski 2014 Faecal Nitrogen As An Approach To Estimate Forage Intake of Wethers
No ratings yet
Kozloski 2014 Faecal Nitrogen As An Approach To Estimate Forage Intake of Wethers
8 pages
2025 Module 1 Development of Practical Skills in Biology
No ratings yet
2025 Module 1 Development of Practical Skills in Biology
9 pages
Precision in ASTM Test Mehods, What Precision Means, ASTM Data Points, January-February 2016
No ratings yet
Precision in ASTM Test Mehods, What Precision Means, ASTM Data Points, January-February 2016
2 pages
Module 05 Estimation of Parameters
No ratings yet
Module 05 Estimation of Parameters
3 pages
SPX Seasonality Statistics from 1980 to 2024
From Everand
SPX Seasonality Statistics from 1980 to 2024
AUSTIN NG
No ratings yet

Midterm Asm

Uploaded by

Midterm Asm

Uploaded by

midterm-asm

[73]: # Monthly Index

"USA Standard (Large+Mid Cap)": "USA", "UNITED KINGDOM Standard (Large+Mid␣

"FRANCE Standard (Large+Mid Cap)": "France", "JAPAN Standard (Large+Mid␣

annual_data_cleaned = annual_data.apply(pd.to_numeric, errors='coerce').dropna()

[64]: France Canada Germany Japan USA United Kingdom

[640 rows x 6 columns]

[65]: France Canada Germany Japan USA United Kingdom

# Convert index to a column for plotting

# Plotting the monthly data

plt.title('Monthly Index - Price Level', color='blue')

# Format x-axis labels

# Plotting the monthly returns

# Format x-axis labels

plt.title('Annual Index - Price Level', color='red')

# Plotting the annual returns

# Add additional statistics if needed

# Display the summary statistics

France Canada Germany Japan USA \

# Add additional statistics if needed

# Display the summary statistics

France Canada Germany Japan USA United Kingdom

# Perform Augmented Dickey-Fuller test for each series

for country in ['France', 'Canada', 'Germany', 'Japan', 'USA', 'United␣

Augmented Dickey-Fuller Test for France:

Augmented Dickey-Fuller Test for Canada:

Augmented Dickey-Fuller Test for Germany:

Augmented Dickey-Fuller Test for Japan:

Augmented Dickey-Fuller Test for United Kingdom:

# Assuming monthly_data_cleaned is a DataFrame containing the monthly data for␣

# Step 1: VAR Lag Length Selection

# Extract the selected lag order from the results

# Step 2: Estimation of Long-Run Equation using VAR

# Step 3: Estimation of Short-Run VECM

print("Cointegration Rank:", cointegration_rank)

# Extract the selected lag order from the results

# Step 2: Estimation of Long-Run Equation using VAR

# Step 3: Estimation of Short-Run VECM

print("Cointegration Rank:", cointegration_rank)

# Step 1: VAR Lag Length Selection

# Extract the selected lag order from the results

# Step 2: Estimation of Long-Run Equation using VAR

# Step 3: Estimation of Short-Run VECM

print("Cointegration Rank:", cointegration_rank)

# Step 1: VAR Lag Length Selection

# Extract the selected lag order from the results

# Step 2: Estimation of Long-Run Equation using VAR

# Step 3: Estimation of Short-Run VECM

print("Cointegration Rank:", cointegration_rank)

# Extract the selected lag order from the results

# Step 2: Estimation of Long-Run Equation using VAR

# Step 3: Estimation of Short-Run VECM

print("Cointegration Rank:", cointegration_rank)

# Perform Augmented Dickey-Fuller test for each series

for country in ['France', 'Canada', 'Germany', 'Japan', 'USA', 'United␣

Augmented Dickey-Fuller Test for France:

Augmented Dickey-Fuller Test for Germany:

Augmented Dickey-Fuller Test for Japan:

Augmented Dickey-Fuller Test for USA:

Augmented Dickey-Fuller Test for United Kingdom:

variables = ['USA', 'United Kingdom', 'France', 'Japan', 'Canada', 'Germany']

# Identify the stationary and non-stationary variables

# Perform VAR lag length selection using AIC

# Estimate the VAR model using the selected lag order

# Print the selected lag order

[135]: #Step 3: Estimation of Long-Run Equation using VAR

# Extract the long-run coefficients for the first lag

[136]: #Step 4: Johansen Cointegration Test

johansen_results = coint_johansen(df_subset, det_order=0, k_ar_diff=10-1)

# Print the results

Johansen cointegration test results:

You might also like