0% found this document useful (0 votes)

15 views10 pages

CFM - Programming Task

The document outlines a programming task involving data analysis of inflation and inventor locations using STATA, Python, and Julia. It includes tasks such as cleaning inflation data, analyzing state-level inflation trends, and matching inventor locations to true city/state names using fuzzy matching techniques. The document also discusses the modeling of a cake-eating problem with stochastic shocks and value function iteration.

Uploaded by

rominagoodarzi7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views10 pages

CFM - Programming Task

Uploaded by

rominagoodarzi7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

CFM_Programming Task

July 6, 2022

• By: Seyed Mahdi Hosseini Maasoum

Note: The first task has been done by STATA, the second task by Python and the third by Julia.
The codes are commented and attached to this report.

1 Task1
1.1
There are some remarkable outliers in the quarterly inflation, like 256 and -35 which are obviously
not true.

Table1. Descriptive Statistics before cleaning

Variable Obs Mean Std. Dev. Min Max

pi 3192 3.027 5.053 −35.647 256.338

I replace the outliers (defined as the 1% tails of the distribution of inflation in all periods) with
national level inflation extracted from FRED database.

Table2. Descriptive Statistics after cleaning

Variable Obs Mean Std. Dev. Min Max

pi 3192 2.888 1.939 −.994 10.662

The data on inflation is missing between the years 1986 and 1989. I do not do anything for these
observations. Because replacing these dates with the national level inflation can result in biased
estimations in sections 4 and 5.

1.2
The graph below depicts the median, 25th and 75th percentiles of state level inflation in each
quarter. The shaded area represents US recessions.

figure1. American states quarterly infaltion

1
State-level inflation, USA

10 8
Quarterly Inflation %
2 4 0 6

1972q3 1985q1 1997q3 2010q1 2022q3

75th percentile 25th percentile

Median Recession

From the above figure, I conclude that the dispersion of inflation (the distance between the 75th
and 25th percentiles) has stayed constant during the majority of the recessions. The financial crises
of 2008-2009 is an exception, during which this dispersion has decreased considerably. However, in
other recession the inflation dispersion across the states has no specific patterns. It has increased
at some times (the first recession in the plot) and in most cases remained constant.

1.3
In the attached STATA code I generate a dummy which is equal to one if the absolute value of the
difference between the inflation of the state and the median of inflation in that period is greater
than 1 (1 percentage point is equal to 100 basis points).
I then count the number of observation with that dummy equal to one and divide it by the number
of all observation in that period. This will give us the share of states that had inflation more than
100 basis points away from median and is equal to 33%

1.4
I think the question can be best answered by using time and state fixed-effects. R2 of a regression
with inflation as the dependent variable and time fixed-effects as the dependent variable will tell
us what percentage of variations in the inflation can be explained by factors that are common to
all states at each period (i.e. national factors)
On the other hand, R2 of a regression with inflation as the dependent variable and state fixed effects
as the dependent variable shows the fraction of variation in the inflation which can be explained
by state-specific factors that are constant through the course of time.
The table below shows the results of these two regression:

2
Table3. Time and State Fixed Effect R-squared

(1) (2)
Time Fixed Effect State Fixed Effect

Observations 3, 192 3, 192

R − squared 0.691 0.009

So around 69 % of inflation can be explained by common changes in inflation across all states
(Time fixed effect) and only less than 1 % of that is due to constant differences between states.
This is reasonable, inflation (at least in the long run) will be determined by the fiscal and monetary
policies of the government; and this is not different for different states!

1.5
To check the persistency of inflation, I regress the inflation on its lag. The more the persistent
inflation is, the greater will be the coeﬀicient of its lag.

πs,t = α0 + α1 πs,t−1 + ϵst

Where πs,t is the inflation at time t at state s.

To test the hypothesis that persistence is due to national and not to state level factors, I simply
add the time fixed effect to the above regression:

πs,t = α0 + α1 πs,t−1 + δt + ϵst

Where δt are dummies for each period t.

This partial out the effect of common (national) factors. If the coeﬀicient of this regression is
significantly different from zero, then we will reject the hypothesis.

H0 : α 1 = 0

Table4. Controling for time fixed effects in persistency

(1) (2)
VARIABLES NO Fixed Effect Time Fixed Effect

πt−1 0.912∗∗∗ 0.777∗∗∗

(0.00726) (0.0114)
Observations 3, 064 3, 064
R-squared 0.838 0.903

3
Although controlling for the national factors reduced the magnitude of the coeﬀicient of πt−1 it is
still statistically significant. So we reject the hypothesis that persistence in inflation is only due to
national factors.

2 Task2
2.1
A short definition of words:
• Extracted state/city: What I have simply extracted from the inventorlocation variable as the
sate/city of the inventor
• True states/cities: Complete and truly spelled name of cities/states in the US
I first split the location of inventor by comma. After doing this, each location is a list with few
items. We extract the state and city of the inventor by choosing the second and third items (from
the end of the list), respectively. Although this is not a universal pattern in the data set, it’s a
good start and works for many observations! (This can be done automatically; I will discuss that.)
Now I have to match these incomplete (and often misspelled) names with true names of states and
cities of US.
Ideally we need a complete list of US cities. (Since it is possible for the inventor to live in locations
that don’t appear in “PlantLocations.csv”). Here I only use the states and cities which are in the
PlantLocations file, but the extension to a more comprehensive set of cities is straight-forward.
I match the name of extracted cities and states to their true names by two different methods.
The first method utilizes Levenshtein Distance, which is implemented in FuzzyWuzzy library of
Python. This allows us to check the equality of two string variables, but in a fuzzy (rather than a
binary) way.
I compare the extracted state and city of the inventor with each state and city in the PlantLocations
file. The city and state that receives the higher score (lower Levenshtein distance) will be assigned
to the inventor.
The second method uses another library of python, abydos.phonetic. I use Russell Index algorithm,
which encode the words to numbers. Similar word receives similar numbers. In a similar process to
the previous method, I encode the name of cities and states and pick the most similar one. (i.e. the
smallest difference between encoded “extracted state” and the true states).
Below is the Pseudo-codes for the first method (See Appendix for the second method), I write them
for finding the states, but it will be exactly the same for finding cities:
first method:
for each extracted state
for each true state

calculate the Levenshtein Distance

between the extracted state and the true state

4
end

assign the state with minumum distance

(maximum similarity) to the inventor

end
Note that this process can be done even without extracting the city and state from the inventor-
location variable! We can simply calculate the Levenshtein distance between the inventorlocation
and states/cities and pick the state/cities with the smallest distance.
I have implemented these algorithms in python. The below figure shows the result for some sam-
ple observations. On average we are doing OK, but there are also some errors. The “Inven-
tor_city/Inventor_state” columns are result of the method. The extracted state/extracted city
columns are from raw data.

figure2. Results of inventor’s state and city (first method)

2.2
Since we have no other information, we assume that the inventor works at the nearest plant to
his/her home.
We can have a matrix (or function) which calculates the distance between any two given cities.
We then calculate (in a for loop) the distance between inventor’s home city and each plant of his
company. We then choose the minimum distance as the plant in which the inventor is working.
Below is the pseudo-code for this procedure:
for each inventor

5
for each plant of inventor's company

Calculate the distance between the inventor's home and plant;

end

find the minumum distance;

assign the plant with minimum

distance to the inventor

end

This method will be reliable if different plants of companies are located in places far from each
other. If each plant is reasonably near the other plants, then the distance may not be the most
important factor. We will probably have to gather data on plants and inventor attributes (probably
from some other data set) and then estimate a discrete-choice model (which inventors choose which
plant). We can then predict the plant at which each inventor is more likely to work.

3 Task3
3.1
First let’s write the problem with its budget constraint:
[ ∞
]
∑
t
max : V = E0 β log (ct )
t=0

s.t : kt+1 = kt − ct − δt kt

Where kt is the stock of cake remaining at period t and δt is a stochastic varaible (shock).
We assume that the initail value of cake is equal to one.
Now we can write the Bellman equation:

V (kt , δt ) = max [log(ct ) + βEt [V (kt+1 , δk+1 ) | δt ]]

s.t : kt+1 = kt − ct − δt kt

Since our shock is state-dependent, we have to condition our expectation on the past realization of
the schock.
Given the inforamtoin in the question, good/bad days are a markov process with the transition
matrix P:

6
[ ]
0.8 0.2
P =
0.2 0.8

We use this matrix to calcualte conditioanl expectations in the code.

3.2
The value of log(ct ) is only a function of how much cake is eaten today (ct ). (The other two
variables are irrevelant).
In the code, I make a 3 dimentioanl matrix (100 × 100 × 2). Which shows the value of log(ct ) for
the set of variables (kt , ct , δt ). Since this is a function of ct , the only varaition is among rows of the
matrix. I then interpolate the function.

3.3
Because of log function problems around zero, I have changed the utility function to another similar
and standard function, CRRA:

c1−γ
U (c) =
1−γ

with γ = 0.8.
After solving the problem by value function iteration, I save the policy function and interpolate it.
This allows me to simulate the model. I also draw a markov process with matrix P for 100 periods.
The graph below shows the results of simulation starting from k=1 (a complete cake) and a good
day:

7
figure3. Cake eating model simulation

I have also plotted the value functions, Good days increase the value function a little bit, which is
reasonable.

8
figure4. Cake eating model value functions

4 Appendix
Pseudo-code for the second method:
for each extracted state

encode the name of extracted state (RussellIndex algorithm)

end

for each true state

encode the name of true states (RussellIndex algorithm)

end

9
for each extracted state
for each true state

calculate the Difference between the

encoded extracted state and the encoded true state

end

assign the state with minumum difference to the inventor

end
And the results:

figure5. Results of inventor’s state and city (second method)

Econometric Analysis, 7th and 8th Editions William Greene
No ratings yet
Econometric Analysis, 7th and 8th Editions William Greene
15 pages
(Original PDF) Real Stats Using Econometrics For Political Science and Public Policy Instant Download
100% (1)
(Original PDF) Real Stats Using Econometrics For Political Science and Public Policy Instant Download
45 pages
Part1 Intro Panel Data
No ratings yet
Part1 Intro Panel Data
32 pages
1 Granger2003
No ratings yet
1 Granger2003
11 pages
ALENCAR - Income Mobility Census
No ratings yet
ALENCAR - Income Mobility Census
7 pages
Polstab Results
No ratings yet
Polstab Results
28 pages
Intermediate Macroeconomics Exercises
No ratings yet
Intermediate Macroeconomics Exercises
21 pages
PS1 Solutions
No ratings yet
PS1 Solutions
6 pages
Property Rights
No ratings yet
Property Rights
5 pages
Macro II Notes - Topic 3 - Update 8
No ratings yet
Macro II Notes - Topic 3 - Update 8
76 pages
1577-Article Text-3743-1-10-20201229
No ratings yet
1577-Article Text-3743-1-10-20201229
4 pages
Quantecon Python Advanced
No ratings yet
Quantecon Python Advanced
1,074 pages
Comp Lab 2 GunExample 2425
No ratings yet
Comp Lab 2 GunExample 2425
15 pages
Week-3 NK
No ratings yet
Week-3 NK
8 pages
Macro Topic 2.4 - Price Indices and Inflation - Google Docs - QHZTVTV
No ratings yet
Macro Topic 2.4 - Price Indices and Inflation - Google Docs - QHZTVTV
5 pages
(Original PDF) Real Stats Using Econometrics For Political Science and Public Policy Instant Download
100% (8)
(Original PDF) Real Stats Using Econometrics For Political Science and Public Policy Instant Download
45 pages
Econ 306 HW 3
No ratings yet
Econ 306 HW 3
7 pages
Comparing GDP
No ratings yet
Comparing GDP
43 pages
Nassim Taleb Risk Book
100% (3)
Nassim Taleb Risk Book
99 pages
ECO121 - Test 01 - Individual Assignment 01 - Summer2024
No ratings yet
ECO121 - Test 01 - Individual Assignment 01 - Summer2024
3 pages
STA03B3 Lecture 18
No ratings yet
STA03B3 Lecture 18
30 pages
Isye HW2
No ratings yet
Isye HW2
10 pages
Data8 sp19 Midterm Solutions
No ratings yet
Data8 sp19 Midterm Solutions
14 pages
Lecture Note 2019 PDF
100% (1)
Lecture Note 2019 PDF
235 pages
Econometric S
100% (1)
Econometric S
348 pages
Unobserved Components Models in Economics and Finance
No ratings yet
Unobserved Components Models in Economics and Finance
34 pages
Regression Models Course Notes
No ratings yet
Regression Models Course Notes
102 pages
CSC 240 HW 2
No ratings yet
CSC 240 HW 2
5 pages
Population Forecasting: Time Series Forecasting Techniques
No ratings yet
Population Forecasting: Time Series Forecasting Techniques
33 pages
Instrumental Variable Estimation 2: Implementation in R: Instructor: Yuta Toyama Last Updated: 2021-05-18
No ratings yet
Instrumental Variable Estimation 2: Implementation in R: Instructor: Yuta Toyama Last Updated: 2021-05-18
34 pages
Practical Introduction To Stata PDF
100% (1)
Practical Introduction To Stata PDF
58 pages
Notes Vfi sp2024
No ratings yet
Notes Vfi sp2024
11 pages
Outliers Influence
No ratings yet
Outliers Influence
6 pages
GovernmentSpr182 PDF
No ratings yet
GovernmentSpr182 PDF
6 pages
Measuring The Output Gap Using Large Datasets: Matteo Barigozzi Matteo Luciani
No ratings yet
Measuring The Output Gap Using Large Datasets: Matteo Barigozzi Matteo Luciani
44 pages
Case Study Real Estate
No ratings yet
Case Study Real Estate
4 pages
RATS Programming Manual
No ratings yet
RATS Programming Manual
255 pages
Final Report
No ratings yet
Final Report
14 pages
Panel Data EconometricsPanel Data Sets
No ratings yet
Panel Data EconometricsPanel Data Sets
9 pages
Practical Introduction To Stata PDF
No ratings yet
Practical Introduction To Stata PDF
58 pages
Unit 3 - Chapter 9 - Greenlaw
No ratings yet
Unit 3 - Chapter 9 - Greenlaw
8 pages
Formulas and Graphs For Macro Exam
No ratings yet
Formulas and Graphs For Macro Exam
63 pages
Indice de Datos - Econometric Analysis
No ratings yet
Indice de Datos - Econometric Analysis
16 pages
Val Fun Iter
No ratings yet
Val Fun Iter
10 pages
Assignment 5
No ratings yet
Assignment 5
13 pages
PMT2 20
No ratings yet
PMT2 20
32 pages
Econometrics - Final Project
No ratings yet
Econometrics - Final Project
11 pages
Lab 01
No ratings yet
Lab 01
22 pages
Econometrics Project
No ratings yet
Econometrics Project
3 pages
Economic History of The United States 1st Edition by Siegler ISBN Test Bank
100% (38)
Economic History of The United States 1st Edition by Siegler ISBN Test Bank
15 pages
AML-2203 Advanced Python AI and ML Tools Assignment
No ratings yet
AML-2203 Advanced Python AI and ML Tools Assignment
19 pages
Gdpnow: A Model For GDP "Nowcasting" Patrick Higgins Working Paper 2014-7 July 2014
No ratings yet
Gdpnow: A Model For GDP "Nowcasting" Patrick Higgins Working Paper 2014-7 July 2014
86 pages
Ps 4 Soln
No ratings yet
Ps 4 Soln
5 pages
Art of Programming Through Algorithms and Flowcharts in C
No ratings yet
Art of Programming Through Algorithms and Flowcharts in C
7 pages
Solutions Manual Using R Introductory ST
No ratings yet
Solutions Manual Using R Introductory ST
33 pages
EXP5 Alexnet
No ratings yet
EXP5 Alexnet
3 pages
GMM and OLS Estimation and Inference For New Keynesian Phillips Curve
No ratings yet
GMM and OLS Estimation and Inference For New Keynesian Phillips Curve
26 pages
DSP 5th Sem File
No ratings yet
DSP 5th Sem File
32 pages
Presentation 2
No ratings yet
Presentation 2
28 pages
Solutions 1
No ratings yet
Solutions 1
9 pages
10.+he+et+al. 0785
No ratings yet
10.+he+et+al. 0785
21 pages
Backtracking:: Back Track
No ratings yet
Backtracking:: Back Track
30 pages
Engineering Analysis With Boundary Elements: Kalani Rubasinghe, Guangming Yao, Jing Niu, Gantumur Tsogtgerel
No ratings yet
Engineering Analysis With Boundary Elements: Kalani Rubasinghe, Guangming Yao, Jing Niu, Gantumur Tsogtgerel
13 pages
EC 744 Lecture Notes: Incomplete Markets and Bewley Models: Jianjun Miao
No ratings yet
EC 744 Lecture Notes: Incomplete Markets and Bewley Models: Jianjun Miao
39 pages
Meta Heuristic Method
No ratings yet
Meta Heuristic Method
46 pages
Bec613a MMC Mod4
100% (1)
Bec613a MMC Mod4
41 pages
Analyzing Time-Varying Noise Properties With Spectrerf
100% (1)
Analyzing Time-Varying Noise Properties With Spectrerf
22 pages
20011P0417 DSP Matlab Assignment
No ratings yet
20011P0417 DSP Matlab Assignment
12 pages
7.8 Repeated Eigenvalues: Satya Mandal, KU
No ratings yet
7.8 Repeated Eigenvalues: Satya Mandal, KU
22 pages
Big M Method
No ratings yet
Big M Method
10 pages
Greedy Method: General Method, Coin Change Problem Knapsack Problem Job Sequencing With Minimum Spanning Tree
No ratings yet
Greedy Method: General Method, Coin Change Problem Knapsack Problem Job Sequencing With Minimum Spanning Tree
55 pages
Medium Level Array Practice
No ratings yet
Medium Level Array Practice
6 pages
Class47 49 - AttentionBasedModels Transformers 10 15may2023
No ratings yet
Class47 49 - AttentionBasedModels Transformers 10 15may2023
27 pages
Q) Matrix Chain Multiplication Problem ?: Conclusion
No ratings yet
Q) Matrix Chain Multiplication Problem ?: Conclusion
6 pages
Adsa Practice Questions
No ratings yet
Adsa Practice Questions
7 pages
Examples: Bubble Sort, Insertion Sort, Merge Sort, Quick Sort, Heap Sort
No ratings yet
Examples: Bubble Sort, Insertion Sort, Merge Sort, Quick Sort, Heap Sort
10 pages
Adaptive Equalizer PDF
No ratings yet
Adaptive Equalizer PDF
24 pages
AI Questionaries For Exit Exam Preparation - AI (CS-2015)
No ratings yet
AI Questionaries For Exit Exam Preparation - AI (CS-2015)
9 pages
Graduated Non-Convexity For Robust Spatial Perception: From Non-Minimal Solvers To Global Outlier Rejection
No ratings yet
Graduated Non-Convexity For Robust Spatial Perception: From Non-Minimal Solvers To Global Outlier Rejection
11 pages
100 Matlab-3
No ratings yet
100 Matlab-3
4 pages
DM
No ratings yet
DM
7 pages
Assignment 1,2&3
No ratings yet
Assignment 1,2&3
3 pages
Rightpdf 100 Percent Maths cl-9 ch-2 Ty 2023 Watermark Unlocked
No ratings yet
Rightpdf 100 Percent Maths cl-9 ch-2 Ty 2023 Watermark Unlocked
5 pages
Unit 1 Introduction To Algorithms: Structure
No ratings yet
Unit 1 Introduction To Algorithms: Structure
19 pages
Assignment 7 (Sol.) : Reinforcement Learning
0% (1)
Assignment 7 (Sol.) : Reinforcement Learning
3 pages
UNIT 5 Session 6
No ratings yet
UNIT 5 Session 6
67 pages
Ain Shams University Faculty of Engineering
No ratings yet
Ain Shams University Faculty of Engineering
2 pages

CFM - Programming Task

Uploaded by

CFM - Programming Task

Uploaded by

CFM_Programming Task

• By: Seyed Mahdi Hosseini Maasoum

Table1. Descriptive Statistics before cleaning

Variable Obs Mean Std. Dev. Min Max

Table2. Descriptive Statistics after cleaning

Variable Obs Mean Std. Dev. Min Max

figure1. American states quarterly infaltion

1972q3 1985q1 1997q3 2010q1 2022q3

75th percentile 25th percentile

Observations 3, 192 3, 192

πs,t = α0 + α1 πs,t−1 + ϵst

Where πs,t is the inflation at time t at state s.

πs,t = α0 + α1 πs,t−1 + δt + ϵst

Where δt are dummies for each period t.

Table4. Controling for time fixed effects in persistency

πt−1 0.912∗∗∗ 0.777∗∗∗

calculate the Levenshtein Distance

assign the state with minumum distance

figure2. Results of inventor’s state and city (first method)

Calculate the distance between the inventor's home and plant;

find the minumum distance;

assign the plant with minimum

V (kt , δt ) = max [log(ct ) + βEt [V (kt+1 , δk+1 ) | δt ]]

We use this matrix to calcualte conditioanl expectations in the code.

encode the name of extracted state (RussellIndex algorithm)

for each true state

encode the name of true states (RussellIndex algorithm)

calculate the Difference between the

assign the state with minumum difference to the inventor

figure5. Results of inventor’s state and city (second method)

You might also like