Lecture 3 Economics Dissertation - 2025
Lecture 3 Economics Dissertation - 2025
[ECO00045H]
Dr Nahid Farnaz
[email protected]
1
Lecture 3 - outline
2
Types of data
• Time series data: observing a phenomenon (for e.g. macro indicator) for a
country (or countries) over a period of time This means that data collected
over time for a single entity (e.g. UK inflation rate from 2000-2023)
3
Types of variables
Numeric variables: can be measured numerically (e.g. income, height,
temperature)
• Discrete: Values can only take on specific, distinct values (usually whole numbers), often
representing counts. Example: number of students in a class, number of cars in a parking lot,
number of siblings.
• Continuous: Values can take on any value within a given range (e.g., height, weight, temperature).
4
Slido poll #
Which type of data are you working with for your dissertation?
a) Cross sectional
b) Time series
c) Panel
5
Getting started with STATA
6
Installing STATA
7
Stata interface
● Open STATA
● Results window (centre)
● Review window (left)
● Menus (top)
● Command window (bottom)
8
Using STATA: useful resources
9
Data compilation and cleaning
Importing and cleaning data:
• Loading and browsing data (opening a data file and browsing its contents), changing
variable names
• Check for missing data: identify missing values and handle missing data (misstable
summarize, drop if, replace)
• Merge datasets: merging two datasets together, or append (adding new data)
Top tip: use the “help” command for more information on a STATA code (simply type
“help” followed by the code in the command window on STATA)
10
Using STATA: do Files
● Using do-files: Type your commands and comments in a .do file.
● They are a record of what you have done (There would be no record if you use
the menus instead of commands.)
● To open a .do file either:
○ Click pencil and pad icon for a new file or
○ File > Open and navigate to a previously saved .do file
● Remember to save your .do file - this is not done automatically
11
Stata demonstration (practical session)
• Panel data
12
Common data problems
• Heteroscedasticity
• Multicollinearity
• Autocorrelation
• Endogeneity
13