Stata Task

1. The document provides instructions to clean data from three datasets, perform analyses, and draw random samples. It involves merging datasets, cleaning variables, creating new variables, describing data, regression analysis, and sampling. 2. Regression analysis is conducted to determine wage determinants using Mincer earnings function with log wage as dependent variable and education, experience and other individual characteristics as independents. 3. Further regressions are run to examine effects of categorical variables like marital status and occupation on wages. Standard errors are clustered at district level to account for within-district correlations.

Uploaded by

Saad Raja

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

177 views3 pages

Stata Task

Uploaded by

Saad Raja

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Instructions:

1. We would like you to save the log file and the do file for each section separately. Kindly send all your
log and do files in a zipped folder

Data Cleaning:

1. Make one data set out of three.

Hint: Merge 200910 dataset first and append with 201011 dataset.
2. Clean “DistrictName” variable.
Hint: Should have same name for the same district.
3. Make a new Variable “Dist_code” having a unique code for each district.
4. Define and assign labels of districts created in question 2.
5. Make a new variable “age_brac” from “q406” having three brackets: 1 (14-29), 2 (30-45) and 3
(46-64). This variable should have dot (.) if age is greater than 64.
6. Create a new variable “Pcode_new” having last four integers of “P_Code”.

Data Structure:

1. Describe the level of the data and report the number of missing values in “q406”.
2. Create a new variable “age_district” showing average age per district from “q406”. For
calculating the average age per district, ignore values of greater than 64 in “q406”.
3. Make a bar chart showing distribution of age bracket by gender aggregated at district level.
4. Create a new variable “age_brac_district” showing the percentages of individuals falling in each
age bracket using “age_brac” variable created in question 4, section 1.
5. Reshape data in such a manner that one row should represent one district and should have only
four columns: DistrictName, % falling in 14-29, 30-45 and 46-64 age brackets.
6. Make a table in excel/word using stata command having District Names and Average Age per
district.

Regression Analysis:

The Mincer earnings function is a single-equation model that explains wage income as a function of
schooling and experience, named after Jacob Mincer. The equation has been examined on many datasets
and Thomas Lemieux argues it is "one of the most widely used models in empirical economics".
Typically the logarithm of earnings is modelled as the sum of determinants of wage.

The dataset you prepared and merged is Labor Force Survey for the years 200910 and 201011. It contains
wage earned for the last year and other determinants such as number of sex of the member; age of the
member; years of education; marital status; number of years living in the district; if the person has
obtained any professional training; principal professional activities in the last year; any subsidiary
occupation
We want to see out these variables, which variables determine wage of the individuals.

Regression Specification

1) Please write regression specification for other above variables, keeping log of wage earned in the
previous year as the dependent variable and other determinants as the independent variable.

Data Setup

2a) Generate log of wage earned in the last year

2b) Recode variables of gender, training attended, additional subsidiary work such that instead for female
gender, training not attended and no additional subsidiary work, it shows 0 instead of 2

Regression Analysis

3a) Run the regression in STATA and export the regression results in word format.

3b) Interpret the regression results in 3a and Explain which variables are important for determining wage.
Are there are counter intuitive results?

3c) The variable for marital status, number of years living in the district, principal activities in last 12
months are categorical variables. We want to see the effect on wage of on each category of these
variables. Rerun the regression for each category by taking base for marital status as never married, for
number of years take base as more than 10 years, for activities in 12 months take base as not in labor
force

Hint: use xi command in STATA and check its help manual for how to use categorical variable in
regression

3di) Rerun specification 3c for robust standard errors and cluster at the district level.

3dii) what is the purpose of clustering at district level

Sampling

Draw a random sample of 1000 observations

Draw a random sample of 15% of the data

Draw a sample of 3000 observations such that number of observation in the sample represents the
proportion of observation from each district in the original sample

Emma Mason - The Cambridge Introduction To William Wordsworth-Cambridge University Press (2010)
100% (6)
Emma Mason - The Cambridge Introduction To William Wordsworth-Cambridge University Press (2010)
151 pages
My Courses: Home UGRD-GE6114-2113T Week 10: Midterm Examination Midterm Exam
100% (1)
My Courses: Home UGRD-GE6114-2113T Week 10: Midterm Examination Midterm Exam
8 pages
Detail Project Report SMDM
100% (1)
Detail Project Report SMDM
25 pages
Zinc Flake Coating Ex Geomet
No ratings yet
Zinc Flake Coating Ex Geomet
7 pages
Monika Sree 11-07-2024
No ratings yet
Monika Sree 11-07-2024
36 pages
Stata An Introduction Summer 2020
No ratings yet
Stata An Introduction Summer 2020
60 pages
Pivot Table
No ratings yet
Pivot Table
52 pages
Dummy Variable Ques
No ratings yet
Dummy Variable Ques
7 pages
Quiz 3
No ratings yet
Quiz 3
56 pages
2 - Summary Statistics Demo Data
No ratings yet
2 - Summary Statistics Demo Data
56 pages
Group Assignment SB
No ratings yet
Group Assignment SB
42 pages
R Working Materials Prep
No ratings yet
R Working Materials Prep
43 pages
Pes1ug22cs841 Sudeep G Lab1
No ratings yet
Pes1ug22cs841 Sudeep G Lab1
37 pages
PFDA
No ratings yet
PFDA
23 pages
FRA Milestone 1
No ratings yet
FRA Milestone 1
33 pages
Instrumental Variable Estimation 2: Implementation in R: Instructor: Yuta Toyama Last Updated: 2021-05-18
No ratings yet
Instrumental Variable Estimation 2: Implementation in R: Instructor: Yuta Toyama Last Updated: 2021-05-18
34 pages
2022bbe1052 Ecotrix Merged
No ratings yet
2022bbe1052 Ecotrix Merged
18 pages
SB Assignment 1 (Group 68)
No ratings yet
SB Assignment 1 (Group 68)
21 pages
Case Study AHS Team Report
No ratings yet
Case Study AHS Team Report
28 pages
STATA Training For Staff
No ratings yet
STATA Training For Staff
23 pages
Final Project
No ratings yet
Final Project
22 pages
Account Based Analytics Final Spring 2025
No ratings yet
Account Based Analytics Final Spring 2025
2 pages
Prac 31 Jan
No ratings yet
Prac 31 Jan
16 pages
Logistic Regression in Stata
No ratings yet
Logistic Regression in Stata
21 pages
R Programming Interview Questions-1
No ratings yet
R Programming Interview Questions-1
20 pages
GMU Econ535-Applied Econometrics Problem Set2 (PS2) Solutions Spring 2024
No ratings yet
GMU Econ535-Applied Econometrics Problem Set2 (PS2) Solutions Spring 2024
14 pages
Att - Cahues2edoc7sv N7q9p8j6zzbhzn7aszxbvd Ybewy
No ratings yet
Att - Cahues2edoc7sv N7q9p8j6zzbhzn7aszxbvd Ybewy
14 pages
DAX3 Memo Template - Anh Le 1613552
No ratings yet
DAX3 Memo Template - Anh Le 1613552
10 pages
Stata Codes
No ratings yet
Stata Codes
8 pages
(Sb-t22324pwb-4) Group 2 - Group Assignment
No ratings yet
(Sb-t22324pwb-4) Group 2 - Group Assignment
21 pages
Econometrics - Functional Forms
No ratings yet
Econometrics - Functional Forms
22 pages
Komputer
No ratings yet
Komputer
14 pages
Econ 2b03 Assignment 1
No ratings yet
Econ 2b03 Assignment 1
8 pages
R Working Manuals Students
No ratings yet
R Working Manuals Students
11 pages
Seminar QN - Econometrics - June2024
No ratings yet
Seminar QN - Econometrics - June2024
6 pages
Stata Workshop
No ratings yet
Stata Workshop
5 pages
My Courses: Home UGRD-GE6114-2113T Week 10: Midterm Examination Midterm Exam
No ratings yet
My Courses: Home UGRD-GE6114-2113T Week 10: Midterm Examination Midterm Exam
11 pages
DS Assignment COMPLETED
No ratings yet
DS Assignment COMPLETED
11 pages
Eco Report
No ratings yet
Eco Report
9 pages
Stata Class Activity
No ratings yet
Stata Class Activity
3 pages
Past Paper 2019
No ratings yet
Past Paper 2019
7 pages
DM Makeup Key
No ratings yet
DM Makeup Key
6 pages
Shsconf Cdems2023 03013
No ratings yet
Shsconf Cdems2023 03013
5 pages
PROG8430 - Data Analysis, Modeling and Algorithms Assignment 1 Exploratory Data Analysis With R'
No ratings yet
PROG8430 - Data Analysis, Modeling and Algorithms Assignment 1 Exploratory Data Analysis With R'
7 pages
Heckman Selection Model
No ratings yet
Heckman Selection Model
9 pages
Stata Assignment - Bryson Shelist
No ratings yet
Stata Assignment - Bryson Shelist
10 pages
Surviving Graduate Econometrics With R Difference-In-Differences Estimation - 2 of 8
No ratings yet
Surviving Graduate Econometrics With R Difference-In-Differences Estimation - 2 of 8
7 pages
Detecting and Resolving Model Specification Errors in STATA
No ratings yet
Detecting and Resolving Model Specification Errors in STATA
7 pages
2 PRACTICE ANSWER Class2 - Practice - Questions - POST - BLUM - Answers
No ratings yet
2 PRACTICE ANSWER Class2 - Practice - Questions - POST - BLUM - Answers
2 pages
Assignment #2 - For Statistical Software
No ratings yet
Assignment #2 - For Statistical Software
4 pages
CU-2020 B.A. B.Sc. (Honours) Economics Semester-V Paper-DSE-A-1P Practical QP
No ratings yet
CU-2020 B.A. B.Sc. (Honours) Economics Semester-V Paper-DSE-A-1P Practical QP
2 pages
II PU Statistics MQP-2022-2
No ratings yet
II PU Statistics MQP-2022-2
4 pages
Pam3100 Ps5 Revised Spring 2018
No ratings yet
Pam3100 Ps5 Revised Spring 2018
5 pages
T04 PDF
No ratings yet
T04 PDF
3 pages
Econ 230 Assignment 2
No ratings yet
Econ 230 Assignment 2
2 pages
Mathematical Optimization of Solar Thermal Collectors Efficiency Function Using MATLAB
No ratings yet
Mathematical Optimization of Solar Thermal Collectors Efficiency Function Using MATLAB
5 pages
Problems: Managerial Statistics Problem Set Populations and Distributions Boston College
No ratings yet
Problems: Managerial Statistics Problem Set Populations and Distributions Boston College
3 pages
Design Calculation: Hindustan Construction Co. LTD
No ratings yet
Design Calculation: Hindustan Construction Co. LTD
13 pages
Project1 Instructions Statistics
No ratings yet
Project1 Instructions Statistics
2 pages
Economic and Product Design Considerations in Machining
No ratings yet
Economic and Product Design Considerations in Machining
29 pages
Kra 4 Community Linkages and Professional Engagement & Personal Growth and
No ratings yet
Kra 4 Community Linkages and Professional Engagement & Personal Growth and
7 pages
PHD Thesis On Physics Education
100% (3)
PHD Thesis On Physics Education
5 pages
Handlebars
No ratings yet
Handlebars
5 pages
Irits 0618 058 0221 Reciprocating Compressors
No ratings yet
Irits 0618 058 0221 Reciprocating Compressors
12 pages
MANUAL Health O Meter Scale 800KL
No ratings yet
MANUAL Health O Meter Scale 800KL
2 pages
Uuuu U U U U: Registers (16-Bit)
No ratings yet
Uuuu U U U U: Registers (16-Bit)
3 pages
Egg Drop Project 2
No ratings yet
Egg Drop Project 2
2 pages
Semi Finals Examination: Multiple Choice
No ratings yet
Semi Finals Examination: Multiple Choice
6 pages
ISO-9001-quality-management System
No ratings yet
ISO-9001-quality-management System
16 pages
Articles (Homework)
No ratings yet
Articles (Homework)
2 pages
Ps6x Basics User Guide
No ratings yet
Ps6x Basics User Guide
26 pages
Test CAE
No ratings yet
Test CAE
10 pages
ITRS Variables
No ratings yet
ITRS Variables
12 pages
Industrial Drawing (W/ 2D CAD) : Laguna State Polytechnic University
No ratings yet
Industrial Drawing (W/ 2D CAD) : Laguna State Polytechnic University
10 pages
Stata Task
No ratings yet
Stata Task
3 pages
User Manual 2569067
No ratings yet
User Manual 2569067
70 pages
Horizon (Ceiling Hung) : Key Features
No ratings yet
Horizon (Ceiling Hung) : Key Features
2 pages
Rakesh Resume
No ratings yet
Rakesh Resume
2 pages
Ebook Golden Rules For Futures Traders
No ratings yet
Ebook Golden Rules For Futures Traders
15 pages
Ass2 Sem-2 20-21
No ratings yet
Ass2 Sem-2 20-21
1 page
Abl
No ratings yet
Abl
19 pages
Assignment 5 Comp 3261
No ratings yet
Assignment 5 Comp 3261
6 pages
Galactic Standard Calendar: History
No ratings yet
Galactic Standard Calendar: History
2 pages
931 Nesteoil GB
No ratings yet
931 Nesteoil GB
1 page
Case Note Excellent1 Annotated
No ratings yet
Case Note Excellent1 Annotated
8 pages
Daniel Robert Middleton
No ratings yet
Daniel Robert Middleton
3 pages
BUSS 1020 - Quantitative Business Analysis Individual ASSIGNMENT Semester 2, 2015
No ratings yet
BUSS 1020 - Quantitative Business Analysis Individual ASSIGNMENT Semester 2, 2015
3 pages
Notes On The Balance of Power
No ratings yet
Notes On The Balance of Power
1 page
Method & Theory in Rhetorical Criticism
No ratings yet
Method & Theory in Rhetorical Criticism
7 pages
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
From Everand
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
Sama Alshatali
No ratings yet
Introduction To Business Statistics Through R Software: Software
From Everand
Introduction To Business Statistics Through R Software: Software
Editor IJSMI
No ratings yet
Excel Functions for the Daily User - Vol 2
From Everand
Excel Functions for the Daily User - Vol 2
Palani Murugappan
No ratings yet
A Quick and Easy Guide in Using SPSS for Linear Regression Analysis
From Everand
A Quick and Easy Guide in Using SPSS for Linear Regression Analysis
Jurex Gallo
No ratings yet
IGNOU BCA Statistical Techniques Previous Year Unsolved Papers BCS 040
From Everand
IGNOU BCA Statistical Techniques Previous Year Unsolved Papers BCS 040
Manish Soni
No ratings yet