0% found this document useful (0 votes)
57 views2 pages

Sessions 1-6 - Lab Assignment

This summary provides an overview of the key steps in a multi-session Stata lab assignment on testing the impact of improved toilet facilities on child health outcomes using DHS data. The steps include: opening and preparing the DHS datasets; creating relevant variables; merging datasets; reviewing the data; running regressions with and without clustering standard errors; adding controls; and considering an alternative research design using propensity score matching.

Uploaded by

Hatim Rashid
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
57 views2 pages

Sessions 1-6 - Lab Assignment

This summary provides an overview of the key steps in a multi-session Stata lab assignment on testing the impact of improved toilet facilities on child health outcomes using DHS data. The steps include: opening and preparing the DHS datasets; creating relevant variables; merging datasets; reviewing the data; running regressions with and without clustering standard errors; adding controls; and considering an alternative research design using propensity score matching.

Uploaded by

Hatim Rashid
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
You are on page 1/ 2

Sessions 1-6 - Lab assignment This assignment will be done over two sessions, and you will continue

building on this dofile in subsequent sessions, so save your dofile (on a USB, or email it to yourself). Email your wor so far on the dofile to the T! before leaving lab at the end of the session. "a e sure your first and last names are clearly indicated. #ur ob$ective is to test whether im%roved toilet facilities affect child health as measured by the weight for age &'score, using the "()S *++,'- and *+.. datasets. /e are %re%aring data to run the following regression0 1i 2 + 3 . T#(4ETi 3 * /E!4T5 3 ei /here0 1 is the weight'for'age 1'score of the child T#(4ET is a dummy for having im%roved toilet facilities /E!4T5 is a household wealth inde6 (already calculated by 7un$ab Bureau of Statistics) 8o all the following ste%s in a dofile. /here you need to comment on the res%onses to the questions, annotate the dofile by %receding the relevant lines with an asteris (9). .) #%en "()S *+.. household data file. *) :ind the geogra%hic and 55 (8s that uniquely identify the household. ;) (dentify and summari&e the ey variables we will need to run our regression. <) Tabulate the toilet ty%e variable, with and without labels. 8ecide how you want to define =im%roved toilet>. )reate a new variable, toilet, which is . for an im%roved toilet, and + otherwise. 8o you need to re%lace any values as missing? /hy @ why not? A) 5istogram the weight'for'age 1'score variable. 8o you need to re%lace any values as missing? /hy @ why not? B) )reate a variable, round, to denote the round of the survey. ,) "erge household datafile with child datafile. /hich of the two has unique values? -) Ceview the data using summari&e and browse commands and chec that everything has wor ed %ro%erly. 7ay s%ecial attention to the Dmerge variable. /hat does each value re%resent? /hy are there some values of . in this variable? E) #%en "()S *++,'- child'household data. !re all the variables %resent that are required for the regression? )hange any variable names as needed so they match the "()S *+.. dataset. .+) )reate a round variable in both rounds of the survey so that when you a%%end the two it is clear which observations are from which round. ..) !%%end the * survey rounds. .*) Ceview the data using summari&e and browse commands and chec that everything has wor ed %ro%erly. .;) Cun the regression s%ecification above (this is a %ooled cross section regression).

.<) "a e a table to dis%lay the results using the outreg command. .A) !t what level do you thin the standard errors should be clustered? /hy? Cun the regression again with the a%%ro%riate clustering, using the synta6 reg y 6, cluster(cluster(8) .B) !dd the results of the regression with clustered SE to the table ne6t to the original results. 5ow do the results differ? .,) /hat inds of #FB could occur in this regression? /hat variables in the dataset could you include as controls to hel% address this? Browse variable labels and@or the questionnaire (on the 8ro%bo6) to identify a%%ro%riate variables. .-) 4oo at the descri%tives and distribution of these new control variables and assess whether you need to do any cleaning of these variables, as you did with the main study variables above. 7erform any needed cleaning. .E) Cun the regression again with clustering and the additional controls, and %ut it in the table ne6t to the first two columns. 5ow do the results differ? *+) Gow consider that you are designing a new study on 7un$ab. Hou want to test whether two biradaris have differences in wealth, as measured by the number of cattle owned. Gote that the "()S survey does not include information on biradari, so you need to design a new sam%le survey which does as for this information. Use the information from the "()S and StataIs sam%si command to calculate the a%%ro%riate sam%le si&e. S%ecify what effect si&e, al%ha and beta you are using and what each of them means. *.) Use the %smatch* command to carry out a %ro%ensity'score matching a%%roach to the same research question0 how much does? /hat variables will you %ut in the %ro%ensity score equation? !dd these results to the table you made earlier using #4S. 5ow and why do these results differ from the results of the #4S models?

You might also like