0% found this document useful (0 votes)
13 views3 pages

ProjectInstructions GradeRubric

The document provides a grading rubric and recommendations for a research project involving statistical analysis. It outlines specific requirements and points allocated for different sections of the paper, including properly formulating a hypothesis, describing statistical models and control variables, conducting exploratory data analysis on variables, and reporting on regression results, diagnostics, and conclusions. It emphasizes writing in an academic style, justifying choices, and acknowledging limitations rather than focusing solely on statistical significance. The goal is for students to conduct a justified analysis and identify models with significant predictors rather than perfect diagnostics.

Uploaded by

nasrullah.khan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
13 views3 pages

ProjectInstructions GradeRubric

The document provides a grading rubric and recommendations for a research project involving statistical analysis. It outlines specific requirements and points allocated for different sections of the paper, including properly formulating a hypothesis, describing statistical models and control variables, conducting exploratory data analysis on variables, and reporting on regression results, diagnostics, and conclusions. It emphasizes writing in an academic style, justifying choices, and acknowledging limitations rather than focusing solely on statistical significance. The goal is for students to conduct a justified analysis and identify models with significant predictors rather than perfect diagnostics.

Uploaded by

nasrullah.khan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

I.

Grade rubric

Below provides a specific grading rubric for your research project. If you fail to fulfill a
bullet point below, you will lose one or more points depending on how far away you are from
fulfilling it. Your project’s full credit is 100 points.

1. Basic Requirements [10 points]

• Submission of research question by the deadline (Oct 6) [4 points]


• A research paper with at least 10 full pages (excluding a cover page, tables, figures,
and references); 12 points, double space, and 1-inch margins all around. [2 points]
• R code. Please make your R code cleaned and succinct so that I can replicate your
analysis by simply running your code. [2 points]
• Your paper must consist of four sections: 1) Hypothesis; 2) Statistical Model; 3)
Analysis 1 – EDA; 4) Analysis 2 – Hypothesis Test Results. [2 points]

2. What to include in the Hypothesis section? [10 points]

• Clarify your hypothesis: “A (your explanatory variable) affects B (your dependent


variable.” [1 point]
• Provide strong reasons why you believe that your hypothesis should be right. In
other words, you should convince the reader that your hypothesis substantively makes
sense. To this end, use examples (anecdotal evidence), cite at least 5 existing
academic articles, and construct a logical mechanism. [9 points]

3. What to include in the Statistical Model section? [10 points]

• Introduce your control variables. You must have at least 3 control variables. [1 point]
• Provide strong reasons why you should include the control variables in your model.
For each of the control variables, provide the reasons. [8 points]
• Provide your regression model in this way: Y = beta_0 + beta_1X_1 + beta_2X_2 +
… Insert your variables for Y and X’s. [1 point]

4. What to include in the Analysis 1 -- EDA section? [30 points]

• For each of the dependent, primary independent, and control variables, provide
numerical and graphical EDA. You should clarify what type the variable is (i.e.,
continuous/categorical), along with appropriate numerical (e.g., frequency table,
mean/median/standard deviation, etc) and graphical summaries (barplot, histogram,
etc.) [10 points]
• Provide numerical and graphical EDA for bivariate relationships (two-variable
relationships): Dependent – Primary Independent and Dependent – Each of the
Control Variables. [10 points]
• Discussion of EDA results: 1) What implications do you extract from your EDA for
your hypothesis? [5 points] 2) What implications do you extract from your EDA for
regression diagnostics? (e.g., do you see any evidence for possible violation of the
regression assumptions, or do you not see any evidence?) [5 points]

5. What to include in the Analysis 2 – Hypothesis Test Results section? [40 points]
• Regression Diagnostics [15 points]
o Draw a plot for Residuals-versus-Fitted Values and discuss the plot regarding
the assumptions for linearity, random disturbances, and constant variances.
o Draw a Normal QQ plot and discuss the normality assumption.
o Test for multicollnearity and discuss the results.
o According to the regression diagnostics, come up with a solution if necessary.
Discuss what improvements your solution makes. (You will need to run
regression diagnostics for your new model if your solution leads to a new
model.)
• Interpretations [20 points]
o What's the meaning of the coefficient for each of the independent variables
(primary and control)?
o Is the coefficient statistically significant at 0.05 or 0.01 level?
o Are the effect sizes substantively large?
o Interpret the overall significance test result and Adjusted R^2.
• Conclusion [5 points]
o What do the analysis results suggest for your hypothesis?
o Your study’s weaknesses or limitations that lead to your future research plan.

II. Important recommendations/remarks

• Your regression diagnostics could be surprisingly bad compared to some of the “nice”
examples you’ve seen elsewhere in the course. In many cases, even after improving
diagnostics [for instance with transformations], they may still show various problems
or artifacts due to the nature of the data. If so, say so; and also say what other models
you tried and what those diagnostics looked like (and hence justify your final choice
comparatively; why was it best on balance?)
• Diagnostics will likely be improved by transformations.
• On the other hand, too many transformations make the interpretations more
complicated; you will need to make your own decisions about balancing and
articulating these tradeoffs and justifying whatever final decisions you make.
• If your primary independent variable is not significant in a model you try, it’s fine for
the project. It will indicate that there is no evidence that supports your hypothesis.
You might want to try another model, though. You don’t have to stick to one research
question/hypothesis. Change your hypothesis and model if you want or need.
• If none of the variables, including primary and control, is significant in a model you
try, then the model is no better than merely randomly guessing and then the model
should not be used in practice. Although it's still fine for this project, I would
recommend you should try another model that might be more interesting.
• For most models you might fit to most of these datasets, the Adjusted R^2 will be
surprisingly low compared to some of the “nice” examples you’ve seen elsewhere in
the course. That might simply be the nature of the relationship, and it’s not a sign of a
mistake or of an unjustified model. The primary goal is a model as justified as
possible, on balance, and with significant terms.
III. Useful advice for your writing

• Write as if it is an academic journal article. Use sentences – paragraphs to write your


paper rather than bullet points.
• Your title should be something interesting and meaningful that captures the reader’s
attention and has something to do with the topic and the work (not “my project” or
something similar that you might title a piece of course work).
• If you want to emphasize the importance of your research question, you should
suggest why the topic is of interest in the larger world and to the reader, not just to
you personally.
• It is common in academic writing to use third person (“we”, not “I”). The “we”
typically implies the author and the readers (as if you are leading the readers together
with you through the discovery process).
• Be professional in your writing; don’t use emoticons, memes, online shorthand, or
colloquialisms. This is practice for professional writing. [On the other hand,
“professional” doesn’t mean “artificially and unnecessarily complicated.” Be clear
and concise.]
• Don’t copy verbatim sentences, phrases, or title from the posted data prompts. You
should be creating your own words/phrases/transitions/paragraphs, etc.
• Don’t copy/paste from R directly. Instead, create new tables to provide R results.

You might also like