0% found this document useful (0 votes)
762 views1 page

Regression Analysis Sheet

This document provides an overview of regression analysis, including definitions of key terms, the graphical and mathematical representations of regression models, how to interpret regression results by examining values like the R-squared and p-values, and checks that should be made of key assumptions in building regression models to best predict relationships between dependent and independent variables. Contact information is also provided for the Graduate Resource Center at the University of New Mexico for students seeking additional help with regression analysis techniques and selecting optimal models.

Uploaded by

Hanane Atif
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
762 views1 page

Regression Analysis Sheet

This document provides an overview of regression analysis, including definitions of key terms, the graphical and mathematical representations of regression models, how to interpret regression results by examining values like the R-squared and p-values, and checks that should be made of key assumptions in building regression models to best predict relationships between dependent and independent variables. Contact information is also provided for the Graduate Resource Center at the University of New Mexico for students seeking additional help with regression analysis techniques and selecting optimal models.

Uploaded by

Hanane Atif
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

R E G R EGRADUATE

SSION ANALYSIS CHEAT SHEET


R ESOURCE C ENTER , U NIVERSITY OF N EW M EXICO

1 R EGRESSION BASICS 2 R EGRESSION 3 R EGRESSION RESULT


Definitions and Terms Graphical Representation (univeriate linear model) After Regression, what to look for
Variable: Measurable characteristics that varies (by groups, indivi- 1. R-square (R2 ): It shows the amount of variance of DV explained
duals or time) by IV (described in percent)

Dependent/Outcome Variable (DV): Presumed effect in an analysis 2


2. p-value of the model: It tests whether R is different from
0. A value less than 0.05 shows statistically significant relationship
Independent/Explanatory Variable (IV): The presumed cause between IV and DV
in an analysis
3. p-value of the coefficient: p-value of the hypothesis testing
Control Variable/Covariate: Variables that are not studied but coefficient is different from 0 (H0 ).
included in the model/analysis
4. Coefficient (βi ): For each one-point increase in IV, the DV
Parameter: Unknown population characteristics is expected to increases by βi (or decreases if βi is negative), holding
all the other independent variables constant
Best Fitting Line: When plotting data, the most appropriate Note: Coefficient of logistic regression is interpreted differently
line showing the relationship between dependent and independent
variables 5. Regression results also provides:
t-statistics: Coefficient divided by standard error
Residual: Deviations from the fitted line (estimated value) to Standard error: Standard deviation of the coefficient
Mathematical Representation
the observed values (data point) Confidence Interval of the coefficient - Usually 95%
1. Linear regression (DV is a continuous variable)
Root mean squared error: Standard deviation of the regression
Error: Difference between the observed value and the true va- Constant Coefficient IV
DV Error
lue (often unobserved) z}|{ z}|{ z}|{ z}|{ z}|{ Check for Key Assumptions
yi = β0 + β0 xi + i
1. Linearity: The relationship between IV and the mean of DV is
Regression Coefficient: Describes the relationship between a 2. Logistic Regression (DV is a binary variable) linear.
DV and IV
1 2. Homoscedasticity: The variance of residual is the same for
Understanding Regression Pr(yi = 1 | xi ) = 1+exp−(β0 +βi xi ) any of IV.
Can we explain how much, on average, DV changes because of IV?
Steps to Regression Analysis 3. Normality: For any fixed value of IV, DV is normally dis-
Can we predict, on average, what DV values might be for a 1. Choose a suitable model tributed.
value of IV? 2. Choose a suitable predictors
3. Select a technique for fitting the model 4. Multicollinearity: The independent variables are not not
Can we determine if the amount of change in one variable is 4. Fit the model perfectly multicolinear (one IV should not be a linear function of
related to, on average, the amount of change in another? 5. Assesses goodness of fit of the model another).
Model Building 6. Interpret and Conclude - make causal inference, Note: Assumptions 1, 2 and 3 do not apply to
Model is a simplified representation of reality issue prediction logistic regression.

When building a model, think: Selecting the "BEST" Model


1. What are the variables that interact? N EED H ELP ? Goal is to minimize the residual mean square (which maximizes
2. How do those variables interact?
Contact Us R2 ) - by comparing regression models
3. Apply underlying theories
Graduate Resource Center
Mesa Vista Hall, Suite 1057 Use informtion criterion statistics -
Phone: 505-277-1407 Akaike’s Information Criterion (AIC)
Email: [email protected] Bayesian Information Criterion (BIC)
Website: https://unmgrc.unm.edu/

You might also like