Linear Regression Analysis For Survey Data
Linear Regression Analysis For Survey Data
Regression in Surveys
Useful for modeling responses to survey
questions as function of (external)
sample data and/or other survey data
Sometimes easier/more efficient then highdimensional multi-way tables
Useful for summarizing how changes in the
Xs affect Y
yi = 0 + 1 xi + i
0 + 1 xi
4
Linear Model
Can think of it as modeling the expected value
of y,
E ( y | x ) = 0 + 1 x
E ( y | x ) y = 0 + 1 x
1 =
n
1 n
xi yi yi xi
n i =1 i =1
i =1
x xi
n i =1
i =1
n
and 0 = y 1 x
2
i
Strongly agree
Agree
Neutral
Disagree
Strongly disagree
2-1=1
3-2=1
4-3=1
5-4=1
My Take
Generally, Im okay with assumption for 5-point Likert
scale
Boils down to assuming Agree is halfway between
Neutral and Strongly agree
Strongly
agree
Strongly
disagree
Correlation Matrices
Useful Place to Start
JMP: Analyze > Multivariate Methods > Multivariate
Coding Examples
0/1 coding
Compares calc_grade
to a baseline group
Regression equation:
females:
calc_grade=80.41 - 0.48 0
males:
calc_grade=80.41 0.48 1
-1/1 coding
Compares each group
to overall average
Regression equation:
females: calc_grade=80.18 + 0.24 1
males: calc_grade=80.18 + 0.24 (-1)
13
14
15
If Assumptions Met
...can use regression to do the usual
inference
Hypothesis tests on the slope and intercept
R-squared (fraction in the variation of y
explained by x)
Confidence and prediction intervals, etc.
A Note on Weights
and Weighted Least Squares
Weighted least squares often discussed in
statistics textbooks as a remedy for unequal
variances
Weights used are not the same as sampling
weights previously discussed
Regression in JMP
In JMP, use Analyze > Fit Model to do
multiple regression
Fill in Y with (continuous) dependent
variable
Put Xs in model by highlighting and then
clicking Add
Use Remove to take out Xs
Click Run Model when done
23
10 15 20 25 30 35 40 45 50
24
15
.01
.05 .10
.25
.50
.75
.90 .95
.99
10
5
0
-5
-10
-15
-20
-25
-3
-2
-1
27