Assignment of Multiple Linear Regressions
Assignment of Multiple Linear Regressions
Residuals should have CONSTANT VARIATION against range of fitted values for y
Checking Assumptions
Assumption 1
Plot dependent variable against each explanatory variable
Assumption 2
Plot histogram and normal probability plot of residuals
Assumption 3
Plot residuals against each explanatory variable.
NO pattern should be seen
Variability should be constant across range
C. height vs SBP
D. DBP vs SBP
Residuals should have CONSTANT VARIATION against range of fitted values for y.
N.B.
Normal p-p plot of regression standardized residual and Histogram used for checking Normality
Scatter plot of standardized residual and the dependent variable used for checking Spread of
values of dependent variable constant over range of x( independent) values
Scatter plot of dependent variable Vs each independent variable for checking linearity
Regression analysis of for possible significant predictors using systolic BP as outcome variable
a. Predictors: (Constant), high blood pressure , body weight in pound , race of the subject , serum
colestrol level in mg per 100ml, does respondent smoking status, sex of the subject , diastolic blood
pressure, age of the subject in years , height in inches
R2 = 77.2%
77.2% of variation in systolic BP is explained by the model ( difference in dependent variables)
With many variables in model R2 tends be an overestimate
Table 2: ANOVAa
b. Predictors: (Constant), high blood pressure , body weight in pound , race of the subject ,
serum cholesterol level in mg per 100ml, does respondent smoke now?, sex of the subject ,
diastolic blood pressure, age of the subject in years , height in inches
Table 3: Coefficientsa
sex of the subject 5.058 1.300 .124 3.891 .000 2.502 7.614
race of the subject -.211 .866 -.006 -.244 .808 -1.914 1.492
body weight in pound .031 .014 .065 2.253 .025 .004 .058
diastolic blood
.505 .052 .285 9.698 .000 .402 .607
pressure
does respondent
.022 1.086 .001 .021 .984 -2.113 2.157
smoke now?
high blood pressure 26.465 1.416 .566 18.692 .000 23.681 29.249
Using the automated modeling approach of backward regression analysis among the possible selected
candidates as independent variable (predictors) for the dependent variable of systolic BP: age, sex, body
weight, height, diastolic blood pressure, high blood pressure are significant.
Table Backward modeling approach
Bound Bound
The multiple regression backward model of SBP for the predictors will be as follows:
Interpretation
Interpretation of unadjusted Coefficients
1. The predicted value of SBP increases by 0.233mmhg for each increase in 1year of age, after
controlling for the other independent predictors (((((sex needs revision)))
2. The predicted value of SBP increases by 0.031mmhg for difference in sex, after controlling for
the other independent predictor
3. The predicted value of SBP increases by 0.811mmhg for each increase in 1cm of height, after
controlling for the other independent predictor
4. The predicted value of SBP increases by 0.499mmhg for each increase in 1mmhg DBP, after
controlling for the other independent predictor
5. The predicted value of SBP increases by 26.490mmhg for high BP, after controlling for the other
independent predictor.
Summary
The objective of the analysis were to identify factors significantly associated with increased systolic
blood pressure. Accordingly age, sex, body weight, height, diastolic blood pressure, high blood pressure
are significantly associated.