Midterm
Midterm
observed fre 18 30 40 22 10
expected fre 12 36 48 18 6
8.88888889
18-30 31-44 45-58 over 58 total
Jessica Chastain 51 50 41 42 184
Jennifer Lawrence 63 55 37 50 205
Emmanunuelle Riva 15 44 56 74 189
Quvenzhane Wallis 48 25 22 31 126
Naomi Watts 36 65 62 33 196
total 213 239 218 230 900
expected freq
difference
p-value 0.3208841
7.7760998E-08 <0.05 reject hypotheses not reject
chi-square 1.875
degree of freedom
(6-1) = 5
5.85
not reject
18 20 22 27
25 22 27 25
26 23 20 24
27 25 19 21
26 25 31 29
25 28 26 28
mean 24.5
standard de3.0143336
intervals
<21.6 21.6-23.2 23.2-24.5
observed fr 5 4 3
expected fr 5 5 5
chi-square 0 0.2 0.8
degree of f 6-2-1 = 3 chi-square = 6.251
not reject
65 100 80 65 70 500
>0.05
<11.070
22
24
26
25
25
24
mean+stdev(z-score)
24.5-25.8 25.8-27.4 >27.4 total
7 7 4 30
5 5 5 30
0.8 0.8 0.2 2.8 < 6.251
Least Squares Method
resraurant xi yi xi-mean x
1 2 58 -12
2 6 105 -8
3 8 88 -6
4 8 118 -6
5 12 117 -2
6 16 137 2
7 20 157 6
8 20 169 6
9 22 149 8
10 26 202 12
total 140 1300
mean 14 130
b0 = mean y-bi*mean x
b0 = 60
t Test
estimated standard deviation of b1
s(b1) = s/sqrt(sum(xi-mean x)^2) = 0.580265238041082
F = MSR/MSE = 74.2483660130719
ANOVA
source of vara SS DF MS
regression SSR = 14200 # of independent variable = 1 MSR = SSR/1 = 14200
error SSE = 1530 n-2 = 8 MSE = SSE/n-2 = 191.25
total SST = 15730 n-1 = 9
60
7 12 17 22 27 32
student population (1000s) Xi
coefficient of determination
correlation coefficient
, sales increases.
expected sales, which mean quaterly sales are expected to increase by $5 per student.
回归统计
Multiple R 0.9501229552
R Square 0.90273363001
Adjusted R Squ 0.89057533376
标准误差 13.8293166859
观测值 10
方差分析
df SS MS F Significance F
回归分析 1 14200 14200 74.248366 2.54887E-05
残差 8 1530 191.25
总计 9 15730
error (yi-y^) squared error squared deviation (yi-mean y)^2 squared due to regression (y^-mean y)^2
-12 144 5184 3600
15 225 625 1600
-12 144 1764 900
18 324 144 900
-3 9 169 100
-3 9 49 100
-3 9 729 900
9 81 1521 900
-21 441 361 1600
12 144 5184 3600
SSE = 1530 SST = 15730 SSR = 14200SSR = SST-SSE
1530 15730 14200
r^2 = SSR/SST
r^2 = 0.90273363001
90.27% of the total sum of square can be explained by using the estimated regression equation to predict q
rxy = (sign of bi)sqrt(r^2)
rxy = 0.9501229552
A strong positive linear association exists between x and y.
ficance F
mean y)^2
equation to predict quaterly sales.
R^2 in shoe sales prediction
d)
R^2 = SSR/SST = 0.75
e)
Ra^2 = 1-(1-R^2)*(n-1/n-p-1) = 0.67857143
f)
Since R^2 = 0.75, it can be concluded that 75% of the variability in y is explained by
the estimated multiple regression equation with x1 and x2 as the independent
variables.
After adjusting for the number of independent variable in the model, only 67.86%
of the variability in y has been accounted for.
The estimated regression equation may not provide a good fit.
SST = 15182.9
SSR = 14052.2
s(b1) = 0.2471
s(b2) = 0.9484
a)
MSR = SSR/p = 7026.1
SSE = SST-SSR = 1130.7
MSE = SSE/(n-p-1) = 161.528571
F = MSR/MSE = 43.4975679
b)
t = b1/s1 = 8.13435856
df = n-p-1 = 7
a = 0.05
t critical value: t = 2.365 < 8.134
The test statistic value is greater than the critical value, so reject the null hypothesis.
Therefore, it can be concluede that the parameter beta1 is statistically significant.
c)
t = b2/s2 = 4.99789119
df = 7
a = 0.05
t critical value: t = 2.365 < 4.997
reject null hypothesis
the parameter beta2 is statistically significant.
Estimated regression equation: y^2 = 20385.25 - 0.03739x1 - 686.34x2
b)
The R square is 0.951, and the correlation coefficient is 0.9752
The obtained corrlation coefficient is greater than +0.7, so the multicollinearity will be a prob
c)
F test statistic value = 290.8454
a = 0.05
df = 2 and 30
F critical value = 3.32 < 290.8454
The test statistic value is greater than the critical value, so the null hypothesis will not reject.
Thterefore, the is a significant relationship exist between y and two independent variables.
d)
t = b1/s1 = #REF! < -2.042
t = b2/s2 = #REF! < -2.042
ta/2 = 0.025
df = 30
t critical value = 2.042
The test statistic value is greater than the critical value, so reject the null hypothesis.
Therefore, it can be concluede that mileage and age variables both are statistically significan
b)
F test at a = 0.05
F = 6.2519
df = 2 and 4
the critical value is 6.94 > 6.25
c)
t = b1/s1 = #DIV/0!
t = b2/s2 = #DIV/0!
a = 0.05
df = 4
ull hypothesis. the critical t value is 2.776
t bewteen y and 2 independent variables.
ull hypothesis.
significant.
cally significant.
1 - 686.34x2
ull hypothesis.
statistically significant.
value is less than the critical value, so the null hypothesis will not reject.
NO significant relationship exist between y and two independent variables.