0% found this document useful (0 votes)
47 views5 pages

TEAM 10 - Task

The regression model analyzed the relationship between extra weight (dependent variable) and age first smoked cigarettes and number of cigarettes smoked per day (independent variables) based on data from 30 smokers. The linear regression equation found a positive correlation between age first smoked and extra weight, and a negative correlation between cigarettes smoked per day and extra weight. However, the regression model was determined to not be statistically valid at the 90% confidence level, and the model parameters were not found to be statistically significant.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
47 views5 pages

TEAM 10 - Task

The regression model analyzed the relationship between extra weight (dependent variable) and age first smoked cigarettes and number of cigarettes smoked per day (independent variables) based on data from 30 smokers. The linear regression equation found a positive correlation between age first smoked and extra weight, and a negative correlation between cigarettes smoked per day and extra weight. However, the regression model was determined to not be statistically valid at the 90% confidence level, and the model parameters were not found to be statistically significant.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 5

TEAM 10

Aim of study: the behavior of extra weight, depending on the number of cigarettes smoked per
day (over past 30 days) and on the age when first smoked a cigarette.
Data recorded for 30 smokers randomly drawn:

Age when first smoked a Number of cigarettes smoked


Extra weight (kg)
cigarette (years) per day past 30 days
0.79 11 14
1.60 15 11
1.26 17 17
1.37 15 17
0.80 15 19
1.01 16 15
0.39 16 2
1.50 15 16
0.81 13 5
1.35 18 7
0.48 15 17
0.45 15 1
0.69 13 15
0.65 17 1
1.01 14 16
6.59 16 6
1.83 11 8
0.65 16 16
0.77 16 5
0.73 12 9
2.36 11 22
1.12 14 17
0.96 16 19
1.17 17 21
0.90 14 18
1.44 17 17
0.75 21 8
1.43 22 20
0.81 15 20
0.89 11 3
1.83 19 16
2.21 19 8
2.57 16 10
1.34 17 11
1.97 14 15
1.17 13 3
2.26 13 4
2.33 41 7
0.50 21 13
3.17 15 15

Process the data in Excel (Data/Data Analysis/Regression) and answer the following questions:

a. Identify the variables, the linear regression equation and interpret the partial regression
coefficients.
b. Is there enough evidence to conclude that the regression model is valid, at 90%
confidence level? (critical value: 2,45).
c. Test the significance of the model parameters (critical value: 1,687).
d. Find and interpret the confidence intervals for the model parameters.
e. Compute and interpret the coefficient of determination.
f. Analyze the direction and the strength of the relationship between the three variables,
using an appropriate statistical indicator. Test its significance.
g. Get the Correlation Matrix (use Data/Data Analysis/Correlation). Explain the values on
the main diagonal.
h. Predict a person’s extra weight, if he started to smoke when he was 15 years old and used
to smoke 3 cigarettes per days in the last 30 days.

Solution

a.Identify the variables, the linear regression equation and interpret the partial regression
coefficients.

The independent variable (x1) = Age when first smoked a cigarette (years)

The independent variable (x2) = Number of cigarettes smoked per day past 30 days

The dependent variable (y) = extra weight(kg)

SUMMARY
OUTPUT
Regression Statistics
Multiple R 0.129859528
R Square 0.016863497
Adjusted R Square -0.036279017
Standard Error 1.087275694
Observations 40

ANOVA
  df SS MS F Significance F
0.37513 0.31732
Regression 2 0.750265374 3 6 0.730055194
1.18216
Residual 37 43.74023213 8
Total 39 44.4904975      

Standard
  Coefficients Error t Stat P-value Lower 95% Upper 95%

1.55957 0.12737
Intercept 1.108225856 0.710593552 8 3 -0.331573443 2.548025155

0.48839
X Variable 1 0.025217755 0.036033095 0.69985 8 -0.04779223 0.09822774

X Variable 2 -0.009522382 0.028242804 -0.33716 0.7379 -0.06674774 0.047702975

Linear Regression Equation

 Population;
Y i = β 0 + β 1∗X 1i + β 2∗X 2i + ε i

 Sample:
y i= b 0 + b 1∗x 1i + b 2∗x 2i + e i

y i= 1,108 + 0,025∗x 1i - 0,009∗x 2i + e i

b 1=0,025 > 0 => positive correlation between Age when first smoked a cigarette (years) and
extra weight(kg)

b 2 = -0,009 < 0 => negative correlation between Number of cigarettes smoked per day past 30
days and extra weight(kg)
 If the extra Age when first smoked a cigarette (years) increases by 1year, then the extra
weight increase by 0,025 years.

 If the Number of cigarettes smoked per day past 30 days increases by 1m2 then extra
weight decrease by 0,009 kg.

b. Is there enough evidence to conclude that the regression model is valid, at 90%
confidence level? (critical value: 2,45).

H 0 : MSR=MSE ( the model is not valid)


H
. 1: MSR > MSE ( the model is valid)

MSR 0.375133
 F comp = = = 0.3173
MSE 1.182168

 F crit= 2,45

F comp < F crit => F comp ∈ Ra => reject H 1, accept . H 0=> the model is not valid ( significant )

Significance F = 0.730055194 > 0,1 (∝) => accept H 0=> the model is not valid

c. Test the significance of the model parameters (critical value 1,687).

Hypothesis:

H0 : β j = 0
H 1 : β j ≠ 0 ; j= 1 , n

b^ j−βj b^ j
t comp = =
sb^
j
s b^
j

 Testing the β 0 parameter:

H 0 : β0 = 0

H 1 : β0 ≠ 0

b^ 0− β 0 b^ 0 1.108225856
t comp = = = = 1.55957769
s ^b
0
s b^ 0.710593552
0
t ∝/2 , n−k−1 = t 0,025;37 = 1,687

t comp < t ∝/2 , n−k−1 => accept H 0 , the parameter β 0 is


.

You might also like