MLR TestingSignificance

Uploaded by

rodriguezlillian472

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views21 pages

MLR TestingSignificance

Uploaded by

rodriguezlillian472

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 21

Multiple Linear Regression

Subtitle
Content

• Testing Significance overall significance of

Overall fit of the model
• Testing for Individual Regression Coefficients
Important terms
› R2
› Hypothesis Testing
› Significance level
› Degree of freedom
› One Tailed and 2 Tailed Test
R2
› It is coefficient of Determination
› It tests how good your model compared to worst Model
› R-Squared is a statistical measure in a regression model that determines the
proportion of variance in the dependent variable that can be explained by
the independent variable. In other words, r-squared shows how well the data fit
the regression model (the goodness of fit).
› The most common interpretation of r-squared is how well the regression model
explains observed data. For example, an r-squared of 60% reveals that 60% of
the variability observed in the target variable is explained by the regression
model. Generally, a higher r-squared indicates more variability is explained by
the model.
R2

The goodness of fit of regression models can be analyzed on the

basis of the R-square method. The more the value of the r-square
near 1, the better the model is.

The value of R-square can also be negative when the model fitted is
worse than the average fitted model.
R-Squared Vs Adjusted R-Squared
› Adjusted R-Squared is an updated version of R-squared which
takes account of the number of independent variables while
calculating R-squared.
› The main problem with R-squared is that the R-Square value always
increases with an increase in independent variables irrespective of
the fact that where the independent variable is contributing to the
model or not.
Hypothesis Testing
Hypothesis testing is done to confirm our observation
about the population using sample data, within the
desired error level. Through hypothesis testing, we can
determine whether we have enough statistical evidence
to conclude if the hypothesis about the population is
true or not.

When we fit a straight line through a linear regression

model, we get the slope and intercept for the line.
Hypothesis testing is used to confirm if our beta
coefficients are significant in a linear regression model.

Key steps to perform hypothesis test are as follows:

• Formulate a Hypothesis
• Determine the significance level
• Determine the type of test
• Calculate the Test Statistic values and the p
values
• Make Decision
•
Formulating Hypothesis
› One of the key steps to do this is to formulate the below two hypotheses:
› The null hypothesis represented as H₀ is the initial claim that is based on the
prevailing belief about the population.
The alternate hypothesis represented as H₁ is the challenge to the null
hypothesis. It is the claim which we would like to prove as True

There is no relationship between the dependent variable y and the independent

variable xi. In this case, the regression coefficient βi is zero. This is the claim
for the null hypothesis in an individual regression coefficient test: H0:βi=0.
Significance level
› In regression analysis, the significance level (often denoted as α) is a threshold
used to determine whether the coefficients of the independent variables in the
model are statistically significant. It essentially tells you the probability of
incorrectly rejecting the null hypothesis when it is actually true i.e.Alpha
represents an acceptable probability of a Type I error
› For example, if we choose a significance level of 0.05 (commonly used), it means
we are willing to accept a 5% chance of incorrectly rejecting the null hypothesis.
› So, if the absolute value of the t-statistic is greater than the critical value
corresponding to α = 0.05, we reject the null hypothesis and conclude that the
coefficient is statistically significant.
› In practice, the most commonly used alpha values are 0.01, 0.05, and 0.1, which
represent a 1%, 5%, and 10% chance of a Type I error, respectively
P-Value
› A p-value is a metric that expresses the likelihood that an observed difference
could have occurred by chance. As the p-value decreases the statistical
significance of the observed difference increases. If the p-value is too low, you
reject the null hypothesis.
› E.g. you are trying to test whether the new advertising campaign has increased
the product's sales. The p-value is the likelihood that the null hypothesis, which
states that there is no change in the sales due to the new advertising campaign,
is true.
› If the p-value is .30, then there is a 30% chance that there is no increase or
decrease in the product's sales. If the p-value is 0.03, then there is a 3%
probability that there is no increase or decrease in the sales value due to the
new advertising campaign. As you can see, the lower the p-value, the chances of
the alternate hypothesis being true increases, which means that the new
advertising campaign causes an increase or decrease in sales.
Degree of freedom
› Degrees of freedom refer to the maximum number of logically
independent values, which may vary in a data sample.

› In regression analysis, the degrees of freedom (df) represent the number

of independent pieces of information available for estimating a
parameter.
›
› The degrees of freedom in regression analysis depend on the number of
observations (N), the number of independent variables in the model (k),
and any constraints imposed on the model.
Degree of freedom
› Suppose we have a simple linear regression model:
› Yi=β0+β1Xi+εi
• Yiis the dependent variable.
• Xiis the independent variable.
• β0and1β1are the intercept and slope coefficients, respectively.
• εiis the error term.
› In this example, there are two parameters to estimate:β0and β1
.
• The degrees of freedom in this case would be: df=N−k−1 , N is the number of
observations. k is the number of independent variables (excluding the intercept).
Degree of freedom
› Suppose we have data on the heights (X) and weights (Y) of 10
individuals. We want to fit a simple linear regression model to predict
weight from height.
• Number of observations (N) = 10
• Number of independent variables (k) = 1 (height)
• Intercept (β0
) and slope (β1
) are the parameters to estimate.
› Therefore, the degrees of freedom would be: df=10−1−1=8
› This means that in this regression model, there are 8 degrees of freedom
available for estimating the parameters. It's essentially the number of
data points that provide independent information for parameter
estimation after accounting for the constraints imposed by the model.
One Tailed and 2 Tailed Test
› A one-tailed test and a two-tailed test are two types of hypothesis tests
used in statistical analysis to assess the significance of a relationship or
difference in a population parameter.
› A one-tailed test results from an alternative hypothesis which specifies
a direction. i.e. when the alternative hypothesis states that the parameter
is in fact either bigger or smaller than the value specified in the null
hypothesis.
› A two-tailed test results from an alternative hypothesis which does not
specify a direction.
One-tailed Tests
› A one-tailed test may be either left-tailed or right-tailed.
› A left-tailed test is used when the alternative hypothesis states that the
true value of the parameter specified in the null hypothesis is less than
the null hypothesis claims.
› A right-tailed test is used when the alternative hypothesis states that the
true value of the parameter specified in the null hypothesis is greater
than the null hypothesis claims
› E.g. The manufacturer now decides that it is only interested whether the
mean lifetime of an energy-saving light bulb is less than 60 days.
– H0 : The mean lifetime of an energy-saving light bulb is 60 days.
– H1: The mean lifetime of an energy-saving light bulb is less than
60days.
› we have a “less than” in the alternative hypothesis. This means that we
will perform a left-sided one-tailed test.
Two-tailed Tests
› The main difference between one-tailed and two-tailed tests is that one-
tailed tests will only have one critical region whereas two-tailed tests will
have two critical regions.

› E.g. A light bulb manufacturer claims that its' energy saving light bulbs
last an average of 60 days. Set up a hypothesis test to check this claim
and comment on what sort of test we need to use.
› So we have
– H0 : The mean lifetime of an energy-saving light bulb is 60 days
– H1 : The mean lifetime of an energy-saving light bulb is not 60 days.

› Because of the “is not” in the alternative hypothesis, we have to consider

both the possibility that the lifetime of the energy-saving light bulb is
greater than 60 and that it is less than 60. This means we have to use a
two-tailed test.
One-tailed Tests &Two-tailed Tests
Testing for Significance
› The F test is used to determine whether a significant
relationship exists between the dependent variable and the set
of all the independent variables; we will refer to the F test as the
test for overall significance.
› If the F test shows an overall significance, the t test is used to
determine whether each of the individual independent variables
is significant.
› A separate t test is conducted for each of the independent
variables in the model; we refer to each of these t tests as a test
for individual significance.
Testing for Significance
F Statistics

Employee Salary Prediction Slides
No ratings yet
Employee Salary Prediction Slides
21 pages
PDE 710 Statistical Method in Education Module 1 Units 1 7 1
No ratings yet
PDE 710 Statistical Method in Education Module 1 Units 1 7 1
103 pages
Elementary Statistics: A Brief Version 8th Edition (Ebook PDF) PDF Download
100% (4)
Elementary Statistics: A Brief Version 8th Edition (Ebook PDF) PDF Download
56 pages
Bivariate Analysis
No ratings yet
Bivariate Analysis
46 pages
Predective Analytics or Inferential Statistics
No ratings yet
Predective Analytics or Inferential Statistics
27 pages
Stat
67% (3)
Stat
70 pages
320C10
100% (2)
320C10
59 pages
CH 11 Wooldridge 5e PPT
No ratings yet
CH 11 Wooldridge 5e PPT
22 pages
Correlation & Regression Analysis
100% (1)
Correlation & Regression Analysis
39 pages
Chapter 5 Hypothesis Testing
100% (1)
Chapter 5 Hypothesis Testing
27 pages
Tests of Significance
No ratings yet
Tests of Significance
60 pages
P4 New - CHeat Sheet End-Term
No ratings yet
P4 New - CHeat Sheet End-Term
7 pages
Eco 5
No ratings yet
Eco 5
30 pages
Chapter5 - Hypothesis Testing and Statistical Inference
No ratings yet
Chapter5 - Hypothesis Testing and Statistical Inference
50 pages
A Primer in Nonparametric Econometrics
No ratings yet
A Primer in Nonparametric Econometrics
88 pages
Vector Error Correction Model
No ratings yet
Vector Error Correction Model
13 pages
Inferentialstatistics 210411214248
No ratings yet
Inferentialstatistics 210411214248
102 pages
Bayesian Filtering - From Kalman Filters To Particle Filters and Beyond
No ratings yet
Bayesian Filtering - From Kalman Filters To Particle Filters and Beyond
69 pages
Evans - Analytics2e - PPT - 07 and 08 CH
No ratings yet
Evans - Analytics2e - PPT - 07 and 08 CH
50 pages
Week 8 - 10
No ratings yet
Week 8 - 10
72 pages
Babbie CH 16
No ratings yet
Babbie CH 16
65 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
60 pages
Hypothesis Testing & ANOVA
No ratings yet
Hypothesis Testing & ANOVA
23 pages
Forecasting: Operations Management R. Dan Reid & Nada R. Sanders
No ratings yet
Forecasting: Operations Management R. Dan Reid & Nada R. Sanders
32 pages
Chapter 4
No ratings yet
Chapter 4
67 pages
Unit IV - Analytics Tasks (Students)
No ratings yet
Unit IV - Analytics Tasks (Students)
127 pages
Chapter4 - Statistical Inference New
No ratings yet
Chapter4 - Statistical Inference New
62 pages
Survival - Notes (Lecture 6)
No ratings yet
Survival - Notes (Lecture 6)
27 pages
Chapter Three Statistical Inference in Simple Linear Regression Model
No ratings yet
Chapter Three Statistical Inference in Simple Linear Regression Model
33 pages
Multiple Regression
No ratings yet
Multiple Regression
61 pages
AEA 309 - Lecture 4
No ratings yet
AEA 309 - Lecture 4
37 pages
An Introduction To Statistical Inference
No ratings yet
An Introduction To Statistical Inference
33 pages
Chapter 1 Introduction To Psychology - Thinking Through The Themes
No ratings yet
Chapter 1 Introduction To Psychology - Thinking Through The Themes
24 pages
Chapter 5 Hypothesis Testing
No ratings yet
Chapter 5 Hypothesis Testing
27 pages
4 Regression Inference
No ratings yet
4 Regression Inference
36 pages
Lecture Notes Week 1
No ratings yet
Lecture Notes Week 1
70 pages
Chapter 5 T Test & ANOVA
No ratings yet
Chapter 5 T Test & ANOVA
26 pages
Introduction To Statistical Inference 2
No ratings yet
Introduction To Statistical Inference 2
46 pages
+part 03 - AMEFA - 2024 - Introduction and Repetition
No ratings yet
+part 03 - AMEFA - 2024 - Introduction and Repetition
46 pages
Aqt 1
No ratings yet
Aqt 1
33 pages
Design of Experiments
No ratings yet
Design of Experiments
32 pages
Inferential Statistics
No ratings yet
Inferential Statistics
40 pages
Chapter - 12 Data PDF
No ratings yet
Chapter - 12 Data PDF
21 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
23 pages
Chapter 4 Part 3 Inference
No ratings yet
Chapter 4 Part 3 Inference
22 pages
Chapter 5
No ratings yet
Chapter 5
35 pages
Rockchip Trouble Shooting RKNN Toolkit V1.2.1 EN
No ratings yet
Rockchip Trouble Shooting RKNN Toolkit V1.2.1 EN
22 pages
Unbalance Panel Data PDF
No ratings yet
Unbalance Panel Data PDF
19 pages
Artificial Intelligence by SKOLAR
No ratings yet
Artificial Intelligence by SKOLAR
30 pages
Hypothesis Testing in Machine Learning Using Python - by Yogesh Agrawal - 151413
No ratings yet
Hypothesis Testing in Machine Learning Using Python - by Yogesh Agrawal - 151413
15 pages
Doradiana,+a8 The+Effect 125+-+140
No ratings yet
Doradiana,+a8 The+Effect 125+-+140
16 pages
Practical Research 01-06-22
No ratings yet
Practical Research 01-06-22
15 pages
Hypothesis Tests & Control Charts: by S.G.M
No ratings yet
Hypothesis Tests & Control Charts: by S.G.M
26 pages
2statistics Prac New
No ratings yet
2statistics Prac New
13 pages
L1090 Lecture5 AU24
No ratings yet
L1090 Lecture5 AU24
33 pages
Ecn 306
No ratings yet
Ecn 306
43 pages
Grey Minimalist Business Project Presentation
No ratings yet
Grey Minimalist Business Project Presentation
16 pages
Advanced Statistic
No ratings yet
Advanced Statistic
33 pages
Theory
No ratings yet
Theory
7 pages
Statistical Inference
No ratings yet
Statistical Inference
14 pages
Inference For Regression
No ratings yet
Inference For Regression
24 pages
Guideline Test Adaptation
No ratings yet
Guideline Test Adaptation
9 pages
Introduction To Statistical Hypothesis Testing in R
No ratings yet
Introduction To Statistical Hypothesis Testing in R
8 pages
English and Stastics SAT 1
No ratings yet
English and Stastics SAT 1
5 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
35 pages
Discussion+on+Multiple+Regression ShimengHuang
No ratings yet
Discussion+on+Multiple+Regression ShimengHuang
35 pages
Part 2 - Regression Inference
No ratings yet
Part 2 - Regression Inference
37 pages
Test Bank Questions Chapter 5
No ratings yet
Test Bank Questions Chapter 5
5 pages
Module 3 Half
No ratings yet
Module 3 Half
48 pages
BSRM Final Assignment
No ratings yet
BSRM Final Assignment
7 pages
Solution To End Term Exam - Makeup
No ratings yet
Solution To End Term Exam - Makeup
12 pages
L1 QM07 High Yield Notes
No ratings yet
L1 QM07 High Yield Notes
4 pages
Week 6 - Result and Analysis 2 (UP)
No ratings yet
Week 6 - Result and Analysis 2 (UP)
7 pages
Basic Econometrics Notes
No ratings yet
Basic Econometrics Notes
47 pages
Problem Set 2 Quantitative Methods UNIGE
No ratings yet
Problem Set 2 Quantitative Methods UNIGE
10 pages
Chapter 7
No ratings yet
Chapter 7
9 pages
Regression Modeling
No ratings yet
Regression Modeling
8 pages
T - Test
No ratings yet
T - Test
9 pages
TP Math 04 Activity 1 1
No ratings yet
TP Math 04 Activity 1 1
3 pages
T Test
No ratings yet
T Test
38 pages
Basic
No ratings yet
Basic
4 pages
408 Mid
No ratings yet
408 Mid
7 pages
Annex II Procedure Conducting GCP Inspections Requested CHMP Clinical Laboratories - en - 2022
No ratings yet
Annex II Procedure Conducting GCP Inspections Requested CHMP Clinical Laboratories - en - 2022
7 pages
Econometrics Exam 2
No ratings yet
Econometrics Exam 2
3 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
3 pages
These Two Methods Are Explained in Detail in The Next Sections of Your Material.
No ratings yet
These Two Methods Are Explained in Detail in The Next Sections of Your Material.
5 pages
Approximating The Shapiro-Wilk W-Test For Non-Normality
No ratings yet
Approximating The Shapiro-Wilk W-Test For Non-Normality
3 pages
Hypothesis test-WPS Office
No ratings yet
Hypothesis test-WPS Office
2 pages
Homework Suggestions From Chapter 4
No ratings yet
Homework Suggestions From Chapter 4
2 pages
Hypothesis Testing: Six Sigma Thinking, #6
From Everand
Hypothesis Testing: Six Sigma Thinking, #6
Sumeet Savant
No ratings yet

MLR TestingSignificance

Uploaded by

MLR TestingSignificance

Uploaded by

Multiple Linear Regression

• Testing Significance overall significance of

The goodness of fit of regression models can be analyzed on the

When we fit a straight line through a linear regression

Key steps to perform hypothesis test are as follows:

There is no relationship between the dependent variable y and the independent

› In regression analysis, the degrees of freedom (df) represent the number

› Because of the “is not” in the alternative hypothesis, we have to consider

You might also like