Part 8 Linear Regression & Anova
Part 8 Linear Regression & Anova
……………….. Equation 1
Where;
Yi = the dependant variable = the regressand = the explained variable.
Xi = the Independent variable = the regressors = the Explanatory variables.
There are many uses of regression models. They can be used in forecasting, identifying
relationships, optimization, etc.
………………. Equation 2
Where is the dependent variable and is the independent variable, is the y intercept or
value of when and is the gradient or the change in for a one-unit change in .
Page 1 of 6
Linear Regression and ANOVA Notes
………………. Equation 3
Where is the dependent variable and is the independent variable with being any integer
number from 1 to the total number of variables.
Since is any integer number from 1 to the total number of variables, if , it becomes a
simple linear regression model. If , then the model will be written as:
………………. Equation 4
Therefore, a multiple linear regression model with n variables can be expressed as:
………………. Equation 5
Page 2 of 6
Linear Regression and ANOVA Notes
8.4 Regressors
Regressors are variables that are used to predict or explain the value of the dependent
variable. They are also known as independent variables, predictor variables, or explanatory
variables. Regressors can be continuous or categorical variables.
8.5 Regressands
A regressand is the dependent variable or outcome variable in a regression model. It is the
variable that is being predicted or explained by the Regressors (independent variables).
Regressands are typically continuous variables, but can also be binary or categorical
variables. examples of regressands are yearly sales, stock prices of a company, customer
loyalty, GDP growth rate, etc.
Page 3 of 6
Linear Regression and ANOVA Notes
The Best Linear Unbiased Estimator (BLUE) plays a crucial role in statistical inference. Its
properties of linearity, unbiasedness, and minimum variance make it the most efficient and
accurate estimator among all unbiased linear estimators. BLUE has numerous applications in
statistics, econometrics, and time series analysis
A BLUE estimator is an estimator that is not only unbiased but also has the minimum
variance among all unbiased linear estimators. While both unbiased estimators and BLUE
estimators are used to estimate population parameters, BLUE estimators are more efficient
and have the minimum variance among all unbiased linear estimators. The Ordinary Least
Squares estimator is the BLUE for the linear regression model.
Page 4 of 6
Linear Regression and ANOVA Notes
Therefore, the correct regression model has to factor in the excluded errors. It can be written
as:
………………. Equation 6
Where is the error term with the variables and extracted from the sample. Therefore,
the value
………………. Equation 7
The Error Term is also known as the stochastic Disturbance Term or Residual Term. It
represents the random variation in the dependent variable (y) that cannot be explained by the
independent variables (x) in the regression model. It also captures omitted variables and
accounts for model misspecification.
Excel provides an easy-to-use interface for regression analysis. The software offers a built-in
regression tool that allows users to perform regression analysis with just a few clicks. This
makes it accessible to users who may not have extensive statistical knowledge or
programming skills. The regression analysis in your group assignment:
Page 5 of 6
Linear Regression and ANOVA Notes
8.9 ANOVA
ANOVA is the acronym for Analysis of Variance. It is a statistical technique used to compare
the means of two or more groups to determine if there is a significant difference between
them.
A large F-statistic indicates a significant difference between the means of the groups.
A small p-value indicates a significant difference between the means of the groups.
Situational Example:
A researcher wants to compare the average scores of students who received an online
teaching method (Group A) with those who received traditional teaching method (Group B).
The researcher has collected data on the scores of 20 students in each group.
Research Question:
Is there a significant difference in the average scores of students who received the new
teaching method (Group A) and those who received the traditional teaching method (Group
B)?
Page 6 of 6