0% found this document useful (0 votes)

7 views

Dummy Variable Regression

Uploaded by

037MECH MOHANARAM R

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

Dummy Variable Regression

Uploaded by

037MECH MOHANARAM R

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

12/7/2023

Learning objectives
• Understand the role of dummy variables to represent
qualitative explanatory variables and use them in
regression.
• Test for differences between the categories of a
REGRESSION WITH DUMMY, qualitative variable.
DICHOTOMOUS OR INDICATOR • Calculate and interpret confidence intervals and
VARIABLES prediction intervals, to allow inferences about the
regression coefficients.
• Explain the role of the assumptions on the OLS
estimators.
• Describe common violations of the assumptions and
offer remedies.

1 2

Categorical Independent Variables Example

 Example: Programmer Salary Survey

In many situations we must work with categorical
independent variables such as gender (male, female), A software firm collected data for a sample of 20
method of payment (cash, check, credit card), etc. computer programmers. A suggestion was made that
regression analysis could be used to determine if
For example, x2 might represent gender where x2 = 0 salary was related to the years of experience and the
indicates male and x2 = 1 indicates female. score on the firm’s Programmer Aptitude Test.
The years of experience, score on the aptitude test
In this case, x2 is called a dummy or indicator variable. test, and corresponding annual salary ($1000s) for a
sample of 20 programmers is shown on the previous
class.

3 4

Categorical Independent Variables Categorical Independent Variables

 Example: Programmer Salary Survey Exper. Test Salary Exper. Test Salary
As an extension of the problem involving the (Yrs.) Score Degr. ($000s) (Yrs.) Score Degr. ($000s)
computer programmer salary survey, suppose that
4 78 No 24.0 9 88 Yes 38.0
management also believes that the annual salary is
7 100 Yes 43.0 2 73 No 26.6
related to whether the individual has a graduate 1 86 No 23.7 10 75 Yes 36.2
degree in computer science or other. 5 82 Yes 34.3 5 81 No 31.6
The years of experience, the score on the 8 86 Yes 35.8 6 74 No 29.0
10 84 Yes 38.0 8 87 Yes 34.0
programmer aptitude test, whether the individual has
0 75 No 22.2 4 79 No 30.1
a relevant graduate degree, and the annual salary 1 80 No 23.1 6 94 Yes 33.9
($000) for each of the sampled 20 programmers are 6 83 No 30.0 3 70 No 28.2
shown on the next slide. 6 91 Yes 33.0 3 89 No 30.0

5 6

1
12/7/2023

Estimated Regression Equation Categorical Independent Variables

 ANOVA Output
y^ = b0 + b1x1 + b2x2 + b3x3 Analysis of Variance

where: SOURCE DF SS MS F P
Regression 3 507.8960 269.299 29.48 0.000
y^ = annual salary ($1000)a Residual Error 16 91.8895 5.743
Previously,
x1 = years of experience Total 19 599.7855 R Square = .8342
x2 = score on programmer aptitude test
x3 = 0 if individual does not have a graduate degree Previously,
R2 = 507.896/599.7855 = .8468
1 if individual does have a graduate degree Adjusted
20  1 R Square = .815
x3 is a dummy variable Ra2  1  (1  .8468)  .8181
20  3  1

7 8

Categorical Independent Variables Dummy, Dichotomous or indicator

Variables
 Regression Equation Output
• Qualitative Explanatory variable with two
categories
Predictor Coef SE Coef T p • Qualitative Explanatory variable with multiple
Constant 7.945 7.382 1.076 0.298 categories
Experience 1.148 0.298 3.856 0.001
Test Score 0.197 0.090 2.191 0.044
Grad. Degr. 2.280 1.987 1.148 0.268

Not significant

9 10

More Complex Categorical Variables More Complex Categorical Variables

For example, a variable indicating level of

If a categorical variable has k levels, k - 1 dummy education could be represented by x1 and x2 values
variables are required, with each dummy variable as follows:
being coded as 0 or 1.
Highest
For example, a variable with levels A, B, and C could Degree x1 x2
be represented by x1 and x2 values of (0, 0) for A, (1, 0)
for B, and (0,1) for C. *Bachelor’s 0 0
Master’s 1 0
Care must be taken in defining and interpreting the Ph.D. 0 1
dummy variables.

*: Base line Indicator

11 12

2
12/7/2023

Example: Is there evidence of gender pay Is there evidence of gender pay

discrimination? discrimination?
• Worldwide studies have documented gender • She gathers data on 42 professors, including the
differences in wages and that female academics salary, experience, gender and age of each.
received lower pay than their male colleagues.
• Numerous studies have focused on salary
differences between men and women, indigenous
and non-indigenous, and young and old Australians.
• Joanna Smith works in human resources at a large • Using this data set, Joanna hopes to:
university. – Determine whether there is evidence of gender
discrimination in salaries
• After the release of the latest Australian Bureau of – Determine whether there is evidence of age discrimination
Statistics data, the university asked her to test for in salaries.
both gender and age discrimination in salaries.
continued

13 14

Dummy variables Dummy variables

LO :Understand the role of dummy variables to represent
qualitative explanatory variables and use them in • A dummy variable for a qualitative variable with two
regression. categories assigns a value of 1 for one of the
• Previously, all the variables used in regression categories and a value of 0 for the other.
applications are quantitative. • For example, suppose we are interested in teen
behaviour. We might first define a dummy variable d
• In empirical work it is common to have some
that has the following structure:
variables that are qualitative: the values represent
Let d = 1 if age is between 13 and 19
categories that may have no implied ordering.
and d = 0 if age is anything else.
• We can include these factors in a regression through • This would allows us to capture the role of being a
the use of dummy variables. teenager in a regression model and quantify its
impact.
continued continued

15 16

Dummy variables Dummy variables

• For the sake of simplicity, consider a model • For a given x, and d = 0, we compute ŷ as
containing one quantitative explanatory variable and ŷ = b0 + b1x1 + b2(0) = b0 + b1x1.
one dummy variable.
y = b 0 + b 1x1 + b 2d + e
• Similarly, when d = 1
ŷ = b0 + b1x1 + b2(1) = (b0 + b2) + b1x1.
• Conducting a standard ordinary least squares (OLS)
regression will yield an estimated equation of • The dummy variable allows a shift in the intercept
ŷ = b0 + b1x1 + b2d. term, enabling us to use a single regression
equation to represent both categories of the
qualitative variable.

continued continued

17 18

3
12/7/2023

Dummy variables Dummy variables

Graphically, we can see how the dummy variable shifts • Example: Evidence of gender pay discrimination?
the intercept of the regression line. – The introductory case has two qualitative variables, gender
and age group. To measure the impact of gender and age
on salary, we need to create two dummy variables.
Let d1 = 1 if the professor is male; 0 if female
Let d2 = 1 if the professor is 60 or over; 0 if under 60.

continued continued

19 20

Qualitative variables with two

Dummy variables
categories
• Example: LO : Test for differences between the categories of a
qualitative variable.

• The statistical tests discussed in remain valid for

dummy variables as well.
• We can perform a t test for individual significance,
– The estimated equation is form a confidence interval using the parameter
ŷ = 54.011 + 1.503x + 18.541d1 + 5.772d2 estimate and its standard error, and conduct a partial
– The difference in salary between a male and a female
F test for joint significance.
professor is captured in the coefficient of d1. A male
professor, on average, makes $18,541 more than a female
with comparable experience.
– The age coefficient, though statistically insignificant in this
case, would have a similar interpretation.
continued

21 22

Qualitative variables with two Qualitative variables with two

categories categories
• Example: Evidence of gender pay discrimination? • Sometimes a qualitative variable may be described
– Is there a gender effect in the salary study? by more than two categories.
H0: b 2 = 0 (males and females are paid the same) • In such cases we use multiple dummy variables to
HA: b 2 ≠ 0 (there is a difference due to gender) capture the effect of the variable.
– For example, suppose we divide the mode of transport used
– Given a value of the tdf test statistic of 4.86 and p-value of
by commuters into three categories: public transport, driving
approximately 0.00, we reject the null hypothesis and
and park-and-ride.
conclude that the gender dummy variable is significant.
– We then define two dummy variables, d1 and d2, where d1
• For the age coefficient, tdf is 0.94 and the p-value is 0.36, so equals 1 to denote public transport and 0 otherwise, and d2
we do not reject the null hypothesis. The evidence suggests equals 1 to denote driving and 0 otherwise. Park-and-ride is
that professors over 60 do not have significantly different captured when both d1 and d2 equal 0.
salaries, compared to those under 60.

continued continued

23 24

4
12/7/2023

Qualitative variables with two Qualitative variables with two

categories categories
• Our regression model for the mode of transport • Given the intercept term, we exclude one of the
example would then be dummy variables from the regression.

y = b 0 + b 1x + b 2d1 + b 3d2 + e • The excluded variable represents the reference

category (baseline indicator) against which the
and the estimated equation would be others are assessed.

ŷ = b 0 + b 1x + b 2d 1 + b 3d 2 . • If we included as many dummy variables as

categories, this would create perfect multicollinearity
in the data, and such a model cannot be estimated.
• So, we include one fewer dummy variable than the
number of categories of the qualitative variable.
continued

25 26

Interval estimates for the response Interval estimates for the response
variable variable
LO : Calculate and interpret confidence intervals and
prediction intervals, to allow inferences about the • But, this is only a point estimate and ignores
regression coefficients. sampling error. We can also provide interval
• Once we have developed a regression model, we estimates.
often want to use it to make predictions.
• We will develop two types of interval estimates
• In the academic salary example, what salary would regarding y:
we predict for a male professor with 10 years of – A confidence interval for the expected value of y
experience? Inserting these values into our
– A prediction interval for an individual value of y.
estimated regression equation, we find:
Salary(predicted) = ŷ = 54.011 + 1.503(10) + 18.541(1) + 5.772(0) • It is common to refer to the first as a confidence
= 87.554, that is, $87,554. interval and the second as a prediction interval.

continued continued

27 28

Interval estimates for the response Interval estimates for the response
variable variable
• The point estimate of E(y0) is just the ŷ value. • Many statistics programs will compute confidence
ŷ0 = b0 + b1x10 + b2x20 + … + bkxk0 intervals, but Excel’s data analysis tools do not.

• The confidence interval, as always, includes the • Here is a method you can use instead. Shift the
point estimate, plus or minus the margin of error. value of each explanatory variable in your data set
by the value of interest for that variable:
ŷ0 ± ta/2,df se(ŷ0)
x1* = x1 – x10, x2* = x2 – x20, …, xk* = xk – xk0
• The term se(ŷ0) is the standard error of the
prediction. Though difficult to compute by hand if • When we estimate this modified regression, the
there is more than one explanatory variable in the resulting estimate of the intercept and its standard
model, we will develop a procedure to compute it error equal y0 and se(ŷ0), respectively.
with a statistical package.
continued continued

29 30

5
12/7/2023

LO 12.3 Interval estimates for the LO 12.3 Interval estimates for the
response variable response variable
• Example: Evidence of gender pay discrimination? • Example:
– Estimating the modified regression now reveals the
– In the academic salary example, we first shift the data by
confidence interval.
our hypothesised values.

continued continued

31 32

LO 12.3 Interval estimates for the LO 12.3 Interval estimates for the
response variable response variable
• Example: • If we want to compute an interval with a different
– To summarise, after shifting the explanatory variables, confidence level, we simply need to find the correct
the intercept row in the regression output gives us all the ta/2,df statistic and insert the intercept and standard
information we need. The 95% confidence interval is given error of the intercept from the same regression, or
in the same row.
alternatively, specify a different confidence level in
– A 95% confidence interval for the salary of a man with 10 Excel's Regression dialog box'.
years of experience:
ŷ0 ± ta /2,df se(ŷ0) = • The formula for the prediction interval
87.406 ± 2.023 × 2.869 = 87.406 ± 5.802.
– With 95% confidence, we can state that the mean salary of
all male professors with 10 years of experience falls
yˆ 0  ta 2,df seyˆ 
0 2
 se
2

between $81,603 and $93,209.

continued continued

33 34

LO 12.3 Interval estimates for the LO 12.3 Interval estimates for the
response variable response variable
• Example: Prediction interval for salary
• The point estimate and the standard error of the – For the introductory case, to compute the prediction interval
for a man with 10 years of experience, we simply insert the
prediction are computed using the same technique
appropriate values from the previous example, plus the
as for the confidence interval. standard error of the estimate, 9.133.
• Now we need to include the standard error of the 87.406  2.023 2.8682  9.1332  68.044,106.768
estimate in the margin of error calculation.
– With 95% prediction level, we can state that the salary of a
male professor with 10 years of experience falls between
$68,044 and $106,768.
– Remember that the prediction interval is an interval
estimate for one man with this experience, while the
confidence interval pertains to the average of all men with
continued this much experience.

35 36

6
12/7/2023

Model assumptions and common LO 12.4 Model assumptions and

violations common violations
LO 12.4 Explain the role of the assumptions on the OLS
estimators. 4. The variance of the error term e is the same for all
x1, …, xk values. In other words, observations do not
have a changing variability.
• The statistical properties of the OLS estimator, as
well as the validity of the testing procedures, depend 5. The error term e is uncorrelated across observations. In
on a number of assumptions. other words, observations are not correlated.

1. The model given by y = b 0 + b 1x1 + … + b kxk + e is linear 6. The error term e is not correlated with any of the
in the parameters b 0, b 1, …, b k . predictors x1, …, xk. In other words, there are no
explanatory variables excluded.
2. Conditional on x1, …, xk, E(e) = 0, thus
E(y) = b 0 + b 1x1 + … + b kxk . 7. The error term e is normally distributed. This assumption
allows us to do hypothesis testing. If normality is not the
3. There is no exact linear relationship among the case, our tests may not be valid.
explanatory variables (i.e. no perfect multicollinearity).

continued continued

37 38

LO 12.4 Model assumptions and Model assumptions and common

common violations violations
LO 12.5 Describe common violations of the assumptions
• The true error terms e cannot be observed because and offer remedies.
they exist only in the population. We can, however,
Multicollinearity
look at the residuals, e = y – ŷ, where
ŷ = b0 + b1x1 + b2x2 + … + bkxk, for each observation. • Perfect multicollinearity exists when two or more x
variables exhibit an exact linear relationship.
• It is common to plot the residuals on the vertical axis
and an explanatory variable on the horizontal axis. • For example, suppose the x data includes total cost,
fixed cost and variable cost.
• When estimating a regression in Excel, the dialog
box that opens when you select Data > Data • Other data sets may have a great degree of
Analysis > Regression allows you to choose multicollinearity that is not perfect but still strong.
Residuals and Residual Plots options.

continued

39 40

LO 12.5 Model assumptions and LO 12.5 Model assumptions and

common violations common violations
Multicollinearity Multicollinearity
• In these cases we may see a high R2 but individually • A good remedy may be to simply drop one of the
insignificant explanatory variables. Additional, non- collinear variables if we can justify it as redundant.
intuitive results may be indicative.
• Alternatively, we could increase our sample size.
• A sample correlation between explanatory variables
greater than 0.80 or less than –0.80 suggests severe • Another option would be to try to transform our
multicollinearity. variables so that they are no longer collinear.
• Finally, especially if we are interested only in
maintaining a high predictive power, it may make
sense to do nothing.

continued continued

41 42

7
12/7/2023

LO 12.5 Model assumptions and LO 12.5 Model assumptions and

common violations common violations
Changing variability Changing variability
• The variance of the error term changes for different • Heteroscedasticity results in inefficient estimators,
values of at least one explanatory variable. and the hypothesis tests for significance are no
• Informal residual plots can gauge heteroscedasticity. longer valid.
Here is a residual plot for a model where none of the
assumptions has been violated. • To get around the second problem, some
researchers use OLS estimates along with corrected
standard errors, called White’s standard errors.
Many statistical packages have this option available.
Unfortunately, the current version of Excel does not.

continued continued

43 44

LO 12.5 Model assumptions and LO 12.5 Model assumptions and

common violations common violations
Correlated observations Excluded variables
• We assume that the error term is uncorrelated • Endogeneity in the regression model refers to the
across observations when obtaining OLS estimates. error term being correlated with the explanatory
But this often breaks down in time series data. variables. This commonly occurs due to an omitted
• In this example, we predict sales at a sushi restaurant. explanatory variable.
A plot of the residuals against time shows:
• For example, a person’s salary may be highly correlated
with that person’s innate ability. But since we cannot include
it, ability gets incorporated in the error term. If we try to
predict salary by years of education, which may also be
correlated with innate ability, then we have an endogeneity
problem.

• Remedies are not easily accessible using Excel.

continued continued

45 46

LO 12.5 Model assumptions and

common violations
Excluded variables
• Endogeneity will result in biased estimators, and so
is quite a serious problem. Unfortunately, it is difficult
to fix.
• Most commonly, we try to find an instrumental
variable. Discussion of the instrumental variable
approach is beyond the scope of the text.

Inferensi Disekitar Mean Dan Pos Hoc-Zahro
No ratings yet
Inferensi Disekitar Mean Dan Pos Hoc-Zahro
11 pages
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Midterm 1a Solutions
No ratings yet
Midterm 1a Solutions
9 pages
ECN 813 Dummy Variable
No ratings yet
ECN 813 Dummy Variable
21 pages
Métodos numéricos aplicados a Ingeniería: Casos de estudio usando MATLAB
From Everand
Métodos numéricos aplicados a Ingeniería: Casos de estudio usando MATLAB
Héctor Jorquera González
5/5 (1)
Chapter Three QM
No ratings yet
Chapter Three QM
77 pages
Econometrics 2
No ratings yet
Econometrics 2
135 pages
Chapter 1 Econometrics
No ratings yet
Chapter 1 Econometrics
21 pages
Chapter 1
No ratings yet
Chapter 1
47 pages
Econometrics II Chapter One
No ratings yet
Econometrics II Chapter One
71 pages
Econometrics II Chapter Two
No ratings yet
Econometrics II Chapter Two
96 pages
CHapter 5 Acct
No ratings yet
CHapter 5 Acct
8 pages
Econometrics Lecture Note Chapter 4 and 5
No ratings yet
Econometrics Lecture Note Chapter 4 and 5
39 pages
Chapter 5 & 6
No ratings yet
Chapter 5 & 6
136 pages
CHAPTER 5 & 6
No ratings yet
CHAPTER 5 & 6
139 pages
Dummies
No ratings yet
Dummies
5 pages
Multiple Regression: MR Example With Dummy Variables
No ratings yet
Multiple Regression: MR Example With Dummy Variables
19 pages
Econometrics II Notes (1)
No ratings yet
Econometrics II Notes (1)
72 pages
Module 5.2 - Anova and Ancova
No ratings yet
Module 5.2 - Anova and Ancova
10 pages
Econometrics CH 1-4
100% (1)
Econometrics CH 1-4
315 pages
1-6 Dummy Variable
No ratings yet
1-6 Dummy Variable
16 pages
Chapter 7 2023 Dummy Variable
No ratings yet
Chapter 7 2023 Dummy Variable
62 pages
ch 4 eco
No ratings yet
ch 4 eco
42 pages
Econometrics I - Lecture 7 (Wooldridge)
No ratings yet
Econometrics I - Lecture 7 (Wooldridge)
34 pages
3.dummy Variables
No ratings yet
3.dummy Variables
25 pages
Econometrics 2
No ratings yet
Econometrics 2
84 pages
5 Gender Divide
No ratings yet
5 Gender Divide
13 pages
Chap 1
No ratings yet
Chap 1
77 pages
Metrics course outline
No ratings yet
Metrics course outline
22 pages
Introduction To Econometrics Ii (Econ-3062) : Mohammed Adem (PHD)
100% (5)
Introduction To Econometrics Ii (Econ-3062) : Mohammed Adem (PHD)
83 pages
Econometrics Categorical Variables
No ratings yet
Econometrics Categorical Variables
12 pages
econoch7
No ratings yet
econoch7
32 pages
Dummy Variables: Nominal Scale
No ratings yet
Dummy Variables: Nominal Scale
17 pages
Chapter 4 (Compatibility Mode)
No ratings yet
Chapter 4 (Compatibility Mode)
66 pages
Econometrics II Handout For Students
No ratings yet
Econometrics II Handout For Students
29 pages
Econometrics Cha 4(1)
No ratings yet
Econometrics Cha 4(1)
72 pages
Dummy Variables
No ratings yet
Dummy Variables
60 pages
Extending the Multiple Regression
No ratings yet
Extending the Multiple Regression
19 pages
Lecture 9. Issues in Multiple Regression
No ratings yet
Lecture 9. Issues in Multiple Regression
13 pages
Dummy Variable Final
No ratings yet
Dummy Variable Final
14 pages
C3-English
No ratings yet
C3-English
31 pages
Econometric lec4
No ratings yet
Econometric lec4
58 pages
Econometric Model With Qualitative Variables - 2
No ratings yet
Econometric Model With Qualitative Variables - 2
20 pages
Dummy Variables EAB
No ratings yet
Dummy Variables EAB
12 pages
chap7
No ratings yet
chap7
7 pages
Econometrics II-1-1
No ratings yet
Econometrics II-1-1
37 pages
Block 3 MECE 001 Unit 10
100% (1)
Block 3 MECE 001 Unit 10
16 pages
How To Use Dummy X Variables
No ratings yet
How To Use Dummy X Variables
7 pages
Interpretation: Part III: Working With The Classical Regression Model
No ratings yet
Interpretation: Part III: Working With The Classical Regression Model
11 pages
Activity 3 For Statistics Categorical Predictors in Regression
No ratings yet
Activity 3 For Statistics Categorical Predictors in Regression
9 pages
Chapter Five (Dummy) - For Evaluation
No ratings yet
Chapter Five (Dummy) - For Evaluation
64 pages
Chap 6 MultipleLinearRegression Adjusted
No ratings yet
Chap 6 MultipleLinearRegression Adjusted
30 pages
Econometrics II Distance Module
No ratings yet
Econometrics II Distance Module
97 pages
Lecture 08 Dummy Variables
No ratings yet
Lecture 08 Dummy Variables
6 pages
Topic 7 Regression (Cont.)
No ratings yet
Topic 7 Regression (Cont.)
47 pages
Chapter 1 Dummy Variable Regression
No ratings yet
Chapter 1 Dummy Variable Regression
45 pages
Presentation G1
No ratings yet
Presentation G1
21 pages
Econometrics II-1
No ratings yet
Econometrics II-1
56 pages
Regression With Qualitative Information
No ratings yet
Regression With Qualitative Information
25 pages
EBE Dummy Variables
No ratings yet
EBE Dummy Variables
9 pages
Chapter10 Econometrics DummyVariableModel
No ratings yet
Chapter10 Econometrics DummyVariableModel
8 pages
Econometrics For Finance Chapter 5
No ratings yet
Econometrics For Finance Chapter 5
12 pages
Ideal Gas Law
No ratings yet
Ideal Gas Law
1 page
Title 3
No ratings yet
Title 3
19 pages
Financial Econometrics, Mathematics and Statistics
No ratings yet
Financial Econometrics, Mathematics and Statistics
19 pages
Dalton's Law of Partial Pressure
No ratings yet
Dalton's Law of Partial Pressure
19 pages
Panels Tata Command
No ratings yet
Panels Tata Command
7 pages
STAT2225 Module 6. Intro To Inferential Statistics
No ratings yet
STAT2225 Module 6. Intro To Inferential Statistics
57 pages
PHYS09007 2016 May
No ratings yet
PHYS09007 2016 May
7 pages
Econometrics-Stata Commands Utrecht University
No ratings yet
Econometrics-Stata Commands Utrecht University
4 pages
A Guide to Modern Econometrics 5th Edition Marno Verbeek download
100% (1)
A Guide to Modern Econometrics 5th Edition Marno Verbeek download
52 pages
NLOGIT 6 Reference Guide
No ratings yet
NLOGIT 6 Reference Guide
695 pages
3 5 7 MovingAverages
No ratings yet
3 5 7 MovingAverages
2 pages
Tutorial - Time Series - 2024
No ratings yet
Tutorial - Time Series - 2024
2 pages
The Kinetic Theory of Gases
No ratings yet
The Kinetic Theory of Gases
4 pages
HHHHJH
No ratings yet
HHHHJH
3 pages
Lampiran Tabel MPN Mikro
No ratings yet
Lampiran Tabel MPN Mikro
2 pages
Voorbeeldexamen Econometrie - Oplossing
No ratings yet
Voorbeeldexamen Econometrie - Oplossing
6 pages
Real Gases Class11 (Resonance)
No ratings yet
Real Gases Class11 (Resonance)
24 pages
Answer Key to Exercises_LN3_ver2
No ratings yet
Answer Key to Exercises_LN3_ver2
16 pages
Part 8
No ratings yet
Part 8
17 pages
Business Analytics 2nd Edition Evans Test Bankdownload
100% (6)
Business Analytics 2nd Edition Evans Test Bankdownload
45 pages
LEGASPI BSA31 LaboratoryExercise5
No ratings yet
LEGASPI BSA31 LaboratoryExercise5
19 pages
K Means & Polynomial
No ratings yet
K Means & Polynomial
5 pages
Tractor Sales Data Set
No ratings yet
Tractor Sales Data Set
9 pages
Machine Learning
No ratings yet
Machine Learning
7 pages
4dvar or EnKF Kalnay
No ratings yet
4dvar or EnKF Kalnay
47 pages
TWO WAY ANOVa
No ratings yet
TWO WAY ANOVa
3 pages
Block 3
No ratings yet
Block 3
36 pages
Vicky patil_Practical_9 - Colab
No ratings yet
Vicky patil_Practical_9 - Colab
4 pages

Dummy Variable Regression

Uploaded by

Dummy Variable Regression

Uploaded by

12/7/2023

Categorical Independent Variables Example

 Example: Programmer Salary Survey

Categorical Independent Variables Categorical Independent Variables

Estimated Regression Equation Categorical Independent Variables

Categorical Independent Variables Dummy, Dichotomous or indicator

More Complex Categorical Variables More Complex Categorical Variables

For example, a variable indicating level of

*: Base line Indicator

Example: Is there evidence of gender pay Is there evidence of gender pay

Dummy variables Dummy variables

Dummy variables Dummy variables

Dummy variables Dummy variables

Qualitative variables with two

• The statistical tests discussed in remain valid for

Qualitative variables with two Qualitative variables with two

Qualitative variables with two Qualitative variables with two

y = b 0 + b 1x + b 2d1 + b 3d2 + e • The excluded variable represents the reference

ŷ = b 0 + b 1x + b 2d 1 + b 3d 2 . • If we included as many dummy variables as

between $81,603 and $93,209.

Model assumptions and common LO 12.4 Model assumptions and

LO 12.4 Model assumptions and Model assumptions and common

LO 12.5 Model assumptions and LO 12.5 Model assumptions and

LO 12.5 Model assumptions and LO 12.5 Model assumptions and

LO 12.5 Model assumptions and LO 12.5 Model assumptions and

• Remedies are not easily accessible using Excel.

LO 12.5 Model assumptions and

You might also like