DSC4213 - Analytics Tools For Consulting
DSC4213 - Analytics Tools For Consulting
Consulting
Session 1 Introduction
• WANG Tong
– Associate Professor, Dept. of Decision Sciences
– Office: Biz 1 (Mochtar Riady Building) 8-68
– Email: [email protected]
– Web: http://www.bschool.nus.edu.sg/staff/bizwt/
• Andrew LIM
– Professor, Dept. of Decision Sciences
– Office: Biz 1 (Mochtar Riady Building) 8-70
– Email: [email protected]
• Please be
– participative
– open-minded
– ready to criticize
– innovative
• Q?
• Linear Regression
– Simple and multiple linear regression model
– Least squares estimation
– Model assessment
– Model selection
• Other Considerations in Regression Model
– Qualitative predictors
– Introducing nonlinearity: interaction terms, polynomial terms, log
transformation
• Practical Issues
– Multicollinearity
– Heteroscedasticity
– Outliers and high leverage points
• And obtain the standard errors of the estimates (variance under different
samples)
• Measure of variation
– TSS: total sum of squares (variation of y around its mean)
yi
y
yi – yi
_
yi – y
y
_
_ yi - y
_
y y
xi X
Aug 2016 DSC4213 Session 1 - Prof. WANG Tong 14
Assessing Overall Quality of the Model
Y Y Y
X X X
r2 = 1 0 < r2 < 1 r2 = 0
Aug 2016 DSC4213 Session 1 - Prof. WANG Tong 15
Assessing Overall Quality of the Model
a = .05
0 F
Do not Reject H0
reject H0
Aug 2016 DSC4213 Session 1 - Prof. WANG Tong 16
Multiple Linear Regression
– jis the slope for variable Xj, i.e., the average increase in Y when Xj is increased
by one and all other X’s are held constant
<
yi ŷ = b0 + b1x1 + b2 x2
yi
x2i
x2
x1i
x1
• Adjusted R2:
– used to correct for the fact that adding non-relevant variables will still reduce the
residual sum of squares
– Smaller than R2
– Forward selection: start from none, iteratively add the variable that improve R 2
most
– Backward selection: start from all, iteratively remove the least significant
variable
• Interpretation:
Y
Female
Male
X
the expected difference in tips from a male customer as opposed to a female
customer, after controlling for the effect of total_bill and size
• Is this the same as having two regressions for Male and Female separately?
Y
vs Female
Male
• Multi-level factors
– Which is equivalent to
• Without interaction
tip
smokerNo
smokerYes
Total_bill
• With interaction
tip
smokerNo
smokerYes
Total_bill
– β4 = -0.064 means that on average, smokers pay 6.4% (of the total bill) less tips
than non-smokers, after controlling for total bill amount and table size
tip
– Overall effect of being a smoker: smokerNo
• β4: 6.4% (of the total bill) less
• β3: $1.16 more
smokerYes
– Quadratic term
– Logarithm transformation
• Once the season begins, the only thing we can control is price
“Retailers hate markdowns. Discount an item too late, and stores are stuck with
truckloads of inventory. Too early, and they loose profits as people snap up items
thrown on the bargain table prematurely …”
Demand
– Time of Use: winter clothes, toys for Christmas, gifts for the Valentine’s day
• Pricing
– Four allowable prices: $60 (full price), $54 (90%), $48 (80%), $36 (60%)
– Start with $60 in Week 1, then mark down over time (no price increase)
• Homework
– Do self-study on how to use StatTools to do regression
• http://www.palisade.com/GuidedTour/EN/StatTools/
– Read the retailer case and understand the context of fashion retailing