Discriminant & Logit Analysis Using SAS Enterprise Guide
Discriminant & Logit Analysis Using SAS Enterprise Guide
Discriminant & Logit Analysis Using SAS Enterprise Guide
• Group means and group standard deviations. These are computed for each
predictor for each group.
[Continued]
Annual Attitude Importance Age of Amount Spent
Resort Family Toward Attached to Household Head of on Family
No. Visit Income ($000) Travel Family Vacation Size Household Vacation
16 2 32.1 5 4 3 58 L (1)
17 2 36.2 4 3 2 55 L (1)
18 2 43.2 2 5 2 57 M (2)
19 2 50.4 5 2 4 37 M (2)
20 2 44.1 6 6 3 42 M (2)
21 2 38.3 6 6 2 45 L (1)
22 2 55.0 1 2 2 57 M (2)
23 2 46.1 3 5 3 51 L (1)
24 2 35.0 6 4 5 64 L (1)
25 2 37.3 2 7 4 54 L (1)
26 2 41.8 5 1 3 56 M (2)
27 2 57.0 8 3 2 36 M (2)
28 2 33.4 6 8 2 50 L (1)
29 2 37.5 3 2 3 48 L (1)
30 2 41.3 3 3 2 42 L (1)
Information on Resort Visits: Holdout Sample
[Continued]
Wilks’ λ (U-statistic) and univariate F ratio with 1 and 28 degrees of freedom
Variable Wilks’ λ F Significance
INCOME 0.45310 33.80 0.0000
TRAVEL 0.92479 2.277 0.1425
VACATION 0.82377 5.990 0.0209
HSIZE 0.65672 14.64 0.0007
AGE 0.95441 1.338 0.2572
[Continued]
Standard Canonical Discriminant Function Coefficients
blank Func 1
INCOME 0.74301
TRAVEL 0.09611
VACATION 0.23329
HSIZE 0.46911
AGE 0.20922
Structure Matrix
Pooled within-groups correlations between discriminating variables and canonical
discriminant functions (variables ordered by size of correlation within function).
blank Func 1
INCOME 0.82202
HSIZE 0.54096
VACATION 0.34607
TRAVEL 0.21337
AGE 0.16354
Results of Two-Group Discriminant Analysis (4 of 6)
[Continued]
Unstandardized Canonical Discriminant Function Coefficients
blank Func 1
INCOME 0.8476710E-01
TRAVEL 0.4964455E-01
VACATION 0.1202813
HSIZE 0.4273893
AGE 0.2454380E-01
(constant) −7.975476
Group Func 1
1 1.29118
2 −1.29118
Results of Two-Group Discriminant Analysis (5 of 6)
[Continued]
Classification Results
Predicted Group Membership
blank blank Visit 1 2 Total
Original Count 1 12 3 15
blank blank 2 0 15 15
blank % 1 80.0 20.0 100.0
blank blank 2 0.0 100.0 100.0
Cross-validated Count 1 11 4 15
blank blank 2 2 13 15
blank % 1 73.3 26.7 100.0
blank blank 2 13.3 86.7 100.0
aCross-validation is done only for those cases in the analysis. In cross-validation, each
case is classified by the functions derived from all cases other than that case.
b90.0% of original grouped cases correctly classified.
c80.0% of cross-validated grouped cases correctly classified.
Results of Two-Group Discriminant Analysis (6 of 6)
[Continued]
Classification Results for Cases Not Selected for Use in the Analysis
(Holdout Sample)
Predicted Group Membership
blank Actual Group No. of Cases 1 2
Group 1 6 4 2
blank blank blank 66.7% 33.3%
Group 2 6 0 6
blank blank blank 0.0% 100.0%
[Continued]
Wilks’ λ (U-statistic) and univariate F ratio with 2 and 27 degrees of
freedom.
Variable Wilks’λ F Significance
INCOME 0.26215 38.000 0.0000
TRAVEL 0.78790 3.634 0.0400
VACATION 0.88060 1.830 0.1797
HSIZE 0.87411 1.944 0.1626
AGE 0.88214 1.804 0.1840
[Continued]
Standardized Canonical Discriminant Function Coefficients
blank Func 1 Func 2
INCOME 1.04740 −0.42076
TRAVEL 0.33991 0.76851
VACATION −0.14198 0.53354
HSIZE −0.16317 0.12932
AGE 0.49474 0.52447
Structure Matrix
Pooled within-groups correlations between discriminating variables and canonical
discriminant functions (variables ordered by size of correlation within function).
Results of Three-Group Discriminant Analysis (4 of 6)
[Continued]
blank Func 1 Func 2
INCOME 0.85556* −0.27833
HSIZE 0.19319* 0.07749
VACATION 0.21935 0.58829*
TRAVEL 0.14899 0.45362*
AGE 0.16576 0.34079*
[Continued]
Group Func 1 Func 2
1 −2.04100 0.41847
2 −0.40479 −0.65867
3 2.44578 0.24020
[Continued]
aCross-validation is done only for those cases in the analysis. In cross-validation, each
case is classified by the functions derived from all cases other than that case.
b86.7% of original grouped cases correctly classified.
c66.7% of cross-validated grouped cases correctly classified.
Classification Results for Cases Not Selected for Use in the Analysis
Predicted Group Membership
blank Actual Group No. of Cases 1 2 3
Group 1 4 3 1 0
blank blank blank 75.0% 25.0% 0.0%
Group 2 4 0 3 1
blank blank blank 0.0% 75.0% 25.0%
Group 3 4 1 0 3
blank blank blank 25.0% 0.0% 75.0%
𝑃
log 𝑒 = 𝑎0 + 𝑎1 𝑋1 + 𝑎2 𝑋2 + ⋯ + 𝑎𝑘 𝑋𝑘
1−𝑃
Or 𝑛
𝑃
log 𝑒 = 𝑎𝑖 𝑋𝑖
1−𝑃
𝑖=0
Model Formulation
exp σ𝑘𝑖=0 𝑎𝑖 𝑋𝑖
𝑃=
1 + exp σ𝑘𝑖=0 𝑎𝑖 𝑋𝑖
Where
P = Probability of success
Xi = Independent variable i
ai = parameter to be estimated.
Properties of the Logit Model
• Although Xi may vary from −∞ to +∞, P is constrained to lie between 0 and 1.
• The sign of ai will determine whether the probability increases (if the sign is
positive) or decreases (if the sign is negative) by this amount.
Explaining Brand Loyalty (1 of 2)
Explaining Brand Loyalty
No. Loyalty Brand Product Shopping
1 1 4 3 5
2 1 6 4 4
3 1 5 2 4
4 1 7 5 5
5 1 6 3 4
6 1 3 4 5
7 1 5 5 5
8 1 5 4 2
9 1 7 5 4
10 1 7 6 4
11 1 6 7 2
12 1 5 6 4
13 1 7 3 3
14 1 5 1 4
15 1 7 5 5
Explaining Brand Loyalty (2 of 2)
[Continued]
No. Loyalty Brand Product Shopping
16 0 3 1 3
17 0 4 6 2
18 0 2 5 2
19 0 5 2 4
20 0 4 1 3
21 0 3 3 4
22 0 3 4 5
23 0 3 6 3
24 0 4 4 2
25 0 6 3 6
26 0 3 6 3
27 0 4 3 2
28 0 3 5 2
29 0 5 5 3
30 0 1 3 2
Results of Logistic Regression (1 of 2)
Results of Binary Logit Model or Logistic Regression
Model Summary
Step −2 Log Likelihood Cox & Snell R Square Nagelkerke R Square
1 23.471a .453 .604
aEstimation terminated at iteration number 6 because parameter estimates changed by
less than .001.
Results of Logistic Regression (2 of 2)
[Continued]
Classification Tablea
blank blank blank blank Predicted blank
blank blank blank Loyalty to the Brand blank blank
blank Observed blank Not Loyal Loyal Percentage Correct
Step 1 Loyalty to the brand Not loyal 12 3 80.0
blank blank Loyal 3 12 80.0
blank Overall percentage blank blank blank 80.0
aThe cut Value is .500
Variables in the Equation
blank blank B S.E. Wald df Sig. Exp (B)
Step 1a Brand 1.274 .479 7.075 1 .008 3.575
blank Product .186 .322 .335 1 .563 1.205
blank Shopping .590 .491 1.442 1 .230 1.804
blank Constant -8.642 3.346 6.672 1 .010 .000
aVariable(s) entered on step 1: Brand, Product, Shopping.
SAS Enterprise Guide
Both two-group and multiple discriminant analysis can be performed using the
Discriminant Analysis task within SAS Enterprise Guide. To select this task, click:
Analyze>Multivariate>Discriminant Analysis
To run logit analysis or logistic regression using SAS Enterprise Guide, click:
One Shot!