What Is Wilks Lambda
What Is Wilks Lambda
Here, the determinant of the error sums of squares and cross products matrix E is
divided by the determinant of the total sum of squares and cross products matrix T =
H + E. If H is large relative to E, then |H + E| will be large relative to |E|. Thus, we will
reject the null hypothesis if Wilks lambda is small (close to zero).
Wilks’ lambda assesses the differences between two or more groups on multiple
variables at once. It is the multivariate version of the F-test statistic in one-way
ANOVA, which examines the differences between multiple groups on one variable.
In other words, Wilks’ lambda tests for differences between groups on a vector of
variables. Before introducing the method formally, it is important to understand the
assumptions behind it.
Assumptions behind the Method
All statistical tests rely on some underlying assumptions. If your data violate any of
the assumptions, you might still be able to perform the test, but the test results may
not have desirable statistical properties (unbiasedness, etc.) and so should be taken
with caution. Moreover, understanding the assumptions of a statistical will help you
improve your research-design and data-collection efforts. There
Are six critical assumptions underlying the use of Wilks’ lambda:
• The dependent variables in question are measured on an ordinal or
Continuous scale. A common example of ordinal variables is the Likert scale, e.g., a
5-point scale from “strongly disagree” to “strongly agree.” Continuous variables are
quite common, such as body weight, exam scores, blood pressure, heart rate, etc.
• The subjects are independent of each other. This relates to the sampling design of
the data collection process and hence cannot be directly evaluated with the data.
• Relationships between the continuous variables of interest, if they exist, are linear.
This can be visually inspected by plotting every pair of the variables with a
scatterplot.
• Variances and covariance’s of the continuous dependent variables are equal
across groups.
• There are no outliers in the sample. This can be visually inspected using boxplots
or identified quantitatively using Mahalanobis distance.
• The dependent variables are normally distributed within groups. In practice, it is
common that this condition is not met, but Wilks’ lambda is fairly robust to moderate
violations of the normality assumption.
Calculating Wilks’ Lambda
Wilks’ lambda assesses the proportion of variance in the dependent variables that is
not accounted for by the intergroup variations. If this proportion is small, then it
suggests that the dependent variables vary a lot across groups and the groups might
have different mean values for the dependent variables.
To calculate Wilks’ lambda, we first need to introduce a few notations. Suppose that
there are n subjects in total, divided into m groups with nj subjects in-group
j, and P dependent variables. Denote the value of variable p for subject i in group
j as xijp, and let Xij be the column vector(xij1, ⋯ , xijP)T. Wilks’ Lambda can be
calculated as the following:
• Calculate the intergroup cross product matrix H (the variations across
Groups):
Wilks’ lambda is small when |H| is larger than |E| or the intergroup variation is larger
than the residual variation. The distribution of Wilks’ lambda is complicated, but it
can be approximated by an F-distribution. One tends to reject the null hypothesis of
no difference if the resulting Wilks’ lambda is small with a significant p-value.