Slides Nmds
Slides Nmds
Non-metric Multidimensional
Scaling (NMDS)
Objective: Group data points into classes of similar points based on a
series of variables
BUT (unlike PCA which uses Euclidian distances) NMDS relies on rank orders
(distances) for ordination (i.e non-metric)
The use of distances omits some of the issues associated with using predictor
variables alone (e.g., sensitivity to transformation)
Allows for much more flexible technique that accepts a variety of data types
Shepard 1962
Kruskal 1964 Contributed to the development of
Tprgersen & Meuser 1962 multidimensional scaling
Guttman 1968
The math behind NMDS
NMDS is an iterative procedure which takes place over several steps:
1. Define the original data point positions in multidimensional space
2. Specify the number of reduced dimensions you want (typically 2)
3. Construct an initial configuration of the data in 2-dimensions
4. Compare distances in this initial 2D configuration against the calculated
distances
5. Determine the stress on data points
6. Correct the position of the points in 2D to optimize the stress for all points
The math behind NMDS
Consider a 3 variable analysis with 4 data points
Euclidian Plot in 2D by distance
Variable 2 (could be any distance matrix)
A B C D
D
A 0 1.6 2.6 2.4 2.6
A C
B 1.6 0 2.5 3.3
Variable 3
C 2.6 2.5 0 1.7
1.6
C D 2.4 3.3 1.7 0 2.6
A
B
3.3
D
B
Variable 1
Think of optimizing stress as: “Pulling on all points a little bit so no single point is
completely wrong, all points are a little off compared to distances”
Distance matrix
Mahalanobis is good for
correlated variables