0% found this document useful (0 votes)

95 views13 pages

An Introduction To Locally Linear Embedding

This document introduces locally linear embedding (LLE), an unsupervised learning algorithm that computes low dimensional, neighborhood preserving embeddings of high dimensional data. LLE attempts to discover nonlinear structure in data by finding a linear reconstruction of each data point from its neighbors. It maps the high dimensional data into a single global coordinate system of lower dimensionality without involving local minima like other methods. The algorithm is illustrated on images of lips used for audiovisual speech synthesis.

Uploaded by

Jorge Leandro

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

95 views13 pages

An Introduction To Locally Linear Embedding

Uploaded by

Jorge Leandro

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

An Introduction to Locally Linear Embedding

Lawrence K. Saul
AT&T Labs – Research
180 Park Ave, Florham Park, NJ 07932 USA
[email protected]

Sam T. Roweis
Gatsby Computational Neuroscience Unit, UCL
17 Queen Square, London WC1N 3AR, UK
[email protected]

Abstract
Many problems in information processing involve some form of dimension-
ality reduction. Here we describe locally linear embedding (LLE), an unsu-
pervised learning algorithm that computes low dimensional, neighborhood
preserving embeddings of high dimensional data. LLE attempts to discover
nonlinear structure in high dimensional data by exploiting the local symme-
tries of linear reconstructions. Notably, LLE maps its inputs into a single
global coordinate system of lower dimensionality, and its optimizations—
though capable of generating highly nonlinear embeddings—do not involve
local minima. We illustrate the method on images of lips used in audiovisual
speech synthesis.

1 Introduction

Many problems in statistical pattern recognition begin with the preprocessing of

multidimensional signals, such as images of faces or spectrograms of speech.
Often, the goal of preprocessing is some form of dimensionality reduction: to com-
press the signals in size and to discover compact representations of their variability.
Two popular forms of dimensionality reduction are the methods of principal com-
ponent analysis (PCA) [1] and multidimensional scaling (MDS) [2]. Both PCA and
MDS are eigenvector methods designed to model linear variabilities in high dimen-
sional data. In PCA, one computes the linear projections of greatest variance from

1
the top eigenvectors of the data covariance matrix. In classical (or metric) MDS,
one computes the low dimensional embedding that best preserves pairwise dis-
tances between data points. If these distances correspond to Euclidean distances,
the results of metric MDS are equivalent to PCA. Both methods are simple to
implement, and their optimizations do not involve local minima. These virtues ac-
count for the widespread use of PCA and MDS, despite their inherent limitations
as linear methods.
Recently, we introduced an eigenvector method—called locally linear embedding
(LLE)—for the problem of nonlinear dimensionality reduction[4]. This problem
is illustrated by the nonlinear manifold in Figure 1. In this example, the dimen-
sionality reduction by LLE succeeds in identifying the underlying structure of the
manifold, while projections of the data by PCA or metric MDS map faraway data
points to nearby points in the plane. Like PCA and MDS, our algorithm is sim-
ple to implement, and its optimizations do not involve local minima. At the same
time, however, it is capable of generating highly nonlinear embeddings. Note that
mixture models for local dimensionality reduction[5, 6], which cluster the data and
perform PCA within each cluster, do not address the problem considered here—
namely, how to map high dimensional data into a single global coordinate system
of lower dimensionality.
In this paper, we review the LLE algorithm in its most basic form and illustrate a
potential application to audiovisual speech synthesis[3].
(A) (B) (C)

3 3
2
2 2
1
1 1
0

0 0
-1

-1 5 -1 5
-1 0 -1 0 -2
1 0 1 0 -2 -1 0 1 2

Figure 1: The problem of nonlinear dimensionality reduction, as illustrated for

three dimensional data (B) sampled from a two dimensional manifold (A). An un-
supervised learning algorithm must discover the global internal coordinates of the
manifold without signals that explicitly indicate how the data should be embed-
ded in two dimensions. The shading in (C) illustrates the neighborhood-preserving
mapping discovered by LLE.

2
2 Algorithm

Suppose the data consist of

The LLE algorithm, summarized in Fig. 2, is based on simple geometric intuitions.
real-valued vectors , each of dimensionality ,
sampled from some smooth underlying manifold. Provided there is sufficient data
(such that the manifold is well-sampled), we expect each data point and its neigh-
bors to lie on or close to a locally linear patch of the manifold.
We can characterize the local geometry of these patches by linear coefficients that
reconstruct each data point from its neighbors. In the simplest formulation of LLE,
one identifies nearest neighbors per data point, as measured by Euclidean dis-
tance. (Alternatively, one can identify neighbors by choosing all points within a
ball of fixed radius, or by using more sophisticated rules based on local metrics.)
Reconstruction errors are then measured by the cost function:

(1)

which adds up the squared distances between all the data points and their recon-

structions. The weights

summarize the contribution of the th data point to
the th reconstruction. To compute the weights

, we minimize the cost func-

tion subject to two constraints: first, that each data point is reconstructed only

from its neighbors, enforcing if does not belong to this set; second,
that the rows of the weight matrix sum to one:

. The reason for the
sum-to-one constraint will become clear shortly. The optimal weights subject
to these constraints are found by solving a least squares problem, as discussed in
Appendix A.
Note that the constrained weights that minimize these reconstruction errors obey an
important symmetry: for any particular data point, they are invariant to rotations,
rescalings, and translations of that data point and its neighbors. The invariance to
rotations and rescalings follows immediately from the form of eq. (1); the invari-
ance to translations is enforced by the sum-to-one constraint on the rows of the
weight matrix. A consequence of this symmetry is that the reconstruction weights
characterize intrinsic geometric properties of each neighborhood, as opposed to
properties that depend on a particular frame of reference.

!#"
Suppose the data lie on or near a smooth nonlinear manifold of dimensionality
. To a good approximation, then, there exists a linear mapping—consisting
of a translation, rotation, and rescaling—that maps the high dimensional coordi-

design, the reconstruction weights

nates of each neighborhood to global internal coordinates on the manifold. By
reflect intrinsic geometric properties of the

3
data that are invariant to exactly such transformations. We therefore expect their

local patches on the manifold. In particular, the same weights

characterization of local geometry in the original data space to be equally valid for
that reconstruct
!
the th data point in dimensions should also reconstruct its embedded manifold
coordinates in dimensions.
(Informally, imagine taking a pair of scissors, cutting out locally linear patches
of the underlying manifold, and placing them in the low dimensional embedding
space. Assume further that this operation is done in a way that preserves the angles
formed by each data point to its nearest neighbors. In this case, the transplantation
of each patch involves no more than a translation, rotation, and rescaling of its
data, exactly the operations to which the weights are invariant. Thus, when the
patch arrives at its low dimensional destination, we expect the same weights to
reconstruct each data point from its neighbors.)

LLE constructs a neighborhood preserving mapping based on the above idea. In the
$
final step of the algorithm, each high dimensional observation is mapped to a
! $
low dimensional vector representing global internal coordinates on the manifold.
This is done by choosing -dimensional coordinates to minimize the embedding
% )(
$ & $ '
$
cost function:
(2)

whilebased
This cost function—like the previous one—is
$ . The
on locally linear reconstruction
errors, but here we fix the weights
$
optimizing the coordinates
embedding cost in Eq. (2) defines a quadratic form in the vectors . Subject to
+*, eigenvector problem, whose bottom ! non-zero eigenvectors provide
constraints that make the problem well-posed, it can be minimized by solving a
sparse
an ordered set of orthogonal coordinates centered on the origin. Details of this
eigenvector problem are discussed in Appendix B.
Note that while the reconstruction weights for each data point are computed from

bedding coordinates are computed by an

-*.
its local neighborhood—independent of the weights for other data points—the em-
eigensolver, a global operation that
couples all data points in connected components of the graph defined by the weight
matrix. The different dimensions in the embedding space can be computed succes-
sively; this is done simply by computing the bottom eigenvectors from eq. (2) one
at a time. But the computation is always coupled across data points. This is how
the algorithm leverages overlapping local information to discover global structure.
Implementation of the algorithm is fairly straightforward, as the algorithm has only

one free parameter: the number of neighbors per data point, . Once neighbors

4
LLE ALGORITHM

1. Compute the neighbors of each data point,

.
2. Compute the weights

that best reconstruct each data point from

its neighbors, minimizing the cost in eq. (1) by constrained linear fits.
/$
3. Compute the vectors best reconstructed by the weights

, minimizing
the quadratic form in eq. (2) by its bottom nonzero eigenvectors.

0$
Figure 2: Summary of the LLE algorithm, mapping high dimensional data points,
, to low dimensional embedding vectors, .

are chosen, the optimal weights

$
and coordinates are computed by standard
methods in linear algebra. The algorithm involves a single pass through the three
steps in Fig. 2 and finds global minima of the reconstruction and embedding costs

neighbors outnumber the input dimensionality

213
in Eqs. (1) and (2). As discussed in Appendix A, in the unusual case where the
, the least squares problem
for finding the weights does not have a unique solution, and a regularization term—
for example, one that penalizes the squared magnitudes of the weights—must be
added to the reconstruction cost.

The algorithm, as described in Fig. 2, takes as input the high dimensional vec-
tors, . In many settings, however, the user may not have access to data of this
form, but only to measurements of dissimilarity or pairwise distance between dif-
ferent data points. A simple variation of LLE, described in Appendix C, can be
applied to input of this form. In this way, matrices of pairwise distances can be
analyzed by LLE just as easily as MDS[2]; in fact only a small fraction of all pos-
sible pairwise distances (representing distances between neighboring points and
their respective neighbors) are required for running LLE.

3 Examples

The embeddings discovered by LLE are easiest to visualize for intrinsically two
dimensional manifolds. In Fig. 1, for example, the input to LLE consisted
5467
8:9
data points sampled off the S-shaped manifold. The resulting embedding shows
how the algorithm, using neighbors per data point, successfully unraveled
the underlying two dimensional structure.

5
Fig. 3 shows another two dimensional manifold, this one living in a much higher
dimensional space. Here, we generated examples—shown in the middle panel of
the figure—by translating the image of a single face across a larger background
of random noise. The noise was uncorrelated from one example to the next. The
only consistent structure in the resulting images thus described a two-dimensional
<;74= 97> * 96
manifold parameterized by the face’s center of mass. The input to LLE consisted

?; * ?
of grayscale images, with each image containing a face su-
perimposed on a background of noise. Note that while simple to visu-

@67A;
alize, the manifold of translated faces is highly nonlinear in the high dimensional

CB
( ) vector space of pixel coordinates. The bottom portion of Fig. 3 shows
the first two components discovered by LLE, with neighbors per data point.
By contrast, the top portion shows the first two components discovered by PCA. It
is clear that the manifold structure in this example is much better modeled by LLE.
Finally, in addition to these examples, for which the true manifold structure was
D> ? 7> >
known, we also applied LLE to images of lips used in the animation of talking
EA> *
>FB
heads[3]. Our database contained
59AG69=:4
color (RGB) images of lips at
resolution. Dimensionality reduction of these images ( ) is useful for

H:4
faster and more efficient animation. The top and bottom panels of Fig. 4 show the
first two components discovered, respectively, by PCA and LLE (with ).
If the lip images described a nearly linear manifold, these two methods would
yield similar results; thus, the significant differences in these embeddings reveal
the presence of nonlinear structure. Note that while the linear projection by PCA
has a somewhat uniform distribution about its mean, the locally linear embedding
has a distinctly spiny structure, with the tips of the spines corresponding to extremal
configurations of the lips.

4 Discussion

It is worth noting that many popular learning algorithms for nonlinear dimension-
ality reduction do not share the favorable properties of LLE. Iterative hill-climbing
methods for autoencoder neural networks[7, 8], self-organizing maps[9], and latent
variable models[10] do not have the same guarantees of global optimality or con-
vergence; they also tend to involve many more free parameters, such as learning
rates, convergence criteria, and architectural specifications.

I
The different steps of LLE have the following complexities. In Step 1, computing
nearest neighbors scales (in the worst case) as , or linearly in the input

dimensionality, , and quadratically in the number of data points, . For many

6
Figure 3: The results of PCA (top) and LLE (bottom), applied to images of a single
face translated across a two-dimensional background of noise. Note how LLE
maps the images with corner faces to the corners of its two dimensional embedding,
while PCA fails to preserve the neighborhood structure of nearby images.

7
Figure 4: Images of lips mapped into the embedding space described by the first
two coordinates of PCA (top) and LLE (bottom). Representative lips are shown
next to circled points in different parts of each space. The differences between the
two embeddings indicate the presence of nonlinear structure in the data.

8
data distributions, however – and especially for data distributed on a thin subman-

I 8J LK7M
ifold of the observation space – constructions such as K-D trees can be used to

I ON

compute the neighbors in time[13]. In Step 2, computing the recon-

*
struction weights scales as ; this is the number of operations required

I !
to solve a set of linear equations for each data point. In Step 3, computing
!
the bottom eigenvectors scales as , linearly in the number of embedding
dimensions, , and quadratically in the number of data points, . Methods for
sparse eigenproblems[14], however, can be used to reduce the complexity to sub-
quadratic in . Note that as more dimensions are added to the embedding space,
the existing ones do not change, so that LLE does not have to be rerun to compute
higher dimensional embeddings. The storage requirements of LLE are limited by
the weight matrix which is size N by K.
LLE illustrates a general principle of manifold learning, elucidated by Tenenbaum
et al[11], that overlapping local neighborhoods—collectively analyzed—can pro-
vide information about global geometry. Many virtues of LLE are shared by the
Isomap algorithm[11], which has been successfully applied to similar problems
in nonlinear dimensionality reduction. Isomap is an extension of MDS in which
embeddings are optimized to preserve “geodesic” distances between pairs of data
points; these distances are estimated by computing shortest paths through large
sublattices of data. A virtue of LLE is that it avoids the need to solve large dy-
namic programming problems. LLE also tends to accumulate very sparse matrices,
whose structure can be exploited for savings in time and space.
LLE is likely to be even more useful in combination with other methods in data
analysis and statistical learning. An interesting and important question is how to

the results of LLE. One possible approach is to use
P $ R Q
learn a parametric mapping between the observation and embedding spaces, given
pairs as labeled ex-
amples for statistical models of supervised learning. The ability to learn such map-
pings should make LLE broadly useful in many areas of information processing.

A Constrained Least Squares Problem

The constrained weights that best reconstruct each data point from its neighbors
S
neighbors T
and reconstruction weights U

can be computed in closed form. Consider a particular data point with nearest
that sum to one. We can write the
reconstruction error as:

V S U T W U S T Y X U U X7Z)YX (3)

9
where in the first identity, we have exploited the fact that the weights sum to one,
and in the second identity, we have introduced the local covariance matrix,
(
S T [\ S T X
Z)YX +
(4)

U ]
This error can be minimized in closed form, using a Lagrange multiplier to enforce
the constraint that . In terms of the inverse local covariance matrix, the
optimal weights are given by:
X Z#a^`X _ (
U Z ^`_
ceb d bfd
(5)

The solution, as written in eq. (5), appears to require an explicit inversion of the
Z)aX X
U
local covariance matrix. In practice, a more efficient way to minimize the error is
simply to solve the linear system of equations,
, and then to rescale
the weights so that they sum to one (which yields the same result). By construction,
the local covariance matrix in eq. (4) is symmetric and semipositive definite. If
the covariance matrix is singular or nearly singular—as arises, for example, when
there are more neighbors than input dimensions ( 21g
), or when the data points
are not in general position—it can be conditioned (before solving the system) by
adding a small multiple of the identity matrix,
Z)aX,hiZ)aXkjml)n YX
porq (6)

n Z
where is small compared to the trace of . This amounts to penalizing large
weights that exploit correlations beyond some level of precision in the data sam-
pling process.

B Eigenvector Problem
0$
fixed weights

sutwv
The embedding vectors
:
are found by minimizing the cost function, eq. (2), for

% )(
$ y $ / 2F
$
x (7)

%
Note that the cost defines a quadratic form,
$ [ $
$ {
z

10
*| z

j
X
X}
involving inner products of the embedding vectors and the matrix :

z X
q (8)

where
q is 1 if and 0 otherwise.

% $
This optimization is performed subject to constraints that make the problem well

ment without affecting the cost,

$
posed. It is clear that the coordinates can be translated by a constant displace-
. We remove this degree of freedom by
requiring the coordinates to be centered on the origin:
(
$
(9)

Also, to avoid degenerate solutions, we constrain the embedding vectors to have

unit covariance, with outer products that satisfy

$ $# ~
(10)

! * !
where is the
$
identity matrix. Note that there is no loss in generality in
constraining the covariance of to be diagonal and of order unity, since the cost
function in eq. (2) is invariant to rotations and homogeneous rescalings. The further
constraint that the covariance is equal to the identity matrix expresses an assump-
tion that reconstruction errors for different coordinates in the embedding space
should be measured on the same scale.

by computing the bottom

! j
The optimal embedding—up to a global rotation of the embedding space—is found
eigenvectors of the matrix, ; this is a version of z
the Rayleitz-Ritz theorem[12]. The bottom eigenvector of this matrix, which we
discard, is the unit vector with all equal components; it represents a free translation
mode of eigenvalue zero. Discarding this eigenvector enforces the constraint that
the embeddings have zero mean, since the components of other eigenvectors must
!
!
sum to zero, by virtue of orthogonality. The remaining eigenvectors form the

! j
embedding coordinates found by LLE.
Note that the bottom
!j
eigenvectors of the matrix (that is, those correspond- z
ing to its smallest eigenvalues) can be found without performing a full matrix
diagonalization[14]. Moreover, the matrix z
can be stored and manipulated as the
sparse symmetric matrix

z +
8
(11)

11
giving substantial computational savings for large values of . In particular, left
multiplication by z (the subroutine required by most sparse eigensolvers) can be

performed as
z-

~
(12)
requiring just one multiplication by

and one multiplication by

~ , both of
which are extremely sparse. Thus, the matrix z never needs to be explicitly cre-

ated or stored; it is sufficient to store and multiply the matrix .

C LLE from Pairwise Distances

LLE can be applied to user input in the form of pairwise distances. In this case,
nearest neighbors are identified by the smallest non-zero elements of each row in
Z)aX
the distance matrix. To derive the reconstruction weights for each data point, we
need to compute the local covariance matrix between its nearest neighbors,
as defined by eq. (4) in appendix A. This can be done by exploiting the usual
relation between pairwise distances and dot products that forms the basis of metric
MDS[2]. Thus, for a particular data point, we set:
Z)aX j X a X
9 (13)
aX
where aX aX distance between the th and th neighbors,
` and . In terms of this local covariance matrix, the
denotes the squared

reconstruction weights for each data point are given by eq. (5). The rest of the
algorithm proceeds as usual.
Note that this variant of LLE requires significantly less user input than the com-
plete matrix of pairwise distances. Instead, for each data point, the user needs
only to specify its nearest neighbors and the submatrix of pairwise distances be-
tween those neighbors. Is it possible to recover manifold structure from even less
user input—say, just the pairwise distances between each data point and its near-
est neighbors? A simple counterexample shows that this is not possible. Consider

S
the square lattice of three dimensional data points whose integer coordinates sum to

S
zero. Imagine that points with even -coordinates are colored black, and that points
with odd -coordinates are colored red. The “two point” embedding that maps all
black points to the origin and all red points to one unit away preserves the distance
between each point and its four nearest neighbors. Nevertheless, this embedding
completely fails to preserve the underlying structure of the original manifold.

12
Acknowledgements

The authors thank E. Cosatto, H.P. Graf, and Y. LeCun (AT&T Labs) and B. Frey
(U. Toronto) for providing data for these experiments. S. Roweis acknowledges
the support of the Gatsby Charitable Foundation, the National Science Foundation,
and the National Sciences and Engineering Research Council of Canada.

References
[1] I.T. Jolliffe, Principal Component Analysis (Springer-Verlag, New York, 1989).
[2] T. Cox and M. Cox. Multidimensional Scaling (Chapman & Hall, London, 1994).
[3] E. Cosatto and H.P. Graf. Sample-Based Synthesis of Photo-Realistic Talking-Heads.
Proceedings of Computer Animation, 103–110. IEEE Computer Society (1998).
[4] S. T. Roweis and L. K. Saul. Nonlinear dimensionality reduction by locally linear
embedding. Science 290, 2323-2326 (2000).
[5] K. Fukunaga and D. R. Olsen, An algorithm for finding intrinsic dimensionality of
data. IEEE Transactions on Computers 20(2), 176-193 (1971).
[6] N. Kambhatla and T. K. Leen. Dimension reduction by local principal component
analysis. Neural Computation 9, 1493–1516 (1997).
[7] D. DeMers and G.W. Cottrell. Nonlinear dimensionality reduction. In Advances in
Neural Information Processing Systems 5, D. Hanson, J. Cowan, L. Giles, Eds. (Mor-
gan Kaufmann, San Mateo, CA, 1993), pp. 580–587.
[8] M. Kramer. Nonlinear principal component analysis using autoassociative neural net-
works. AIChE Journal 37, 233–243 (1991).
[9] T. Kohonen. Self-organization and Associative Memory (Springer-Verlag, Berlin,
1988).
[10] C. Bishop, M. Svensen, and C. Williams. GTM: The generative topographic mapping.
Neural Computation 10, 215–234 (1998).
[11] J. B. Tenenbaum, V. de Silva, and J. C. Langford. A global geometric framework for
nonlinear dimensionality reduction. Science 290, 2319-2323 (2000).
[12] R. A. Horn and C. R. Johnson. Matrix Analysis (Cambridge University Press, Cam-
bridge, 1990).
[13] J. H. Friedman, J. L. Bentley and R. A. Finkel. An algorithm for finding best matches
in logarithmic expected time. ACM Transactions on Mathematical Software, 3(3),
290-226 (1977).
[14] Z. Bai, J. Demmel, J. Dongarra, A. Ruhe, and H. van der Vorst. Templates for the
Solution of Algebraic Eigenvalue Problems: A Practical Guide (Society for Industrial
and Applied Mathematics, Philadelphia, 2000).

FML Cie2
No ratings yet
FML Cie2
103 pages
Data Science L30 - ManifoldLearning
No ratings yet
Data Science L30 - ManifoldLearning
79 pages
An Introduction To Locally Linear Embedding: L. K. Saul S. T. Roweis
No ratings yet
An Introduction To Locally Linear Embedding: L. K. Saul S. T. Roweis
35 pages
Manifold Learning: What, How, and Why: Marina Meila, Hanyu Zhang, November 8, 2023
No ratings yet
Manifold Learning: What, How, and Why: Marina Meila, Hanyu Zhang, November 8, 2023
33 pages
Manifold Learning Theory and Applications 9781439871102 Compress
No ratings yet
Manifold Learning Theory and Applications 9781439871102 Compress
322 pages
Locally Linear Embedding and Its Variants: Tutorial and Survey
No ratings yet
Locally Linear Embedding and Its Variants: Tutorial and Survey
23 pages
Module7 PCA Clustering November 9-13-2023
No ratings yet
Module7 PCA Clustering November 9-13-2023
41 pages
Visualization 9 Dim Reduction
No ratings yet
Visualization 9 Dim Reduction
73 pages
Locally Linear Embedding Algorithm
No ratings yet
Locally Linear Embedding Algorithm
127 pages
Vandermaaten 14 A
No ratings yet
Vandermaaten 14 A
25 pages
Hyperspherical Variational Auto-Encoders: Tim R. Davidson Luca Falorsi Nicola de Cao Thomas Kipf Jakub M. Tomczak
No ratings yet
Hyperspherical Variational Auto-Encoders: Tim R. Davidson Luca Falorsi Nicola de Cao Thomas Kipf Jakub M. Tomczak
19 pages
Eigenvectors 2
No ratings yet
Eigenvectors 2
31 pages
Metric Learning and Manifolds: Preserving The Intrinsic Geometry
No ratings yet
Metric Learning and Manifolds: Preserving The Intrinsic Geometry
37 pages
Manifold Learning Using Growing Locally Linear Embedding
No ratings yet
Manifold Learning Using Growing Locally Linear Embedding
8 pages
Thompson
No ratings yet
Thompson
33 pages
Lecture21 1
No ratings yet
Lecture21 1
14 pages
Understanding Dimensional Collapse
No ratings yet
Understanding Dimensional Collapse
17 pages
LectureNotes PCA
No ratings yet
LectureNotes PCA
20 pages
WIREs Computational Stats - 2012 - Izenman - Introduction To Manifold Learning
No ratings yet
WIREs Computational Stats - 2012 - Izenman - Introduction To Manifold Learning
8 pages
Hadsell Et Al - Dimensionality Reduction by Learning An Invariant Mapping
No ratings yet
Hadsell Et Al - Dimensionality Reduction by Learning An Invariant Mapping
8 pages
Rough Entropy-Based Fused Granular Features in 2-D Locality Preserving Projections For High-Dimensional Vision Sensor Data
No ratings yet
Rough Entropy-Based Fused Granular Features in 2-D Locality Preserving Projections For High-Dimensional Vision Sensor Data
10 pages
unit-II Node Embeddings
No ratings yet
unit-II Node Embeddings
44 pages
J Patcog 2020 107450
No ratings yet
J Patcog 2020 107450
14 pages
Nonlinear Dimensionality Reduction by Locally Linear Embedding
No ratings yet
Nonlinear Dimensionality Reduction by Locally Linear Embedding
5 pages
Spectral Graph Theory Part 3
No ratings yet
Spectral Graph Theory Part 3
10 pages
Algorithms: Robust Hessian Locally Linear Embedding Techniques For High-Dimensional Data
No ratings yet
Algorithms: Robust Hessian Locally Linear Embedding Techniques For High-Dimensional Data
21 pages
Graph Embedding and Extensions: A General Framework For Dimensionality Reduction
No ratings yet
Graph Embedding and Extensions: A General Framework For Dimensionality Reduction
12 pages
I2ml3e Chap6
No ratings yet
I2ml3e Chap6
37 pages
Think Globally, Fit Locally
No ratings yet
Think Globally, Fit Locally
33 pages
1368 - نیک افشان
No ratings yet
1368 - نیک افشان
6 pages
Hadsell Chopra Lecun 06 PDF
No ratings yet
Hadsell Chopra Lecun 06 PDF
8 pages
Dimensionality Reduction by Learning An Invariant Mapping
No ratings yet
Dimensionality Reduction by Learning An Invariant Mapping
9 pages
Visualizing Data Using t-SNE: Laurens Van Der Maaten
No ratings yet
Visualizing Data Using t-SNE: Laurens Van Der Maaten
27 pages
Unsupervised Learning: Neighbor Embedding
No ratings yet
Unsupervised Learning: Neighbor Embedding
15 pages
Manifold Learning Algorithms
No ratings yet
Manifold Learning Algorithms
17 pages
LecN11 R
No ratings yet
LecN11 R
4 pages
Engineering Mathematics II
100% (2)
Engineering Mathematics II
200 pages
Rank Priors For Continuous Non-Linear Dimensionality Reduction
No ratings yet
Rank Priors For Continuous Non-Linear Dimensionality Reduction
8 pages
Science:, 2323 (2000) Sam T. Roweis and Lawrence K. Saul
No ratings yet
Science:, 2323 (2000) Sam T. Roweis and Lawrence K. Saul
5 pages
Diffusion Wavelets On Graphs and Manifolds: R.R. Coifman, MM, J.C. Bremer JR., A.D. Szlam
No ratings yet
Diffusion Wavelets On Graphs and Manifolds: R.R. Coifman, MM, J.C. Bremer JR., A.D. Szlam
46 pages
00science Saul Nonlinear Dimensionality Reduction
No ratings yet
00science Saul Nonlinear Dimensionality Reduction
4 pages
Low-Rank Neighbor Embedding For
No ratings yet
Low-Rank Neighbor Embedding For
4 pages
BCI Case Law
No ratings yet
BCI Case Law
4 pages
Learning A Locality Preserving Subspace For Visual Recognition
No ratings yet
Learning A Locality Preserving Subspace For Visual Recognition
8 pages
On Adding and Subtracting Eigenspaces With Evd and SVD: Peter Hall David Marshall Ralph Martin
No ratings yet
On Adding and Subtracting Eigenspaces With Evd and SVD: Peter Hall David Marshall Ralph Martin
16 pages
Charting A Manifold: Mitsubishi Electric Information Technology Center America, 2003
No ratings yet
Charting A Manifold: Mitsubishi Electric Information Technology Center America, 2003
10 pages
Database Friendly Random Projections
No ratings yet
Database Friendly Random Projections
8 pages
Face Recognition Using Extended Isomap
No ratings yet
Face Recognition Using Extended Isomap
5 pages
2461 Out of Sample Extensions For Lle Isomap Mds Eigenmaps and Spectral Clustering
No ratings yet
2461 Out of Sample Extensions For Lle Isomap Mds Eigenmaps and Spectral Clustering
8 pages
Ellipse Fitting PDF
100% (1)
Ellipse Fitting PDF
23 pages
Face Recognition Using Wavelet Based Kernel Locally Discriminating Projection
No ratings yet
Face Recognition Using Wavelet Based Kernel Locally Discriminating Projection
6 pages
Lec 11: Linear Dimensionality Reduction: 11.33.1 Minimizing Variance
No ratings yet
Lec 11: Linear Dimensionality Reduction: 11.33.1 Minimizing Variance
3 pages
Science 2006 Cottrell 454 5
No ratings yet
Science 2006 Cottrell 454 5
3 pages
Lap Lac Ian Face
No ratings yet
Lap Lac Ian Face
34 pages
Class: 8: REF/iOM18/AD/LEVEL 2-Set-A
No ratings yet
Class: 8: REF/iOM18/AD/LEVEL 2-Set-A
9 pages
Handbook of Accelerator Physics and Engineering
No ratings yet
Handbook of Accelerator Physics and Engineering
679 pages
Principles of Charged Particle Acceleration
No ratings yet
Principles of Charged Particle Acceleration
593 pages
Advanced Mathematical Methods in Theoretical Physics - Gernot Schaller
No ratings yet
Advanced Mathematical Methods in Theoretical Physics - Gernot Schaller
144 pages
CLS JEEAD-19-20 XII Mat Target-4 Level-1 Chapter-8 PDF
No ratings yet
CLS JEEAD-19-20 XII Mat Target-4 Level-1 Chapter-8 PDF
23 pages
AdvancedMath Sets F5 Loibanguti (2022)
No ratings yet
AdvancedMath Sets F5 Loibanguti (2022)
37 pages
Trigonometric Function and Identities
100% (1)
Trigonometric Function and Identities
2 pages
Drizzle: A Method For The Linear Reconstruction of Undersampled Images
No ratings yet
Drizzle: A Method For The Linear Reconstruction of Undersampled Images
9 pages
ODE Simulink PDF
No ratings yet
ODE Simulink PDF
92 pages
Deep Convolutional Denoising of Low-Light Images: Tal Remez or Litany Raja Giryes
No ratings yet
Deep Convolutional Denoising of Low-Light Images: Tal Remez or Litany Raja Giryes
11 pages
Solutions Manual For Students
No ratings yet
Solutions Manual For Students
106 pages
Mathematics 3 Calculus PDF
No ratings yet
Mathematics 3 Calculus PDF
136 pages
Wiley Pattern Classification-Errata PDF
No ratings yet
Wiley Pattern Classification-Errata PDF
19 pages
Limits and Continuity
No ratings yet
Limits and Continuity
35 pages
Lecture Notes (Chapter 2.3 Triple Integral)
100% (1)
Lecture Notes (Chapter 2.3 Triple Integral)
5 pages
Mathematics - Binomial Theorem - P and C - Complete Module
No ratings yet
Mathematics - Binomial Theorem - P and C - Complete Module
85 pages
Lecture Basics of FT
No ratings yet
Lecture Basics of FT
35 pages
Automata and Quantum Computing
No ratings yet
Automata and Quantum Computing
34 pages
Class XII Session 2024-25 Subject - Mathematics Sample Question Paper - 4
No ratings yet
Class XII Session 2024-25 Subject - Mathematics Sample Question Paper - 4
19 pages
Inverse Trigonometric Functions - DPP 05 (Of Lec 06) - Lakshya JEE AIR O1 (2026)
No ratings yet
Inverse Trigonometric Functions - DPP 05 (Of Lec 06) - Lakshya JEE AIR O1 (2026)
3 pages
11th Annual Exam Maths 2023-24 Set B
No ratings yet
11th Annual Exam Maths 2023-24 Set B
5 pages
Formula Sheet Ch11
No ratings yet
Formula Sheet Ch11
6 pages
Machine Learning Yarning - Andrew NG - 23 To 27
50% (2)
Machine Learning Yarning - Andrew NG - 23 To 27
8 pages
Discrete Mathematics - J. Saxl (1995) WW
No ratings yet
Discrete Mathematics - J. Saxl (1995) WW
41 pages
Chapter: 1 - Number Series Synopsis
No ratings yet
Chapter: 1 - Number Series Synopsis
22 pages
Git Succinctly
No ratings yet
Git Succinctly
59 pages
Entwistle Trial Exam
No ratings yet
Entwistle Trial Exam
13 pages
Ma 8353 Tpde Iae 3 MCQ Type: Excel Engineering College
No ratings yet
Ma 8353 Tpde Iae 3 MCQ Type: Excel Engineering College
4 pages
Alpha Shapes - Celikik
No ratings yet
Alpha Shapes - Celikik
40 pages
LeNgiCoaLahProNg11 PDF
No ratings yet
LeNgiCoaLahProNg11 PDF
8 pages
Materi Fismat Bab 2
No ratings yet
Materi Fismat Bab 2
33 pages
Dips Lab Report
No ratings yet
Dips Lab Report
5 pages
AI Benchmark: All About Deep Learning On Smartphones in 2019
No ratings yet
AI Benchmark: All About Deep Learning On Smartphones in 2019
19 pages
Digital Color Cameras - Spectral Response
No ratings yet
Digital Color Cameras - Spectral Response
32 pages
Lab 09
No ratings yet
Lab 09
8 pages
Workshop 05 S
No ratings yet
Workshop 05 S
7 pages
Eigenvalues, Eigenvectors (CDT-28) : April 2020
No ratings yet
Eigenvalues, Eigenvectors (CDT-28) : April 2020
11 pages
Jeftha Spunda 4174615 Approximate Nearest Neighbor Field Computation Via K-D Trees
No ratings yet
Jeftha Spunda 4174615 Approximate Nearest Neighbor Field Computation Via K-D Trees
26 pages
Jeftha Spunda 4174615 Approximate Nearest Neighbor Field Computation Via K-D Trees
No ratings yet
Jeftha Spunda 4174615 Approximate Nearest Neighbor Field Computation Via K-D Trees
26 pages
Day 2 MCQ
No ratings yet
Day 2 MCQ
2 pages
q3 Week5 Subtraction of Fractions
No ratings yet
q3 Week5 Subtraction of Fractions
10 pages
Scan Conversion or Rasterization
No ratings yet
Scan Conversion or Rasterization
19 pages
Maths G12
No ratings yet
Maths G12
2 pages
Limits of Resolution 6 How Many Megapixels
No ratings yet
Limits of Resolution 6 How Many Megapixels
13 pages
2022 Midterm Aidsheet - Final
No ratings yet
2022 Midterm Aidsheet - Final
1 page
Standard-Slope Integration: A New Approach to Numerical Integration
From Everand
Standard-Slope Integration: A New Approach to Numerical Integration
Peter James Italia, MD
No ratings yet
Trifocal Tensor: Exploring Depth, Motion, and Structure in Computer Vision
From Everand
Trifocal Tensor: Exploring Depth, Motion, and Structure in Computer Vision
Fouad Sabry
No ratings yet
Bilinear Interpolation: Enhancing Image Resolution and Clarity through Bilinear Interpolation
From Everand
Bilinear Interpolation: Enhancing Image Resolution and Clarity through Bilinear Interpolation
Fouad Sabry
No ratings yet
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
From Everand
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
Fouad Sabry
No ratings yet
Multi View Three Dimensional Reconstruction: Advanced Techniques for Spatial Perception in Computer Vision
From Everand
Multi View Three Dimensional Reconstruction: Advanced Techniques for Spatial Perception in Computer Vision
Fouad Sabry
No ratings yet
Direct Linear Transformation: Practical Applications and Techniques in Computer Vision
From Everand
Direct Linear Transformation: Practical Applications and Techniques in Computer Vision
Fouad Sabry
No ratings yet
Level Set Method: Advancing Computer Vision, Exploring the Level Set Method
From Everand
Level Set Method: Advancing Computer Vision, Exploring the Level Set Method
Fouad Sabry
No ratings yet

An Introduction To Locally Linear Embedding

Uploaded by

An Introduction To Locally Linear Embedding

Uploaded by

An Introduction to Locally Linear Embedding

Many problems in statistical pattern recognition begin with the preprocessing of

Figure 1: The problem of nonlinear dimensionality reduction, as illustrated for

Suppose the data consist of

design, the reconstruction weights

local patches on the manifold. In particular, the same weights

bedding coordinates are computed by an

1. Compute the neighbors of each data point,

are chosen, the optimal weights

neighbors outnumber the input dimensionality

A Constrained Least Squares Problem

ment without affecting the cost,

Also, to avoid degenerate solutions, we constrain the embedding vectors to have

by computing the bottom

ated or stored; it is sufficient to store and multiply the matrix .

C LLE from Pairwise Distances

You might also like