0% found this document useful (0 votes)

7 views5 pages

Module 2 Lab 3

This document provides a comprehensive guide to Manifold Learning, specifically focusing on ISOMAP, a non-linear dimensionality reduction technique. It explains the steps involved in ISOMAP, compares it with PCA, and discusses practical implementation tips and limitations. The guide is structured for beginners, with clear examples and explanations to facilitate understanding of complex data structures.

Uploaded by

katrao39798

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views5 pages

Module 2 Lab 3

Uploaded by

katrao39798

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Detailed Explanation of Module 2 Lab 3: Manifold Learning Methods (Updated and

Structured for Beginners)

This guide explains every concept and step in the Manifold Learning lab, integrating the content
provided and all your follow-up queries. It is organized for clarity, with practical examples and
beginner-friendly language.

Section 1: What is Manifold Learning?

Manifold learning is a set of techniques for reducing the dimensionality of data by finding a
lower-dimensional “surface” (manifold) within a higher-dimensional space.
Many real-world datasets, though high-dimensional, actually lie on or near a much lower-
dimensional curved surface (manifold).
The goal: Find a new, low-dimensional representation of the data that preserves its
essential structure—especially for visualization or further analysis.

Section 2: Why Not Just Use PCA?

PCA (Principal Component Analysis) is a linear method: it works well if the data lies on a flat
(linear) subspace.
Drawbacks of PCA on curved manifolds:
PCA may need many more dimensions than the true manifold to capture the data
structure.
PCA can project faraway points along the manifold to nearby locations, losing the true
relationships [1] [2] .
PCA cannot capture curved or non-linear relationships [1] .

Section 3: What is ISOMAP?

ISOMAP stands for Isometric Mapping.
It is a non-linear dimensionality reduction technique based on spectral theory.
Key idea: Instead of preserving straight-line (Euclidean) distances, ISOMAP preserves
geodesic distances—the shortest path along the manifold, not through space [1] [2] [3] .
Result: ISOMAP can "unfold" curved data (like an S-curve) into a flat, low-dimensional
space while preserving meaningful relationships.
Section 4: How ISOMAP Works (Step-by-Step with Example)

Step 1: Construct the Neighborhood Graph

Goal: Capture local relationships by connecting each data point to its nearest neighbors [1]
[2] [3] [4] .

How:
For each data point, find its k nearest neighbors (using Euclidean distance).
Build a graph where each point is a node connected to its neighbors by edges weighted
by their distances.
You can use either the k-nearest neighbors method or an ε-ball (all points within a
certain radius) [5] .
Example:
Imagine 1000 points in 3D forming an S-curve. For each point, connect it to its 10 closest
points. The graph now represents local relationships.

Step 2: Compute Geodesic (Shortest Path) Distances

Goal: Estimate the true "manifold" distance between all pairs of points—not just the straight
line, but the shortest path along the graph [1] [2] [3] .
How:
Use Dijkstra’s algorithm (or similar) to find the shortest path between every pair of
points in the graph [1] [5] .
The sum of the edge weights along the shortest path gives the geodesic distance.
Example:
If points A and D aren’t directly connected, but A is connected to B, B to C, and C to D, the
geodesic distance from A to D is the sum of distances A–B, B–C, and C–D [5] .

Step 3: Find the Low-Dimensional Embedding (Using MDS)

Goal: Map the data into a lower-dimensional space (like 2D or 3D) while preserving
geodesic distances as much as possible [1] [2] .
How:
Square the geodesic distance matrix and double-center it.
Perform eigenvalue decomposition (like in PCA) to find the top eigenvectors (directions
with the most variance).
The top k eigenvectors (with the largest eigenvalues) become the axes of your new,
reduced space.
Project the data onto these axes to get the low-dimensional embedding [2] [3] .
Example:
For the S-curve, ISOMAP "unfolds" the curve into a flat 2D space, revealing the underlying
2D structure.
Section 5: ISOMAP in Practice

Python Implementation (with scikit-learn)

from sklearn.manifold import Isomap

# X is your high-dimensional data

embedding = Isomap(n_neighbors=10, n_components=2)
X_transformed = embedding.fit_transform(X)

n_neighbors: Number of neighbors for the neighborhood graph.

n_components: Number of dimensions for the output.

Manual Steps (as in the lab notebook)

1. Compute pairwise Euclidean distances for all points.
2. Keep only the k nearest neighbors for each point to build the graph.
3. Use Dijkstra’s algorithm to compute shortest (geodesic) paths.
4. Center the squared geodesic distance matrix.
5. Perform eigenvalue decomposition and select top components.
6. Project data onto these components for the final embedding.

Section 6: ISOMAP vs. PCA

Aspect PCA (Linear) ISOMAP (Non-linear)

Preserves Euclidean distances Geodesic (manifold) distances

Handles curves? No Yes

Good for Flat, linear data Curved, non-linear manifolds

Example Flat plane S-curve, Swiss roll

Section 7: Parameters and Practical Tips

Number of Neighbors (k):
Too low: The graph may break into disconnected pieces.
Too high: The graph may connect points that are not true neighbors, distorting the
manifold [2] [3] .
Tip: Try different values and visualize the results.
Connected Graph:
Ensure the neighborhood graph is a single connected component, or results may be
incoherent [2] [3] .

Section 8: Limitations and Drawbacks

ISOMAP struggles if the manifold is not well-sampled or has holes [2] [3] .
Careful parameter tuning (especially k) is required.
Computationally intensive for very large datasets (Landmark ISOMAP is an efficient variant)
[2] .

Section 9: Other Manifold Learning Methods

LLE (Locally Linear Embedding): Preserves local relationships.
t-SNE: Focuses on preserving local structure for visualization.
UMAP: Similar to t-SNE, often faster and better at maintaining global structure.
Key difference: Each method preserves different aspects of the data’s structure.

Section 10: Example – ISOMAP on an S-Curve

Dataset: 1000 points forming an S-shaped curve in 3D.
Process:
1. Build the neighborhood graph (each point connects to 10 nearest neighbors).
2. Compute geodesic distances using shortest paths.
3. Use MDS/eigenvalue decomposition to embed data in 2D.
Result: The S-curve is “unfolded” into a flat 2D shape, revealing the true underlying
structure.

Section 11: Key Takeaways

Manifold learning (like ISOMAP) helps uncover the true, lower-dimensional structure of
complex data.
ISOMAP is powerful for non-linear dimensionality reduction, especially when data lies on a
curved manifold.
Choosing parameters (like k) and method is crucial for good results.
Visualization after reduction helps interpret and understand high-dimensional data.

If you want a deeper explanation of any step, or want to see code for a particular part, just
ask!
⁂
1. https://www.sjsu.edu/faculty/guangliang.chen/Math253S20/lec10ISOmap.pdf
2. https://www.centron.de/en/tutorial/dimension-reduction-isomap/
3. https://www.mililink.com/upload/article/1159096330aams_vol_215_march_2022_a6_p2371-2382_s._gna
na_sophia,_k._k._thanammal_and_s._s._sujatha.pdf
4. https://labex.io/tutorials/ml-manifold-learning-with-scikit-learn-71115
5. https://www.youtube.com/watch?v=Xu_3NnkAI9s

ML Unit 4
No ratings yet
ML Unit 4
50 pages
Manifold Learning Theory and Applications 9781439871102 Compress
No ratings yet
Manifold Learning Theory and Applications 9781439871102 Compress
322 pages
Feature Selection and Feature Extraction in Pattern Analysis: A Literature Review
No ratings yet
Feature Selection and Feature Extraction in Pattern Analysis: A Literature Review
14 pages
Isomap
No ratings yet
Isomap
5 pages
Unit 2-2
No ratings yet
Unit 2-2
33 pages
Lec 10 ISOmap
No ratings yet
Lec 10 ISOmap
14 pages
Face Recognition Using Extended Isomap
No ratings yet
Face Recognition Using Extended Isomap
5 pages
ISOMAP
No ratings yet
ISOMAP
1 page
Isomap Unveiling Hidden Structures in Data
No ratings yet
Isomap Unveiling Hidden Structures in Data
8 pages
Isometric Projection: Report No. UIUCDCS-R-2006-2747 UILU-ENG-2006-1787
No ratings yet
Isometric Projection: Report No. UIUCDCS-R-2006-2747 UILU-ENG-2006-1787
24 pages
ISOMAP in ML
No ratings yet
ISOMAP in ML
12 pages
Kernel Isomap: Heeyoul Choi and Seungjin Choi
No ratings yet
Kernel Isomap: Heeyoul Choi and Seungjin Choi
6 pages
Manifold Learning Algorithms
No ratings yet
Manifold Learning Algorithms
17 pages
Charting A Manifold: Mitsubishi Electric Information Technology Center America, 2003
No ratings yet
Charting A Manifold: Mitsubishi Electric Information Technology Center America, 2003
10 pages
FML Cie2
No ratings yet
FML Cie2
103 pages
Metric Learning and Manifolds: Preserving The Intrinsic Geometry
No ratings yet
Metric Learning and Manifolds: Preserving The Intrinsic Geometry
37 pages
Non Linear Dimensionality Reduction
No ratings yet
Non Linear Dimensionality Reduction
2 pages
Thompson
No ratings yet
Thompson
33 pages
Machine-Generated Fingerprint Classes Mahalanobis Distance: 2009 Springer Science+Business Media, LLC
No ratings yet
Machine-Generated Fingerprint Classes Mahalanobis Distance: 2009 Springer Science+Business Media, LLC
48 pages
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
From Everand
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
Fouad Sabry
No ratings yet
Testing The Robustness of Manifold Learning On Examples of Thinned-Out Data
No ratings yet
Testing The Robustness of Manifold Learning On Examples of Thinned-Out Data
8 pages
Manifold Learning: What, How, and Why: Marina Meila, Hanyu Zhang, November 8, 2023
No ratings yet
Manifold Learning: What, How, and Why: Marina Meila, Hanyu Zhang, November 8, 2023
33 pages
Lecture 7: Unsupervised Learning: C19 Machine Learning Hilary 2013 A. Zisserman
No ratings yet
Lecture 7: Unsupervised Learning: C19 Machine Learning Hilary 2013 A. Zisserman
20 pages
How To Train A Classifier Based On The Huge Face Database - AMFG2005
No ratings yet
How To Train A Classifier Based On The Huge Face Database - AMFG2005
12 pages
Learning With L1-Graph For Image Analysis-rD5
No ratings yet
Learning With L1-Graph For Image Analysis-rD5
9 pages
Cmds
No ratings yet
Cmds
14 pages
WIREs Computational Stats - 2012 - Izenman - Introduction To Manifold Learning
No ratings yet
WIREs Computational Stats - 2012 - Izenman - Introduction To Manifold Learning
8 pages
Geometric Primitive: Exploring Foundations and Applications in Computer Vision
From Everand
Geometric Primitive: Exploring Foundations and Applications in Computer Vision
Fouad Sabry
No ratings yet
Fast Manifold Learning Based On Riemannian Normal Coordinates
No ratings yet
Fast Manifold Learning Based On Riemannian Normal Coordinates
10 pages
Deep Learning 3
No ratings yet
Deep Learning 3
12 pages
Analysis of Manifolds and Its APPLICATIONS-VERY GOOD
No ratings yet
Analysis of Manifolds and Its APPLICATIONS-VERY GOOD
11 pages
ML Chapter 4
No ratings yet
ML Chapter 4
38 pages
16 Manifold Learning
No ratings yet
16 Manifold Learning
32 pages
Computer Vision Graph Cuts: Exploring Graph Cuts in Computer Vision
From Everand
Computer Vision Graph Cuts: Exploring Graph Cuts in Computer Vision
Fouad Sabry
No ratings yet
Design And Analysis Of Algorithm
From Everand
Design And Analysis Of Algorithm
Bhupendra Mandloi
No ratings yet
Supplementary - Active Learning Alloys
No ratings yet
Supplementary - Active Learning Alloys
38 pages
Cheat Sheet-Building Unsupervised Learning Models
No ratings yet
Cheat Sheet-Building Unsupervised Learning Models
3 pages
J Patcog 2020 107450
No ratings yet
J Patcog 2020 107450
14 pages
Bundle Adjustment: Optimizing Visual Data for Precise Reconstruction
From Everand
Bundle Adjustment: Optimizing Visual Data for Precise Reconstruction
Fouad Sabry
No ratings yet
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
From Everand
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
Fouad Sabry
No ratings yet
I2ml3e Chap6
No ratings yet
I2ml3e Chap6
37 pages
Hadsell Chopra Lecun 06 PDF
No ratings yet
Hadsell Chopra Lecun 06 PDF
8 pages
Multi View Three Dimensional Reconstruction: Advanced Techniques for Spatial Perception in Computer Vision
From Everand
Multi View Three Dimensional Reconstruction: Advanced Techniques for Spatial Perception in Computer Vision
Fouad Sabry
No ratings yet
Ruiz Modified I2ml3e Chap6
No ratings yet
Ruiz Modified I2ml3e Chap6
38 pages
Ijcai07 113 PDF
No ratings yet
Ijcai07 113 PDF
6 pages
Nonlinear Dimensionality Reduction by Locally Linear Embedding
No ratings yet
Nonlinear Dimensionality Reduction by Locally Linear Embedding
5 pages
00science Saul Nonlinear Dimensionality Reduction
No ratings yet
00science Saul Nonlinear Dimensionality Reduction
4 pages
Week 8 DS Practical
No ratings yet
Week 8 DS Practical
13 pages
Mathematics for Data Science: Linear Algebra with Matlab
From Everand
Mathematics for Data Science: Linear Algebra with Matlab
César Pérez López
No ratings yet
Adaptive Feature Selection and Image Classification Using Manifold Learning Techniques
No ratings yet
Adaptive Feature Selection and Image Classification Using Manifold Learning Techniques
11 pages
Hadsell Et Al - Dimensionality Reduction by Learning An Invariant Mapping
No ratings yet
Hadsell Et Al - Dimensionality Reduction by Learning An Invariant Mapping
8 pages
Lecture21 1
No ratings yet
Lecture21 1
14 pages
Multimedia Information Retrieval
No ratings yet
Multimedia Information Retrieval
11 pages
Graph Embedding and Extensions: A General Framework For Dimensionality Reduction
No ratings yet
Graph Embedding and Extensions: A General Framework For Dimensionality Reduction
12 pages
2461 Out of Sample Extensions For Lle Isomap Mds Eigenmaps and Spectral Clustering
No ratings yet
2461 Out of Sample Extensions For Lle Isomap Mds Eigenmaps and Spectral Clustering
8 pages
Computational Geometry: Exploring Geometric Insights for Computer Vision
From Everand
Computational Geometry: Exploring Geometric Insights for Computer Vision
Fouad Sabry
No ratings yet
Science:, 2323 (2000) Sam T. Roweis and Lawrence K. Saul
No ratings yet
Science:, 2323 (2000) Sam T. Roweis and Lawrence K. Saul
5 pages
A Tangent Distance Preserving Dimensionality Reduction Algorithm
No ratings yet
A Tangent Distance Preserving Dimensionality Reduction Algorithm
6 pages
Image Based Modeling and Rendering: Exploring Visual Realism: Techniques in Computer Vision
From Everand
Image Based Modeling and Rendering: Exploring Visual Realism: Techniques in Computer Vision
Fouad Sabry
No ratings yet
CS-AI LECUTRE NOTES Unsupervised Learning-03
No ratings yet
CS-AI LECUTRE NOTES Unsupervised Learning-03
71 pages
# Mix Data Into A 100-Dimensional State: Print
No ratings yet
# Mix Data Into A 100-Dimensional State: Print
25 pages
Procedural Surface: Exploring Texture Generation and Analysis in Computer Vision
From Everand
Procedural Surface: Exploring Texture Generation and Analysis in Computer Vision
Fouad Sabry
No ratings yet
BIA 658 Social Network Analysis - Midterm Exam Fall 2020
No ratings yet
BIA 658 Social Network Analysis - Midterm Exam Fall 2020
3 pages
1 s2.0 S1574013721000186 Main
No ratings yet
1 s2.0 S1574013721000186 Main
13 pages
Nonlinear Dimensionality Reduction
No ratings yet
Nonlinear Dimensionality Reduction
18 pages
ML Unit 4
No ratings yet
ML Unit 4
34 pages
Dimensionality Reduction Techniques You Should Know in 2021
No ratings yet
Dimensionality Reduction Techniques You Should Know in 2021
12 pages
ML Unit 4
No ratings yet
ML Unit 4
10 pages
A Global Geometric Framework For Nonlinear Dimensionality Reduction
No ratings yet
A Global Geometric Framework For Nonlinear Dimensionality Reduction
8 pages
UNIT-4 Machine Learning
No ratings yet
UNIT-4 Machine Learning
20 pages
(Lecture Notes in Electrical Engineering 292) Jacob Scharcanski, Hugo ProenÃ A, Eliza Du (Eds.) - Signal and Image Processing For Biometrics
No ratings yet
(Lecture Notes in Electrical Engineering 292) Jacob Scharcanski, Hugo ProenÃ A, Eliza Du (Eds.) - Signal and Image Processing For Biometrics
336 pages
Shakiba Rahimiaghdam - 61130 - Assignsubmission - File - DatasetAnalysis - MINERS
No ratings yet
Shakiba Rahimiaghdam - 61130 - Assignsubmission - File - DatasetAnalysis - MINERS
56 pages

Module 2 Lab 3

Uploaded by

Module 2 Lab 3

Uploaded by

Detailed Explanation of Module 2 Lab 3: Manifold Learning Methods (Updated and

Structured for Beginners)

Section 1: What is Manifold Learning?

Section 2: Why Not Just Use PCA?

Section 3: What is ISOMAP?

Step 1: Construct the Neighborhood Graph

Step 2: Compute Geodesic (Shortest Path) Distances

Step 3: Find the Low-Dimensional Embedding (Using MDS)

Python Implementation (with scikit-learn)

from sklearn.manifold import Isomap

# X is your high-dimensional data

n_neighbors: Number of neighbors for the neighborhood graph.

Manual Steps (as in the lab notebook)

Section 6: ISOMAP vs. PCA

Preserves Euclidean distances Geodesic (manifold) distances

Handles curves? No Yes

Good for Flat, linear data Curved, non-linear manifolds

Example Flat plane S-curve, Swiss roll

Section 7: Parameters and Practical Tips

Section 8: Limitations and Drawbacks

Section 9: Other Manifold Learning Methods

Section 10: Example – ISOMAP on an S-Curve

Section 11: Key Takeaways

You might also like