0% found this document useful (0 votes)

17 views4 pages

Feature Exploration PCA MNIST

This document outlines an experiment using Principal Component Analysis (PCA) on the MNIST dataset to reduce dimensionality and visualize handwritten digit images. The goal is to extract meaningful features, visualize them in a 2D space, and understand the variance captured by principal components. The results indicate effective feature extraction, with clusters of similar digits observed in the scatter plot.

Uploaded by

editorvar4444

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views4 pages

Feature Exploration PCA MNIST

Uploaded by

editorvar4444

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Feature Exploration using PCA on MNIST Dataset

Objective:

The objective of this experiment is to:

- Perform feature exploration using Principal Component Analysis (PCA) on the MNIST dataset.

- Reduce the dimensionality of handwritten digit images while preserving essential features.

- Visualize the reduced features to understand patterns and clusters in the data.

Application Domain:

Feature exploration using PCA is widely used in:

- Computer Vision: Reducing image dimensions for efficient classification and clustering.

- Data Preprocessing: Improving model performance by reducing noise and redundancy.

- Finance: Analyzing trends and patterns in stock market data.

- Healthcare: Identifying clusters in medical images or patient data.

Target:

The target of this lab is to:

- Extract meaningful features from the MNIST dataset using PCA.

- Visualize the extracted features in a 2D space.

- Understand the variance captured by principal components.

Dataset:

- Dataset Used: MNIST Handwritten Digits Dataset

- Description: MNIST consists of 70,000 grayscale images of handwritten digits (0-9). Each image is

of size 28x28 pixels.

- Classes: 10 classes (Digits 0 to 9).

- Source: The dataset is available in TensorFlow/Keras and can be directly loaded using the Keras

datasets module.

Dataset Loading Code:

from tensorflow.keras.datasets import mnist

(x_train, y_train), (x_test, y_test) = mnist.load_data()

x_train = x_train.astype('float32') / 255.

x_test = x_test.astype('float32') / 255.

x_train_flat = x_train.reshape((x_train.shape[0], -1))

x_test_flat = x_test.reshape((x_test.shape[0], -1))

Description:

Principal Component Analysis (PCA) is a dimensionality reduction technique that transforms

high-dimensional data into a lower-dimensional space while preserving the maximum variance. In

this experiment, we will:

- Flatten the 2D images into 1D vectors.

- Apply PCA to reduce dimensions to 2 principal components.

- Visualize the reduced features in a 2D scatter plot to observe clusters and patterns.

Implementation:

1. Import Libraries:

import numpy as np

import matplotlib.pyplot as plt

from tensorflow.keras.datasets import mnist

from sklearn.decomposition import PCA

2. Load and Preprocess Dataset:

(x_train, y_train), (x_test, y_test) = mnist.load_data()

x_train = x_train.astype('float32') / 255.

x_test = x_test.astype('float32') / 255.

x_train_flat = x_train.reshape((x_train.shape[0], -1))

x_test_flat = x_test.reshape((x_test.shape[0], -1))

3. Apply PCA for Feature Extraction:

pca = PCA(n_components=2)

x_test_pca = pca.fit_transform(x_test_flat)

print("Explained variance ratio:", pca.explained_variance_ratio_)

4. Visualize Reduced Features:

plt.scatter(x_test_pca[:, 0], x_test_pca[:, 1], c=y_test, cmap='viridis', s=5)

plt.title('Feature Visualization using PCA on MNIST')

plt.xlabel('Principal Component 1')

plt.ylabel('Principal Component 2')

plt.show()

Output (Complete):

- Explained Variance Ratio: Displays the percentage of variance retained by the two principal

components.

Example Output:

Explained variance ratio: [0.097, 0.084]

- Feature Visualization: The scatter plot shows clusters of digits in the reduced feature space.

Similar digits (e.g., 0s and 6s) are grouped together, demonstrating effective feature extraction.
Conclusion:

- PCA effectively reduces the dimensionality of the MNIST dataset while preserving essential

features.

- The clusters observed in the 2D scatter plot indicate that digits with similar shapes are grouped

together, showing the capability of PCA in feature exploration.

- This experiment demonstrates the power of PCA for unsupervised feature extraction and

visualization.

Future Enhancements:

- Increase the number of principal components to capture more variance.

- Experiment with other dimensionality reduction techniques like t-SNE or UMAP for better

visualization.

- Apply this method to other image datasets such as CIFAR-10 or Fashion MNIST.

COBIT 2019 Design Toolkit With Description - Group X.XLSX - Canvas
No ratings yet
COBIT 2019 Design Toolkit With Description - Group X.XLSX - Canvas
8 pages
Exam Ref AI-900 Microsoft Azure AI Fundame - Julian Sharp
100% (1)
Exam Ref AI-900 Microsoft Azure AI Fundame - Julian Sharp
371 pages
Spectra Bluetooth TX and RX BTI-010
No ratings yet
Spectra Bluetooth TX and RX BTI-010
5 pages
New Feature Exploration PCA MNIST
No ratings yet
New Feature Exploration PCA MNIST
4 pages
Mloa Exp2 C121
No ratings yet
Mloa Exp2 C121
20 pages
Cvresearchpaperfinalfinal
No ratings yet
Cvresearchpaperfinalfinal
5 pages
Face Recognition Using PCA
No ratings yet
Face Recognition Using PCA
8 pages
Lab Assignment 7: Nishiv Singh (B20MT029) Google Colab Notebooks Link: Task 1
No ratings yet
Lab Assignment 7: Nishiv Singh (B20MT029) Google Colab Notebooks Link: Task 1
3 pages
Mla 3
No ratings yet
Mla 3
5 pages
K. J. Somaiya College of Engineering, Mumbai-77: Title: Implementation of Principal Component Analysis
No ratings yet
K. J. Somaiya College of Engineering, Mumbai-77: Title: Implementation of Principal Component Analysis
2 pages
PGM 3
No ratings yet
PGM 3
2 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
11 pages
Project LA
No ratings yet
Project LA
13 pages
DS Prac 9
No ratings yet
DS Prac 9
3 pages
Exp 3 A
No ratings yet
Exp 3 A
2 pages
3.program PCA
No ratings yet
3.program PCA
7 pages
Principal Component Analysis: #Question 1
No ratings yet
Principal Component Analysis: #Question 1
6 pages
Love Report 1
No ratings yet
Love Report 1
10 pages
Implementing PCA in Python With Scikit
No ratings yet
Implementing PCA in Python With Scikit
6 pages
ML Assignment 01 Code
No ratings yet
ML Assignment 01 Code
21 pages
Fem2063 Data Analytics - May 2020 Lab Practice 5 (Week 6)
No ratings yet
Fem2063 Data Analytics - May 2020 Lab Practice 5 (Week 6)
8 pages
CS306 Data Analysis and Visualization Winter, 2019: Lab. 7 MNIST Dataset For Dimensionality Reduction Using PCA
No ratings yet
CS306 Data Analysis and Visualization Winter, 2019: Lab. 7 MNIST Dataset For Dimensionality Reduction Using PCA
1 page
PCA Explained
No ratings yet
PCA Explained
9 pages
Reduce Data Dimensionality Using PCA
No ratings yet
Reduce Data Dimensionality Using PCA
6 pages
Principal Component Analysis (PCA)
No ratings yet
Principal Component Analysis (PCA)
3 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
34 pages
Ai (PCA)
No ratings yet
Ai (PCA)
3 pages
Machine Learning (CSO851) - Lecture 03
No ratings yet
Machine Learning (CSO851) - Lecture 03
71 pages
Dimensionality Reduction: Motivation I: Data Compression
No ratings yet
Dimensionality Reduction: Motivation I: Data Compression
35 pages
CSE455/CSE552 Machine Learning (Spring 2024) Homework #3: Hand-In Policy Collaboration Policy Grading
No ratings yet
CSE455/CSE552 Machine Learning (Spring 2024) Homework #3: Hand-In Policy Collaboration Policy Grading
2 pages
Assignment2 Alankar
No ratings yet
Assignment2 Alankar
1 page
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
Love Report
No ratings yet
Love Report
7 pages
What Is PCA?: Image Source
No ratings yet
What Is PCA?: Image Source
17 pages
Module 5.2 Principal Component Analysis - V1
No ratings yet
Module 5.2 Principal Component Analysis - V1
4 pages
A COMPLETE GUIDE TO PRINCIPAL COMPONENT ANALYSIS in ML 1598272724
No ratings yet
A COMPLETE GUIDE TO PRINCIPAL COMPONENT ANALYSIS in ML 1598272724
16 pages
ML Module 6
No ratings yet
ML Module 6
6 pages
Dimensionality Reduction - PCA LDA
No ratings yet
Dimensionality Reduction - PCA LDA
25 pages
Assignment 2 Documentation
No ratings yet
Assignment 2 Documentation
15 pages
B22EE010 Report
No ratings yet
B22EE010 Report
9 pages
کتاب نهم بارگزاری شده
No ratings yet
کتاب نهم بارگزاری شده
55 pages
PCA Using Python
No ratings yet
PCA Using Python
18 pages
Program - 3
No ratings yet
Program - 3
4 pages
CS464 Ch6 FeatureExtraction
No ratings yet
CS464 Ch6 FeatureExtraction
46 pages
Program 3
No ratings yet
Program 3
7 pages
Experiment 10
No ratings yet
Experiment 10
3 pages
Principle Component Analysis (PCA) : Purpose of This Project
No ratings yet
Principle Component Analysis (PCA) : Purpose of This Project
30 pages
Kinya Sharon - Ass2 - Machine Learning
No ratings yet
Kinya Sharon - Ass2 - Machine Learning
12 pages
AIML
No ratings yet
AIML
5 pages
Mat 211 - 7
No ratings yet
Mat 211 - 7
14 pages
PCA (v3)
No ratings yet
PCA (v3)
34 pages
Dvpd11 Merged Merged 27 83
No ratings yet
Dvpd11 Merged Merged 27 83
57 pages
Assignment
No ratings yet
Assignment
24 pages
Principal Component Analysis: #Datascience
No ratings yet
Principal Component Analysis: #Datascience
13 pages
Feature Extraction Summary
No ratings yet
Feature Extraction Summary
1 page
Dim Reduction & Pattern Recognition
No ratings yet
Dim Reduction & Pattern Recognition
63 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
1 page
Face Recognition PAC
No ratings yet
Face Recognition PAC
24 pages
Lab #3
No ratings yet
Lab #3
12 pages
Experiment 3 PCA On Iris Dataset
No ratings yet
Experiment 3 PCA On Iris Dataset
2 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
1 page
The Intuition Behind PCA: Machine Learning Assignment
No ratings yet
The Intuition Behind PCA: Machine Learning Assignment
11 pages
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
NetApp Metrocluster TR4705
No ratings yet
NetApp Metrocluster TR4705
28 pages
Uptime and Downtime Conversion Cheat Sheet
100% (1)
Uptime and Downtime Conversion Cheat Sheet
1 page
Nvidia Resume
No ratings yet
Nvidia Resume
1 page
DW Revision Question With Solutions
No ratings yet
DW Revision Question With Solutions
5 pages
Data Engineering Brochure
No ratings yet
Data Engineering Brochure
23 pages
Notes Computer Assisted Audit Techniques - CAATs
No ratings yet
Notes Computer Assisted Audit Techniques - CAATs
3 pages
RDBMS
No ratings yet
RDBMS
30 pages
GIS شبكة الطرق والنقل الحضري في مدينة المكلا - دراسة جغرافية باستخدام نظم المعلومات الجغرافية
100% (1)
GIS شبكة الطرق والنقل الحضري في مدينة المكلا - دراسة جغرافية باستخدام نظم المعلومات الجغرافية
31 pages
Highway On My Plate II
0% (2)
Highway On My Plate II
3 pages
IT131 Project
No ratings yet
IT131 Project
3 pages
Lectura Sesión 4. Metadata Creation Practices at The Lilongwe University of Agriculture and Natural Resources Library's Institutional Repositor
No ratings yet
Lectura Sesión 4. Metadata Creation Practices at The Lilongwe University of Agriculture and Natural Resources Library's Institutional Repositor
16 pages
IT113
No ratings yet
IT113
10 pages
Fusion Data Model
No ratings yet
Fusion Data Model
262 pages
Rom Theme Park To Resort: Customer Information Management at Port Aventura
No ratings yet
Rom Theme Park To Resort: Customer Information Management at Port Aventura
9 pages
Ai Project Cycle
No ratings yet
Ai Project Cycle
11 pages
System Integration & Architecture
No ratings yet
System Integration & Architecture
20 pages
Lab # 05 Implementation of SQL Wildcards & Operators
No ratings yet
Lab # 05 Implementation of SQL Wildcards & Operators
11 pages
Chapter 12
No ratings yet
Chapter 12
1 page
Comp st3 Notes
No ratings yet
Comp st3 Notes
15 pages
AWSCertified MLSlides
No ratings yet
AWSCertified MLSlides
450 pages
Detailed Syllabus of Database Management System For Gate
No ratings yet
Detailed Syllabus of Database Management System For Gate
4 pages
CH 7 Basic Cyber Forensics
No ratings yet
CH 7 Basic Cyber Forensics
7 pages
Chapter 3 MCQV
No ratings yet
Chapter 3 MCQV
33 pages
AIS Chapter 1
No ratings yet
AIS Chapter 1
28 pages
Dbms Syl PDF
No ratings yet
Dbms Syl PDF
7 pages
Security Logging Standard
No ratings yet
Security Logging Standard
6 pages
15 ER-EER To Relational
No ratings yet
15 ER-EER To Relational
22 pages

Feature Exploration PCA MNIST

Uploaded by

Feature Exploration PCA MNIST

Uploaded by

Feature Exploration using PCA on MNIST Dataset

The objective of this experiment is to:

Feature exploration using PCA is widely used in:

- Data Preprocessing: Improving model performance by reducing noise and redundancy.

- Finance: Analyzing trends and patterns in stock market data.

- Healthcare: Identifying clusters in medical images or patient data.

The target of this lab is to:

- Extract meaningful features from the MNIST dataset using PCA.

- Visualize the extracted features in a 2D space.

- Understand the variance captured by principal components.

- Dataset Used: MNIST Handwritten Digits Dataset

of size 28x28 pixels.

- Classes: 10 classes (Digits 0 to 9).

Dataset Loading Code:

from tensorflow.keras.datasets import mnist

(x_train, y_train), (x_test, y_test) = mnist.load_data()

x_train = x_train.astype('float32') / 255.

x_test = x_test.astype('float32') / 255.

x_train_flat = x_train.reshape((x_train.shape[0], -1))

x_test_flat = x_test.reshape((x_test.shape[0], -1))

Principal Component Analysis (PCA) is a dimensionality reduction technique that transforms

this experiment, we will:

- Flatten the 2D images into 1D vectors.

- Apply PCA to reduce dimensions to 2 principal components.

import matplotlib.pyplot as plt

from tensorflow.keras.datasets import mnist

from sklearn.decomposition import PCA

2. Load and Preprocess Dataset:

x_train = x_train.astype('float32') / 255.

x_test = x_test.astype('float32') / 255.

x_train_flat = x_train.reshape((x_train.shape[0], -1))

x_test_flat = x_test.reshape((x_test.shape[0], -1))

3. Apply PCA for Feature Extraction:

print("Explained variance ratio:", pca.explained_variance_ratio_)

4. Visualize Reduced Features:

plt.scatter(x_test_pca[:, 0], x_test_pca[:, 1], c=y_test, cmap='viridis', s=5)

plt.title('Feature Visualization using PCA on MNIST')

plt.xlabel('Principal Component 1')

plt.ylabel('Principal Component 2')

Explained variance ratio: [0.097, 0.084]

together, showing the capability of PCA in feature exploration.

- Increase the number of principal components to capture more variance.

You might also like