0% found this document useful (0 votes)

30 views5 pages

Mla 3

This report explores applying K-Means clustering to the MNIST dataset of handwritten digits after performing dimensionality reduction via PCA. The code loads the MNIST data, performs PCA to reduce dimensions to 2, standardizes features, runs K-Means clustering with 3 clusters, and visualizes the results by plotting the clusters and outliers. The analysis demonstrates clustering techniques can uncover inherent structures in high-dimensional data and reveals distinct digit clusters in the MNIST data with robust handling of outliers.

Uploaded by

Renat Zhamilov

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views5 pages

Mla 3

Uploaded by

Renat Zhamilov

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

ASSIGNMENT

REPORT
HOMEWORK 3
Machine Learning Algorithms
(work name)

Student’s names: Zhamilov Renat

Group: CS-2201
Period: 14.02.2024 – 20.02.2024
Date: 19.02.2024
Supervisor: Aigul B. Mimenbayeva

Astana IT University, 2024

1. Introduction:
This report explores the application of K-Means clustering to the
MNIST dataset, aiming to identify patterns within handwritten digit
images. By leveraging Principal Component Analysis (PCA) for
dimensionality reduction, the code partitions the dataset into clusters
and visualizes the clustering results. Through this analysis, we aim to
demonstrate the effectiveness of clustering techniques in uncovering
inherent structures within high-dimensional datasets.
2. Procedure
This code performs clustering on the MNIST dataset using K-Means
algorithm after reducing the dimensionality of the data using Principal
Component Analysis (PCA). Let's break it down step by step:

Import Libraries: The necessary libraries are imported including

numpy for numerical operations, matplotlib for plotting, PCA, KMeans,
and StandardScaler from scikit-learn, and the MNIST dataset from
TensorFlow.

Load MNIST Dataset: The MNIST dataset is loaded using

mnist.load_data() function from TensorFlow Keras. This dataset consists
of 28x28 grayscale images of handwritten digits (0 through 9) and their
corresponding labels.

Display Sample Images: Some sample images from the dataset are
displayed using matplotlib for visualization purposes.

Prepare Data for Clustering:

- The training images are reshaped into a 2D array where each row
represents a flattened image (28x28 = 784 pixels).

- The pixel values are normalized to the range [0, 1] by dividing by

255.0.
Principal Component Analysis (PCA):

- PCA is applied to reduce the dimensionality of the dataset to 2

dimensions (n_components=2).

- PCA helps in visualizing high-dimensional data in a lower-dimensional

space while preserving the variance as much as possible.

Standardize Features:

- The features obtained from PCA are standardized using StandardScaler

to have zero mean and unit variance.

Perform K-Means Clustering:

- K-Means clustering is applied to the standardized features.

- The number of clusters is set to 3 (n_clusters=3).

Visualize Clusters:

- Scatter plot is created where each point represents a data point in the
reduced 2D space.

- Points are colored based on their assigned cluster label.

- Noisy points (outliers) are marked separately.

- The title of the plot includes the number of clusters and the number of
noisy points.

- Print Cluster Information:

- The number of clusters and the number of noisy points (outliers) are
printed.

3. Code
4. Conclusion:
Clustering algorithms like K-Means offer valuable insights into the
MNIST dataset, aiding in digit recognition and classification tasks. PCA
effectively reduces dimensionality for visualization. Our analysis reveals
distinct digit clusters and robust handling of outliers. Future research
could explore parameter tuning and alternative algorithms for improved
performance.

This study highlights clustering's role in understanding complex

datasets, facilitating data-driven applications in various domains.
Link:
https://colab.research.google.com/drive/1pwVt3uDCkKCdKUfi4xE_RgbY
wOkplBoa?usp=sharing

Bachelor of Education Primary Program Code 3114 PDF
No ratings yet
Bachelor of Education Primary Program Code 3114 PDF
1 page
B.SC Nursing Medical Surgical Nursing - I Unit: Iv - Nursing Management of Patients With Disorders of Digestive System Portal Hypertension
100% (1)
B.SC Nursing Medical Surgical Nursing - I Unit: Iv - Nursing Management of Patients With Disorders of Digestive System Portal Hypertension
32 pages
Synopsis - Vinay Mohan
No ratings yet
Synopsis - Vinay Mohan
3 pages
Science 4 Q4 W3
No ratings yet
Science 4 Q4 W3
6 pages
Intro
No ratings yet
Intro
32 pages
Swami Vivekananda and Human Excellence - A Book Summary
100% (2)
Swami Vivekananda and Human Excellence - A Book Summary
6 pages
Request For Sound System
100% (6)
Request For Sound System
2 pages
CJR Elite 16 B
No ratings yet
CJR Elite 16 B
32 pages
Movie Review Doctors in The Barrios
No ratings yet
Movie Review Doctors in The Barrios
2 pages
Introduction To Machine Learning Prof. Anirban Santara Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur
No ratings yet
Introduction To Machine Learning Prof. Anirban Santara Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur
15 pages
Assignment # 1: Performance Timeline of Flynn Taxonomy
No ratings yet
Assignment # 1: Performance Timeline of Flynn Taxonomy
21 pages
AI Ass 2
No ratings yet
AI Ass 2
32 pages
In Memoriam - Sunil Dua
No ratings yet
In Memoriam - Sunil Dua
64 pages
Coland Systems Technolgy College, Inc
No ratings yet
Coland Systems Technolgy College, Inc
4 pages
An Overview of Needs Assessment in ESP by Kay Westerfield
No ratings yet
An Overview of Needs Assessment in ESP by Kay Westerfield
5 pages
Designing Online Learning Modules in Kinesiology
No ratings yet
Designing Online Learning Modules in Kinesiology
7 pages
Kernel Principal Component Analysis and Its Applications in Face Recognition and Active Shape Models
No ratings yet
Kernel Principal Component Analysis and Its Applications in Face Recognition and Active Shape Models
9 pages
Lab 8
No ratings yet
Lab 8
8 pages
Faculty Application Form: Mepco Schlenk Engineering College, Sivakasi
No ratings yet
Faculty Application Form: Mepco Schlenk Engineering College, Sivakasi
5 pages
The Middle Jurassic Oseberg Delta, Northern North Sea: A Sedimentological and Sequence Stratigraphic
No ratings yet
The Middle Jurassic Oseberg Delta, Northern North Sea: A Sedimentological and Sequence Stratigraphic
5 pages
Examples of A Thesis Literature Review
100% (1)
Examples of A Thesis Literature Review
4 pages
Mloa Exp2 C121
No ratings yet
Mloa Exp2 C121
20 pages
ABSTRACT - Career Guidance Program
No ratings yet
ABSTRACT - Career Guidance Program
4 pages
Ela - The Quilt Story Lesson Plan
No ratings yet
Ela - The Quilt Story Lesson Plan
3 pages
RACMA Approved Masters Programs - 2020
No ratings yet
RACMA Approved Masters Programs - 2020
1 page
Unit 3 - MLnotes-WPS Office
No ratings yet
Unit 3 - MLnotes-WPS Office
18 pages
Chanakya Brochure Email
No ratings yet
Chanakya Brochure Email
4 pages
Lab Assignment 7: Nishiv Singh (B20MT029) Google Colab Notebooks Link: Task 1
No ratings yet
Lab Assignment 7: Nishiv Singh (B20MT029) Google Colab Notebooks Link: Task 1
3 pages
Eigenfaces and Fisherfaces For Face Recognition
No ratings yet
Eigenfaces and Fisherfaces For Face Recognition
6 pages
Face Recognition Using PCA
No ratings yet
Face Recognition Using PCA
8 pages
Image Segmentation Using Clustering (Texture With PCA)
No ratings yet
Image Segmentation Using Clustering (Texture With PCA)
25 pages
Brent William
No ratings yet
Brent William
173 pages
Week 3
No ratings yet
Week 3
12 pages
Critical Urban Theory
No ratings yet
Critical Urban Theory
23 pages
ML Exp5 C36
No ratings yet
ML Exp5 C36
18 pages
MLP - Week 5 - MNIST - Perceptron - Ipynb - Colaboratory
No ratings yet
MLP - Week 5 - MNIST - Perceptron - Ipynb - Colaboratory
31 pages
DWDM Lab All
No ratings yet
DWDM Lab All
20 pages
Comenius Report
No ratings yet
Comenius Report
5 pages
B22EE010 Report
No ratings yet
B22EE010 Report
9 pages
ML DSBA Lab7
No ratings yet
ML DSBA Lab7
6 pages
A COMPLETE GUIDE TO PRINCIPAL COMPONENT ANALYSIS in ML 1598272724
No ratings yet
A COMPLETE GUIDE TO PRINCIPAL COMPONENT ANALYSIS in ML 1598272724
16 pages
Pca&kmean
No ratings yet
Pca&kmean
6 pages
Dvpd11 Merged Merged 27 83
No ratings yet
Dvpd11 Merged Merged 27 83
57 pages
Banknote Authentication
100% (1)
Banknote Authentication
3 pages
Practical 5
No ratings yet
Practical 5
6 pages
4 B.Sc. (N.M)
No ratings yet
4 B.Sc. (N.M)
38 pages
AbidAdhikari26840 DWDM
No ratings yet
AbidAdhikari26840 DWDM
43 pages
Project LA
No ratings yet
Project LA
13 pages
Esam - DWM Lab 8
No ratings yet
Esam - DWM Lab 8
5 pages
ML Assignment-10
No ratings yet
ML Assignment-10
5 pages
Predictivemaintenance FaultDetection
No ratings yet
Predictivemaintenance FaultDetection
12 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
11 pages
Assignment 2 Documentation
No ratings yet
Assignment 2 Documentation
15 pages
DM Ass03
No ratings yet
DM Ass03
5 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
34 pages
Clinical Ward Rotation Final Year MBBS (New File)
No ratings yet
Clinical Ward Rotation Final Year MBBS (New File)
1 page
Cvresearchpaperfinalfinal
No ratings yet
Cvresearchpaperfinalfinal
5 pages
IDM Assignment
No ratings yet
IDM Assignment
15 pages
ML - Aat - Report 1
No ratings yet
ML - Aat - Report 1
8 pages
Love Report 1
No ratings yet
Love Report 1
10 pages
Colonial Colleges - Wikipedia
No ratings yet
Colonial Colleges - Wikipedia
55 pages
# Mix Data Into A 100-Dimensional State: Print
No ratings yet
# Mix Data Into A 100-Dimensional State: Print
25 pages
Assignment4 CH5650 CH21B112
No ratings yet
Assignment4 CH5650 CH21B112
3 pages
Talent Management
No ratings yet
Talent Management
35 pages
Drawback of Standard K-Means Algorithm
No ratings yet
Drawback of Standard K-Means Algorithm
5 pages
Classifying Chinese Handwritten Numerals Using Machine Learning Classification Methods
No ratings yet
Classifying Chinese Handwritten Numerals Using Machine Learning Classification Methods
13 pages
DALT7011 Assignment Report
No ratings yet
DALT7011 Assignment Report
11 pages
New Feature Exploration PCA MNIST
No ratings yet
New Feature Exploration PCA MNIST
4 pages
Feature Exploration PCA MNIST
No ratings yet
Feature Exploration PCA MNIST
4 pages
CSE316 221D14 LabReport04 kMeansClustering
No ratings yet
CSE316 221D14 LabReport04 kMeansClustering
7 pages
Lab6 Instruction
No ratings yet
Lab6 Instruction
3 pages
Untitled Document-2-1-13-7-11.4
No ratings yet
Untitled Document-2-1-13-7-11.4
5 pages
MNIST Autoencoder KMeans Report
No ratings yet
MNIST Autoencoder KMeans Report
2 pages
51 DA5400 - FML51 - 20250501 ProblemSet06
No ratings yet
51 DA5400 - FML51 - 20250501 ProblemSet06
4 pages
Week 8 DS Practical
No ratings yet
Week 8 DS Practical
13 pages
What Is PCA?: Image Source
No ratings yet
What Is PCA?: Image Source
17 pages
DCRUST B.tech First Counseling Results
No ratings yet
DCRUST B.tech First Counseling Results
72 pages
Cheat Sheet-Building Unsupervised Learning Models
No ratings yet
Cheat Sheet-Building Unsupervised Learning Models
3 pages
Calendar 2024 2025
No ratings yet
Calendar 2024 2025
2 pages
20 ENG 016 Assignment 8
No ratings yet
20 ENG 016 Assignment 8
4 pages
Assignment8 ML)
No ratings yet
Assignment8 ML)
4 pages
CSE455/CSE552 Machine Learning (Spring 2024) Homework #3: Hand-In Policy Collaboration Policy Grading
No ratings yet
CSE455/CSE552 Machine Learning (Spring 2024) Homework #3: Hand-In Policy Collaboration Policy Grading
2 pages
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet

Mla 3

Uploaded by

Mla 3

Uploaded by

ASSIGNMENT

Student’s names: Zhamilov Renat

Astana IT University, 2024

Import Libraries: The necessary libraries are imported including

Load MNIST Dataset: The MNIST dataset is loaded using

Prepare Data for Clustering:

- The pixel values are normalized to the range [0, 1] by dividing by

- PCA is applied to reduce the dimensionality of the dataset to 2

- PCA helps in visualizing high-dimensional data in a lower-dimensional

- The features obtained from PCA are standardized using StandardScaler

Perform K-Means Clustering:

- K-Means clustering is applied to the standardized features.

- Points are colored based on their assigned cluster label.

- Noisy points (outliers) are marked separately.

- Print Cluster Information:

This study highlights clustering's role in understanding complex

You might also like