Seminar 10

This document provides code snippets and explanations for clustering handwritten digit data using k-means clustering. The code first loads handwritten digit data and finds k-means clusters. It then interprets the 10 cluster centers as prototype digits and plots them. Accuracy is calculated by assigning each datapoint to the most common label of its cluster. A confusion matrix plots the accuracy of this clustering-based digit labeling.

Uploaded by

Nishad Ahamed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

336 views3 pages

Seminar 10

Uploaded by

Nishad Ahamed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

UWL SCE S2

L-6 Databases and Analytics (CP60056E)

Seminar 10

This relates to Lecture 9

Exercise 1 Using your preferred editor (colab is recommended) to fill the snippet gaps.
The following is a simple demonstration of using WSS to decide and plot the clusters
based on k-means clusters algorithm.

%% Import the necessary packages

%
import numpy as np
import pandas as pd
from matplotlib import pyplot as plt
from sklearn.datasets.samples_generator import make_blobs
from sklearn.cluster import KMeans

%% Generate 6 artificial clusters for illustration purpose

%% Hint: you may need to use make_blobs and scatter functions: check the Python
%% official resources for more information of their usages
%
Insert your code block here

%% Implement the WSS method and check through the number of clusters from 1
%% to 12, and plot the figure of WSS vs. number of clusters.
%% Hint: reference the plots in the lecture slides;
%% You may need to use inertia_ from property WCSS, and kmeans function
%
wcss = []
for i in range(1, 12):
kmeans = KMeans(n_clusters=i, init='k-means++', max_iter=300, n_init=10,
random_state=0)
Insert your code block here

%% Categorize the data using the optimum number of clusters (6)

%% we determined in the last step. Plot the fitting results
%% Hint: you may need to call fit_predict from kmeans; scatter
%
kmeans = KMeans(n_clusters=6, init='k-means++', max_iter=300, n_init=10,
random_state=0)
Insert your code block here
plt.scatter(X[:,0], X[:,1])
plt.scatter(kmeans.cluster_centers_[:, 0], kmeans.cluster_centers_[:, 1], s=300,
c='red')
plt.show()
1
UWL SCE S2

Exercise 2 For the following code blocks and plots, run the code first; then provide your
interpretation/explanation for the required parts.

k-means on digits
We will attempt to use k-means to try to identify similar digits without using the original
label information; this might be similar to a first step in extracting meaning from a new
dataset about which you don't have any a priori label information.
We will start by loading the digits and then finding the k-Means clusters. The digits
consist of 1,797 samples with 64 features, where each of the 64 features is the
brightness of one pixel in an 8×8 image.

import seaborn as sns; sns.set() # for plot styling

from sklearn.datasets import load_digits
digits = load_digits()
digits.data.shape

## Provide your interpretation/explanation for the following block

#
kmeans = KMeans(n_clusters=10, random_state=0)
clusters = kmeans.fit_predict(digits.data)
kmeans.cluster_centers_.shape

## Provide your interpretation/explanation for the following block

#
fig, ax = plt.subplots(2, 5, figsize=(8, 3))
centers = kmeans.cluster_centers_.reshape(10, 8, 8)
for axi, center in zip(ax.flat, centers):
axi.set(xticks=[], yticks=[])
axi.imshow(center, interpolation='nearest', cmap=plt.cm.binary)

from scipy.stats import mode

labels = np.zeros_like(clusters)
for i in range(10):
mask = (clusters == i)
labels[mask] = mode(digits.target[mask])[0]

from sklearn.metrics import accuracy_score

accuracy_score(digits.target, labels)

## Provide your interpretation/explanation for the following block

2
UWL SCE S2

#
from sklearn.metrics import confusion_matrix
mat = confusion_matrix(digits.target, labels)
sns.heatmap(mat.T, square=True, annot=True, fmt='d', cbar=False,
xticklabels=digits.target_names,
yticklabels=digits.target_names)
plt.xlabel('true label')
plt.ylabel('predicted label');

Pattern Recognition Lab
No ratings yet
Pattern Recognition Lab
24 pages
NemoSens Briefsheet 008
No ratings yet
NemoSens Briefsheet 008
2 pages
SE KMeansClustering
No ratings yet
SE KMeansClustering
21 pages
DWDM Lab All
No ratings yet
DWDM Lab All
20 pages
K-Means in Python - Solution
No ratings yet
K-Means in Python - Solution
6 pages
Assignment 6
No ratings yet
Assignment 6
4 pages
Tutorial 8
No ratings yet
Tutorial 8
12 pages
K.means Clustering
No ratings yet
K.means Clustering
8 pages
Experiment 3.1 K-Mean
No ratings yet
Experiment 3.1 K-Mean
8 pages
K-Means Algorithm
No ratings yet
K-Means Algorithm
29 pages
KDD WS 24 25 E4 Clustering I
No ratings yet
KDD WS 24 25 E4 Clustering I
2 pages
ML0101EN Clus DBSCN Weather Py v1
No ratings yet
ML0101EN Clus DBSCN Weather Py v1
16 pages
Maxbox - Starter68 Machine Learning
No ratings yet
Maxbox - Starter68 Machine Learning
5 pages
Clustering in Python-Dr. Afsaneh Javadi
No ratings yet
Clustering in Python-Dr. Afsaneh Javadi
8 pages
Practice Exam - Gradescope Ver.
No ratings yet
Practice Exam - Gradescope Ver.
19 pages
Pranav ML-8
No ratings yet
Pranav ML-8
4 pages
K Means
No ratings yet
K Means
3 pages
Maxbox Starter60 Machine Learning
No ratings yet
Maxbox Starter60 Machine Learning
8 pages
CS 611 Slides 4
No ratings yet
CS 611 Slides 4
25 pages
ML 3
No ratings yet
ML 3
24 pages
51 DA5400 - FML51 - 20250501 ProblemSet06
No ratings yet
51 DA5400 - FML51 - 20250501 ProblemSet06
4 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
20 pages
Data Science Analysis Final Project
No ratings yet
Data Science Analysis Final Project
10 pages
Lab Manual ML
No ratings yet
Lab Manual ML
23 pages
Chapter 4
No ratings yet
Chapter 4
30 pages
Detecting Patterns With Unsupervised Learning
No ratings yet
Detecting Patterns With Unsupervised Learning
21 pages
AIML Short Term Internship Session 9 Summary-1719044709410
No ratings yet
AIML Short Term Internship Session 9 Summary-1719044709410
14 pages
ML Lab Manual
No ratings yet
ML Lab Manual
24 pages
Record
No ratings yet
Record
23 pages
20 ENG 016 Assignment 8
No ratings yet
20 ENG 016 Assignment 8
4 pages
M PDF
No ratings yet
M PDF
13 pages
2.3 Aiml Rishit
No ratings yet
2.3 Aiml Rishit
7 pages
Application of Linear Algebra
No ratings yet
Application of Linear Algebra
7 pages
ML - LAB 2 - Jupyter Notebook
No ratings yet
ML - LAB 2 - Jupyter Notebook
9 pages
01 K Means - Merged
No ratings yet
01 K Means - Merged
26 pages
ML 2.3 Prashant
No ratings yet
ML 2.3 Prashant
4 pages
Classification Algorithms I
No ratings yet
Classification Algorithms I
14 pages
DMDW Lab8
No ratings yet
DMDW Lab8
3 pages
AI Unit 5
No ratings yet
AI Unit 5
103 pages
K - Means Clustering and Related Algorithms: Ryan P. Adams COS 324 - Elements of Machine Learning Princeton University
No ratings yet
K - Means Clustering and Related Algorithms: Ryan P. Adams COS 324 - Elements of Machine Learning Princeton University
18 pages
Assignment 6 ML
No ratings yet
Assignment 6 ML
4 pages
ML Lab Manual Completed
No ratings yet
ML Lab Manual Completed
56 pages
ML Shristi File
No ratings yet
ML Shristi File
49 pages
Baidurya Debnath 4
No ratings yet
Baidurya Debnath 4
37 pages
Yunsu Han KNN K Means
No ratings yet
Yunsu Han KNN K Means
8 pages
ML Exp5 C36
No ratings yet
ML Exp5 C36
18 pages
ML DSBA Lab7
No ratings yet
ML DSBA Lab7
6 pages
Task 2
No ratings yet
Task 2
3 pages
Region Segmentation Readings: Chapter 10: 10.1 Additional Materials Provided
No ratings yet
Region Segmentation Readings: Chapter 10: 10.1 Additional Materials Provided
47 pages
Machine Learning (ML)
No ratings yet
Machine Learning (ML)
35 pages
Drawback of Standard K-Means Algorithm
No ratings yet
Drawback of Standard K-Means Algorithm
5 pages
TD4 Unsupervised Machine Learning
No ratings yet
TD4 Unsupervised Machine Learning
10 pages
Introduction To (Statistical) Machine Learning
No ratings yet
Introduction To (Statistical) Machine Learning
30 pages
Machine Learning Programs
No ratings yet
Machine Learning Programs
10 pages
Python DM Lab Manual Part 2
No ratings yet
Python DM Lab Manual Part 2
8 pages
Clustering
No ratings yet
Clustering
1 page
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
18 pages
DM Lab Internal
No ratings yet
DM Lab Internal
37 pages
ML0101EN Clus K Means Customer Seg Py v1
100% (1)
ML0101EN Clus K Means Customer Seg Py v1
8 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
Data Structures and Algorithm - Mid Term
No ratings yet
Data Structures and Algorithm - Mid Term
2 pages
DivUps Proposal University
No ratings yet
DivUps Proposal University
16 pages
MBA - Assignment
No ratings yet
MBA - Assignment
1 page
Assignment 4-1
No ratings yet
Assignment 4-1
6 pages
Hacking CDDVDBlu-ray For Fun and Scientific Research
No ratings yet
Hacking CDDVDBlu-ray For Fun and Scientific Research
71 pages
Disposition Plan: United States Mint
No ratings yet
Disposition Plan: United States Mint
12 pages
Introduction To Machine Learning PART 1
No ratings yet
Introduction To Machine Learning PART 1
6 pages
IHHA sts2011 - Turner
No ratings yet
IHHA sts2011 - Turner
9 pages
Operational Amplifier
No ratings yet
Operational Amplifier
18 pages
Natural Science and History Museum
No ratings yet
Natural Science and History Museum
24 pages
Sketchup 1pp PDF
No ratings yet
Sketchup 1pp PDF
115 pages
The Effect of Controlled Permeable Formwork Liner On The Mechanical Properties of Concrete
No ratings yet
The Effect of Controlled Permeable Formwork Liner On The Mechanical Properties of Concrete
11 pages
B - Com - II Money and Financial System Additional Sub Point
No ratings yet
B - Com - II Money and Financial System Additional Sub Point
32 pages
The Virtual File System (VFS)
No ratings yet
The Virtual File System (VFS)
60 pages
Running Head: DATA STRUCTURES 1: Course: Project Name: Student Name: Date
No ratings yet
Running Head: DATA STRUCTURES 1: Course: Project Name: Student Name: Date
7 pages
Emerging ICT Technologies and Cybersecurity: Kutub Thakur Al-Sakib Khan Pathan Sadia Ismat
No ratings yet
Emerging ICT Technologies and Cybersecurity: Kutub Thakur Al-Sakib Khan Pathan Sadia Ismat
291 pages
B11 - B12 - B13 - 0141 - MAT2002 - 100318 - Dr. Sheerin Kayenat - Fall 22-23 - TEE
No ratings yet
B11 - B12 - B13 - 0141 - MAT2002 - 100318 - Dr. Sheerin Kayenat - Fall 22-23 - TEE
2 pages
Block Diagram
No ratings yet
Block Diagram
6 pages
CP R80 CheckPoint API ReferenceGuide
No ratings yet
CP R80 CheckPoint API ReferenceGuide
6 pages
Option 1 Project Management Issues and Concerns About The Project Timeline
No ratings yet
Option 1 Project Management Issues and Concerns About The Project Timeline
8 pages
Solar Dryer
No ratings yet
Solar Dryer
25 pages
High School Students' Perceptions of Motivations For Cyberbullying An Exploratory Study
No ratings yet
High School Students' Perceptions of Motivations For Cyberbullying An Exploratory Study
6 pages
Iot 220112132928
No ratings yet
Iot 220112132928
31 pages
0613 CT 0001
No ratings yet
0613 CT 0001
180 pages
Spider-Man 3
No ratings yet
Spider-Man 3
19 pages
Chapter 19 - Continual Improvement Methods With Six Sigma and Lean
No ratings yet
Chapter 19 - Continual Improvement Methods With Six Sigma and Lean
8 pages
Message
No ratings yet
Message
30 pages
Activo PD503 004
No ratings yet
Activo PD503 004
4 pages
9852 2340 01b Manual Cement Unit Boltec M & L RCS 4.5
No ratings yet
9852 2340 01b Manual Cement Unit Boltec M & L RCS 4.5
56 pages
Fundamentals of Mathematics Unit 2 - V1
No ratings yet
Fundamentals of Mathematics Unit 2 - V1
21 pages
DL QB With Ans
No ratings yet
DL QB With Ans
38 pages
CEM How To - Final
No ratings yet
CEM How To - Final
84 pages
3.1 Usage of Ajax and Json
No ratings yet
3.1 Usage of Ajax and Json
18 pages