Digital Image Processing

This document proposes an unsupervised deep image clustering (DIC) method for image segmentation. It consists of two parts: 1) a feature transformation subnetwork that extracts features from the image using a CNN architecture, and 2) a deep clustering subnetwork that iteratively clusters the features into segments. The method is tested on the Berkeley Segmentation Dataset and achieves state-of-the-art performance according to multiple evaluation metrics, outperforming methods like K-means, mean-shift, and normalized cuts. Visual results also show it effectively merges similar regions and separates diverse ones.

Uploaded by

Unaixa Khan

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

57 views

Digital Image Processing

Uploaded by

Unaixa Khan

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 23

Unsupervised Image Segmentation

using Deep Image Clustering

Monazza Qadeer Khan

206154
Introduction
• Object segmentation is the most vital operation in image processing
techniques prior to image analysis

• Object segmentation is a challenging problem in the field of computer

vision and it has been widely applied in areas such as object recognition
and image classification

• Generally speaking, object segmentation methods can be divided into

three categories, unsupervised, semi-supervised and fully supervised.
Introduction
• In fully supervised segmentation, accurate labeled training dataset is
used

• In unsupervised segmentation, there are no ground truth labels

• Focus of this project is on unsupervised image segmentation

• It has two parts: extraction of features from given image and division of
image into different regions
Supervised vs. Unsupervised
Problem Statement
• Conventional clustering methods like K-means , Active contour ,
normalized cut , MLSS and SAS can be used for segmentation
• These methods have two principal drawbacks i.e. they are sensitive to
the segmentation parameters such as cluster numbers and the whole
procedure is complex, which cannot be optimized easily
• So, a deep image clustering (DIC) network is designed and
implemented
• It consists of a feature transformation subnetwork and a
differentiable deep clustering subnetwork; it divides the image space
into different clusters
Objectives
• Encouraged by neural networks’ flexibility and their ability for modelling intricate
patterns, an unsupervised segmentation framework based on a novel deep image
clustering (DIC) model is proposed
• The DIC consists of a feature transformation subnetwork (FTS) and a trainable
deep clustering subnetwork (DCS) for unsupervised image clustering
• FTS is built on a simple and capable network architecture
• DCS can assign pixels with different cluster numbers by updating cluster
associations and cluster centers iteratively
Material
• Extensive experiments have been conducted on the Berkley Segmentation
Database
• The experimental results show that DCS is more effective in aggregating features
during the clustering procedure
• DIC has also proven to be less sensitive to varying segmentation parameters and
of lower computation costs
• DIC can achieve significantly better segmentation performance compared to the
state-of-the-art techniques
Material
Berkeley Segmentation Dataset (BSD)
• The dataset consists of 500 natural images, ground-truth human annotations
and benchmarking code
• The data is explicitly separated into disjoint train, validation and test
subsets
• The dataset is an extension of the BSDS300, where the original 300 images
are used for training / validation and 200 fresh images, together with human
annotations, are added for testing
• Each image was segmented by five different subjects on average
Flow Diagram
Illustration of the proposed DIC framework for unsupervised image segmentation. DIC
consists of a FTS and a DCS and DIC is trained by an iterative refinement loss.
Methodology
Unsupervised image segmentation
• Includes technical details like preprocessing steps, features, how they
are extracted, their visualization, model training and testing
• Deep image clustering model consists of two modules:
1. a subnetwork for feature extraction
2. and a deep clustering subnetwork
• Super-pixel guided iterative refinement loss
• Over-fitting training protocol optimizing the network parameters in an end-to-end
way
Methodology
1. Network architecture for Feature Transformation subnetwork (FTS)

• Autoencoder architecture is used and the connection is skipped for

constructing the feature transformation subnetwork(FTS)

• The CNN for feature extraction is composed of a series of convolution layers

interleaved with batch normalization (BN) and ReLU activations

• FTS consists of six convolution blocks, one max-pooling operation, one

deconvolution operation and a simple convolution operation.
Methodology
• We use max-pooling, which down samples the input by a factor of 2, after the
2nd convolution block to increase the receptive field
• Then the 4th convolution block outputs are up-sampled by deconvolution and
concatenated with the 2nd convolution block outputs to pass onto the 5th
convolution block
• After the 6th convolution block and the simple convolution block, feature Y
with dimension C is generated
Methodology
• We use 3* 3 convolution filters with the number of output channels set to 64,
128 or 192 in each block, except the last CNN layer which outputs C channels

• The resulting C dimensional features Y can be taken as coarse cluster

associations

• In order to aggregate the features more effectively, Y will be passed onto the
following deep clustering module that iteratively updates the pixel-clusters
associations and cluster centers for 𝜏 iterations
Methodology
The flowchart of the feature transformation subnetwork.

• Convolution block (CB) - 33 convolution

• Batch-normalization max-pooling(MP) with the factor 2
• Relu
• Max-pooling(MP) with the factor 2
• Deconvolution(DC) of sample features by 2 times
Methodology
2. Deep Clustering Subnetwork

• Firstly the extracted feature Y is flattened to the dimension N C, where N

D H W, H is the height of image, W is the width of image and C is the
channel number or super-pixel number (SPN). Then a neural network
based clustering procedure is designed

• The cluster centers Ω are defined as the initializations for feature

clustering. Assuming the cluster centers are defined as Ω={Ω1, Ω2, Ω3,
…,ΩM}, M is the number of default clusters and Ωi is with dimension C*1
Methodology
The flowchart of the deep clustering subnetwork. DCS contains two iterative steps:
calculating cluster associations H and updating cluster centers Ω
Experimentation
• The segmentation results on two Berkley Segmentation Databases (BSDS300 and
BSDS500) [35] which consists of 300 and 500 natural images respectively, are
reported.
• To quantitatively evaluate the segmentation results, five criteria are used:
1. Probabilistic Rand Index (PRI)
2. Variation of Information (VoI)
3. Global Consistency Error (GCE)
4. Boundary Displacement Error (BDE)
5.Segmentation Covering (SC).
• The segmentation performance is better if PRI and SC is large and the other three
are smaller compared to the ground truths
Experimentation
𝜏 is set as 3 according to the cross-validation
experiments. Training epoch T is set
as T = 100, the learning rate is set as 2 and the momentum is set as 0.9
Illustration of the iteration clustering process
Results
• In order to evaluate the proposed method DIC comprehensively, we compare the average
scores of the DIC’s with sixteen benchmark algorithms, such as Ncut, Mean-shift gPb-owt-
ucm, MLSS, W-Net MLSS , the optimal Image scale (OIS) is selected for segmenting images
in the Berkley Segmentation Database

• DIC works better in merging similar pixels and separating diverse regions by learning
from local image patterns adaptively
Results
The visual comparison between DIC and other state-of-the-arts, such as
MLSS, SAS
Demo
• Github link: https://github.com/zmbhou/DIC
• BSD dataset link:
https://www2.eecs.berkeley.edu/Research/Projects/CS/vision/bsds/
• Contour Detection and Image Segmentation
Resources:
http://web.archive.org/web/20160306133802/http://www.eecs.berkeley.edu/Research/P
rojects/CS/vision/grouping/resources.html#bsds500
Demo
Thank You
Q&A

Deep Learning Interview Questions and Answers
No ratings yet
Deep Learning Interview Questions and Answers
21 pages
Group Q Presentation4
No ratings yet
Group Q Presentation4
21 pages
Deep Learning Module-04 Search Creators
No ratings yet
Deep Learning Module-04 Search Creators
17 pages
DIC: Deep Image Clustering For Unsupervised Image Segmentation
No ratings yet
DIC: Deep Image Clustering For Unsupervised Image Segmentation
11 pages
Robot
No ratings yet
Robot
20 pages
21CS743 Model Question Paper Solution
No ratings yet
21CS743 Model Question Paper Solution
32 pages
Densely Semantically Aligned Person Re
No ratings yet
Densely Semantically Aligned Person Re
22 pages
NLP UNIT 5c
No ratings yet
NLP UNIT 5c
33 pages
Image Classification Using Convolutional Neural Networks (CNNS)
No ratings yet
Image Classification Using Convolutional Neural Networks (CNNS)
61 pages
Deep Learning Module-04
No ratings yet
Deep Learning Module-04
17 pages
Topic 3ii - Convolutional Neural Network
No ratings yet
Topic 3ii - Convolutional Neural Network
43 pages
Finalppt
No ratings yet
Finalppt
20 pages
Unit 3 Deep Learning
No ratings yet
Unit 3 Deep Learning
15 pages
13031122003_SAINI_GUHA_ROY_CA2
No ratings yet
13031122003_SAINI_GUHA_ROY_CA2
8 pages
Seatbelt Detection Report
No ratings yet
Seatbelt Detection Report
18 pages
ML Lec 13 CNN
No ratings yet
ML Lec 13 CNN
44 pages
Visual Data Mining: Concepts, Frameworks and Algorithm Development
No ratings yet
Visual Data Mining: Concepts, Frameworks and Algorithm Development
30 pages
UNIT 2 Self Notes
No ratings yet
UNIT 2 Self Notes
10 pages
Neural Architecture Search
No ratings yet
Neural Architecture Search
11 pages
Project Presentation
No ratings yet
Project Presentation
20 pages
Project Work Papers
No ratings yet
Project Work Papers
19 pages
CO2_CNN_3
No ratings yet
CO2_CNN_3
31 pages
AI - Fruits
No ratings yet
AI - Fruits
11 pages
Presentation Slides
No ratings yet
Presentation Slides
39 pages
Antim Prahar AI and ML for Business 2025
No ratings yet
Antim Prahar AI and ML for Business 2025
45 pages
Technical Seminar On: Face Recognition Based On Convolution Neural Network
No ratings yet
Technical Seminar On: Face Recognition Based On Convolution Neural Network
22 pages
Towards Better Analysis of Deep Convolutional Neural Networks
No ratings yet
Towards Better Analysis of Deep Convolutional Neural Networks
41 pages
Machine Learning (CSO851) - Lecture 10
No ratings yet
Machine Learning (CSO851) - Lecture 10
83 pages
Deep Learning - Image Synthesis
No ratings yet
Deep Learning - Image Synthesis
36 pages
CNN_Image_Processing_Presentation
No ratings yet
CNN_Image_Processing_Presentation
8 pages
Post-Reading Report Alex Shen (Mid Exam)
No ratings yet
Post-Reading Report Alex Shen (Mid Exam)
36 pages
mini-project report1
No ratings yet
mini-project report1
7 pages
MAJOR14
100% (1)
MAJOR14
14 pages
Text Segmentation PPT Nitheesha
No ratings yet
Text Segmentation PPT Nitheesha
26 pages
Unit 3
No ratings yet
Unit 3
80 pages
EEG SIGNAL Analysing Using ML Techniques FOR Epilepsy Disease
No ratings yet
EEG SIGNAL Analysing Using ML Techniques FOR Epilepsy Disease
24 pages
Machine Learning Unit 3
No ratings yet
Machine Learning Unit 3
40 pages
Conv-MCD: A Plug-and-Play Multi-Task Module For Medical Image Segmentation
No ratings yet
Conv-MCD: A Plug-and-Play Multi-Task Module For Medical Image Segmentation
8 pages
Module 5
No ratings yet
Module 5
72 pages
Indexing and Retrieval Medical Images Based On 2x2 DCT and IDS Compression
No ratings yet
Indexing and Retrieval Medical Images Based On 2x2 DCT and IDS Compression
4 pages
Typical CNN (Convolutional Neural Network) Architecture: CHARAN S (1VE20CA005) Cse-Ai, Svce
No ratings yet
Typical CNN (Convolutional Neural Network) Architecture: CHARAN S (1VE20CA005) Cse-Ai, Svce
13 pages
Quiz sol
No ratings yet
Quiz sol
4 pages
MADA11
No ratings yet
MADA11
35 pages
DCGAN (Deep Convolution Generative Adversarial Networks)
No ratings yet
DCGAN (Deep Convolution Generative Adversarial Networks)
27 pages
Project Documentation
No ratings yet
Project Documentation
13 pages
Bone Fracture Detection Using CNN
No ratings yet
Bone Fracture Detection Using CNN
30 pages
21cs743 Model Question Paper Solution
No ratings yet
21cs743 Model Question Paper Solution
33 pages
Unit3 2023 NNDL
No ratings yet
Unit3 2023 NNDL
69 pages
DFANet Deep Feature Aggregation For Real-Time Semantic Segmentation
No ratings yet
DFANet Deep Feature Aggregation For Real-Time Semantic Segmentation
10 pages
Cahpter 3
No ratings yet
Cahpter 3
4 pages
Unit II
No ratings yet
Unit II
35 pages
Fast Multiresolution Image Querying: CS474/674 - Prof. Bebis
0% (1)
Fast Multiresolution Image Querying: CS474/674 - Prof. Bebis
39 pages
Lect 2 Common Architectural Principles of Deep Networks (3)
No ratings yet
Lect 2 Common Architectural Principles of Deep Networks (3)
20 pages
Deep Facial Recognition
No ratings yet
Deep Facial Recognition
27 pages
Object Classification Using CNN
No ratings yet
Object Classification Using CNN
9 pages
Assignment 1 NF
No ratings yet
Assignment 1 NF
6 pages
Technologies
No ratings yet
Technologies
9 pages
Gi 225: Remote Sensing Applications 2020/2021: Feature Extraction Classification
No ratings yet
Gi 225: Remote Sensing Applications 2020/2021: Feature Extraction Classification
40 pages
Computer Vision CH2
No ratings yet
Computer Vision CH2
34 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Image Segmentation: Unlocking Insights through Pixel Precision
From Everand
Image Segmentation: Unlocking Insights through Pixel Precision
Fouad Sabry
No ratings yet
Random Walks For Image Segmentation: Leo Grady
No ratings yet
Random Walks For Image Segmentation: Leo Grady
17 pages
Segmentation Using Superpixels: A Bipartite Graph Partitioning Approach
No ratings yet
Segmentation Using Superpixels: A Bipartite Graph Partitioning Approach
8 pages
Semantic Image Segmentation With Task-Specific Edge Detection Using Cnns and A Discriminatively Trained Domain Transform
No ratings yet
Semantic Image Segmentation With Task-Specific Edge Detection Using Cnns and A Discriminatively Trained Domain Transform
10 pages
For Optimal Boundary & Region Segmentation of Objects in N-D Images
No ratings yet
For Optimal Boundary & Region Segmentation of Objects in N-D Images
8 pages
Fully Convolutional Networks For Semantic Segmentation: Jonathan Long Evan Shelhamer Trevor Darrell UC Berkeley
No ratings yet
Fully Convolutional Networks For Semantic Segmentation: Jonathan Long Evan Shelhamer Trevor Darrell UC Berkeley
10 pages
An Efficient K-Means Clustering Algorithm: Analysis and Implementation
No ratings yet
An Efficient K-Means Clustering Algorithm: Analysis and Implementation
12 pages
Time Series Analysis
No ratings yet
Time Series Analysis
9 pages
Sreekar Resume
No ratings yet
Sreekar Resume
1 page
Transportation and Assignment
No ratings yet
Transportation and Assignment
31 pages
Informed Search AO Search Hill Climbing
No ratings yet
Informed Search AO Search Hill Climbing
20 pages
SPIKER: Analog Waveform To Digital Spiketrain Conversion in ATR's Articial Brain (CAM-Brain) Project
No ratings yet
SPIKER: Analog Waveform To Digital Spiketrain Conversion in ATR's Articial Brain (CAM-Brain) Project
4 pages
An Alternate Way To Asses The Suitability of The Cell Thickness Chosen For Layering in Petrel 6555829 01
No ratings yet
An Alternate Way To Asses The Suitability of The Cell Thickness Chosen For Layering in Petrel 6555829 01
11 pages
Homework 4
No ratings yet
Homework 4
2 pages
Lazy Viterbi Slides
100% (1)
Lazy Viterbi Slides
13 pages
MAT216 Practice Sheet-1
No ratings yet
MAT216 Practice Sheet-1
2 pages
E Xam Ple. Let: 2.11 Lineal' Feed Back Shift R Egister Sequences
No ratings yet
E Xam Ple. Let: 2.11 Lineal' Feed Back Shift R Egister Sequences
8 pages
Artifi Cial Intelligence: G. Konidaris
No ratings yet
Artifi Cial Intelligence: G. Konidaris
76 pages
Automatic Tos Math
No ratings yet
Automatic Tos Math
6 pages
Internship Presentation
No ratings yet
Internship Presentation
19 pages
Stochastic Disturbance Accommodating Control Using A Kalman Estimator
No ratings yet
Stochastic Disturbance Accommodating Control Using A Kalman Estimator
22 pages
AI RsrchGuide DZ
100% (1)
AI RsrchGuide DZ
29 pages
Similarity Measure of Plithogenic Cubic Vague Sets: Examples and Possibilities
No ratings yet
Similarity Measure of Plithogenic Cubic Vague Sets: Examples and Possibilities
9 pages
Automatic Music Generation
No ratings yet
Automatic Music Generation
16 pages
Materi Pengolahan Citra Digital 4c Sesi 11-12 Image Transformations
No ratings yet
Materi Pengolahan Citra Digital 4c Sesi 11-12 Image Transformations
16 pages
lecture16 GAN cont
No ratings yet
lecture16 GAN cont
35 pages
Canonical Transformation - Wikipedia
No ratings yet
Canonical Transformation - Wikipedia
37 pages
Digital Communicationsg
100% (1)
Digital Communicationsg
8 pages
2547101-MBA-Integrated-WINTER-2022
No ratings yet
2547101-MBA-Integrated-WINTER-2022
3 pages
Advanced Machine Learning: Module-1
No ratings yet
Advanced Machine Learning: Module-1
164 pages
RD Sharma Solution Class 9 Maths Chapter 6 Factorization of Polynomials PDF
No ratings yet
RD Sharma Solution Class 9 Maths Chapter 6 Factorization of Polynomials PDF
25 pages
CNS Unit 2
No ratings yet
CNS Unit 2
24 pages
CCCCCCCCCCCCCCCCCCCCCCCCCCCCC CC: CCCCCCC C CC C
No ratings yet
CCCCCCCCCCCCCCCCCCCCCCCCCCCCC CC: CCCCCCC C CC C
29 pages
Fundamentals of Data Structures - MCQ - I
100% (1)
Fundamentals of Data Structures - MCQ - I
26 pages
Raport 2
No ratings yet
Raport 2
21 pages
Unit Second Order Boundary Value Problems: Structure Page No
No ratings yet
Unit Second Order Boundary Value Problems: Structure Page No
26 pages