Image Segmentation DeepLearning

Uploaded by

Chloe Tee

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

122 views18 pages

Image Segmentation DeepLearning

Uploaded by

Chloe Tee

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 18

Introduction to Deep

Learning-Based
Segmentation
ITS 69204 Computer Vision and NLP
Prepared By: Dr. Toh Leow Bin
Learning Outcomes

 Define and explain the concept of image

segmentation.
 Differentiate between traditional and deep learning-
based approaches.
 Identify key deep learning architectures used for
segmentation.
 Evaluate challenges and benefits in using deep
learning for segmentation.
Introduction to Image
Segmentation
 Image segmentation is an extension of image
classification where, in addition to classification, we
perform localization.
 Image segmentation thus is a superset of image
classification with the model pinpointing where a
corresponding object is present by outlining the
object's boundary.
Introduction to Image
Segmentation
 Most image segmentation models consist of an
encoder-decoder network as compared to a single
encoder network in classifiers.
 The encoder encodes a latent space representation
of the input which the decoder decodes to form
segment maps, or in other words maps outlining
each object’s location in the image.
 A typical segment map looks something like this:
Traditional vs. Deep Learning-
Based Segmentation

 Traditional Methods:
 Thresholding, Edge Detection, Clustering, Region-Based
Methods.
 Depend on handcrafted features, limited scalability for
complex tasks.
 Deep Learning-Based Methods:
 Learn hierarchical features directly from data.
 Robust and adaptable for complex patterns.
Traditional Image
Segmentation techniques
Deep Learning-based Method
Sematic Segmentation
 Semantic segmentation models provide segment maps
as outputs corresponding to the inputs they are fed.
 These segment maps are often n-channeled with n
being the number of classes the model is supposed to
segment.
 Each of these n-channels is binary in nature with
object locations being “filled” with ones and empty
regions consisting of zeros.
 The ground truth map is a single channel integer
array the same size as the input and has a range of
“n”, with each segment “filled” with the index value of
the corresponding classes (classes are indexed from 0
to n-1).
Deep Learning-based Method
Sematic Segmentation
 The model output in an “n-channel” binary format is
also known as a two-dimensional one-hot encoded
representation of the predictions.
 Neural networks that perform segmentation
typically use an encoder-decoder structure
where the encoder is followed by a bottleneck and a
decoder or upsampling layers directly from the
bottleneck (like in the FCN).
Convolutional Encoder-
Decoder Architecture
 Encoder decoder architectures for semantic
segmentation became popular with the onset of
works like SegNet (by Badrinarayanan et. a.) in 2015.
 SegNet proposes the use of a combination of
convolutional and downsampling blocks to squeeze
information into a bottleneck and form a
representation of the input.
 The decoder then reconstructs input information
to form a segment map highlighting regions on the
input and grouping them under their classes.
 Finally, the decoder has a sigmoid activation at the
end that squeezes the output in the range (0,1).
Convolutional Encoder-
Decoder Architecture
 SegNet was accompanied by the release of another
independent segmentation work at the same time, U-Net ( by
Ronnerberger et. al.), which first introduced skip connections
in Deep Learning as a solution for the loss of information
observed in downsampling layers of typical encoder-
decoder networks.
 Skip connections are connections that go from the encoder
directly to the decoder without passing through the
bottleneck.
 In other words, feature maps at various levels of encoded
representations are captured and concatenated to
feature maps in the decoder. This helps to reduce data loss
by aggressive pooling and downsampling as done in the
encoder blocks of an encoder-decoder architecture.
U-Net explanation
Why Deep Learning for
Segmentation?

 Advantages:
 Robust performance in noisy and variable conditions.
 End-to-end learning with hierarchical features.
 Scalable for diverse datasets.

 Examples:
 Medical Imaging: Automated cancer detection.
 Autonomous Vehicles: Precise road marking detection.
Applications of Image
Segmentation
 Robotics (Machine Vision)
 Aids machine perception and locomotion by pointing
out objects in their path of motion
 Enabling them to change paths effectively and
understand the context of their environment.
 Medical Imaging
 Helps doctors identify possible malignant features in
images in a fast and accurate manner.
 X-ray, CT scan, Dental, pathology cell
Applications of Image
Segmentation
 Smart Cities
 CCTV cameras for real-time monitoring of pedestrians,
traffic, and crime.
 Pedestrian detection, Traffic analytics, License plate
detection and Video Surveillance
 Self Driving Cars/Autonomous Driving Cars
 Planning of routes and movement depending heavily
on it.
 Drivable surface semantic segmentation, Car and
pedestrian instance segmentation, In-vehicle object
detection (stuff left behind by passengers) and Pothole
detection and segmentation
Challenges in Deep Learning-
Based Segmentation

 Data Dependency:
 Large annotated datasets are required.
 Annotation is expensive and time-consuming
 Computational Requirements:
 High-performance GPUs or TPUs are required for training.
 Model Complexity:
 Risk of overfitting with insufficient data.
 Class Imbalance:
 Small objects or regions may be overshadowed by larger
ones in loss functions.
Summary

 Image segmentation is key to AI applications

needing fine-grained analysis.
 Deep learning provides scalable, robust solutions
(e.g., FCN, U-Net, Mask R-CNN).
 Challenges include data requirements,
computational costs, and generalization.

Machine Learning For Tabular Data XGBoost, Deep Learning, and AI (Mark Ryan, Luca Massaron) (Z-Library)
100% (1)
Machine Learning For Tabular Data XGBoost, Deep Learning, and AI (Mark Ryan, Luca Massaron) (Z-Library)
504 pages
Unit 4
No ratings yet
Unit 4
38 pages
Esp32s3 Camera Mastery Free
No ratings yet
Esp32s3 Camera Mastery Free
124 pages
Daily Dose of Data Science Full Archive
No ratings yet
Daily Dose of Data Science Full Archive
53 pages
Graph Neural Network The Next Frontier in Deep Learning
No ratings yet
Graph Neural Network The Next Frontier in Deep Learning
1 page
Computer Vision Unit 4
No ratings yet
Computer Vision Unit 4
186 pages
Deep Learning-Powered Technologies Autonomous Driving, Artificial Intelligence of Things (AIoT), Augmented Reality, 5G Communications and Beyond
100% (1)
Deep Learning-Powered Technologies Autonomous Driving, Artificial Intelligence of Things (AIoT), Augmented Reality, 5G Communications and Beyond
216 pages
Image Enhancement
No ratings yet
Image Enhancement
144 pages
NCA-GENL Nvidia Generative Ai Llms Exam Dumps
No ratings yet
NCA-GENL Nvidia Generative Ai Llms Exam Dumps
5 pages
Chris Crawford Balance of Power
100% (1)
Chris Crawford Balance of Power
103 pages
Machine Learning
100% (1)
Machine Learning
8 pages
Mitsubishi Meldas 330HM-V - 335M-V - HDVH - LT - BT - Instruction Manual
No ratings yet
Mitsubishi Meldas 330HM-V - 335M-V - HDVH - LT - BT - Instruction Manual
486 pages
Greedy-Layerwise in Deep Learning
No ratings yet
Greedy-Layerwise in Deep Learning
15 pages
Btech CSE
100% (1)
Btech CSE
17 pages
Early Detection of Lung Cancer Using AI and ML
No ratings yet
Early Detection of Lung Cancer Using AI and ML
6 pages
Deep Learning
No ratings yet
Deep Learning
127 pages
The Ultimate Guide To Object Detection
No ratings yet
The Ultimate Guide To Object Detection
16 pages
Paper 1-Bidirectional LSTM With Attention Mechanism and Convolutional Layer
100% (1)
Paper 1-Bidirectional LSTM With Attention Mechanism and Convolutional Layer
51 pages
ML Notes MAKAUT 7th Sem
No ratings yet
ML Notes MAKAUT 7th Sem
31 pages
Federated Learning - Hope and Scope
No ratings yet
Federated Learning - Hope and Scope
4 pages
A Survey On Vision Transformer
No ratings yet
A Survey On Vision Transformer
23 pages
Face Recognition With Python
No ratings yet
Face Recognition With Python
5 pages
Deep CNN Based Brain Tumor Detection in - 2024 - International Journal of Intel
No ratings yet
Deep CNN Based Brain Tumor Detection in - 2024 - International Journal of Intel
8 pages
Linear Regression 18may
No ratings yet
Linear Regression 18may
28 pages
Nn4ir PDF
No ratings yet
Nn4ir PDF
290 pages
Machine Learning Algorithms
No ratings yet
Machine Learning Algorithms
9 pages
Protege Tutorial
No ratings yet
Protege Tutorial
40 pages
Practice Questions: Class Ix: Chapter - 2 Polynomials
No ratings yet
Practice Questions: Class Ix: Chapter - 2 Polynomials
3 pages
Federated Learning Overview, Strategies, Applications, Tools and
No ratings yet
Federated Learning Overview, Strategies, Applications, Tools and
24 pages
General Framework For Object Detection
No ratings yet
General Framework For Object Detection
9 pages
20.tooth Segmentation On Dental Meshes Using Morphologic Skeleton
No ratings yet
20.tooth Segmentation On Dental Meshes Using Morphologic Skeleton
13 pages
Manual
100% (1)
Manual
76 pages
Ref 3 Recommender Systems For Learning PDF
No ratings yet
Ref 3 Recommender Systems For Learning PDF
84 pages
Deep Learning Approaches For Network Int
No ratings yet
Deep Learning Approaches For Network Int
116 pages
Project
No ratings yet
Project
10 pages
2018 Arxiv Mou VehicleSegmentation
No ratings yet
2018 Arxiv Mou VehicleSegmentation
14 pages
Lecture 12 - Deep Learning
No ratings yet
Lecture 12 - Deep Learning
25 pages
RBM, DBN, and DBM
No ratings yet
RBM, DBN, and DBM
79 pages
20191216134846D3338 - COMP6579 Session 10 - Big Data Analytics (Apache Spark - SparkML)
No ratings yet
20191216134846D3338 - COMP6579 Session 10 - Big Data Analytics (Apache Spark - SparkML)
42 pages
Perceptron: Learning Rule. Generalize From Its Training Vectors and Work With
No ratings yet
Perceptron: Learning Rule. Generalize From Its Training Vectors and Work With
32 pages
Computer Vision - Ipynb - Colaboratory
No ratings yet
Computer Vision - Ipynb - Colaboratory
17 pages
Spring 2022 CS7643 Deep Learning Syllabus and Schedule - v5.1
No ratings yet
Spring 2022 CS7643 Deep Learning Syllabus and Schedule - v5.1
11 pages
A Survey of Evolution of Image Captioning PDF
No ratings yet
A Survey of Evolution of Image Captioning PDF
18 pages
Techniques To FineTune LLMs
No ratings yet
Techniques To FineTune LLMs
7 pages
Depth Prediction Single Image
No ratings yet
Depth Prediction Single Image
8 pages
Deep Learning Methods and Applications For Electrical Power Systems A Comprehensive Review
No ratings yet
Deep Learning Methods and Applications For Electrical Power Systems A Comprehensive Review
22 pages
3D U-Net Based Brain Tumor Segmentation
No ratings yet
3D U-Net Based Brain Tumor Segmentation
11 pages
A Survey On Vision Transformer
No ratings yet
A Survey On Vision Transformer
24 pages
PyTorch Workflow Fundamentals
No ratings yet
PyTorch Workflow Fundamentals
1 page
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
2 pages
Large-Scale Deep Reinforcement Learning
No ratings yet
Large-Scale Deep Reinforcement Learning
6 pages
GD TroubleshootingGuide ENGLISH
No ratings yet
GD TroubleshootingGuide ENGLISH
62 pages
Deep Learning Nanodegree Syllabus 8-15
No ratings yet
Deep Learning Nanodegree Syllabus 8-15
15 pages
Columbia Seaborn Tutorial
No ratings yet
Columbia Seaborn Tutorial
12 pages
Performance Analysis of LoRA Finetuning Llama-2
No ratings yet
Performance Analysis of LoRA Finetuning Llama-2
4 pages
2 Convolutional Neural Network For Image Classification
No ratings yet
2 Convolutional Neural Network For Image Classification
6 pages
05 Logistic - Regression
No ratings yet
05 Logistic - Regression
7 pages
Neural
No ratings yet
Neural
35 pages
TrakPro Lite Secure 6004404
No ratings yet
TrakPro Lite Secure 6004404
47 pages
Simple Libraries in Python
No ratings yet
Simple Libraries in Python
12 pages
Example of 2D Convolution
No ratings yet
Example of 2D Convolution
5 pages
CSC312 Automata Theory Languages: Lecture # 2
No ratings yet
CSC312 Automata Theory Languages: Lecture # 2
50 pages
Confirmed Employers - Sheet1
No ratings yet
Confirmed Employers - Sheet1
4 pages
Car Make and Model Recognition Using Ima
No ratings yet
Car Make and Model Recognition Using Ima
8 pages
Career Plans For Next 2 Years
No ratings yet
Career Plans For Next 2 Years
11 pages
Introduction To Network Cables
No ratings yet
Introduction To Network Cables
10 pages
PDK 201840 Software Aipex en
No ratings yet
PDK 201840 Software Aipex en
113 pages
Watershed Segmentation
No ratings yet
Watershed Segmentation
22 pages
CH 456
No ratings yet
CH 456
61 pages
W11 Lecture ITS69204 Image Recognition
No ratings yet
W11 Lecture ITS69204 Image Recognition
44 pages
ITS66034 Group 24 Assignment
No ratings yet
ITS66034 Group 24 Assignment
13 pages
Lecture 08 Image Segmentation
No ratings yet
Lecture 08 Image Segmentation
31 pages
PT Multi Pembayaran Digital
No ratings yet
PT Multi Pembayaran Digital
34 pages
Image Segmentation in Python - Practical Hands-On
No ratings yet
Image Segmentation in Python - Practical Hands-On
24 pages
Matrix Multiplication With CUDA - A Basic Introduction To The CUDA Programming Model
No ratings yet
Matrix Multiplication With CUDA - A Basic Introduction To The CUDA Programming Model
44 pages
09 - Implementing Secure Network Designs
No ratings yet
09 - Implementing Secure Network Designs
42 pages
Cambridge International Examinations Cambridge International General Certificate of Secondary Education
No ratings yet
Cambridge International Examinations Cambridge International General Certificate of Secondary Education
16 pages
AI-ML Using Py
No ratings yet
AI-ML Using Py
10 pages
23cm PSK Packet Radio Transceiver For 1.2 Mbit/ User Access: ASE SK
No ratings yet
23cm PSK Packet Radio Transceiver For 1.2 Mbit/ User Access: ASE SK
23 pages
Toy Car Remote Controller With Nine Functions: TX-5B/RX-5B
No ratings yet
Toy Car Remote Controller With Nine Functions: TX-5B/RX-5B
9 pages
The Impact of Gamification On Students L
No ratings yet
The Impact of Gamification On Students L
8 pages
Celesteadoq: Hacker Guterpuck 5 5
No ratings yet
Celesteadoq: Hacker Guterpuck 5 5
2 pages
Php/Mysql Tutorial: Webmonkey
No ratings yet
Php/Mysql Tutorial: Webmonkey
10 pages
PracticalWeek03a
No ratings yet
PracticalWeek03a
1 page
Care Unite
No ratings yet
Care Unite
6 pages
NFG Annual Report 2012
No ratings yet
NFG Annual Report 2012
35 pages
已经整理以色列LED Panel Light
No ratings yet
已经整理以色列LED Panel Light
14 pages
Css 9 Week 3
No ratings yet
Css 9 Week 3
10 pages
Students - Students
No ratings yet
Students - Students
1 page
PracticalWeek02
No ratings yet
PracticalWeek02
1 page
Datasheet LRMate-200iD-7WP
No ratings yet
Datasheet LRMate-200iD-7WP
1 page
WF0128BTYAA4DNN0
No ratings yet
WF0128BTYAA4DNN0
5 pages
Näyttökuva 2022-10-19 Kello 23.04.28
No ratings yet
Näyttökuva 2022-10-19 Kello 23.04.28
1 page
Invoice 54480
No ratings yet
Invoice 54480
1 page
Org Chart Netflix - The Official Board
No ratings yet
Org Chart Netflix - The Official Board
1 page
Homework 4
No ratings yet
Homework 4
1 page
Image Segmentation: Unlocking Insights through Pixel Precision
From Everand
Image Segmentation: Unlocking Insights through Pixel Precision
Fouad Sabry
No ratings yet
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
From Everand
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
Fouad Sabry
No ratings yet
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
From Everand
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
Fouad Sabry
No ratings yet

Image Segmentation DeepLearning

Uploaded by

Image Segmentation DeepLearning

Uploaded by

Introduction to Deep

 Define and explain the concept of image

 Image segmentation is key to AI applications

You might also like