0% found this document useful (0 votes)

30 views20 pages

Computer Vision Experiential Learning Report

The document discusses image segmentation techniques applied to the Cityscapes dataset. It provides context on image segmentation and its applications. It also describes unsupervised clustering algorithms and supervised deep learning models like U-Net that were used for segmentation. Quantitative metrics to evaluate the different methods are mentioned.

Uploaded by

aditya.pande.btech2021

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views20 pages

Computer Vision Experiential Learning Report

Uploaded by

aditya.pande.btech2021

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 20

Computer Vision Experiential Learning

Report

Aditya Pande
21070126001
AIML A1

Image Segmentation of Cityscapes Data with U-

NET Pytorch

Introduction to Image Segmentation:

Image segmentation is a fundamental task in computer vision, playing
a pivotal role in extracting meaningful information from images by
dividing them into semantically coherent regions. Unlike object
detection, which identifies and localizes objects within an image,
segmentation goes a step further by precisely outlining the boundaries
of individual objects or regions. This process is critical for various
applications, ranging from medical imaging and autonomous vehicles
to augmented reality and content-based image retrieval.

Significance in Computer Vision Applications:

1. Object Recognition and Tracking:

- Image segmentation facilitates precise identification and tracking
of objects within a scene, enabling applications like object recognition
and tracking in real-time video streams.

2. Medical Imaging:
- In medical fields, segmentation aids in the accurate delineation of
structures and organs, assisting in diagnosis, treatment planning, and
monitoring of diseases.

3. Autonomous Vehicles:
- For autonomous vehicles, accurate segmentation is crucial for
understanding the surrounding environment, identifying road lanes,
pedestrians, and other vehicles.

4. Augmented Reality:
- In augmented reality applications, segmentation helps distinguish
between the foreground and background, allowing virtual elements to
seamlessly interact with the real world.

Explanation of the Project:

In our project, we focus on image segmentation using the Cityscapes dataset,

which contains labeled urban scenes captured from vehicles in Germany. The
dataset provides a challenging yet realistic environment for testing and
evaluating segmentation techniques. Our project involves implementing various
image segmentation methods, encompassing traditional techniques such as
thresholding, clustering algorithms, as well as state-of-the-art deep learning
models like U-Net and Mask R-CNN.

One aspect of our project involves the application of clustering algorithms such
as K-means and DBSCAN to segment images. These algorithms group pixels
based on similarities in color, allowing us to explore their effectiveness in
extracting meaningful regions from the dataset. We will compare the results of
clustering algorithms with traditional and deep learning methods to understand
their respective advantages and limitations.

Our evaluation will not only focus on visual comparisons but will also include
quantitative assessments using metrics such as Intersection over Union (IoU)
and Dice Coefficient. These metrics provide insights into the accuracy and
precision of the segmentation methods, aiding in a comprehensive analysis of
their performance.

Additionally, our project aims to explore the trade-offs between traditional and
deep learning approaches, taking into consideration factors such as
computational efficiency, robustness to variations, and interpretability. By
conducting this analysis, we seek to contribute insights into the effectiveness of
different segmentation techniques, offering a holistic understanding of the
challenges associated with image segmentation in complex urban environments.

Literature Review on Image Segmentation:

Author Title Result

Olaf Ronneberger, U-Net: Convolutional In this paper, we present a
Philipp Fischer, Thomas Networks for network and training
strategy that relies on the
Brox Biomedical Image
strong use of data
Segmentation augmentation to use the
available annotated samples
more efficiently
Vijay Badrinarayanan, SegNet: A Deep The novelty of SegNet lies
Alex Kendall, Roberto Convolutional Encoder- is in the manner in which
the decoder upsamples its
Cipolla Decoder Architecture
lower resolution input
for Image Segmentation feature map(s). Specifically,
the decoder uses pooling
indices computed in the
max-pooling step of the
corresponding encoder to
perform non-linear
upsampling.

Fausto Milletari, Nassir V-Net: Fully In this work we propose an

Navab, Seyed-Ahmad Convolutional Neural approach to 3D image
segmentation based on a
Ahmadi · Networks for
volumetric, fully
Volumetric Medical convolutional, neural
Image Segmentation network.

S. Prabu A Study on Image In this paper different

algorithms of segmentation can
J.M. Gnanasekar Segmentation Method be reviewed, analyzed and
for Image Processing finally list out the comparison
for all the algorithms. This
comparison study is useful for
increasing accuracy and
performance of segmentation
methods in various image
processing domains.

Refik Samet; Şahin Fuzzy Rule-Based In this paper, we

Emrah Amrahov; Ali Image Segmentation propose Fuzzy Rule-
Hikmet Ziroğlu technique for rock thin Based Image
section images Segmentation
technique to segment
rock thin section
images.
Ashwani Kumar Yadav; Thresholding and The main objective of
Ratnadeep Roy; morphological based this work is to segment
Rajkumar; Vaishali; segmentation techniques the medical image
Devendra Somwanshi for medical images under various
conditions and
different backgrounds.
Sharifah Lailee Syed An accurate The traditional
Abdullah; Hamirul'Aini thresholding-based thresholding and
Hambali; Nursuriati segmentation technique clustering
Jamil for natural images segmentation
techniques that were
widely used are Otsu
and K-means
Annegreet van Opbroek; Transfer Learning The variation between
M. Arfan Ikram; Meike Improves Supervised images obtained with
W. Vernooij; Marleen de Image Segmentation different scanners or
Bruijne Across Imaging different imaging
Protocols protocols presents a
major challenge in
automatic
segmentation of
biomedical images.

About the Dataset :

Context
Cityscapes data (dataset home page) contains labelled videos taken from
vehicles driven in Germany. This version is a processed subsample
created as part of the Pix2Pix paper. The dataset has still images from
the original videos, and the semantic segmentation labels are shown in
images alongside the original image. This is one of the best datasets
around for semantic segmentation tasks.

Content
This dataset has 2975 training images files and 500 validation image
files. Each image file is 256x512 pixels, and each file is a composite with
the original photo on the left half of the image, alongside the labeled
image (output of semantic segmentation) on the right half.
Acknowledgements
This dataset is the same as what is available here from the Berkeley AI
Research group.

License
The Cityscapes data available from cityscapes-dataset.com has the
following license:

This dataset is made freely available to academic and non-academic

entities for non-commercial purposes such as academic research,
teaching, scientific publications, or personal experimentation. Permission
is granted to use the data given that you agree:

 That the dataset comes "AS IS", without express or implied warranty.
Although every effort has been made to ensure accuracy, we (Daimler
AG, MPI Informatics, TU Darmstadt) do not accept any responsibility for
errors or omissions.
 That you include a reference to the Cityscapes Dataset in any work that
makes use of the dataset. For research papers, cite our preferred
publication as listed on our website; for other media cite our preferred
publication as listed on our website or link to the Cityscapes website.
 That you do not distribute this dataset or modified versions. It is
permissible to distribute derivative works in as far as they are abstract
representations of this dataset (such as models trained on it or additional
annotations that do not directly include any of our data) and do not allow
to recover the dataset or something similar in character.
 That you may not use the dataset or any derivative work for commercial
purposes as, for example, licensing or selling the data, or using the data
with a purpose to procure a commercial gain.
 That all rights not expressly granted to you are reserved by (Daimler AG,
MPI Informatics, TU Darmstadt).
Code Snippets :
Github Repo :
https://github.com/adityapande403/CV_segmentation_UN
ET_EXPL/tree/main

Expl CV
No ratings yet
Expl CV
16 pages
CV Expl 21070126001
No ratings yet
CV Expl 21070126001
16 pages
Breathing and Exchange of Gases Class 11 Study Notes 1
No ratings yet
Breathing and Exchange of Gases Class 11 Study Notes 1
8 pages
UNIT_3 _DL
No ratings yet
UNIT_3 _DL
15 pages
2301.07499v1
No ratings yet
2301.07499v1
177 pages
Recent Progress in Semantic Image Segmentation: Xiaolong Liu Zhidong Deng Yuhan Yang
No ratings yet
Recent Progress in Semantic Image Segmentation: Xiaolong Liu Zhidong Deng Yuhan Yang
18 pages
Sec 2 Team 06
No ratings yet
Sec 2 Team 06
71 pages
Lec+2(+Image+Segemnation)
No ratings yet
Lec+2(+Image+Segemnation)
52 pages
A Review On Deep Learning Techniques Applied To Semantic Segmentation
No ratings yet
A Review On Deep Learning Techniques Applied To Semantic Segmentation
23 pages
Thesis On Image Segmentation
No ratings yet
Thesis On Image Segmentation
4 pages
HCMA22 Contemporary Music and Arts
No ratings yet
HCMA22 Contemporary Music and Arts
158 pages
A Study On Image Categorization Techniques
No ratings yet
A Study On Image Categorization Techniques
7 pages
DL UNIt-III
No ratings yet
DL UNIt-III
21 pages
explo_ppt
No ratings yet
explo_ppt
25 pages
A1745136595_29458_13_2025_unit6cv
No ratings yet
A1745136595_29458_13_2025_unit6cv
54 pages
Deep Semantic Segmentation New Model of Natural and Medical Images
No ratings yet
Deep Semantic Segmentation New Model of Natural and Medical Images
4 pages
Image Segmentation ÔÇö A BeginnerÔÇÖs Guide _ Medium
No ratings yet
Image Segmentation ÔÇö A BeginnerÔÇÖs Guide _ Medium
16 pages
Image Segmentation Using Deep Learning: A Survey
No ratings yet
Image Segmentation Using Deep Learning: A Survey
22 pages
DL UNIT 5
No ratings yet
DL UNIT 5
63 pages
ML Report-Image Segmentation
No ratings yet
ML Report-Image Segmentation
19 pages
Technical Updated
No ratings yet
Technical Updated
22 pages
report_explo
No ratings yet
report_explo
31 pages
Minor Report
No ratings yet
Minor Report
27 pages
Research Paper
No ratings yet
Research Paper
7 pages
Image Segmentationand Semantic Labelingusing Machine Learning
No ratings yet
Image Segmentationand Semantic Labelingusing Machine Learning
6 pages
9781638280712-summary
No ratings yet
9781638280712-summary
65 pages
10623proposal Copy
No ratings yet
10623proposal Copy
4 pages
Ppt Finale
No ratings yet
Ppt Finale
17 pages
1907.06119
No ratings yet
1907.06119
58 pages
Semantic Segmentation Architecture: A Key Part of Scene Understanding Applications
No ratings yet
Semantic Segmentation Architecture: A Key Part of Scene Understanding Applications
9 pages
FuseSeg Semantic Segmentation of Urban Scenes Based On RGB and Thermal Data Fusion
No ratings yet
FuseSeg Semantic Segmentation of Urban Scenes Based On RGB and Thermal Data Fusion
12 pages
Image Segmentation Using Deep Learning: A Survey
No ratings yet
Image Segmentation Using Deep Learning: A Survey
23 pages
Lecture 8 Image Segmentationi n Computer Vision 2025
No ratings yet
Lecture 8 Image Segmentationi n Computer Vision 2025
18 pages
Overview of semantic segmentation
No ratings yet
Overview of semantic segmentation
20 pages
Image Processing
No ratings yet
Image Processing
7 pages
Harley MSC Thesis Menos Especializadpo
No ratings yet
Harley MSC Thesis Menos Especializadpo
71 pages
Boundary-Aware Segmentation Network For Mobile and Web Applications
No ratings yet
Boundary-Aware Segmentation Network For Mobile and Web Applications
19 pages
Two-Stage Framework For Faster Semantic Segmentation
No ratings yet
Two-Stage Framework For Faster Semantic Segmentation
9 pages
Lecture Sematic-Segmentation
No ratings yet
Lecture Sematic-Segmentation
23 pages
Rsr11 Project Step3
No ratings yet
Rsr11 Project Step3
2 pages
IVP notes
No ratings yet
IVP notes
25 pages
A Survey On Deep Learning Techniques For Image and Video Semantic Segmentation
No ratings yet
A Survey On Deep Learning Techniques For Image and Video Semantic Segmentation
68 pages
Image Segmentation Using Deep Learning A Survey
No ratings yet
Image Segmentation Using Deep Learning A Survey
20 pages
Semantic Segmentation For Urban-Scene Images: Shorya Sharma
No ratings yet
Semantic Segmentation For Urban-Scene Images: Shorya Sharma
15 pages
BSSNet_A_Real-Time_Semantic_Segmentation_Network_for_Road_Scenes_Inspired_From_AutoEncoder
No ratings yet
BSSNet_A_Real-Time_Semantic_Segmentation_Network_for_Road_Scenes_Inspired_From_AutoEncoder
15 pages
Design and Implementation of A Deep Learning
No ratings yet
Design and Implementation of A Deep Learning
11 pages
IJRAR1DUP001
No ratings yet
IJRAR1DUP001
3 pages
CV Project Proposal
No ratings yet
CV Project Proposal
3 pages
Image Segmentation Keras: Implementation of Segnet, FCN, Unet, Pspnet and Other Models in Keras
No ratings yet
Image Segmentation Keras: Implementation of Segnet, FCN, Unet, Pspnet and Other Models in Keras
5 pages
research ideas
No ratings yet
research ideas
2 pages
Lecture 13 Image Segmentation Using Convolutional Neural Network
No ratings yet
Lecture 13 Image Segmentation Using Convolutional Neural Network
9 pages
SDPT Semantic-Aware Dimension-Pooling Transformer For Image Segmentation
No ratings yet
SDPT Semantic-Aware Dimension-Pooling Transformer For Image Segmentation
13 pages
Real-Time Traffic Scene Segmentation Based On Multi-Feature Map and Deep Learning
No ratings yet
Real-Time Traffic Scene Segmentation Based On Multi-Feature Map and Deep Learning
6 pages
8DL
No ratings yet
8DL
6 pages
VtM - Kindred of the East - Thrashing Dragons (Digital)
No ratings yet
VtM - Kindred of the East - Thrashing Dragons (Digital)
95 pages
4
No ratings yet
4
5 pages
Image Segmentation in Deep Learning
No ratings yet
Image Segmentation in Deep Learning
12 pages
BML Assign Print 4
No ratings yet
BML Assign Print 4
8 pages
JAFAR THESIS FINAL - Copy
No ratings yet
JAFAR THESIS FINAL - Copy
56 pages
Walkman NW-E103, NW-E105, NW-E107
No ratings yet
Walkman NW-E103, NW-E105, NW-E107
46 pages
Premier - Rules - 11.11.11 July 14 2021
No ratings yet
Premier - Rules - 11.11.11 July 14 2021
49 pages
Social Psychology PDF
No ratings yet
Social Psychology PDF
9 pages
Group 6 Community Development Aspect of Housing 5107
No ratings yet
Group 6 Community Development Aspect of Housing 5107
25 pages
Practical Manual Networking
100% (1)
Practical Manual Networking
15 pages
Appendix 11 - Instructions - ORS
No ratings yet
Appendix 11 - Instructions - ORS
1 page
Carol Dougherty, Leslie Kurke Cultural Poetics in Archaic Greece - Cult, Performance, Politics 1998 PDF
0% (1)
Carol Dougherty, Leslie Kurke Cultural Poetics in Archaic Greece - Cult, Performance, Politics 1998 PDF
344 pages
Motivation: Dr. Mosam Sinha
No ratings yet
Motivation: Dr. Mosam Sinha
8 pages
Tracheostomy Care Notes-1
No ratings yet
Tracheostomy Care Notes-1
5 pages
Read The Following Text Carefully Then Do The Activities Below
No ratings yet
Read The Following Text Carefully Then Do The Activities Below
4 pages
IB Deadlines and Assessments 2024-25 Student Version
No ratings yet
IB Deadlines and Assessments 2024-25 Student Version
6 pages
WBinstructions
No ratings yet
WBinstructions
17 pages
Heres A Little Poem - Evaluation Form
No ratings yet
Heres A Little Poem - Evaluation Form
3 pages
Doctors Order (Allergy)
No ratings yet
Doctors Order (Allergy)
4 pages
Slaughterhouse Five Comparative Essay
No ratings yet
Slaughterhouse Five Comparative Essay
4 pages
Course On Vivekachudamani
No ratings yet
Course On Vivekachudamani
2 pages
1.early Iron Age Q&A
No ratings yet
1.early Iron Age Q&A
7 pages
Ethics: Ethics or Moral Philosophy Is The Branch of
No ratings yet
Ethics: Ethics or Moral Philosophy Is The Branch of
15 pages
Answer The Following Questions About The Reading Analyzed
No ratings yet
Answer The Following Questions About The Reading Analyzed
2 pages
B Test Theo Form 2025 - Anh 10 Global Success Grade 10 - Unit 3 - Test 3 - GV
100% (3)
B Test Theo Form 2025 - Anh 10 Global Success Grade 10 - Unit 3 - Test 3 - GV
4 pages
Calculus Math 310 Syllabus Fall
No ratings yet
Calculus Math 310 Syllabus Fall
3 pages
Communicate Work Roles in The Operations of The Enterprise
0% (1)
Communicate Work Roles in The Operations of The Enterprise
9 pages
Four Corners 2 Unit 5 6
No ratings yet
Four Corners 2 Unit 5 6
5 pages
Entrepreneurial Behavior and Competencies - Chapter 2 - BSOA-4C
No ratings yet
Entrepreneurial Behavior and Competencies - Chapter 2 - BSOA-4C
4 pages
Choose The Correct Answer by Crossing
No ratings yet
Choose The Correct Answer by Crossing
3 pages
GURPS Thaumatology - Sorcery
67% (3)
GURPS Thaumatology - Sorcery
36 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Contextual Image Classification: Understanding Visual Data for Effective Classification
From Everand
Contextual Image Classification: Understanding Visual Data for Effective Classification
Fouad Sabry
No ratings yet
Pyramid Image Processing: Exploring the Depths of Visual Analysis
From Everand
Pyramid Image Processing: Exploring the Depths of Visual Analysis
Fouad Sabry
No ratings yet
Object Detection: Advances, Applications, and Algorithms
From Everand
Object Detection: Advances, Applications, and Algorithms
Fouad Sabry
No ratings yet
Image Segmentation: Unlocking Insights through Pixel Precision
From Everand
Image Segmentation: Unlocking Insights through Pixel Precision
Fouad Sabry
No ratings yet

Computer Vision Experiential Learning Report

Uploaded by

Computer Vision Experiential Learning Report

Uploaded by

Computer Vision Experiential Learning

Image Segmentation of Cityscapes Data with U-

Introduction to Image Segmentation:

Significance in Computer Vision Applications:

1. Object Recognition and Tracking:

Explanation of the Project:

In our project, we focus on image segmentation using the Cityscapes dataset,

Literature Review on Image Segmentation:

Author Title Result

Fausto Milletari, Nassir V-Net: Fully In this work we propose an

S. Prabu A Study on Image In this paper different

Refik Samet; Şahin Fuzzy Rule-Based In this paper, we

About the Dataset :

This dataset is made freely available to academic and non-academic

You might also like