0% found this document useful (0 votes)
4 views

Image_Segmentation

This document is a minor project report submitted for the Bachelor of Technology in Computer Science and Engineering, focusing on evolutionary feature selection for optimizing image segmentation models. It discusses the integration of evolutionary algorithms and dimensionality reduction techniques to enhance model performance and efficiency in processing complex image data. The report includes sections on literature review, proposed models and algorithms, experimental results, and concludes with discussions on future work and implications for the field of computer vision.
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4 views

Image_Segmentation

This document is a minor project report submitted for the Bachelor of Technology in Computer Science and Engineering, focusing on evolutionary feature selection for optimizing image segmentation models. It discusses the integration of evolutionary algorithms and dimensionality reduction techniques to enhance model performance and efficiency in processing complex image data. The report includes sections on literature review, proposed models and algorithms, experimental results, and concludes with discussions on future work and implications for the field of computer vision.
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 22

Evolutionary feature selection for optimization of

Image Segmentation Model.

A MINOR PROJECT REPORT SUBMITTED IN PARTIAL


FULFILLMENT OF THE REQUIREMENT FOR THE DEGREE OF

BACHELOR OF TECHNOLOGY
IN
COMPUTER SCIENCE AND ENGINEERING

SUBMITTED BY
Name Univ. Roll No.
Subharaj Mishra 10800120067
Dipyaman Ghosal 10800121194
Abir Maji 10800121203
Arghya Konar 10800121193

UNDER THE GUIDANCE OF

Mr. Pranabes Gangopadhyay


(Assistant Professor of CS)

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING


ASANSOL ENGINEERING COLLAGE
AFFILIATED TO
MAULANA ABUL KALAM AZAD UNIVERSITY OF TECHNOLOGY

December, 2023
Contents

Certificate of ii
Recommendation………………………………………………………………
Certificate of iii
Approval……………………………………………………………….............
Acknowledgement…………………………………………………………......................... iv
...
Abstract…………………………………………………………………………... v
…………..
List of vii
Figures……………………………………………………………………………….
List of Tables………………………………………………………………………………. vii
i

1. Preface………………………………………………………………………………
1.1 Introduction……………………………………………………... 1
……………
1.2 Motivation of the project…………………………………. 2
…………………..
1.3 Basic description of the project…………………………. 3
……………………
2. Literature Review
2.1 General……………………………………………………... 4
…………………
2.2 Review of related works …………………………. …………......….…... 6
……
3. Related Theories and Algorithms…………..……………………………………..
3.1 Fundamental theories underlying the work. 8
…………………………………..
3.2 Fundamental algorithms…………………………………. 10
……………………
4. Proposed model/algorithm…………………………………….………………….
4.1 Proposed 12
model………………………………………………………………..
4.2 Proposed algorithms………………………. 16
………………………………….
5. Simulation Results…………………………………………...…………………….

4.1 Experimental set up ……………………………………………..………….. 20
4.2 Experimental results………...……………...……… 25
6. Discussion and Conclusion ……...…………………………………………….
…….
6.1 Discussion…………………………………..…………………………. 30
……..
6.2 Future work…………………………………..….……………………... 32

ii
……..
6.3 Conclusion………………………………………………………………… 34
…..
References…………………………………………………………………………
….

DEPARTMENT OF COMPUTER SCIENCE AND


ENGINEERING
ASANSOL ENGINEERING COLLEGE
Vivekananda Sarani, Kanyapur, Asansol, West Bengal – 713305

Certificate of Recommendation

I hereby recommend that the Minor Project Report entitled, “Write Project Title
Here” carried out under my supervision by the group of students listed below may be
accepted in partial fulfilment of the requirement for the degree of “Bachelor of
Technology in Computer Science and Engineering” of Asansol Engineering College
under MAULANA ABUL KALAM AZAD UNIVERSITY OF TECHNOLOGY.

Name Univ. Roll No.


Subharaj Mishra 10800120067
Dipyaman Ghosal 10800121194
Abir Maji 10800121203
Arghya Konar 10800121193

…………………………………
(Asst Prof. Mr. Pranabes Gangopadhyay)
Project Supervisor

iii
Dept. of Comp. Sc. & Engg,
Asansol Engineering College,
Asansol-713305

Countersigned:
………………………………
(Dr. Monish Chatterjee)
Head of the Department
Dept. of Comp. Sc. & Engg.
Asansol Engineering College,
Asansol-713305

DEPARTMENT OF COMPUTER SCIENCE AND


ENGINEERING

ASANSOL ENGINEERING COLLEGE


Vivekananda Sarani, Kanyapur, Asansol, West Bengal – 713305

Certificate of Approval

The Minor Project Report is hereby approved as a creditable study of an engineering


subject carried out and presented in a manner satisfactory to warrant its acceptance for
the degree for which it has been submitted. It is understood that by this approval the
undersigned does not necessarily endorse or approve any statement made, opinion
expressed or conclusion drawn therein but approve the report only for the purpose for
which it is submitted.

…………………………………

(Asst Prof. Mr. Pranabes Gangopadhyay)


Project Supervisor
iv
Dept. of Comp. Sc. & Engg.,
Asansol Engineering College,
Asansol-713305

Acknowledgement

It is our great privilege to express our profound and sincere gratitude to our Project
Supervisor, Asst Prof. Mr. Pranabes Gangopadhyay for providing us a very
cooperative and precious guidance at every stage of the present project work being
carried out under his/her supervision. His valuable advice and instructions in carrying
out the present study has been a very rewarding and pleasurable experience that has
greatly benefited us throughout the course of work.

We would like to convey our sincere gratitude towards Dr. Monish Chatterjee, Head of
the Department of Computer Science and Engineering of Asansol Engineering College
for providing us the requisite support for timely completion of our work. We would also
like to pay our heartiest thanks and gratitude to all the teachers of the Department of
Computer Science and Engineering, for various suggestions being provided in attaining
success in our work.

We would like to express our earnest thanks to Mr. Suman Mallick, of CSE Project Lab
for his technical assistance provided during our project work.

v
Finally, I would like to express my deep sense of gratitude to my parents for their
constant motivation and support throughout my work.

…………………………………
(Subharaj Mishra)

…………………………………
(Dipyaman Ghosal)

…………………………………
(Abir Maji)

…………………………………
(Arghya Konar)

Abstract

As the complexity and dimensionality of image data continue to grow, the demand for efficient
and accurate image segmentation models becomes increasingly crucial. This abstract explores the
application of evolutionary feature selection and dimensionality reduction techniques as powerful
strategies for optimizing image segmentation models. Evolutionary algorithms, inspired by
natural selection, offer a dynamic and adaptive approach to selecting relevant features by
iteratively evolving a population of potential solutions. Through the exploration of candidate
feature subsets, these algorithms aim to enhance the model's performance by identifying and
retaining the most informative features for image segmentation tasks.

In conjunction with evolutionary approaches, dimensionality reduction techniques play a pivotal


role in streamlining the computational burden and improving the interpretability of image
segmentation models. Methods such as Principal Component Analysis (PCA) and t-Distributed
Stochastic Neighbour Embedding (t-SNE) are explored within the context of image segmentation
optimization. By transforming high-dimensional image data into a reduced feature space, these
techniques facilitate more efficient model training and enhance the model's ability to capture
relevant patterns for accurate segmentation.

This abstract concludes by highlighting the potential synergy between evolutionary feature
selection and dimensionality reduction techniques, emphasizing their collective impact on

vi
improving the efficiency, accuracy, and interpretability of image segmentation models. The
integration of these strategies holds promise for advancing the field of computer vision and
empowering applications ranging from medical imaging to autonomous systems.

List of Figures

Fig. 1.0 Proposed Model for Image Segmentation. Page no: xi

Fig. 2.0 Proposed Algorithms For Image Segmentation(R-CNN). Page no: xii

Fig. 3.0 Accuracy and Precision Testing. Page no: xix

vii
1. Preface……………………………………………………………………………

1.1 Introduction:

The advent of sophisticated imaging technologies has fuelled the need for
advanced computer vision models, particularly in the domain of image
segmentation. As the complexity of visual data grows, optimizing
segmentation models becomes paramount. This preface introduces the key
elements of the upcoming exploration—evolutionary feature selection and
dimensionality reduction—as innovative techniques to address the challenges
of enhancing image segmentation models. It sets the stage for understanding
the dynamic interplay between these methodologies and their potential impact
on the field of computer vision.

1.2 Motivation of the Project:


The motivation behind this project stems from the pressing demand to
develop image segmentation models that can efficiently process and interpret
intricate visual data. The preface delves into the motivating factors, including
the expanding applications of segmentation in fields such as medical imaging
and autonomous systems. It emphasizes the urgency to optimize these models
for improved efficiency, accuracy, and interpretability. By laying out the
project's motivation, readers gain insights into the real-world implications and

viii
significance of exploring evolutionary feature selection and dimensionality
reduction in this context.

1.3 Basic Description of the Project:


This section provides a concise overview of the project's fundamental
components. It outlines the scope and purpose of integrating evolutionary
feature selection and dimensionality reduction techniques to optimize image
segmentation models. The preface establishes a baseline understanding of the
project's structure, guiding readers through the forthcoming exploration of
these methodologies. By offering a basic description, this section serves as a
roadmap for the subsequent chapters, encouraging readers to delve into the
intricate details of how these techniques can synergistically contribute to
advancing image segmentation in the realm of computer vision.

2. Literature Review…………………………………………………………………
2.1 General:
The literature review in this section provides a broad overview of the general
landscape surrounding image segmentation, evolutionary algorithms, and
dimensionality reduction techniques. It explores foundational concepts,
historical developments, and key theoretical frameworks that form the basis of
the subsequent investigation. By delving into the general literature, readers
will gain a comprehensive understanding of the evolution of image
segmentation models, the principles underlying evolutionary algorithms, and
the significance of dimensionality reduction in the context of computer vision.

2.2 Review of Related Works:


This subsection narrows the focus to a more specific examination of related
works within the realm of evolutionary feature selection and dimensionality
reduction for image segmentation optimization. It critically evaluates existing
research, methodologies, and findings that contribute to the current state of
knowledge in this field. By synthesizing insights from diverse sources, this
review aims to identify trends, gaps, and emerging patterns in the literature.

ix
Additionally, it provides a foundation for the project's methodology and
conceptual framework, setting the stage for the subsequent chapters that delve
into the practical application of these techniques in optimizing image
segmentation models.

3. Related Theories and Algorithms………………………………………………..


3.1 Fundamental Theories Underlying the Work:

This section explores the foundational theories that underpin the application
of TensorFlow in the context of image segmentation. It delves into the core
principles of deep learning, convolutional neural networks (CNNs), and the
specific architectural considerations within TensorFlow that make it a
powerful tool for image segmentation tasks. The review of fundamental
theories provides readers with the necessary background to comprehend the
intricacies of how TensorFlow processes image data, extracts features, and
performs segmentation. Insights into the theoretical underpinnings guide the
subsequent discussion on the practical implementation of image segmentation
using TensorFlow.

3.2 Fundamental Algorithms:

Building upon the theoretical foundation, this subsection focuses on the


fundamental algorithms integral to image segmentation using TensorFlow. It

x
covers key algorithms such as U-Net, Deep Lab, and Mask R-CNN, among
others, that have proven effective in segmenting images. The discussion
outlines the conceptual frameworks and algorithmic intricacies that contribute
to the success of these methods. By examining the core algorithms, readers
will gain a deeper understanding of the nuances involved in leveraging
TensorFlow for image segmentation. This knowledge serves as a crucial
precursor to the subsequent chapters, where the integration of evolutionary
feature selection and dimensionality reduction techniques will be explored to
enhance these fundamental algorithms and optimize image segmentation
models.

4. Proposed Model/Algorithm……………………………………………………….
4.1 Proposed Model of U-Net:

In this section, we present a detailed overview of the proposed model based


on the U-Net architecture for image segmentation using TensorFlow. U-Net,
renowned for its success in biomedical image segmentation, features an
encoder-decoder structure with skip connections, allowing for precise
localization of objects within an image. We discuss the adaptation and
enhancements made to the traditional U-Net model to cater to specific
requirements and challenges within the targeted domain. By elucidating the
modifications and justifications for our proposed U-Net model, readers gain
insights into the tailored approach taken to optimize image segmentation for
the given context.

xi
Fig: 1.0 : Proposed Model for Image Segmentation.
4.2 Proposed Algorithms For Image Segmentation(R-CNN):

Mask R-CNN (Region-based Convolutional Neural Network) is a popular and


powerful algorithm for instance segmentation, a task that involves identifying
and delineating individual objects within an image. Developed as an extension
of the Faster R-CNN architecture, Mask R-CNN introduces an additional
branch to predict segmentation masks alongside bounding boxes and class
labels. This allows the algorithm not only to identify objects but also to
provide pixel-level segmentation masks for each object instance.

Here's a step-by-step explanation of the Mask R-CNN algorithm:

xii
Backbone Network:
Mask R-CNN typically starts with a backbone network (e.g., U-Net or a
similar architecture) that extracts hierarchical features from the input image.
These features serve as a foundation for subsequent tasks.
Region Proposal Network (RPN):
Similar to Faster R-CNN, Mask R-CNN utilizes an RPN to generate region
proposals. These proposals are candidate bounding boxes that potentially
contain objects of interest. The RPN proposes regions based on anchor boxes
and assigns objectless scores to these proposals.
ROI Align:
Once region proposals are obtained, the algorithm uses ROI (Region of
Interest) Align to extract features from these proposed regions with sub-pixel
accuracy. Unlike the ROI pooling used in Faster R-CNN, ROI Align mitigates
information loss caused by quantization and improves the accuracy of the
subsequent tasks.
Object Classification:
The extracted features from the proposed regions are fed into a classification
head, which predicts the class label of the object contained within each
region. This step distinguishes different object categories.

Bounding Box Regression:


Simultaneously, another branch of the network performs bounding box
regression, refining the coordinates of the proposed bounding boxes to better
align with the actual object boundaries.

Mask Prediction:
Mask R-CNN introduces an additional branch for mask prediction. This
branch is responsible for generating pixel-level segmentation masks for each
instance within the proposed regions. It uses a small fully convolutional
network to predict the mask for each region independently.
Training:
The entire Mask R-CNN model is trained end-to-end. The training involves
optimizing three main components: the region proposal network, the object
classification, and bounding box regression, and the mask prediction. A multi-
task loss function combines these components, ensuring that the model learns
to simultaneously predict bounding boxes, class labels, and segmentation
masks.
Inference:

xiii
During inference, the trained Mask R-CNN model takes an input image,
generates region proposals using the RPN, performs ROI Align to extract
features, and then predicts class labels, bounding boxes, and segmentation
masks for each region. The final output includes the identified objects along
with their precise segmentation masks.
Mask R-CNN has proven to be highly effective in various applications,
including object detection and instance segmentation, making it a widely
adopted algorithm in the field of computer vision

Fig: 2.0

5. Simulation Results………………………………………………………………..
5.1 Experimental Setup:
In the experimental setup, images from a diverse dataset were pre-processed
with resizing and normalization.

1. Preparing the dataset:

The dataset namely Oxford-IIIT Pets dataset is loaded in the Program where
the images of different animals are give to work upon. In addition, the image
color values are normalized to the [0, 1] range. Finally, as mentioned above
the pixels in the segmentation mask are labelled either {1, 2, 3}. For the sake
of convenience, subtract 1 from the segmentation mask, resulting in labels
that are: {0, 1, 2}.

xiv
2. Normalizing the Image:
In image processing, normalization is a process that changes the range of
pixel intensity values. Applications include photographs with poor contrast
due to glare, for example. Normalization is sometimes called contrast
stretching or histogram stretching. In more general fields of data processing,
such as digital signal processing, it is referred to as dynamic range expansion.

3. Loading the Images for normalization:


This piece of code adjusts and load the images for their normalization. This
process helps in removing unnecessary elements and resize in order to fit the
model for predictions.

4. Preparing a Training set and Testing set:


A training and testing set is prepared for our model to perform the operation
by using TensorFlow’s splitting methods. The dataset already contains the
required training and test splits, so continue to use the same splits.

xv
5. Augmenting the data:
The data is further given to for augmentation. Data augmentation is a
technique to increase the diversity of your training set by applying random
(but realistic) transformations, such as image rotation. Here we are randomly-
flipping an image. The flipping is horizontal in nature.

6. Building Input pipelines:


The input pipeline is a quick and easy utility provided in tf.dataapi to make
complex input pipelines from simple and reusable codes and all in few lines
of code. It also allows handling a large amount of data, thus giving low-end
machines an advantage in computing them.

7. Visualising the Image dataset and masks:


Now comes the part where we are going to display the images along with
there masks. These masks are made in a photo processing software manually.
These are the particular data we will be using to train our model. The code is
shown along with the images.

xvi
5.2 Experimental Results:
1. First Phase (Running the model before Training):
First phase of testing is to run the model on the new images before
training the model. This is comparing the results before training.

xvii
2. Second Phase (Running the model when its Training):
Second phase of testing is to run the model with partial data while its
training on the training data to check the progress of our machine learning model.

3. Third Phase (Making prediction after completion of Training):


Third Phase of Testing is to run it on the new and tested images and
measuring there result to evaluate how well our machine learning model is
performing.

xvii
i
4. Fourth Phase (Accuracy and Precision of each testing data):
These are the following Iteration of the testing data along with there
validation loss and Accuracy.

xix
The graph showing the reduction of validation loss and Training loss over the
iteration in the model.

6. Discussion and Conclusion……………………………………………………….


6.1 Discussion:

xx
The discussion section delves into the implications of the simulation results,
addressing strengths and limitations of the proposed algorithm for image
segmentation. Key considerations include the algorithm's adaptability to
various datasets, the impact of hyperparameter choices on performance, and
potential challenges in handling complex scenes. Additionally, comparisons
with existing methods and insights into the algorithm's generalization
capabilities are explored.

6.2 Future Work:


This may involve exploring novel architectures, incorporating advanced
attention mechanisms, or adapting the algorithm for specific application
domains. Considerations for addressing potential challenges identified during
the experimental phase and scaling the algorithm for larger datasets are
discussed, paving the way for continued advancements in image
segmentation.

6.3 Conclusion:
It emphasizes the effectiveness of the proposed algorithm in achieving
accurate image segmentation results, as evidenced by comprehensive
evaluation metrics. The conclusion reinforces the algorithm's practical
applicability and potential for real-world deployment. Closing remarks may
also touch upon the broader implications of the research within the field of
computer vision and image processing.

References:

xxi
1. Kaggle - https://www.kaggle.com/

2. Coursera - https://www.coursera.org/

3. Geeksforgeeks - https://www.geeksforgeeks.org/

xxii

You might also like