0% found this document useful (0 votes)

13 views24 pages

Finalised Question 2

This document provides a template for loading and processing the Oxford-IIIT Pets dataset for classification and semantic segmentation tasks in Assignment 1B. It includes helper functions for data preprocessing, augmentation, and loading the MobileNetV3Small Network for fine-tuning. Additionally, it outlines the structure and requirements for the assignment, emphasizing the importance of following the provided instructions and reading the assignment brief.

Uploaded by

ido824488

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views24 pages

Finalised Question 2

Uploaded by

ido824488

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

CAB420 Assignment 1B Question 2: Template

Overview
This notebook provides a helper function to load in the Oxford-IIIT Pets dataset suitable for classification and semantic segmentation, to help with
Assignment 1B, Question 2.

It also provides an example of how to load in the MobileNetV3Small Network which you are required to fine tune for the second part of the
question.

Please read the comments and instructions within this notebook. It has been carefully designed to help you with many of the tasks required.

Please make sure you read the assignment brief on canvas, and check the FAQ for other information.

In [1]: !pip install --upgrade tensorflow_datasets

import os
os.environ['TF_CPP_MIN_LOG_LEVEL'] = '3'

import matplotlib.pyplot as plt

from matplotlib import gridspec

import tensorflow as tf
import keras
from keras.layers import Input, Dense, Conv2D, MaxPooling2D, UpSampling2D, concatenate, BatchNormalization, SpatialDropout2D, Act
from keras.models import Model

import numpy as np
import pandas as pd
import tensorflow_datasets as tfds
import glob
Requirement already satisfied: tensorflow_datasets in c:\users\acer\anaconda3\lib\site-packages (4.9.8)
Requirement already satisfied: tqdm in c:\users\acer\anaconda3\lib\site-packages (from tensorflow_datasets) (4.64.1)
Requirement already satisfied: requests>=2.19.0 in c:\users\acer\anaconda3\lib\site-packages (from tensorflow_datasets) (2.28.1)
Requirement already satisfied: promise in c:\users\acer\anaconda3\lib\site-packages (from tensorflow_datasets) (2.3)
Requirement already satisfied: protobuf>=3.20 in c:\users\acer\anaconda3\lib\site-packages (from tensorflow_datasets) (4.21.12)
Requirement already satisfied: simple_parsing in c:\users\acer\anaconda3\lib\site-packages (from tensorflow_datasets) (0.1.7)
Requirement already satisfied: termcolor in c:\users\acer\anaconda3\lib\site-packages (from tensorflow_datasets) (2.5.0)
Requirement already satisfied: toml in c:\users\acer\anaconda3\lib\site-packages (from tensorflow_datasets) (0.10.2)
Requirement already satisfied: wrapt in c:\users\acer\anaconda3\lib\site-packages (from tensorflow_datasets) (1.14.1)
Requirement already satisfied: absl-py in c:\users\acer\anaconda3\lib\site-packages (from tensorflow_datasets) (2.1.0)
Requirement already satisfied: pyarrow in c:\users\acer\anaconda3\lib\site-packages (from tensorflow_datasets) (20.0.0)
Requirement already satisfied: psutil in c:\users\acer\anaconda3\lib\site-packages (from tensorflow_datasets) (5.9.0)
Requirement already satisfied: etils[edc,enp,epath,epy,etree]>=1.6.0 in c:\users\acer\anaconda3\lib\site-packages (from tensorfl
ow_datasets) (1.12.2)
Requirement already satisfied: dm-tree in c:\users\acer\anaconda3\lib\site-packages (from tensorflow_datasets) (0.1.9)
Requirement already satisfied: tensorflow-metadata in c:\users\acer\anaconda3\lib\site-packages (from tensorflow_datasets) (1.1
7.1)
Requirement already satisfied: numpy in c:\users\acer\anaconda3\lib\site-packages (from tensorflow_datasets) (1.26.4)
Requirement already satisfied: immutabledict in c:\users\acer\anaconda3\lib\site-packages (from tensorflow_datasets) (4.2.1)
Requirement already satisfied: typing_extensions in c:\users\acer\anaconda3\lib\site-packages (from etils[edc,enp,epath,epy,etre
e]>=1.6.0->tensorflow_datasets) (4.12.2)
Requirement already satisfied: zipp in c:\users\acer\anaconda3\lib\site-packages (from etils[edc,enp,epath,epy,etree]>=1.6.0->te
nsorflow_datasets) (3.11.0)
Requirement already satisfied: fsspec in c:\users\acer\anaconda3\lib\site-packages (from etils[edc,enp,epath,epy,etree]>=1.6.0->
tensorflow_datasets) (2022.11.0)
Requirement already satisfied: importlib_resources in c:\users\acer\anaconda3\lib\site-packages (from etils[edc,enp,epath,epy,et
ree]>=1.6.0->tensorflow_datasets) (6.5.2)
Requirement already satisfied: einops in c:\users\acer\anaconda3\lib\site-packages (from etils[edc,enp,epath,epy,etree]>=1.6.0->
tensorflow_datasets) (0.8.1)
Requirement already satisfied: certifi>=2017.4.17 in c:\users\acer\anaconda3\lib\site-packages (from requests>=2.19.0->tensorflo
w_datasets) (2024.12.14)
Requirement already satisfied: idna<4,>=2.5 in c:\users\acer\anaconda3\lib\site-packages (from requests>=2.19.0->tensorflow_data
sets) (3.4)
Requirement already satisfied: charset-normalizer<3,>=2 in c:\users\acer\anaconda3\lib\site-packages (from requests>=2.19.0->ten
sorflow_datasets) (2.0.4)
Requirement already satisfied: urllib3<1.27,>=1.21.1 in c:\users\acer\anaconda3\lib\site-packages (from requests>=2.19.0->tensor
flow_datasets) (1.26.14)
Requirement already satisfied: attrs>=18.2.0 in c:\users\acer\anaconda3\lib\site-packages (from dm-tree->tensorflow_datasets) (2
2.1.0)
Requirement already satisfied: six in c:\users\acer\anaconda3\lib\site-packages (from promise->tensorflow_datasets) (1.16.0)
Requirement already satisfied: docstring-parser<1.0,>=0.15 in c:\users\acer\anaconda3\lib\site-packages (from simple_parsing->te
nsorflow_datasets) (0.16)
Requirement already satisfied: colorama in c:\users\acer\anaconda3\lib\site-packages (from tqdm->tensorflow_datasets) (0.4.6)
Data loading and pre-processing functions
We first provide some helper functions to format the data in the way we need. You shouldn't need to change these, though you are welcome to if
you like.

One thing you may want to do is create additional augmentation functions, and the flip_lr_augmentation function below could be used as a
template to create additional augmentation types.

In [2]: def preprocess_segmentation_mask(segmentation_mask):

"""preprocess the semgentation mask

The original segmentation mask has three categories.

foreground, background and outline
This function will just convert it to foreground and background

The original segmentation mask is also 1-index, so will convert it

to 0-index.

the original mask is represented as:

1 - edge of dog/cat and things like leashes etc.
2 - background
3 - foreground

we want to just keep the merge the edges and foreground of the doggo/catto, and
then treat it as a binary semantic segmentation task.
To achieve this, we will just subtract two, converting to values of [-1, 0, 1],
and then apply the abs function to convert the -1 values (edges) to the foreground.

Will also convert it to 32 bit float which will be needed for working with tf.

Why am I doing it this way?

A reasonable question. Initially I tried to do it with just normal array indexing,
but this is a bit more work since the mask is a tensorflow tensor and not a np array.
We could alternatively convert it to an array, perform indexing and then map it back,
but this would have a performance overhead, which wouldn't be a big deal, but still.
With all that being said, I am doing it for you, so you don't have to.

Args:
segmentation_mask (array):
original segmentation mask
Returns:
preprocessed segmentation_mask
"""
return tf.abs(tf.cast(segmentation_mask, tf.float32) - 2)

def return_image_label_mask(ds_out):
""" function to return image, class label and segmentation mask

The original dataset contains additional information, such as the filename and
the species. We don't care about any of that for this work, so will
discard them and just keep the original image as our input, and then
a tuple of our outputs that will be the class label and the semantic
segmentation mask.

Whilst we are here, we will also preprocess the segmentation mask.

Args:
ds_out: dict
original dataset output

Returns:
RGB image
tuple of class label and preprocessed segmentation mask
"""
# preprocess the segmentation mask
seg_mask = preprocess_segmentation_mask(ds_out['segmentation_mask'])
image = tf.cast(ds_out['image'], tf.float32)
# image = standardise_image(image)
return image, (ds_out['label'], seg_mask)

def mobilenet_preprocess_image(image):
"""Apply preprocessing that is suitable for MobileNetV3.

Simply scales to ranges [-1, 1]

you should use this preprocessing for both your model and the mobilenet model
"""
image = (image - 127.5) / 255.0
return image

def unprocess_image(image):
""" undo preprocessing above so can plot images"""
image = image * 255.0 + 127.5
return image

def preprocess_and_resize(image, output, image_size):

"""apply preprocessing steps above to images and resize images and maps

Each image in the dataset is of a different size. The resizing will make sure
each image is the same size.
"""
# resize the image and the semantic segmentation mask
image = tf.image.resize(image, [image_size, image_size])
image = mobilenet_preprocess_image(image)
mask = tf.image.resize(output[1], [image_size, image_size])
return image, (output[0], mask)

def flip_lr_augmentation(image, output, flip_lr_prob):

""" function to return perform left-right flip augmentation

The function will flip the image along the left-right axis with
a defined probability.
"""

# randomly sample a value between 0 and 1

uniform_sample = tf.random.uniform([], minval=0, maxval=1)
# perform flip_lr with probability given by flip_lr_prob
flip_lr_cond = tf.math.less(uniform_sample, flip_lr_prob)
# output is a tuple of (class, segmentation_mask), pull out the segmentation mask
seg = output[1]

# wrapper fn for when we do the flip

def flip():
flipped_image = tf.image.flip_left_right(image)
flipped_seg = tf.image.flip_left_right(seg)
return flipped_image, flipped_seg

# wrapper fn for when we do NOT flip

def no_flip():
return image, seg

# apply augmentation
image, seg = tf.cond(flip_lr_cond, flip, no_flip)
# return the image, and output
return image, (output[0], seg)

def select_tasks(image, output, classification=True, segmentation=True):

"""select the tasks to include the data

By default for each input there are two outputs. This function allows
you to select which outputs to use, so the problem can be reduced to a
single task problem for initial experimenting.
"""
# both tasks
if classification and segmentation:
return image, output
# just classification
elif classification:
return image, output[0]
# just segmentation
elif segmentation:
return image, output[1]
# neither task, doesn't really make sense, so return the image
# for a self-supervised task
else:
return image, image

class TrainForTime(keras.callbacks.Callback):
"""callback to terminate training after a time limit is reached

Can be used to control how long training runs for, and will terminate
training once a specified time limit is reached.
"""
def __init__(
self,
train_time_mins=15,
):
super().__init__()

self.train_time_mins = train_time_mins
self.epochs = 0
self.train_time = 0
self.end_early = False

def on_train_begin(self, logs=None):

# save the start time
self.start_time = tf.timestamp()

def on_epoch_end(self, epoch, logs=None):

self.epochs += 1
current_time = tf.timestamp()
training_time = (current_time - self.start_time)
if (training_time / 60) > self.train_time_mins:
self.train_time = current_time - self.start_time
self.model.stop_training = True
self.end_early = True

def on_train_end(self, logs=None):

if self.end_early:
print('training time exceeded and ending early')
print(f'training ended on epoch {self.epochs}')
print(f'training time = {self.train_time / 60} mins')

Visualizing Data Augmentation (Before vs After)

In [3]: import matplotlib.pyplot as plt

# Load one raw example from Oxford-IIIT Pets dataset

raw_data = tfds.load("oxford_iiit_pet", split="train[:1]", as_supervised=False)
raw_sample = next(iter(raw_data))

# Preprocess image, label, and mask

image, (label, mask) = return_image_label_mask(raw_sample)
image_size = 160 # Use your actual image size here
image, (label, mask) = preprocess_and_resize(image, (label, mask), image_size)

# Apply horizontal flip augmentation (always flip for demo)

aug_image, (aug_label, aug_mask) = flip_lr_augmentation(image, (label, mask), flip_lr_prob=1.0)

# Undo preprocessing for visualization

image_vis = unprocess_image(image).numpy().astype("uint8")
aug_image_vis = unprocess_image(aug_image).numpy().astype("uint8")

# Plot side-by-side
plt.figure(figsize=(10, 4))
plt.subplot(1, 2, 1)
plt.imshow(image_vis)
plt.title("Original Image")
plt.axis('off')

plt.subplot(1, 2, 2)
plt.imshow(aug_image_vis)
plt.title("Horizontally Flipped Image")
plt.axis('off')
plt.show()

Data Loader
We will now put the above functions together into a data loader that we can use to feed directly to the network. You can you this directly as it is.
However, you may modify it to add some additional functionality such as further data augmentations.

In [5]: def load_oxford_pets(split,

batch_size=233,
classification=True,
segmentation=True,
shuffle=True,
augment=True,
image_size=300):
"""Load Oxford pets dataset for Assignment 1B

Function handles loading of data for 1b, included processing of images and
semantic segmentation masks. This function will
organise the tensorflow dataset to return an output that is a tuple, where
the tuple will be (classification_labels, segmentation_masks).

Parameters
----------
split : string
either train, val or test string
classification : bool
whether to include classification labels
segmentation : bool
whether to include semantic segmentation masks
batch_size : int
size of batches to use
shuffle : bool
whether to shuffle the dataset (WILL ONLY APPLY TO TRAIN)
augment : bool
whether to augment the dataset (WILL ONLY APPLY TO TRAIN)
image_size : int
new image size

Returns
-------
tf.Dataset containing the Oxford pets dataset
"""
# lets do some error checking first
# Check fior a valid dataset split, this must be train or test
if (split != 'train') and (split != 'val') and (split != 'test'):
raise ValueError('Arg for split must be either \'train\' or \'test\'')
if (not classification) and (not segmentation):
print("WARNING: One of the tasks (classification and segmentation) must be selected")
print("Setting both to enabled")
classification = True
segmentation = True

# check that if using the val split, shuffle if false. If not, print a warning and force shuffle to be false
if (split == 'val') and shuffle:
print("WARNING: shuffle is set to true, but have specified split to be \'val\'")
print('The shuffle argument will be ignored')
shuffle = False

# check that if using the test split, shuffle if false. If not, print a warning and force shuffle to be false
if (split == 'test') and shuffle:
print("WARNING: shuffle is set to true, but have specified split to be \'test\'")
print('The shuffle argument will be ignored')
shuffle = False

# check that if using the val split, augment if false. If not, print a warning and force augment to be false
if (split == 'val') and augment:
print("WARNING: augment is set to true, but have specified split to be \'val\'")
print('The augment argument will be ignored')
augment = False

# check that if using the test split, augment if false. If not, print a warning and force augment to be false
if (split == 'test') and augment:
print("WARNING: augment is set to true, but have specified split to be \'test\'")
print('The augment argument will be ignored')
augment = False

# the dataset by default only has train and test splits. If val is requested, pull the first 30% of the test set
if (split == 'val'):
split = 'test[:30%]'
# the test set then becomes the remaining 70% of the original test set
elif (split == 'test'):
split = 'test[30%:]'

# now start loading the dataset

ds = tfds.load('oxford_iiit_pet',
split=split,
with_info=False)

# remove unnecessary dataset info

ds = ds.map(return_image_label_mask)

# augmentation
# only apply if in the training split and augment has been set to True
if split == 'train' and augment:
# apply a left-right flip with 50% probability
flip_lr_prob = 0.5
# flip operation
ds = ds.map(lambda inp, out: flip_lr_augmentation(inp, out, flip_lr_prob), num_parallel_calls=tf.data.AUTOTUNE)

# more augmentation operations could go here .....

# Final processing of the data

# here we will resize the data, and add the preprocessing that is needed for compatability with the mobilenet models.
ds = ds.map(lambda inp, out: preprocess_and_resize(inp, out, image_size))

# and now remove any tasks that we don't want. Note that we call this last as it means that all the other functions
# can safely assume that data for both tasks is in the dataset
ds = ds.map(lambda inp, out: select_tasks(inp, out, classification, segmentation))

# if in the training split, and shuffle is true, shuffle the data

if split == 'train' and shuffle:
ds = ds.shuffle(1000)

# return the loaded and processed dataset

return ds.batch(batch_size).prefetch(tf.data.AUTOTUNE)

Testing the provided data loader.

We'll now test the data loader and plot some examples to confirm it's working. NOTE: some poor defaults are specified below for image size
and batch size. Set these to something more appropriate.

In [4]: # testing the data loader and plotting some images.

# NOTE: the image size set here is all but definitely too large. You will need
# to chage this yourself to something that is suitable given your constraints
# NOTE: The batch size is also too large. This done on purpose force you to
# pick a suitable batch size yourself
image_size = 300
batch_size = 273
# load training data, note that shuffle and augment are true
train_class_seg = load_oxford_pets('train', classification=True, segmentation=True, shuffle=True, augment=True,
batch_size=batch_size, image_size=image_size)
# load validation data, note that shuffle and augment are false (though if they weren't, the data loader would force these to be
val_class_seg = load_oxford_pets('val', classification=True, segmentation=True, shuffle=False, augment=False,
batch_size=batch_size, image_size=image_size)
# load testing data, note that shuffle and augment are false (though if they weren't, the data loader would force these to be fal
test_class_seg = load_oxford_pets('test', classification=True, segmentation=True, shuffle=False, augment=False,
batch_size=batch_size, image_size=image_size)

In [5]: # lets plot a few now to see some good kittens/doggos

fig, axs = plt.subplots(2, 3, figsize=(8, 4), layout="constrained")
num_plot = 3
i = 0

# each sample of our dataset will be of the format

# image, outputs
# where outputs[0] = label
# outputs[1] = segmentation mask
#
#
# lets get a single batch, and plot just a few of them
for image, output in train_class_seg.take(1).as_numpy_iterator():
for i in range(num_plot):
im = axs[0, i].imshow(np.squeeze(unprocess_image(image[i, ...])) / 255.0)
axs[0, i].set_title(output[0][i])
axs[0, i].axis('off')
im = axs[1, i].imshow(np.squeeze(output[1][i, ...]))
axs[1, i].axis('off')

print(output[1].shape)
i += 1
if i >= num_plot:
break

plt.savefig('doggos_cattos.png')

(273, 300, 300, 1)

(273, 300, 300, 1)
(273, 300, 300, 1)
The images are showing correctly.

NOTE: You can ignore the JPEG wearning.

We can use the classification and segmentation flags to pull out just one output as well, as the below demonstrates.

In [6]: # classification only; classification = True, segmentation = False (note batch size is 1 here)
train_class_only = load_oxford_pets('train', classification=True, segmentation=False, shuffle=True, augment=True, batch_size=1, i
# segmentation only; classification = False, segmentation = True (note batch size is 1 here)
train_seg_only = load_oxford_pets('train', classification=False, segmentation=True, shuffle=True, augment=True, batch_size=1, ima

# test the classification only dataset

# pull out one element
inp, out = next(iter(train_class_only))
# print the output
print(out.numpy())
# test the segmentation only dataset
# pull out one element
inp, out = next(iter(train_seg_only))
# print just the output shape for the segmentation output
print(out.numpy().shape)

[29]
(1, 300, 300, 1)

While for the question you do need to train networks to do both tasks simultaenously, when you starting playing with the problem it might be
easier to get things working for one task, and then add the second.

Loading MobileNetV3Small base for fine tuning

This model can be loaded directly from keras. By default, the model we download will be pre-trained on Imagenet dataset.

Note that we will need to set the preprocessing option when loading this base network to False. This is because the include_preprocessing
step is implemented in the Datasets we defined above.

We also set include_top=False , to avoid loading our model with the final Dense classification layer which is used for the original Imagenet
model.

More details are available in the keras documentation here.

In [7]: mobile_base = keras.applications.MobileNetV3Small(input_shape=(image_size, image_size, 3),

include_top=False,
include_preprocessing=False)

C:\Users\acer\anaconda3\lib\site-packages\keras\src\applications\mobilenet_v3.py:452: UserWarning: `input_shape` is undefined or

non-square, or `rows` is not 224. Weights for input shape (224, 224) will be loaded as the default.
return MobileNetV3(

For this task, can ignore the input_shape warning, though it is important to keep in mind the difference in size of data used for the pre-trained
model and our data may have an impact on our model (what that impact might be is for you to investigate :) ). Depending on what input shape
you select you may also be able to eliminate this.

For more information on fine-tuning models, can refer to many of the examples from class, or the Keras documentation
Data Loading
This section loads the Oxford-IIIT Pets dataset using the provided load_oxford_pets function. The batch size and image size are set
appropriately. Augmentation is turned on for training, and off for validation/testing. Additional augmentations have been added to improve
generalization.

In [7]: # Set image and batch size

image_size = 160
batch_size = 32

# Load training and test datasets

train_ds = load_oxford_pets(
split='train',
image_size=image_size,
batch_size=batch_size,
classification=True,
segmentation=True,
shuffle=True,
augment=True
)

test_ds = load_oxford_pets(
split='test',
image_size=image_size,
batch_size=batch_size,
classification=True,
segmentation=True,
shuffle=False,
augment=False
)

Fine-Tuned MobileNet Model

This model uses a pre-trained MobileNetV3Small as the backbone (frozen initially), with the same classification and segmentation heads. This
leverages transfer learning to improve performance with limited training data.

In [8]: from tensorflow.keras import layers, Model

def build_scratch_model(input_shape=(image_size, image_size, 3), num_classes=37):

inputs = layers.Input(shape=input_shape)

x = layers.Conv2D(32, 3, activation='relu', padding='same')(inputs)

x = layers.MaxPooling2D()(x)
x = layers.Conv2D(64, 3, activation='relu', padding='same')(x)
x = layers.MaxPooling2D()(x)
x = layers.Conv2D(128, 3, activation='relu', padding='same')(x)
x = layers.MaxPooling2D()(x)

# Classification branch
class_branch = layers.GlobalAveragePooling2D()(x)
class_output = layers.Dense(num_classes, activation='softmax', name='classification')(class_branch)

# Segmentation branch
# Segmentation branch — upsample to match 160x160
seg_branch = layers.Conv2D(128, 3, activation='relu', padding='same')(x) # assume x is ~20x20
seg_branch = layers.UpSampling2D(2)(seg_branch) # ~40x40
seg_branch = layers.Conv2D(64, 3, activation='relu', padding='same')(seg_branch)
seg_branch = layers.UpSampling2D(2)(seg_branch) # ~80x80
seg_branch = layers.Conv2D(32, 3, activation='relu', padding='same')(seg_branch)
seg_branch = layers.UpSampling2D(2)(seg_branch) # ~160x160
seg_branch = layers.Conv2D(3, 1, activation='softmax', name='segmentation')(seg_branch) # Output: (160, 160, 3)

model = Model(inputs=inputs, outputs=[class_output, seg_branch])

return model

scratch_model = build_scratch_model()
scratch_model.compile(
optimizer='adam',
loss={
'classification': 'sparse_categorical_crossentropy',
'segmentation': 'sparse_categorical_crossentropy' # <-- change this line
},
metrics={
'classification': 'accuracy',
'segmentation': 'accuracy'
}
)

# Train
scratch_model.fit(train_ds, epochs=5, validation_data=test_ds)
Epoch 1/5
115/115 ━━━━━━━━━━━━━━━━━━━━ 186s 2s/step - classification_accuracy: 0.0231 - classification_loss: 3.6208 - loss: 4.3136 - segme
ntation_accuracy: 0.5973 - segmentation_loss: 0.6928 - val_classification_accuracy: 0.0327 - val_classification_loss: 3.6073 - v
al_loss: 4.1386 - val_segmentation_accuracy: 0.7455 - val_segmentation_loss: 0.5307
Epoch 2/5
115/115 ━━━━━━━━━━━━━━━━━━━━ 182s 2s/step - classification_accuracy: 0.0354 - classification_loss: 3.6037 - loss: 4.0832 - segme
ntation_accuracy: 0.7666 - segmentation_loss: 0.4795 - val_classification_accuracy: 0.0370 - val_classification_loss: 3.5980 - v
al_loss: 4.0159 - val_segmentation_accuracy: 0.8129 - val_segmentation_loss: 0.4170
Epoch 3/5
115/115 ━━━━━━━━━━━━━━━━━━━━ 168s 1s/step - classification_accuracy: 0.0382 - classification_loss: 3.5884 - loss: 4.0221 - segme
ntation_accuracy: 0.7975 - segmentation_loss: 0.4336 - val_classification_accuracy: 0.0323 - val_classification_loss: 3.5831 - v
al_loss: 3.9780 - val_segmentation_accuracy: 0.8206 - val_segmentation_loss: 0.3932
Epoch 4/5
115/115 ━━━━━━━━━━━━━━━━━━━━ 172s 1s/step - classification_accuracy: 0.0617 - classification_loss: 3.5292 - loss: 3.9158 - segme
ntation_accuracy: 0.8217 - segmentation_loss: 0.3866 - val_classification_accuracy: 0.0572 - val_classification_loss: 3.5329 - v
al_loss: 3.9430 - val_segmentation_accuracy: 0.8131 - val_segmentation_loss: 0.4088
<keras.src.callbacks.history.History at 0x2ca959a72e0>
Out[8]:

In [14]: import numpy as np

from sklearn.metrics import classification_report, f1_score, jaccard_score

# Extract predictions
y_true_cls = []
y_pred_cls = []
y_true_seg = []
y_pred_seg = []

for images, (labels_cls, labels_seg) in test_ds:

preds_cls, preds_seg = scratch_model.predict(images)

# Classification
y_true_cls.extend(labels_cls.numpy())
y_pred_cls.extend(np.argmax(preds_cls, axis=-1))

# Segmentation
labels_seg_np = labels_seg.numpy().reshape(-1).astype(int)
preds_seg_np = np.argmax(preds_seg, axis=-1).reshape(-1).astype(int)

y_true_seg.extend(labels_seg_np.tolist())
y_pred_seg.extend(preds_seg_np.tolist())

# Classification evaluation
print("Classification Report (Scratch Model):")
print(classification_report(y_true_cls, y_pred_cls))

# Segmentation evaluation
print("Segmentation IoU:", jaccard_score(y_true_seg, y_pred_seg, average='macro', zero_division=0))
print("Segmentation F1 Score:", f1_score(y_true_seg, y_pred_seg, average='macro', zero_division=0))
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 475ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 413ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 488ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 425ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 378ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 423ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 385ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 383ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 397ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 436ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 375ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 351ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 338ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 358ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 381ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 471ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 443ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 559ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 614ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 517ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 408ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 416ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 567ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 457ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 439ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 536ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 487ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 506ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 487ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 864ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 676ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 481ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 405ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 498ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 574ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 454ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 429ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 457ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 559ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 497ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 507ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 454ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 419ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 489ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 419ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 569ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 747ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 594ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 718ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 739ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 686ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 574ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 514ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 577ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 627ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 524ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 591ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 756ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 590ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 701ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 643ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 515ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 505ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 494ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 647ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 893ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 667ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 1s/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 1s/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 636ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 561ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 704ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 657ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 746ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 633ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 681ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 625ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 723ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 648ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 1s 610ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 311ms/step
Classification Report (Scratch Model):
precision recall f1-score support

0 0.20 0.01 0.03 69

1 0.00 0.00 0.00 66
2 0.00 0.00 0.00 70
3 0.04 0.14 0.06 65
4 0.00 0.00 0.00 75
5 0.00 0.00 0.00 67
6 0.00 0.00 0.00 70
7 0.16 0.05 0.08 60
8 0.00 0.00 0.00 67
9 0.00 0.00 0.00 64
10 0.06 0.05 0.05 66
11 1.00 0.03 0.06 68
12 0.03 0.03 0.03 73
13 0.08 0.06 0.07 72
14 0.06 0.22 0.09 76
15 0.00 0.00 0.00 64
16 0.10 0.07 0.08 67
17 0.06 0.27 0.10 70
18 0.00 0.00 0.00 71
19 0.08 0.11 0.09 70
20 0.00 0.00 0.00 66
21 0.00 0.00 0.00 64
22 0.43 0.04 0.08 70
23 0.06 0.53 0.10 77
24 0.00 0.00 0.00 73
25 0.00 0.00 0.00 66
26 0.09 0.26 0.13 73
27 0.02 0.01 0.02 71
28 0.00 0.00 0.00 70
29 0.03 0.03 0.03 75
30 0.00 0.00 0.00 68
31 0.00 0.00 0.00 75
32 0.00 0.00 0.00 73
33 0.00 0.00 0.00 67
34 0.02 0.03 0.03 63
35 0.00 0.00 0.00 78
36 0.04 0.09 0.05 69

accuracy 0.06 2568

macro avg 0.07 0.06 0.03 2568
weighted avg 0.07 0.06 0.03 2568
C:\Users\acer\anaconda3\lib\site-packages\sklearn\metrics\_classification.py:1344: UndefinedMetricWarning: Precision and F-score
are ill-defined and being set to 0.0 in labels with no predicted samples. Use `zero_division` parameter to control this behavio
r.
_warn_prf(average, modifier, msg_start, len(result))
C:\Users\acer\anaconda3\lib\site-packages\sklearn\metrics\_classification.py:1344: UndefinedMetricWarning: Precision and F-score
are ill-defined and being set to 0.0 in labels with no predicted samples. Use `zero_division` parameter to control this behavio
r.
_warn_prf(average, modifier, msg_start, len(result))
C:\Users\acer\anaconda3\lib\site-packages\sklearn\metrics\_classification.py:1344: UndefinedMetricWarning: Precision and F-score
are ill-defined and being set to 0.0 in labels with no predicted samples. Use `zero_division` parameter to control this behavio
r.
_warn_prf(average, modifier, msg_start, len(result))
Segmentation IoU: 0.6883749694633189
Segmentation F1 Score: 0.8152755864263785

In [18]: from tensorflow.keras import layers, Model

def build_mobilenet_model(input_shape=(160, 160, 3), num_classes=37):

base_model = keras.applications.MobileNetV3Small(
input_shape=input_shape,
include_top=False,
weights='imagenet'
)
base_model.trainable = False

inputs = layers.Input(shape=input_shape)
x = base_model(inputs)

# Classification head
class_branch = layers.GlobalAveragePooling2D()(x)
class_output = layers.Dense(num_classes, activation='softmax', name='classification')(class_branch)

# Segmentation head — upsample from ~5x5 to 160x160

seg_branch = layers.Conv2D(256, 3, activation='relu', padding='same')(x) # ~5x5
seg_branch = layers.UpSampling2D(2)(seg_branch) # 5x5 → 10x10
seg_branch = layers.Conv2D(128, 3, activation='relu', padding='same')(seg_branch)
seg_branch = layers.UpSampling2D(2)(seg_branch) # 10x10 → 20x20
seg_branch = layers.Conv2D(64, 3, activation='relu', padding='same')(seg_branch)
seg_branch = layers.UpSampling2D(2)(seg_branch) # 20x20 → 40x40
seg_branch = layers.Conv2D(32, 3, activation='relu', padding='same')(seg_branch)
seg_branch = layers.UpSampling2D(2)(seg_branch) # 40x40 → 80x80
seg_branch = layers.Conv2D(16, 3, activation='relu', padding='same')(seg_branch)
seg_branch = layers.UpSampling2D(2)(seg_branch) # 80x80 → 160x160
seg_branch = layers.Conv2D(3, 1, activation='softmax', name='segmentation')(seg_branch)

model = Model(inputs=inputs, outputs=[class_output, seg_branch])

return model

mobilenet_model = build_mobilenet_model()
mobilenet_model.compile(
optimizer='adam',
loss={
'classification': 'sparse_categorical_crossentropy',
'segmentation': 'sparse_categorical_crossentropy' # use sparse loss for integer masks
},
metrics={
'classification': 'accuracy',
'segmentation': 'accuracy'
}
)

mobilenet_model.fit(train_ds, epochs=5, validation_data=test_ds)

Epoch 1/5
115/115 ━━━━━━━━━━━━━━━━━━━━ 135s 615ms/step - classification_accuracy: 0.0241 - classification_loss: 3.6584 - loss: 4.2824 - se
gmentation_accuracy: 0.6977 - segmentation_loss: 0.6239 - val_classification_accuracy: 0.0296 - val_classification_loss: 3.6206
- val_loss: 4.1400 - val_segmentation_accuracy: 0.7406 - val_segmentation_loss: 0.5193
Epoch 2/5
115/115 ━━━━━━━━━━━━━━━━━━━━ 81s 689ms/step - classification_accuracy: 0.0303 - classification_loss: 3.6310 - loss: 4.1411 - seg
mentation_accuracy: 0.7461 - segmentation_loss: 0.5100 - val_classification_accuracy: 0.0280 - val_classification_loss: 3.6204 -
val_loss: 4.1256 - val_segmentation_accuracy: 0.7495 - val_segmentation_loss: 0.5050
Epoch 3/5
115/115 ━━━━━━━━━━━━━━━━━━━━ 70s 599ms/step - classification_accuracy: 0.0245 - classification_loss: 3.6240 - loss: 4.1266 - seg
mentation_accuracy: 0.7504 - segmentation_loss: 0.5025 - val_classification_accuracy: 0.0288 - val_classification_loss: 3.6132 -
val_loss: 4.1158 - val_segmentation_accuracy: 0.7513 - val_segmentation_loss: 0.5025
Epoch 4/5
115/115 ━━━━━━━━━━━━━━━━━━━━ 73s 618ms/step - classification_accuracy: 0.0273 - classification_loss: 3.6180 - loss: 4.1163 - seg
mentation_accuracy: 0.7545 - segmentation_loss: 0.4983 - val_classification_accuracy: 0.0374 - val_classification_loss: 3.6082 -
val_loss: 4.0899 - val_segmentation_accuracy: 0.7640 - val_segmentation_loss: 0.4818
Epoch 5/5
115/115 ━━━━━━━━━━━━━━━━━━━━ 76s 655ms/step - classification_accuracy: 0.0326 - classification_loss: 3.6134 - loss: 4.1033 - seg
mentation_accuracy: 0.7594 - segmentation_loss: 0.4898 - val_classification_accuracy: 0.0327 - val_classification_loss: 3.5969 -
val_loss: 4.1494 - val_segmentation_accuracy: 0.7229 - val_segmentation_loss: 0.5536
<keras.src.callbacks.history.History at 0x168468d5a20>
Out[18]:
In [19]: # Evaluate scratch model
scratch_eval = scratch_model.evaluate(test_ds)
print("Scratch model evaluation:", scratch_eval)

# Evaluate mobilenet model

mobilenet_eval = mobilenet_model.evaluate(test_ds)
print("MobileNet model evaluation:", mobilenet_eval)

81/81 ━━━━━━━━━━━━━━━━━━━━ 26s 321ms/step - classification_accuracy: 0.0631 - classification_loss: 3.5424 - loss: 3.9710 - segme
ntation_accuracy: 0.7992 - segmentation_loss: 0.4286
Scratch model evaluation: [3.9609272480010986, 3.531313180923462, 0.4281884729862213, 0.06542056053876877, 0.7993440628051758]
81/81 ━━━━━━━━━━━━━━━━━━━━ 19s 236ms/step - classification_accuracy: 0.0323 - classification_loss: 3.5896 - loss: 4.1431 - segme
ntation_accuracy: 0.7223 - segmentation_loss: 0.5535
MobileNet model evaluation: [4.149447441101074, 3.596855640411377, 0.5536401271820068, 0.032710280269384384, 0.7228771448135376]

In [12]: # For your write-up, you should include:

# - A discussion of what pre-processing (i.e. resizing, colour conversion, augmentation, etc.) you apply to the data and why.
# - Details of two implemented methods. This should include a details of the final “from-scratch” approach and justification
# for the chosen design, and details of changes made to MobileNetV3Small for the “fine-tuned” approach. Details on how the
# models are trained are also to be provided.
# - An evaluation that compares the two models for the two tasks (classification and semantic segmentation). Your evaluation
# should discuss overall model performance, how it differs between the two approaches, and include figures if/where necessary.
# - A discussion of methods that were explored to improve performance for both models and mitigate identified issues, and potenti
# other methods that were considered but not implemented due to computational constraints. See the assignment brief for further
# details.
# Your write-up should be supported by appropriate figures and tables. Figures and tables should have numbers and meaningful capt
# Note that figures and tables are not included in the page limits.
#
# SEE THE ASSIGNMENT BRIEF ON CANVAS FOR MORE DETAILS AND NOTE THAT A NOTEBOOK FILE DOES NOT CONSTITUTE A VALID SUBMISSION.
# YOU SHOULD WRITE UP YOUR RESPONSE IN A SEPARATE DOCUMENT

Deep Learning Lab Manual
No ratings yet
Deep Learning Lab Manual
88 pages
C2 W1 Assignment
No ratings yet
C2 W1 Assignment
25 pages
Tensorflow 2 - 0 Slides PDF
No ratings yet
Tensorflow 2 - 0 Slides PDF
100 pages
Maher SEBAI Internship Presentation
No ratings yet
Maher SEBAI Internship Presentation
94 pages
Csc413 Project Semantic Segmentation
No ratings yet
Csc413 Project Semantic Segmentation
84 pages
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
UNIT II - PPT - Part 1
No ratings yet
UNIT II - PPT - Part 1
41 pages
Building Comprehension Through Explicit Teaching of Comprehension Strategies
No ratings yet
Building Comprehension Through Explicit Teaching of Comprehension Strategies
27 pages
Group Discussion Evaluation Sheet YUVA
100% (3)
Group Discussion Evaluation Sheet YUVA
4 pages
Deeplearning Lab Manual
No ratings yet
Deeplearning Lab Manual
29 pages
C3W2 - Assignment - Ipynb - Colaboratory
No ratings yet
C3W2 - Assignment - Ipynb - Colaboratory
39 pages
Crash Course On Tensorflow!: Vincent Lepetit!
No ratings yet
Crash Course On Tensorflow!: Vincent Lepetit!
63 pages
Task1 Lakshya - Ipynb - Colab
No ratings yet
Task1 Lakshya - Ipynb - Colab
33 pages
Principles and Practice of Pedodontics 2nd Edition by Arathi Rao ISBN 8184483457 9788184483451 PDF Download
No ratings yet
Principles and Practice of Pedodontics 2nd Edition by Arathi Rao ISBN 8184483457 9788184483451 PDF Download
83 pages
Project Guidelines - AIML
No ratings yet
Project Guidelines - AIML
30 pages
Handwriting Recognition
No ratings yet
Handwriting Recognition
31 pages
Work Immersion Pertinent Papers
No ratings yet
Work Immersion Pertinent Papers
19 pages
Lab Manual
No ratings yet
Lab Manual
45 pages
TF Mannual
No ratings yet
TF Mannual
19 pages
TMA01 Question 1 (45 Marks)
No ratings yet
TMA01 Question 1 (45 Marks)
31 pages
CD 601 Lab Manual
No ratings yet
CD 601 Lab Manual
61 pages
Tensorflow
No ratings yet
Tensorflow
22 pages
C2 W1 Assignment
No ratings yet
C2 W1 Assignment
24 pages
C2 W1 Assignment
No ratings yet
C2 W1 Assignment
24 pages
Trainrealfill
No ratings yet
Trainrealfill
19 pages
Getting Started - TensorFlow
0% (1)
Getting Started - TensorFlow
14 pages
TensorFlow Tutorial
No ratings yet
TensorFlow Tutorial
65 pages
Cancer Peau
No ratings yet
Cancer Peau
23 pages
Main Python Code
No ratings yet
Main Python Code
31 pages
TMA01 Question 2 (55 Marks)
No ratings yet
TMA01 Question 2 (55 Marks)
26 pages
02 - Lecture Note - TensorFlow Ops
No ratings yet
02 - Lecture Note - TensorFlow Ops
21 pages
Notebook - Tensorflow Keras
No ratings yet
Notebook - Tensorflow Keras
25 pages
Code - Edge Impulse
No ratings yet
Code - Edge Impulse
13 pages
Curriculum of Israel
No ratings yet
Curriculum of Israel
61 pages
Tensorflow, Keras and Deep Learning
No ratings yet
Tensorflow, Keras and Deep Learning
51 pages
Tensorflow 2 Tutorial PDF
100% (4)
Tensorflow 2 Tutorial PDF
66 pages
Unit III
No ratings yet
Unit III
28 pages
21BCP167 Ai 9
No ratings yet
21BCP167 Ai 9
10 pages
Tensorflow From Scratch - 01
No ratings yet
Tensorflow From Scratch - 01
16 pages
Explore The Implementation of CNNs in Python
No ratings yet
Explore The Implementation of CNNs in Python
10 pages
Srafvana
No ratings yet
Srafvana
6 pages
Wa0029.
No ratings yet
Wa0029.
11 pages
CVDL Tae 63
No ratings yet
CVDL Tae 63
9 pages
Dcgan
No ratings yet
Dcgan
9 pages
Pytorch Waste Classification Using Densenet Jupyter Notebook
No ratings yet
Pytorch Waste Classification Using Densenet Jupyter Notebook
14 pages
FALLSEM2024-25 BCSE332P LO VL2024250102168 2024-09-09 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE332P LO VL2024250102168 2024-09-09 Reference-Material-I
9 pages
Not F: # Check If The File Exists
No ratings yet
Not F: # Check If The File Exists
7 pages
In This Hands-On You Will Be Performing CNN Operations Using Tensorflow Package
No ratings yet
In This Hands-On You Will Be Performing CNN Operations Using Tensorflow Package
6 pages
Interpreting SNT TC 1a - Part7
No ratings yet
Interpreting SNT TC 1a - Part7
2 pages
CNN TF Keras
No ratings yet
CNN TF Keras
6 pages
Image Classification Code
No ratings yet
Image Classification Code
4 pages
Pre-Trained Models: Objectives
No ratings yet
Pre-Trained Models: Objectives
12 pages
Dinushasan Courseproject04: Sign in
No ratings yet
Dinushasan Courseproject04: Sign in
19 pages
Lab 4-Image Segmentation Using U-Net
No ratings yet
Lab 4-Image Segmentation Using U-Net
9 pages
Apex For Bres 1
No ratings yet
Apex For Bres 1
6 pages
Tensorflow Neural Network Lab: Notmnist
No ratings yet
Tensorflow Neural Network Lab: Notmnist
15 pages
Speech Writing Techniques
No ratings yet
Speech Writing Techniques
1 page
Convolutional Neural Networks: Objectives
No ratings yet
Convolutional Neural Networks: Objectives
10 pages
Chapter02 Mathematical-Building-Blocks
No ratings yet
Chapter02 Mathematical-Building-Blocks
9 pages
MLG Tensor
No ratings yet
MLG Tensor
34 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Questionq and Answers 2024
No ratings yet
Questionq and Answers 2024
15 pages
PedagogySyllabus F11
No ratings yet
PedagogySyllabus F11
3 pages
Task VIII Quantum Vision Transformer
No ratings yet
Task VIII Quantum Vision Transformer
1 page
Cartography
No ratings yet
Cartography
3 pages
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
From Everand
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
Manish Soni
No ratings yet
Tensor Flow 2
No ratings yet
Tensor Flow 2
3 pages
AlexNet Transfer Learning - Ipynb
No ratings yet
AlexNet Transfer Learning - Ipynb
5 pages
Mnist
No ratings yet
Mnist
3 pages
TensorFlow深度学习项目实战: Chinese Edition
From Everand
TensorFlow深度学习项目实战: Chinese Edition
Posts & Telecom Press
No ratings yet
CV
No ratings yet
CV
3 pages
CLASS X (2020-21) Mathematics Basic (241) Sample Paper-1
No ratings yet
CLASS X (2020-21) Mathematics Basic (241) Sample Paper-1
7 pages
Ielts
No ratings yet
Ielts
4 pages
Sample Business Plan For Starting A Nursery School
100% (1)
Sample Business Plan For Starting A Nursery School
13 pages
Senior Consultant - Management Consulting-1
No ratings yet
Senior Consultant - Management Consulting-1
1 page
Admit Card
No ratings yet
Admit Card
2 pages
Kalita & Deka (2024)
No ratings yet
Kalita & Deka (2024)
6 pages
Math Small Group Lesson Plan
No ratings yet
Math Small Group Lesson Plan
5 pages
Work Immersion Portfolio
No ratings yet
Work Immersion Portfolio
41 pages
King of Happy
No ratings yet
King of Happy
5 pages
A Special Supplement: The Question of Machiavelli - by Isaiah Berlin - The New York Review of Books
No ratings yet
A Special Supplement: The Question of Machiavelli - by Isaiah Berlin - The New York Review of Books
2 pages
Academic Excellence Award: Suianne B. Dorde
No ratings yet
Academic Excellence Award: Suianne B. Dorde
19 pages
Learning Kit para Sa Mga Bulilit
No ratings yet
Learning Kit para Sa Mga Bulilit
3 pages
Vocabulary Acquisition of A Four-Year-Old Child Through Piaget's Accommodation Theory
No ratings yet
Vocabulary Acquisition of A Four-Year-Old Child Through Piaget's Accommodation Theory
13 pages
Acp Presentation
No ratings yet
Acp Presentation
10 pages
Senior Java Developer
No ratings yet
Senior Java Developer
3 pages
Makesworth Accountants Scholarship For ACCA Students in Nepal 2024
No ratings yet
Makesworth Accountants Scholarship For ACCA Students in Nepal 2024
2 pages
First Grade: Newspaper Activity: Major Questions
No ratings yet
First Grade: Newspaper Activity: Major Questions
4 pages
English 10 Quarter 4 Lessons (Week1 - Week 2)
No ratings yet
English 10 Quarter 4 Lessons (Week1 - Week 2)
3 pages
STP Reflection
No ratings yet
STP Reflection
1 page

Finalised Question 2

Uploaded by

Finalised Question 2

Uploaded by

CAB420 Assignment 1B Question 2: Template

In [1]: !pip install --upgrade tensorflow_datasets

import matplotlib.pyplot as plt

In [2]: def preprocess_segmentation_mask(segmentation_mask):

The original segmentation mask has three categories.

The original segmentation mask is also 1-index, so will convert it

the original mask is represented as:

Why am I doing it this way?

Whilst we are here, we will also preprocess the segmentation mask.

Simply scales to ranges [-1, 1]

def preprocess_and_resize(image, output, image_size):

def flip_lr_augmentation(image, output, flip_lr_prob):

# randomly sample a value between 0 and 1

# wrapper fn for when we do the flip

# wrapper fn for when we do NOT flip

def select_tasks(image, output, classification=True, segmentation=True):

def on_train_begin(self, logs=None):

def on_epoch_end(self, epoch, logs=None):

def on_train_end(self, logs=None):

Visualizing Data Augmentation (Before vs After)

# Load one raw example from Oxford-IIIT Pets dataset

# Preprocess image, label, and mask

# Apply horizontal flip augmentation (always flip for demo)

# Undo preprocessing for visualization

In [5]: def load_oxford_pets(split,

# now start loading the dataset

# remove unnecessary dataset info

# more augmentation operations could go here .....

# Final processing of the data

# if in the training split, and shuffle is true, shuffle the data

# return the loaded and processed dataset

Testing the provided data loader.

In [4]: # testing the data loader and plotting some images.

In [5]: # lets plot a few now to see some good kittens/doggos

# each sample of our dataset will be of the format

(273, 300, 300, 1)

NOTE: You can ignore the JPEG wearning.

# test the classification only dataset

Loading MobileNetV3Small base for fine tuning

More details are available in the keras documentation here.

In [7]: mobile_base = keras.applications.MobileNetV3Small(input_shape=(image_size, image_size, 3),

C:\Users\acer\anaconda3\lib\site-packages\keras\src\applications\mobilenet_v3.py:452: UserWarning: `input_shape` is undefined or

In [7]: # Set image and batch size

# Load training and test datasets

Fine-Tuned MobileNet Model

In [8]: from tensorflow.keras import layers, Model

def build_scratch_model(input_shape=(image_size, image_size, 3), num_classes=37):

x = layers.Conv2D(32, 3, activation='relu', padding='same')(inputs)

model = Model(inputs=inputs, outputs=[class_output, seg_branch])

In [14]: import numpy as np

for images, (labels_cls, labels_seg) in test_ds:

0 0.20 0.01 0.03 69

accuracy 0.06 2568

In [18]: from tensorflow.keras import layers, Model

def build_mobilenet_model(input_shape=(160, 160, 3), num_classes=37):

# Segmentation head — upsample from ~5x5 to 160x160

model = Model(inputs=inputs, outputs=[class_output, seg_branch])

mobilenet_model.fit(train_ds, epochs=5, validation_data=test_ds)

# Evaluate mobilenet model

In [12]: # For your write-up, you should include:

You might also like