0% found this document useful (0 votes)

5 views15 pages

Capstone Speech

The project aims to develop a semantic segmentation model for identifying colon cancer regions in 3D medical images, utilizing transfer learning and training with 2D slices derived from 3D scans. The document includes code for managing datasets, visualizing image slices, calculating statistics, and converting 3D images to 2D slices, ensuring effective preparation for model training. Additionally, the project emphasizes the importance of accurate segmentation for aiding medical professionals in diagnosis and treatment planning.

Uploaded by

Soumya s Moharana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views15 pages

Capstone Speech

Uploaded by

Soumya s Moharana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 15

My project is about to create a semantic segmentation model that can identify and segment regions

affected by colon cancer within 3D medical images. This model will help in accurately pinpointing
areas of concern, aiding medical professionals in diagnosis and treatment planning.
To improve model accuracy, we will have to employ transfer learning. This technique leverages pre-
trained models.
While our ultimate goal is to segment 3D images, we will train and validate our model using 2D
images. These 2D images will be derived from the 3D medical scans by slicing them into individual 2D
sections.
Once trained, we will demonstrate the model's capability by performing segmentation on both 2D
slices and entire 3D images. This dual approach ensures the model's versatility and effectiveness in
real-world scenarios.

Code
import os

# Paths to the datasets

train_images_path = '/content/drive/MyDrive/Soumya S
Moharana/Mock_Project/imagesTr'
train_image_files = [f for f in os.listdir(train_images_path) if
f.endswith('.nii.gz') and not f.startswith('._')]
print("Number of training dataset: {}".format(len(train_image_files)))

test_images_path = '/content/drive/MyDrive/Soumya S
Moharana/Mock_Project/imagesTs'
test_image_files = [f for f in os.listdir(test_images_path) if
f.endswith('.nii.gz') and not f.startswith('._')]
print("Number of testing dataset: {}".format(len(test_image_files)))

train_labels_path = '/content/drive/MyDrive/Soumya S
Moharana/Mock_Project/labelsTr'
train_labels_files = [f for f in os.listdir(train_labels_path) if
f.endswith('.nii.gz') and not f.startswith('._')]
print("Number of label dataset: {}".format(len(train_labels_files)))

The code scans specific directories (imagesTr, imagesTs, labelsTr) in Google Drive to collect the names
of files (images and labels) for training and testing.

It filters the files based on their extensions (.nii.gz) to ensure they are the correct type of medical
image files.

It counts and prints the number of files found in each directory. This provides an overview of the
dataset size, indicating there are 126 training images, 64 testing images, and 126 corresponding label
images.

Code

import os
import nibabel as nib
import numpy as np
import matplotlib.pyplot as plt

def display_nifti_slices(nifti_image_path, num_slices=5):

nifti_image = nib.load(nifti_image_path)

image_data = nifti_image.get_fdata() #get_fdata retrives the image

data as a Numpy array

slice_indices = np.linspace(0, image_data.shape[2] - 1, num_slices,

dtype=int)

plt.figure(figsize=(15, 3))
for i, slice_index in enumerate(slice_indices):
plt.subplot(1, num_slices, i + 1)
plt.imshow(image_data[:, :, slice_index])
plt.title(f'Slice {slice_index}')
plt.axis('off')
plt.show()

The code defines a function display_nifti_slices that takes a path to a NIfTI image (nifti_image_path)
and an optional parameter num_slices (default is 5) that specifies the number of slices to display.

It loads the NIfTI image from the specified path using nib.load(nifti_image_path). This function
returns a NIfTI image object.

It extracts the image data as a Numpy array using nifti_image.get_fdata(). This step is crucial as it
converts the NIfTI image data into a format that can be easily manipulated and visualized using
Numpy and Matplotlib.

It calculates the indices of the slices to display based on the number of slices requested (num_slices)
and the total number of slices in the image (image_data.shape[2]).

It then iterates over the selected slice indices, plots each slice using Matplotlib (plt.imshow), and
displays the slice number as the title. The plt.subplot function is used to arrange the slices
horizontally.

When the function is called with a path to a NIfTI image, it will display the specified number of slices
from the image, providing a visual representation of the image data.

Code

def display_all_nifti_slices(nifti_image_path):

nifti_image = nib.load(nifti_image_path)
image_data = nifti_image.get_fdata()

num_slices = image_data.shape[2]
print("Number of slices: {}".format(num_slices))

num_cols = 10
num_rows = (num_slices // num_cols) + (1 if num_slices % num_cols !
= 0 else 0)

plt.figure(figsize=(15, num_rows * 2))

for i in range(num_slices):
plt.subplot(num_rows, num_cols, i + 1)
plt.imshow(image_data[:, :, i])
plt.title(f'Slice {i}')
plt.axis('off')
plt.tight_layout()
plt.show()

The code defines a function display_all_nifti_slices that takes a path to a NIfTI image
(nifti_image_path) as input.

It loads the NIfTI image from the specified path using nib.load . This function returns a NIfTI image
object.

It calculates the total number of slices in the image (num_slices) using the third dimension of the
image data array (image_data.shape[2]).

It calculates the number of columns (num_cols) to display the slices in each row and the number of
rows (num_rows) needed to display all slices. It ensures that there are at most num_cols slices in
each row, and calculates the number of rows accordingly.

It iterates over all slices and plots each slice using Matplotlib (plt.imshow). The plt.subplot function is
used to arrange the slices in a grid layout based on the calculated number of rows and columns.

Code

#display slices for few images

num_images_to_display = 3
for image_file in train_image_files[:num_images_to_display]:
image_path = os.path.join(train_images_path, image_file)
print(f"Displaying slices for: {image_file}")
display_nifti_slices(image_path)
The code starts a for loop to iterate over a specified number of images (num_images_to_display)
from the train_image_files list. For each iteration, it constructs the full path to the current image by
joining the train_images_path with the current image_file name.
As a result display the slices of the specified number of training images one by one, showing all slices
of each image in a grid layout. The slices are displayed using the display_nifti_slices function,
providing a visual representation of the image data.

Then i have applied the same code for labels and testing data.

Code

if train_image_files:
image_path = os.path.join(train_images_path, train_image_files[1])
print(f"Displaying all slices for: {train_image_files[1]}")
display_all_nifti_slices(image_path)
else:
print("No training images found.")
The code checks if train_image_files is not empty . If there are training image files available, it
executes the following block of code. Otherwise, it prints "No training images found."

‘’’If there are training image files, the code constructs the full path to the second image file in the
train_image_files list (train_image_files[1]) by joining train_images_path (the directory path where
the images are stored) with the image file name.’’’

def get_num_slices(nifti_image_path):
nifti_image = nib.load(nifti_image_path)
image_data = nifti_image.get_fdata()
return image_data.shape[2]

# Print the number of slices for each training image file

for image_file in train_image_files:
image_path = os.path.join(train_images_path, image_file)
num_slices = get_num_slices(image_path)
print(f"File: {image_file} - Number of slices: {num_slices}")

this function loads the NIfTI image located at nifti_image_path using nib.load(nifti_image_path).It
then retrieves the image data from the NIfTI image .The function returns the number of slices in the
image, which is the size of the third dimension of the image data array (image_data.shape[2]).

‘’’After defining the function, the code iterates through each file in train_image_files, which contains
the list of training image file names. For each image file, it constructs the full path to the image file
and calls the get_num_slices function to get the number of slices in that image.It then prints a
message indicating the file name and the number of slices in that file.

# Function to display images with corresponding labels

def display_image_with_label(image_path, label_path, num_slices=5):
image = nib.load(image_path).get_fdata()
label = nib.load(label_path).get_fdata()
slice_indices = np.linspace(0, image.shape[2] - 1, num_slices,
dtype=int)

plt.figure(figsize=(15, 6))
for i, slice_index in enumerate(slice_indices):
plt.subplot(2, num_slices, i + 1)
plt.imshow(image[:, :, slice_index], cmap='gray')
plt.title(f'Image Slice {slice_index}')
plt.axis('off')

plt.subplot(2, num_slices, num_slices + i + 1)

plt.imshow(label[:, :, slice_index], cmap='gray')
plt.title(f'Label Slice {slice_index}')
plt.axis('off')
plt.show()

This function takes three arguments: image_path, label_path, and an optional argument num_slices
with a default value of 5.

It then calculates a list of slice_indices it generates num_slices equally spaced indices along the third
dimension of the image (slices).

It plots the image slice (image[:, :, slice_index]) in the top row

It plots the corresponding label slice (label[:, :, slice_index]) in the bottom row

# Display a few images with their corresponding labels

for i in range(min(num_images_to_display, len(train_image_files),
len(train_labels_files))):
image_path = os.path.join(train_images_path, train_image_files[i])
label_path = os.path.join(train_labels_path, train_labels_files[i])
print(f"Displaying image and label slices for:
{train_image_files[i]} and {train_labels_files[i]}")
display_image_with_label(image_path, label_path)

The code uses a for loop to iterate over a range of values. It limits the iteration to the minimum value
among num_images_to_display, the length of train_image_files, and the length of train_labels_files.

Inside the loop, it constructs the paths to the image and label files for the current iteration.

It then calls the display_image_with_label function

This function displays the image slices along with their corresponding label slices.

# Calculate and print basic statistics for the dataset

def calculate_statistics(image_files, image_path):
total_slices = 0
min_slices = float('inf')
max_slices = 0
slice_counts = []

for image_file in image_files:

num_slices = get_num_slices(os.path.join(image_path,
image_file))
slice_counts.append(num_slices)
total_slices += num_slices
if num_slices < min_slices:
min_slices = num_slices
if num_slices > max_slices:
max_slices = num_slices

mean_slices = total_slices / len(image_files) if image_files else 0

return {
'total_slices': total_slices,
'min_slices': min_slices,
'max_slices': max_slices,
'mean_slices': mean_slices,
'slice_counts': slice_counts
}

train_stats = calculate_statistics(train_image_files,
train_images_path)
test_stats = calculate_statistics(test_image_files, test_images_path)

Here The function calculate_statistics of the images. It iterates over each image_file in image_files.
For each image_file, it calculates the number of slices (num_slices) using the get_num_slices
function

For each image_file, it updates the total_slices count and appends num_slices to slice_counts.

It updates min_slices and max_slices based on the minimum and maximum num_slices found.

After the loop, it calculates the mean_slices by dividing total_slices by the number of image_files.

The function returns a dictionary containing the calculated statistics: total_slices, min_slices,
max_slices, mean_slices, and slice_counts.

This function is useful for understanding the distribution of the number of slices in the dataset, which
can provide insights into the dataset's characteristics.

These statistics can help in further analysis and decision-making, such as determining the range of
slices to consider for training or evaluating the segmentation model.

.(----continue----)

# Visualize slice distribution

def visualize_slice_distribution(slice_counts, title):
plt.figure(figsize=(10, 5))
plt.hist(slice_counts, bins=20, alpha=0.7, color='red',
edgecolor='black')
plt.title(title)
plt.xlabel('Number of Slices')
plt.ylabel('Frequency')
plt.show()

visualize_slice_distribution(train_stats['slice_counts'], 'Training
Images Slice Distribution')
visualize_slice_distribution(test_stats['slice_counts'], 'Testing
Images Slice Distribution')

The code generates histograms that visually represent the distribution of the number of slices across
all images in the training and testing datasets. These visualizations help in understanding how the
number of slices varies among the images.

# Convert 3D images to 2D slices (ensure output_image_dir and

output_label_dir are created)
import cv2
output_image_dir = 'output_images'
output_label_dir = 'output_labels'

os.makedirs(output_image_dir, exist_ok=True)
os.makedirs(output_label_dir, exist_ok=True)

def convert_3d_to_2d(image_path, label_path, output_image_dir,

output_label_dir):
image = nib.load(image_path).get_fdata()
label = nib.load(label_path).get_fdata()

num_slices = image.shape[2]

for i in range(num_slices):
image_slice = image[:, :, i]
label_slice = label[:, :, i]

image_slice_path = os.path.join(output_image_dir,
f"{os.path.basename(image_path)[:-7]}_{i}.png")
label_slice_path = os.path.join(output_label_dir,
f"{os.path.basename(label_path)[:-7]}_{i}.png")

cv2.imwrite(image_slice_path, image_slice)
cv2.imwrite(label_slice_path, label_slice)

# Convert all training images and labels

for image_file, label_file in zip(train_image_files,
train_labels_files):
image_path = os.path.join(train_images_path, image_file)
label_path = os.path.join(train_labels_path, label_file)
convert_3d_to_2d(image_path, label_path, output_image_dir,
output_label_dir)

output_image_dir, output_label_dir are the directories where the 2D image slices and their
corresponding label slices will be saved.

‘’’These lines create the output directories if they do not already exist. The exist_ok=True parameter
prevents errors if the directories already exist.’’’

‘’’num_slices = image.shape[2]

This line retrieves the number of slices in the 3D image by accessing the third dimension of the
array.’’’

‘’’for i in range(num_slices):

image_slice = image[:, :, i]

label_slice = label[:, :, i]

This loop iterates over each slice in the 3D image and its corresponding label.’’’

image_slice_path = os.path.join(output_image_dir, f"{os.path.basename(image_path)[:-7]}_{i}.png")

label_slice_path = os.path.join(output_label_dir, f"{os.path.basename(label_path)[:-7]}_{i}.png")

These lines create file paths for saving the image and label slices. The base name of the original
file is used with the slice index appended to create unique filenames.

OpenCV is used to save the image and label slices as PNG files in the specified directories.

This loop iterates over all training images and their corresponding labels, converting each pair of 3D
files into 2D slices and saving them using the convert_3d_to_2d function.

This comprehensive approach ensures that the 3D medical images are appropriately processed and
prepared for effective model training, enabling accurate semantic segmentation

import glob
import tensorflow as tf
from sklearn.model_selection import train_test_split

def load_image(image_path, label_path):

image = tf.io.read_file(image_path)
image = tf.image.decode_png(image, channels=1)
image = tf.image.grayscale_to_rgb(image)
image = tf.image.resize(image, [128, 128])
image = tf.image.convert_image_dtype(image, tf.float32)

label = tf.io.read_file(label_path)
label = tf.image.decode_png(label, channels=1)
label = tf.image.resize(label, [128, 128])
label = tf.image.convert_image_dtype(label, tf.float32)

return image, label

def create_dataset(image_files, label_files, batch_size=16,

buffer_size=1000):
dataset = tf.data.Dataset.from_tensor_slices((image_files,
label_files))
dataset = dataset.map(load_image,
num_parallel_calls=tf.data.experimental.AUTOTUNE)
dataset =
dataset.shuffle(buffer_size).batch(batch_size).prefetch(tf.data.experim
ental.AUTOTUNE)
return dataset

output_image_dir = 'output_images'
output_label_dir = 'output_labels'

image_files = sorted(glob.glob(os.path.join(output_image_dir,
'*.png')))
label_files = sorted(glob.glob(os.path.join(output_label_dir,
'*.png')))

print(f"Number of images: {len(image_files)}")

print(f"Number of labels: {len(label_files)}")

# Ensure there are images and labels

assert len(image_files) > 0, "No images found"
assert len(label_files) > 0, "No labels found"
assert len(image_files) == len(label_files), "Mismatch between images
and labels"

# Split the dataset into training and validation sets

train_image_files, val_image_files, train_label_files, val_label_files
= train_test_split(
image_files, label_files, test_size=0.2, random_state=42)

print(f"Training dataset size: {len(train_image_files)}")

print(f"Validation dataset size: {len(val_image_files)}")

# Create training and validation datasets

train_dataset = create_dataset(train_image_files, train_label_files)
val_dataset = create_dataset(val_image_files, val_label_files)

# Check the shapes of the first batch in training and validation

datasets
for images, labels in train_dataset.take(1):
print(f"Training dataset image shape: {images.shape}")
print(f"Training dataset label shape: {labels.shape}")

for images, labels in val_dataset.take(1):

print(f"Validation dataset image shape: {images.shape}")
print(f"Validation dataset label shape: {labels.shape}")

Here I have prepared the image and label data for training a deep learning model. It ensures that the
data is properly loaded, processed, and split into training and validation sets. The shapes of the
images and labels are also checked to ensure they are compatible with the model architecture.

‘’’tf.image.decode_png(image, channels=1):This function decodes the string tensor (which contains

the raw PNG image data) into an image tensor,channels=1 indicates that the PNG image should be
decoded as a grayscale image, which means it has a single color channel.’’’

Loaded the image and label and decoded into into an image tensor then grayscale to RGB, Resized
the image to 128x128 pixels, Converted the image to a float32 type for model compatibility.

Then Createed a dataset from the file paths,Mapped the load_image function to load and
preprocess the images and labels, Shuffled the dataset with a buffer size of 1000, Prefetched data to
improve performance during training.

‘’’Used glob to list all PNG files in the output directories for images and labels.’’’

Splited the image and label files into training (80%) and validation (20%) sets.

Printed the shapes of the first batch of images and labels from the training and validation datasets.

This code prepares the dataset for training a semantic segmentation model, Efficiently handles large
datasets by using TensorFlow's data pipeline features like AUTOTUNE, shuffle, batch, and prefetch.

This systematic approach to dataset preparation and loading sets the stage for training a robust
semantic segmentation model for colon cancer detection.

# Function to calculate the Dice score

def dice_score(y_true, y_pred, smooth=1e-6):
y_true_f = tf.keras.backend.flatten(y_true)
y_pred_f = tf.keras.backend.flatten(y_pred)
intersection = tf.keras.backend.sum(y_true_f * y_pred_f)
return (2. * intersection + smooth) /
(tf.keras.backend.sum(y_true_f) + tf.keras.backend.sum(y_pred_f) +
smooth)

Here I have Defined a function to calculate the Dice score, which is a metric for evaluating the
similarity between two sets of data. This function will be used to assess the performance of the
segmentation model.
y_true: The ground truth labels.

y_pred: The predicted labels from the model.

smooth: A small constant to prevent division by zero errors.

Flattens the y_true tensor into a 1D array:This simplification allows for easier calculation of the
intersection and union of the true and predicted labels.

Computes the intersection between the flattened true and predicted labels:This intersection
represents the overlap between the true and predicted segmentation areas, crucial for the Dice
score calculation.

The Dice score ranges from 0 to 1, where 1 indicates perfect overlap between the predicted and true
labels. Adding the smooth term prevents division by zero and stabilizes the score.

from tensorflow.keras.applications import VGG16

from tensorflow.keras.layers import Conv2D, UpSampling2D, Concatenate,
Input
from tensorflow.keras.models import Model

def build_unet(input_shape):
inputs = Input(shape=input_shape)

# Create the base pre-trained model

base_model = VGG16(weights='imagenet', include_top=False,
input_tensor=inputs)
for layer in base_model.layers:
layer.trainable = False

c1 = base_model.get_layer('block1_conv2').output
c2 = base_model.get_layer('block2_conv2').output
c3 = base_model.get_layer('block3_conv3').output
c4 = base_model.get_layer('block4_conv3').output
c5 = base_model.get_layer('block5_conv3').output

u6 = UpSampling2D((2, 2))(c5)
u6 = Concatenate()([u6, c4])
c6 = Conv2D(512, (3, 3), activation='relu', padding='same')(u6)
c6 = Conv2D(512, (3, 3), activation='relu', padding='same')(c6)

u7 = UpSampling2D((2, 2))(c6)
u7 = Concatenate()([u7, c3])
c7 = Conv2D(256, (3, 3), activation='relu', padding='same')(u7)
c7 = Conv2D(256, (3, 3), activation='relu', padding='same')(c7)

u8 = UpSampling2D((2, 2))(c7)
u8 = Concatenate()([u8, c2])
c8 = Conv2D(128, (3, 3), activation='relu', padding='same')(u8)
c8 = Conv2D(128, (3, 3), activation='relu', padding='same')(c8)

u9 = UpSampling2D((2, 2))(c8)
u9 = Concatenate()([u9, c1])
c9 = Conv2D(64, (3, 3), activation='relu', padding='same')(u9)
c9 = Conv2D(64, (3, 3), activation='relu', padding='same')(c9)

outputs = Conv2D(1, (1, 1), activation='sigmoid')(c9)

model = Model(inputs, outputs)

return model

input_shape = (128, 128, 3)

model = build_unet(input_shape)

model.compile(optimizer='adam', loss='binary_crossentropy',
metrics=['accuracy', dice_score])
model.summary()

Then I have created the U-Net model using the VGG16 architecture as a backbone. The U-Net is a
popular architecture for image segmentation tasks, which in this case will be used for segmenting
medical images.

Loads the VGG16 model pre-trained on ImageNet, excluding the top classification layer.

C1,c2,c3,c4,c5 Extracts the outputs of specific convolutional layers from VGG16. These layers serve as
the "contracting path" of the U-Net, capturing different levels of features.

Then Upsampled the feature maps and concatenate them with corresponding feature maps from the
contracting path. ‘’’it Combines high-level features with spatial information from earlier layers,
critical for precise segmentation.’’’

output: Defines the final output layer with a sigmoid activation function.‘’’Produces the final
segmentation mask with values between 0 and 1.’’’

batch_size = 16
epochs = 10

history = model.fit(train_dataset,
validation_data=val_dataset,epochs=epochs)

then I have trained the model

Each epoch shows the improvement in the loss, accuracy, and Dice score over the training and
validation datasets and By the end of Epoch 10, the values show significant improvements:

# Evaluate the model on the validation dataset

val_loss, val_accuracy,val_dice = model.evaluate(val_dataset)
print(f"Validation Loss: {val_loss}")
print(f"Validation Accuracy: {val_accuracy}")
print(f"Validation Dice Score: {val_dice}")

Then I have Evaluated the model's performance on the validation dataset : it Provides a final
assessment of the model's ability to generalize to unseen data after training.

The high validation accuracy and reasonable Dice score suggest that the model can reliably segment
medical images

# Save the model

model.save('/content/drive/MyDrive/Soumya S
Moharana/Mock_Project/segmentation_model.h5')
then I have saved the model

# Visualize predictions for a few test images

for image_file in test_image_files[:3]:
image_path = os.path.join(test_images_path, image_file)
# Note: Here we don't have ground truth labels for test set, so
just displaying the input and prediction
image = nib.load(image_path).get_fdata()
num_slices = image.shape[2]

# Randomly pick a slice for demonstration

slice_idx = np.random.randint(num_slices)
image_slice = image[:, :, slice_idx]
image_slice = cv2.resize(image_slice, (128, 128))
image_slice = tf.expand_dims(image_slice, axis=-1)
image_slice = tf.image.grayscale_to_rgb(image_slice)
image_slice = tf.expand_dims(image_slice, axis=0)

prediction = model.predict(image_slice)[0]

plt.figure(figsize=(15, 5))
plt.subplot(1, 2, 1)
plt.title("Input Image")
plt.imshow(image_slice.numpy().squeeze())

plt.subplot(1, 2, 2)
plt.title("Prediction")
plt.imshow(prediction.squeeze(), cmap='gray')
plt.show()

here I have Loop through the first three test images and visualize their predictions. have taken a
random slice and applied prediction on it

def preprocess_3d_image(image_path, slice_idx):

image = nib.load(image_path).get_fdata()
image_slice = image[:, :, slice_idx]
image_slice = np.expand_dims(image_slice, axis=-1)
image_slice = cv2.resize(image_slice, (128, 128))
image_slice = tf.convert_to_tensor(image_slice, dtype=tf.float32)
# Ensure the last dimension is size 1
if image_slice.shape[-1] != 1:
image_slice = tf.expand_dims(image_slice, axis=-1)
image_slice = tf.image.grayscale_to_rgb(image_slice)
image_slice = tf.expand_dims(image_slice, axis=0)
return image_slice

def perform_inference(image_path, model):

image = nib.load(image_path).get_fdata()
num_slices = image.shape[2]
predictions = []
for i in range(num_slices):
image_slice = preprocess_3d_image(image_path, i)
prediction = model.predict(image_slice)
predictions.append(prediction)
return np.squeeze(np.array(predictions))

# Load a test image and perform inference

test_image_path = os.path.join(test_images_path, test_image_files[0])
predictions = perform_inference(test_image_path, model)
Now there are two functions: preprocess_3d_image and perform_inference, which are used to
preprocess 3D medical images and perform inference on them slice by slice

Here I have Preprocessed a 3D medical image by Loading the 3D image, extracting a specific slice,
resize it, convert it to a tensor, ensure its shape is correct, convert it to RGB format, and expand its
dimensions to match the model's input shape.

Then Performed inference on a 3D medical image by processing each slice separately and collecting
the predictions. Loaded the 3D image, iterated over each slice, preprocessed each slice, made
predictions using the model, and collected the predictions.

’’’ The `np.squeeze` function removes dimensions of size 1 from the array. This means that if any
dimension of the array has only one element, that dimension is removed.
- This can be useful for simplifying the shape of the array and making it easier to work with.’’’

# Save the predictions as a NIfTI file

def save_predictions_as_nifti(predictions, original_image_path,
output_path):
original_image = nib.load(original_image_path)
affine = original_image.affine
header = original_image.header
prediction_image = nib.Nifti1Image(predictions, affine, header)
nib.save(prediction_image, output_path)

output_path = '/content/drive/MyDrive/Soumya S
Moharana/Mock_Project/predictions.nii.gz'
save_predictions_as_nifti(predictions, test_image_path, output_path)

here is the function…to save the model’s prediction as a NIfTI file , Loaded the original image to
extract the affine transformation and header information. Then, created a new NIfTI image with the
model's predictions using the same affine and header information and saved it to the specified
output path.

# Display original and predicted slices

def display_original_and_predicted_slices(original_image_path,
prediction_image_path, num_slices=5):
original_image = nib.load(original_image_path).get_fdata()
prediction_image = nib.load(prediction_image_path).get_fdata()
slice_indices = np.linspace(0, original_image.shape[2] - 1,
num_slices, dtype=int)
plt.figure(figsize=(15, 6))
for i, slice_index in enumerate(slice_indices):
plt.subplot(2, num_slices, i + 1)
plt.imshow(original_image[:, :, slice_index])
plt.title(f'Original Slice {slice_index}')
plt.axis('off')
plt.subplot(2, num_slices, i + num_slices + 1)
plt.imshow(prediction_image[:, :, slice_index])
plt.title(f'Predicted Slice {slice_index}')
plt.axis('off')
plt.tight_layout()
plt.show()

display_original_and_predicted_slices(test_image_path,output_path)

finally displaying the original and predicted slices, it loads the test image and its corresponding
predicted image then it calculates the indices of slices to display This ensures that the selected slices
are evenly spaced across the z-axis (slices dimension) of the 3D image.

DLV Lab Manual Print
No ratings yet
DLV Lab Manual Print
29 pages
Sizing of Amine Absorber
No ratings yet
Sizing of Amine Absorber
7 pages
OceanStor Dorado 6.1.x HyperReplication Feature Guide For Block
No ratings yet
OceanStor Dorado 6.1.x HyperReplication Feature Guide For Block
286 pages
A320 6E 1100
0% (1)
A320 6E 1100
1 page
Deep Learning Lab Manual
100% (10)
Deep Learning Lab Manual
30 pages
Best Global Brands 2024 Report
No ratings yet
Best Global Brands 2024 Report
33 pages
Pattern Recognition Lab
No ratings yet
Pattern Recognition Lab
24 pages
Students Alumini Project On Python
88% (8)
Students Alumini Project On Python
18 pages
Students Alumini Project On Python
88% (8)
Students Alumini Project On Python
18 pages
Zetium Data Sheet
100% (1)
Zetium Data Sheet
4 pages
Headspace 2nd Year
100% (1)
Headspace 2nd Year
401 pages
0
No ratings yet
0
343 pages
CNN Image Classification - Real Time Food Recognition
No ratings yet
CNN Image Classification - Real Time Food Recognition
25 pages
Content
No ratings yet
Content
27 pages
Electropneumatics Basic Level: Festo Worldwide
No ratings yet
Electropneumatics Basic Level: Festo Worldwide
34 pages
REMOTE PLAN WORK DAILY ACCOMPLISHMENTS As of 04.6.2020
No ratings yet
REMOTE PLAN WORK DAILY ACCOMPLISHMENTS As of 04.6.2020
12 pages
DLT Record Final
No ratings yet
DLT Record Final
120 pages
Transfer Learning CNN
No ratings yet
Transfer Learning CNN
21 pages
Brain Tumor Multi-Classification With PSO: Import As Import As Import
No ratings yet
Brain Tumor Multi-Classification With PSO: Import As Import As Import
18 pages
Finalised Question 1
No ratings yet
Finalised Question 1
40 pages
Potato
No ratings yet
Potato
16 pages
Pinn-Emfnet For Breast Cancer Image Classification: Import As Import As
No ratings yet
Pinn-Emfnet For Breast Cancer Image Classification: Import As Import As
16 pages
Assignment1 DeepL25
No ratings yet
Assignment1 DeepL25
15 pages
Appix
No ratings yet
Appix
15 pages
Building A Brain in 10 Minutes: Perceptron Research From The 50's & 6 Perceptron Research From The 50's & 6
No ratings yet
Building A Brain in 10 Minutes: Perceptron Research From The 50's & 6 Perceptron Research From The 50's & 6
14 pages
Image Datasets For Practicing Machine Learning in OpenCV
No ratings yet
Image Datasets For Practicing Machine Learning in OpenCV
9 pages
Final Question1 With Results
No ratings yet
Final Question1 With Results
21 pages
TLM For CNN
No ratings yet
TLM For CNN
32 pages
PCa $ Image Processing
No ratings yet
PCa $ Image Processing
8 pages
Image Category Classification Using Deep Learning
No ratings yet
Image Category Classification Using Deep Learning
11 pages
End To End Project
No ratings yet
End To End Project
21 pages
Algorithm
No ratings yet
Algorithm
9 pages
Activity-2 DL
No ratings yet
Activity-2 DL
8 pages
Assignment 2.1.2 Image Augmentation
No ratings yet
Assignment 2.1.2 Image Augmentation
8 pages
Fatima Binte Aqeel AI Theory Assignment 3
No ratings yet
Fatima Binte Aqeel AI Theory Assignment 3
7 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
42 pages
CV Assignment 2 Group02
No ratings yet
CV Assignment 2 Group02
12 pages
Lab Report 4
No ratings yet
Lab Report 4
6 pages
Alarm Code Servo Okuma
No ratings yet
Alarm Code Servo Okuma
4 pages
Skill4 2100100003
No ratings yet
Skill4 2100100003
5 pages
Skin Disease Detection Using Transformers
No ratings yet
Skin Disease Detection Using Transformers
35 pages
Activity-3 DL
No ratings yet
Activity-3 DL
4 pages
H1 AndresAlcivar
No ratings yet
H1 AndresAlcivar
4 pages
Training Code
No ratings yet
Training Code
4 pages
Computer SCIENCE With PYTHON
No ratings yet
Computer SCIENCE With PYTHON
18 pages
Computer SCIENCE With PYTHON
No ratings yet
Computer SCIENCE With PYTHON
18 pages
Performance Testing
No ratings yet
Performance Testing
15 pages
Imp.-Image Category Classification Using Deep Learning-MATLAB
No ratings yet
Imp.-Image Category Classification Using Deep Learning-MATLAB
9 pages
LAB6
50% (2)
LAB6
5 pages
Breat Cancer Detection Using Thermograpgy
No ratings yet
Breat Cancer Detection Using Thermograpgy
15 pages
Code
No ratings yet
Code
8 pages
Project Report - Intro To AI
No ratings yet
Project Report - Intro To AI
40 pages
C3W1 Data Augmentation Assignment
No ratings yet
C3W1 Data Augmentation Assignment
16 pages
FSED 27F Places of Assembly Occupancy Checklist Rev01
No ratings yet
FSED 27F Places of Assembly Occupancy Checklist Rev01
4 pages
Def Load - Data (Data - Directory)
No ratings yet
Def Load - Data (Data - Directory)
3 pages
A 1
No ratings yet
A 1
9 pages
ZX2-GC×GC Im 01
No ratings yet
ZX2-GC×GC Im 01
32 pages
Dinushasan Courseproject04: Sign in
No ratings yet
Dinushasan Courseproject04: Sign in
19 pages
MNIST Dataset
No ratings yet
MNIST Dataset
12 pages
DL Lab-III-II
No ratings yet
DL Lab-III-II
98 pages
GenAI - Lab-File - Darab Khan 22SCSE1480055
No ratings yet
GenAI - Lab-File - Darab Khan 22SCSE1480055
31 pages
1729492946538
No ratings yet
1729492946538
10 pages
Capstone Project-1
No ratings yet
Capstone Project-1
15 pages
Potato Disease Classification Using CNN
No ratings yet
Potato Disease Classification Using CNN
21 pages
Assignment 02# - Machine Learning 2023
No ratings yet
Assignment 02# - Machine Learning 2023
8 pages
Project Documentation
No ratings yet
Project Documentation
24 pages
Assignment3 AL
No ratings yet
Assignment3 AL
23 pages
MLP - Week 5 - MNIST - Perceptron - Ipynb - Colaboratory
No ratings yet
MLP - Week 5 - MNIST - Perceptron - Ipynb - Colaboratory
31 pages
DL Lab-Final
No ratings yet
DL Lab-Final
22 pages
13930
No ratings yet
13930
11 pages
Brain Tumour Classification
No ratings yet
Brain Tumour Classification
10 pages
AI Medical Diagnosis Week 01
No ratings yet
AI Medical Diagnosis Week 01
5 pages
Lab 4-Image Segmentation Using U-Net
No ratings yet
Lab 4-Image Segmentation Using U-Net
9 pages
"I C U N N ": Mage Lassification Sing Eural Etworks
No ratings yet
"I C U N N ": Mage Lassification Sing Eural Etworks
15 pages
NI Serial Hardware Specifications PDF
No ratings yet
NI Serial Hardware Specifications PDF
62 pages
Smuat Guide
No ratings yet
Smuat Guide
53 pages
Design of A Latent Heat Storage System For The Replacement of Cooling Tower For DG Set
No ratings yet
Design of A Latent Heat Storage System For The Replacement of Cooling Tower For DG Set
6 pages
Lect6 Traffic Safety
No ratings yet
Lect6 Traffic Safety
83 pages
Control Panel: Need Help?
No ratings yet
Control Panel: Need Help?
12 pages
Brackets Lesson For Coding and Programming by Slidesgo
No ratings yet
Brackets Lesson For Coding and Programming by Slidesgo
57 pages
SNCS D 24 08064 - R123
No ratings yet
SNCS D 24 08064 - R123
47 pages
Logcat 1711449573996
No ratings yet
Logcat 1711449573996
27 pages
Differentiation Ii
No ratings yet
Differentiation Ii
11 pages
Analog Communication Lab VIVA Questions & Answers
No ratings yet
Analog Communication Lab VIVA Questions & Answers
9 pages
CSC 327 (DBMS Ii)
No ratings yet
CSC 327 (DBMS Ii)
8 pages
Ada Assignment
No ratings yet
Ada Assignment
9 pages
LECTURE 3 - Corporate Image
No ratings yet
LECTURE 3 - Corporate Image
10 pages
CV - (Hadziq Mufid Mahmud) (Middleware Developer)
No ratings yet
CV - (Hadziq Mufid Mahmud) (Middleware Developer)
6 pages
Rapoo C1612 Brochure
No ratings yet
Rapoo C1612 Brochure
4 pages
Qualcomm 213
No ratings yet
Qualcomm 213
28 pages
Enrrique Gomez, El Ingeniero Tercermundista
No ratings yet
Enrrique Gomez, El Ingeniero Tercermundista
3 pages
SVG Drawing with HTML5
From Everand
SVG Drawing with HTML5
Hussein Qutbi
No ratings yet
TensorFlow深度学习项目实战: Chinese Edition
From Everand
TensorFlow深度学习项目实战: Chinese Edition
Posts & Telecom Press
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet

Capstone Speech

Uploaded by

Capstone Speech

Uploaded by

My project is about to create a semantic segmentation model that can identify and segment regions

# Paths to the datasets

def display_nifti_slices(nifti_image_path, num_slices=5):

image_data = nifti_image.get_fdata() #get_fdata retrives the image

slice_indices = np.linspace(0, image_data.shape[2] - 1, num_slices,

plt.figure(figsize=(15, num_rows * 2))

#display slices for few images

# Print the number of slices for each training image file

# Function to display images with corresponding labels

plt.subplot(2, num_slices, num_slices + i + 1)

It plots the image slice (image[:, :, slice_index]) in the top row

# Display a few images with their corresponding labels

It then calls the display_image_with_label function

# Calculate and print basic statistics for the dataset

for image_file in image_files:

mean_slices = total_slices / len(image_files) if image_files else 0

# Visualize slice distribution

# Convert 3D images to 2D slices (ensure output_image_dir and

def convert_3d_to_2d(image_path, label_path, output_image_dir,

# Convert all training images and labels

image_slice_path = os.path.join(output_image_dir, f"{os.path.basename(image_path)[:-7]}_{i}.png")

label_slice_path = os.path.join(output_label_dir, f"{os.path.basename(label_path)[:-7]}_{i}.png")

def load_image(image_path, label_path):

return image, label

def create_dataset(image_files, label_files, batch_size=16,

print(f"Number of images: {len(image_files)}")

# Ensure there are images and labels

# Split the dataset into training and validation sets

print(f"Training dataset size: {len(train_image_files)}")

# Create training and validation datasets

# Check the shapes of the first batch in training and validation

for images, labels in val_dataset.take(1):

‘’’tf.image.decode_png(image, channels=1):This function decodes the string tensor (which contains

# Function to calculate the Dice score

y_pred: The predicted labels from the model.

smooth: A small constant to prevent division by zero errors.

from tensorflow.keras.applications import VGG16

# Create the base pre-trained model

outputs = Conv2D(1, (1, 1), activation='sigmoid')(c9)

model = Model(inputs, outputs)

input_shape = (128, 128, 3)

then I have trained the model

# Evaluate the model on the validation dataset

# Save the model

# Visualize predictions for a few test images

# Randomly pick a slice for demonstration

def preprocess_3d_image(image_path, slice_idx):

def perform_inference(image_path, model):

# Load a test image and perform inference

# Save the predictions as a NIfTI file

# Display original and predicted slices

You might also like