0% found this document useful (0 votes)

14 views11 pages

CS401 24 Assign 2 Template Fixed

Uploaded by

asunil1911

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views11 pages

CS401 24 Assign 2 Template Fixed

Uploaded by

asunil1911

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

11/11/2024, 22:50 CIFAR10_CPU

Training a colour image classifier using

Flux

Tip
Hidden below is a useful snippet of HTML to setup a restart button in case training gets out
of hand.

Restart

localhost:1234/edit?id=5eb37fb4-a06e-11ef-1e46-eb0d2c302b1d# 1/11
11/11/2024, 22:50 CIFAR10_CPU

Table of Contents
Training a colour image classifier using Flux
Load the dataset

Rehape data for training with flux

Defining the Classifier
Test network
Training
Testing the network
Overall accuracy
1 begin
2 using PlutoUI
3 using Latexify
4 TableOfContents()
5 end

This is a slightly more complex learning task than the MNIST example. CIFAR10 is a dataset of 50k
tiny coloured training images split into 10 classes.

You need to do the following steps in order:

Load CIFAR10 training and test datasets

Define a Convolution Neural Network
Define a loss function
Train the network on the training data
Test the network on the test data

Again, most of the steps are identical with what we did for MNIST task, but some dimesnsion
adjustments are required because the images are slightly bigger and also involve three colour
localhost:1234/edit?id=5eb37fb4-a06e-11ef-1e46-eb0d2c302b1d# 2/11
11/11/2024, 22:50 CIFAR10_CPU

channels.

Load the dataset

The image gives an idea of the range of images in each of the 10 categories.

localhost:1234/edit?id=5eb37fb4-a06e-11ef-1e46-eb0d2c302b1d# 3/11
11/11/2024, 22:50 CIFAR10_CPU

Again, we'll get the data from the MLDatasets repository.

1 begin
2 using Statistics
3 using Flux, Flux.Optimise
4 using MLDatasets: CIFAR10
5 using Images.ImageCore
6 using Flux: onehotbatch, onecold
7 using Base.Iterators: partition
8 using MLUtils
9 using Plots
10 using DataFrames
11 end

1 begin
2 train_x, train_y = CIFAR10(split=:train)[:]
3 train_labels = onehotbatch(train_y, 0:9)
4 classes = ["airplane", "automobile", "bird", "cat",
5 "deer", "dog", "frog", "horse", "ship", "truck"]
6 end;

The images are simply 32 x 32 matrices of numbers in 3 channels (R,G,B). The train_x array contains
50,000 images converted to 32 x 32 x 3 arrays with the third dimension being the 3 channels
(R,G,B). Let's take a look at a random image from train_x. However, to do this we need to define a
function called image , which calls colorview on the training image, which we have to permute
from 32x32x3 to 3x32x32:

image (generic function with 1 method)

1 image(x) = colorview(RGB, permutedims(x, (3, 2, 1)))

localhost:1234/edit?id=5eb37fb4-a06e-11ef-1e46-eb0d2c302b1d# 4/11
11/11/2024, 22:50 CIFAR10_CPU

Rehape data for training with flux

We can now arrange them into batches of 1,000. This process is called minibatch learning, which is
a popular method of training large neural networks. Rather that sending the entire dataset at once,
we break it down into smaller chunks (called minibatches) that are typically chosen at random, and
train only on them.

The first 49k images (in batches of 1,000) will be our training set, and the rest is for validation.
partition handily breaks down the set we give it into consecutive chunks (1,000 in this case).

Task 1
Partition train_x into training and validation parts, along the lines done for the MNIST example.

Note that train is an array of tuples, where the first tuple element is the image and the second is
the label. This is the format in which the Flux defined model expects its training data.

Defining the Classifier

Now we can define our Convolutional Neural Network (CNN).

A convolutional neural network is one which defines a kernel and slides it across a matrix to create
an intermediate representation from which to extract features. It creates higher-order features as it
goes into deeper layers, making it suitable for images, where the structure of the image will help us
determine which class to which it belongs.
localhost:1234/edit?id=5eb37fb4-a06e-11ef-1e46-eb0d2c302b1d# 5/11
11/11/2024, 22:50 CIFAR10_CPU

In this case we use two convolutional layers of 16 and 8 channels, respectively. Each convolution
phase is passed through a pooling layer, which reduces the image's dimentionality. The SamePad()
function is used to ensure appropriate padding is used to preserve the dimensions of the original
image.

Finally, the 3D array is flattened to a 512 element 1D vector, which is then passed through a
sequence of fully-connected layers to reduce its length to 10. Finally a softmax transformation is
applied to the 10 element output vector to transform the outputs to probabilities.

Model fix
I neglected to use padding in the last version of the template. This resulted in the convolution
not preserving the original dimensions of the image. The use of SamePad() to calculate the
required padding fixes this.

model = Chain(
Conv((5, 5), 3 => 16, relu, pad=2), # 1_216 parameters
MaxPool((2, 2)),
Conv((5, 5), 16 => 8, relu, pad=2), # 3_208 parameters
MaxPool((2, 2)),
Flux.flatten,
Dense(512 => 256), # 131_328 parameters
Dense(256 => 10), # 2_570 parameters
softmax,
) # Total: 8 arrays, 138_322 parameters, 541.023 KiB.
1 model = Chain(
2 Conv((5,5), 3=>16, pad=SamePad(), relu),
3 MaxPool((2,2)),
4 Conv((5,5), 16=>8, pad=SamePad(), relu),
5 MaxPool((2,2)),
6 Flux.flatten,
7 Dense(512, 256),
8 Dense(256, 10),
9 softmax)

Task 2
Make modifications to the network architecture above to (a) insert a new pair of convolutional
and pooling layers between the existing 1st and 2nd ones. Use 16 filters for the new kernel; (b)
insert a new Dense layer just before the final one that goes from a width of 256 down to 128.
Modify the final Dense layer approprately.
Do these modifications separately and in each case calculate the training time and
classification accuracy. Note that each training test may take up to 30 minutes, depending on
your machine.
Comment on and explain what differences, if any, there are between the baseline model and
these two modifications.

localhost:1234/edit?id=5eb37fb4-a06e-11ef-1e46-eb0d2c302b1d# 6/11
11/11/2024, 22:50 CIFAR10_CPU

Test network
Use this partial network to check the dimension of outputs from each layer (use # to comment out
layers not of interest).

(128, 1)

1 with_terminal() do
2 # Test the model up to flattening step
3 x = rand(Float32, 32, 32, 3, 1) # Example input of shape 32x32x3 (one image)
4 model = Chain(
5 Conv((5,5), 3=>16, pad=SamePad(), relu),
6 MaxPool((2,2)),
7 Conv((5,5), 16=>16, pad=SamePad(), relu),
8 MaxPool((2,2)),
9 Conv((5,5), 16=>8, pad=SamePad(), relu),
10 MaxPool((2,2)),
11 Flux.flatten
12 )
13
14 output = model(x)
15 println(size(output))
16 end

We will use a crossentropy loss and the Momentum optimiser here. Crossentropy is a good option
when working with multiple independent classes. Momentum smooths out the noisy gradients and
helps towards a smooth convergence. Gradually lowering the learning rate along with momentum
helps to maintain adaptivity in our optimisation, preventing overshooting of the error minimum.

1 begin
2 using Flux: crossentropy, Momentum
3 loss(x, y) = sum(crossentropy(model(x), y))
4 optimiser = Momentum(0.01)
5 end;

We can start writing our train loop where we will keep track of some basic accuracy numbers about
our model. We can define an accuracy function for it like so:

accuracy (generic function with 1 method)

1 accuracy(x, y) = mean(onecold(model(x), 0:9) .== onecold(y, 0:9))

Training
Training is where we do a bunch of the interesting operations we defined earlier, and see what our
net is capable of. We will loop over the dataset 10 times and feed the inputs to the neural network
and optimise.

localhost:1234/edit?id=5eb37fb4-a06e-11ef-1e46-eb0d2c302b1d# 7/11
11/11/2024, 22:50 CIFAR10_CPU

0.644
0.644
0.646
0.636
0.625
0.645
0.642
0.644
0.644
0.643
0.644
0.648
0.644
0.642
0.643
0.64
0.636
0.634
0.637
0.651
0.638
1 with_terminal() do
2 correct = []
3 epochs = 100
4 for epoch = 1:epochs
5 for d in train
6 gradients = gradient(Flux.params(model)) do
7 l = loss(d...)
8 end
9 update!(optimiser, Flux.params(model), gradients)
10 end
11 acc = accuracy(validate_x, validate_y)
12 push!(correct, acc)
13 println(acc)
14 end
15 plot(correct, ylim=(0.0, 0.75),
16 legend=:none, title="Accuracy", xlabel="epoch", ylabel="proportion correct")
17 end

localhost:1234/edit?id=5eb37fb4-a06e-11ef-1e46-eb0d2c302b1d# 8/11
11/11/2024, 22:50 CIFAR10_CPU

Testing the network

We have trained the network for 100 passes over the training dataset. But we need to check if the
network has learnt anything at all.

We will check this by predicting the class label that the neural network outputs for the test test.

We need to perform the exact same preprocessing on this set, as we did on our training set.

Task 3
Partition the test set similarly to the training set.

Task 4
Test a random sample of 10 test images. Display a dataframe of outputs as below. Use a slider
to display each image and its predicted class.

The dataframe below contains probabilities for the 10 classes (left column). The model's
predictions are indicated by the column names.

Classes/Actual frog dog bird cat dog_1 deer deer_1 bird_1

1 "airplane" 0.0 0.0 0.0 0.0 0.01 0.0 0.0 0.0

2 "automobile" 0.0 0.0 0.0 0.03 0.0 0.0 0.0 0.0

3 "bird" 0.0 0.44 0.19 0.08 0.01 0.0 0.01 0.44

4 "cat" 0.11 0.09 0.17 0.52 0.22 0.21 0.13 0.01

5 "deer" 0.0 0.07 0.06 0.02 0.45 0.17 0.58 0.05

6 "dog" 0.29 0.35 0.46 0.18 0.26 0.02 0.02 0.01

7 "frog" 0.6 0.0 0.07 0.04 0.0 0.03 0.26 0.48

8 "horse" 0.0 0.04 0.04 0.11 0.03 0.56 0.0 0.0

9 "ship" 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0

10 "truck" 0.0 0.0 0.0 0.02 0.02 0.0 0.0 0.0

Tip
Here's some of the code needed to create the DataFrame:
localhost:1234/edit?id=5eb37fb4-a06e-11ef-1e46-eb0d2c302b1d# 9/11
11/11/2024, 22:50 CIFAR10_CPU

DataFrame(round.(model(rand_test), digits=2),
Symbol.(rand_label),
makeunique=true)

1 @bind inx Slider(1:1:10, default=1)

This looks similar to how we would expect the results to be. At this point, it's a good idea to see
how our net actually performs on new data, that we have prepared.

Overall accuracy
We iterate over the entire test set to calculate the overall model accuracy.

0.643
1 round(mean([accuracy(test[i]...) for i in 1:10]), digits=3)

This is much better than random chance set at 10% (since we only have 10 classes), and not bad at
all for a small handcrafted network like ours.

Let's take a look at how the net performed on all the classes individually.

localhost:1234/edit?id=5eb37fb4-a06e-11ef-1e46-eb0d2c302b1d# 10/11
11/11/2024, 22:50 CIFAR10_CPU

1 begin
2 class_correct = zeros(10)
3 class_total = zeros(10)
4 for i in 1:10
5 preds = model(test[i][1])
6 lab = test[i][2]
7 for j = 1:1000
8 pred_class = findmax(preds[:, j])[2]
9 actual_class = findmax(lab[:, j])[2]
10 if pred_class == actual_class
11 class_correct[pred_class] += 1
12 end
13 class_total[actual_class] += 1
14 end
15 end
16 end

accuracy class

1 0.629 "airplane"

2 0.702 "automobile"

3 0.378 "bird"

4 0.447 "cat"

5 0.778 "deer"

6 0.526 "dog"

7 0.693 "frog"

8 0.671 "horse"

9 0.797 "ship"

10 0.813 "truck"

1 DataFrame(accuracy=(class_correct ./ class_total), class=classes)

The spread seems pretty good, with certain classes performing significantly better than the others.

localhost:1234/edit?id=5eb37fb4-a06e-11ef-1e46-eb0d2c302b1d# 11/11

Mnist Handwritten Digit Classification
No ratings yet
Mnist Handwritten Digit Classification
26 pages
Aditya Joshi 23252595 Assign 5
No ratings yet
Aditya Joshi 23252595 Assign 5
7 pages
Train Your Image Classifier Model With PyTorch
No ratings yet
Train Your Image Classifier Model With PyTorch
6 pages
CNN Implementation in Python
No ratings yet
CNN Implementation in Python
7 pages
DL Lab-III-II
No ratings yet
DL Lab-III-II
98 pages
Deep Learning
No ratings yet
Deep Learning
46 pages
ML 03
No ratings yet
ML 03
9 pages
DLT Record Final
No ratings yet
DLT Record Final
120 pages
Computer Vision Activity
No ratings yet
Computer Vision Activity
6 pages
09-Neural Networks
No ratings yet
09-Neural Networks
17 pages
Technologies
No ratings yet
Technologies
9 pages
Explanation of CNN
No ratings yet
Explanation of CNN
8 pages
Assignment 02# - Machine Learning 2023
No ratings yet
Assignment 02# - Machine Learning 2023
8 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
7 CNNWithCustomImage
No ratings yet
7 CNNWithCustomImage
11 pages
01 - Mnist - Ipynb (4) - JupyterLab
No ratings yet
01 - Mnist - Ipynb (4) - JupyterLab
23 pages
DL Lab-Final
No ratings yet
DL Lab-Final
22 pages
Practical 02
No ratings yet
Practical 02
5 pages
Experiment 10 1
No ratings yet
Experiment 10 1
3 pages
How To Develop A CNN For MNIST Handwritten Digit Classification
No ratings yet
How To Develop A CNN For MNIST Handwritten Digit Classification
43 pages
Experiment 10-1
No ratings yet
Experiment 10-1
3 pages
Practical 1: Augmentation and Regularization. Additionally, The Model's Performance Will
No ratings yet
Practical 1: Augmentation and Regularization. Additionally, The Model's Performance Will
6 pages
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
No ratings yet
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
9 pages
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
No ratings yet
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
9 pages
Project Documentation
No ratings yet
Project Documentation
24 pages
Deep Learning Experiments
No ratings yet
Deep Learning Experiments
42 pages
Lecture 6 Deep Learning Training and Testing 2025
No ratings yet
Lecture 6 Deep Learning Training and Testing 2025
36 pages
TLM For CNN
No ratings yet
TLM For CNN
32 pages
Explore The Implementation of CNNs in Python
No ratings yet
Explore The Implementation of CNNs in Python
10 pages
Assignment 2 DL
No ratings yet
Assignment 2 DL
10 pages
Deep Learning Lab Practicals
No ratings yet
Deep Learning Lab Practicals
24 pages
Building Deep Learning Models Using The PyTorch Library
No ratings yet
Building Deep Learning Models Using The PyTorch Library
4 pages
Phase 2: Model Development and Evaluation
No ratings yet
Phase 2: Model Development and Evaluation
8 pages
DLV Lab Manual Print
No ratings yet
DLV Lab Manual Print
29 pages
MVS - Expt8 Object Detection and Reconstruction Using CNN
No ratings yet
MVS - Expt8 Object Detection and Reconstruction Using CNN
5 pages
Lab 5 - Intro To Convolutional Neural Networks
No ratings yet
Lab 5 - Intro To Convolutional Neural Networks
52 pages
Assignment 2 - Neural Network Fundamentals
No ratings yet
Assignment 2 - Neural Network Fundamentals
7 pages
Guddu Jha - Organized
No ratings yet
Guddu Jha - Organized
3 pages
Image Classification Using MNIST Dataset
No ratings yet
Image Classification Using MNIST Dataset
28 pages
NN From Scratch
No ratings yet
NN From Scratch
5 pages
Digit Recognizer Using CNN
No ratings yet
Digit Recognizer Using CNN
4 pages
Vineela Ann1
No ratings yet
Vineela Ann1
9 pages
Deep Learning For Vision Lab Manual 2024
100% (1)
Deep Learning For Vision Lab Manual 2024
25 pages
ML Lab Session 05 - CNN Implementation
No ratings yet
ML Lab Session 05 - CNN Implementation
4 pages
CIFAR - 10 - Dataset - Using - CNN - Aniiiii - HTML
No ratings yet
CIFAR - 10 - Dataset - Using - CNN - Aniiiii - HTML
8 pages
Assignment 3 DS5620
No ratings yet
Assignment 3 DS5620
11 pages
LP V GRPB 2b
No ratings yet
LP V GRPB 2b
8 pages
Week 4
No ratings yet
Week 4
15 pages
CNN With TensorFlow and Keras
No ratings yet
CNN With TensorFlow and Keras
11 pages
Assignment3 AL
No ratings yet
Assignment3 AL
23 pages
Multi Layer Perceptron Tf2 Code Description
No ratings yet
Multi Layer Perceptron Tf2 Code Description
10 pages
Practical 2: Amtics Enrollment No: 202203103510493
No ratings yet
Practical 2: Amtics Enrollment No: 202203103510493
6 pages
Exp 1
No ratings yet
Exp 1
4 pages
03 Pytorch Computer Vision
No ratings yet
03 Pytorch Computer Vision
29 pages
Introduction To Genetic Algorithm Neural Networks
No ratings yet
Introduction To Genetic Algorithm Neural Networks
44 pages
Lab DigitRecognitionMINST
No ratings yet
Lab DigitRecognitionMINST
10 pages
Deep Learning Lab Manual
100% (10)
Deep Learning Lab Manual
30 pages
Exp. No.: I. Aim: AIML634P Neural Network Lab 2262034
No ratings yet
Exp. No.: I. Aim: AIML634P Neural Network Lab 2262034
6 pages
Introduction To ANN With Steps 10 25
No ratings yet
Introduction To ANN With Steps 10 25
30 pages
P 3.1.3 Hierarchical
No ratings yet
P 3.1.3 Hierarchical
30 pages
Tugas Penyelesaian Soal Menggunakan Metode Simplex: Iterasi 1
No ratings yet
Tugas Penyelesaian Soal Menggunakan Metode Simplex: Iterasi 1
6 pages
Binary Search Tree (Lab 10)
No ratings yet
Binary Search Tree (Lab 10)
8 pages
Binary Search Trees: Objectives
No ratings yet
Binary Search Trees: Objectives
36 pages
MIT6 832s09 Exam01 Practice
No ratings yet
MIT6 832s09 Exam01 Practice
4 pages
Homework Set No. 4: 1. Trapezoid and Simpson's Methods
No ratings yet
Homework Set No. 4: 1. Trapezoid and Simpson's Methods
3 pages
Quadratic Equations With Solutions
No ratings yet
Quadratic Equations With Solutions
17 pages
Quiz 05inp Lagrange Solution
No ratings yet
Quiz 05inp Lagrange Solution
8 pages
Dpp-3 Division Algorithm, Factor & Remainder Theorem
No ratings yet
Dpp-3 Division Algorithm, Factor & Remainder Theorem
2 pages
Novel Image Processing Techniques For Early Detection of Breast Cancer, Mat Lab and Lab View Implementation
No ratings yet
Novel Image Processing Techniques For Early Detection of Breast Cancer, Mat Lab and Lab View Implementation
4 pages
Backpropagation PDF 1644779488
No ratings yet
Backpropagation PDF 1644779488
8 pages
Handout 36: Final Exam Solutions: Problem 1. Recurrences
No ratings yet
Handout 36: Final Exam Solutions: Problem 1. Recurrences
21 pages
Introduction To: Artificial Intelligence
No ratings yet
Introduction To: Artificial Intelligence
86 pages
Desk Check Example - Modules
No ratings yet
Desk Check Example - Modules
2 pages
High Performance Computing Matrix Mul.
No ratings yet
High Performance Computing Matrix Mul.
15 pages
Merge Sort, Radix Sort, Shell Sort
100% (1)
Merge Sort, Radix Sort, Shell Sort
21 pages
Graphical Solution Methodspart2
No ratings yet
Graphical Solution Methodspart2
5 pages
Experiment-1: AIM: To Perform Basic Operations On Signals, Generation of Various Software Required: Matlab 7.0.1 Programs
No ratings yet
Experiment-1: AIM: To Perform Basic Operations On Signals, Generation of Various Software Required: Matlab 7.0.1 Programs
16 pages
Deep Learning Approach For Ethiopian Banknote Denomination Classification and Fake Detection System
No ratings yet
Deep Learning Approach For Ethiopian Banknote Denomination Classification and Fake Detection System
8 pages
12 - 1 - 24c - F1B018074 - Baiq Hidayatul Zohriah
No ratings yet
12 - 1 - 24c - F1B018074 - Baiq Hidayatul Zohriah
24 pages
Os Final Project VM
No ratings yet
Os Final Project VM
13 pages
UNIT-5 ML Notes
No ratings yet
UNIT-5 ML Notes
24 pages
TUTORIAL 2-WEEK 2-Series Solution-Ordinary
No ratings yet
TUTORIAL 2-WEEK 2-Series Solution-Ordinary
2 pages
Mat 3226-1 Numerical Analysis Ii - 2024
No ratings yet
Mat 3226-1 Numerical Analysis Ii - 2024
3 pages
Whole Numbers and The Four Operations: Reteach
No ratings yet
Whole Numbers and The Four Operations: Reteach
10 pages
Math F212 Opti
No ratings yet
Math F212 Opti
3 pages
Signal and Systems Analysis Model Exit Exam
No ratings yet
Signal and Systems Analysis Model Exit Exam
6 pages
Fast Approximate Fourier Transform Via Wavelets Transform
No ratings yet
Fast Approximate Fourier Transform Via Wavelets Transform
10 pages
Zulfa Putri Asmawi - TUGAS 10
No ratings yet
Zulfa Putri Asmawi - TUGAS 10
8 pages
Optimization Lecture Notes
No ratings yet
Optimization Lecture Notes
3 pages