0% found this document useful (0 votes)

4 views

Machine Learning HW3 - Image Classification

The document outlines the objectives and tasks for a machine learning homework focused on image classification using convolutional neural networks, specifically on food classification using the food-11 dataset. It details rules for the assignment, including restrictions on using pretrained models and external datasets, as well as guidelines for data augmentation, model selection, and submission formats. Additionally, it provides grading criteria, deadlines, and resources for assistance.

Uploaded by

q1740497416

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Machine Learning HW3 - Image Classification

Uploaded by

q1740497416

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 48

Machine Learning HW3

ML TAs
[email protected]
Objective - Image Classification
1. Solve image classification with convolutional neural networks.
2. Improve the performance with data augmentations.
3. Understand popular image model techniques such as residual.
Task Introduction - Food Classification
● The images are collected from the food-11 dataset classified into 11
classes.
● Training set: 9866 labeled images
● Validation set: 3430 labeled images
● Testing set: 3347 images
Rules
● DO NOT attempt to find the original labels of the testing set.
● DO NOT use any external datasets.
● DO NOT use any pretrained models.
○ Also, do not attempt to “test how effective pretraining is” by submitting to kaggle.
Pretraining is very effective and you may test it after the competition ends.
● You may use any publicly available packages/code
○ But make sure you do not use pretrained models. Most code use those.
○ You may not upload your code/checkpoints to be publicly available during the timespan
of this homework.
Baseline
Simple : 0.50099

Medium : 0.73207 Training Augmentation + Train Longer

Strong : 0.81872 Training Augmentation + Model Design + Train Looonger (+

Cross Validation + Ensemble)

Boss : 0.88446 Training Augmentation + Model Design +Test Time

Augmentation + Train Looonger (+ Cross Validation + Ensemble)
Submission Format
The ﬁle should contain a header and have the following format:

Both type should be strings. Id corresponds to the jpg filenames in test. Follow
the sample code if you have trouble with formatting.
Model Selection
● Visit torchvision.models for a list of model structures, or go to timm for
the latest model structures.
● Pretrained weights are not allowed, specifically set pretrained=False to
ensure that the guideline is met.
Data Augmentation
● Modify the image data so non-identical inputs are given to the model each
epoch, to prevent overfitting of the model
● Visit torchvision.transforms for a list of choices and their corresponding
effect. Diversity is encouraged! Usually, stacking multiple transformations
leads to better results.
● Coding : fill in train_tfm to gain this effect
Advanced Data Augmentation - mixup

0.5* + 0.5 * =

Label

0 1 01
Advanced Data Augmentation - mixup
● Coding :
● In your torch.utils.Dataset, __getitem__()needs to return an
image that is the linear combination of two images.
● In your torch.utils.Dataset, __getitem__() needs to return a label
that is a vector, to assign probabilities to each class.
● You need to explicitly code out the math formula of the cross entropy
loss, as CrossEntropyLoss does not support multiple labels.
Test Time Augmentation
● The sample code tests images using a deterministic “test transformation”
● You may using the train transformation for a more diversified
representation of the images, and predict with multiple variants of the
test images.
● Coding : You need to fill in train_tfm, change the augmentation
method for test_dataset, and modify prediction code to gain this effect
train_tfm test_tfm test_tfm

+ >
Pred Pred Pred Pred Pred Pred Pred
Ensemble
Pred
Test Time Augmentation
● Usually, test_tfm will produce images that are more identiﬁable, so you
can assign a larger weight to test_tfm results for better performance.
train_tfm test_tfm

Pred Pred Pred Pred Pred test_tfm_pred

avg_train_tfm_pred

● Ex : Final Prediction = avg_train_tfm_pred * 0.5 + test_tfm_pred* 0.5

Cross Validation
● Cross-validation is a resampling method that uses diﬀerent portions of
the data to validate and train a model on diﬀerent iterations. Ensembling
multiple results lead to better performance.
● Coding : You need to merge the current train and validation paths, and
resample form those to form new train and validation sets.
Validation Train

Train Validation Train

Ensemble
Train Validation Train

Train Validation
Cross Validation
● Even if you don’t do cross validation, you are encouraged to resplit the
train/validation set to suitable proportions.
○ Currently, train : validation ~ 3 : 1, more training data could be valuable.
Ensemble
● Average of logits or probability : Need to save verbose output, less
ambiguous
● Voting : Easier to implement, need to break ties

● Coding : basic math operations with numpy or torch

Kaggle Tutorial
Kaggle Introduction
● Kaggle GPU : 16G NVIDIA TESLA P100
○ https://www.kaggle.com/docs/eﬃcient-gpu-usage
● Faster data IO
● Easier checkpoint reusing
● Limited to 30+ hrs/week depending on usage.
● Limited to 12hrs/run
● We strongly recommend that you run with Kaggle for this homework
How to run

Change sorting to
“Recently Run” if you
can’t ﬁnd the code
How to get data : In the input section, there should already be data titled
“ml2022spring-hw3”

If there isn’t, click on Add data and ﬁnd “ml2022spring-hw3”

How to use gpu : Change accelerator to “gpu” when you run your code.

Since GPU time is limited, It is advised to NOT utilize GPU while debugging
How to Run interactively : The commands are very similar to google colab

Any output writing to ./ will end up here, you can download it

How to Run in background: Execute code from start to end, all results would
be save permanently. (Output is limited to 20G, max run time = 12hrs)

Make sure your code is bug free, as any error in any code block would result in
early stopping
How to view “Run in background” results

You can view your results this way

Don’t worry if your run is “cancelled”, the

output will still be saved.
How to utilize your results

Upload your model to become kaggle dataset + New Dataset

Create a new notebook with the output as input
How to train and predict
1. Run your code in background
2. Find the output data “./submission.csv” and upload it to the submission
page
How to retrain from a checkpoint
1. Run your code in background
2. Find the output model and save it as a dataset
3. Import your dataset into the notebook via “Add data”
4. Modify your code to load your checkpoint
5. Run your code in background
6. Find the output data “./submission.csv” and upload it to the submission
page
Tips and tricks
Time management
● Kaggle will allocate more time for those who have utilized GPU resources
in the past week. Time resets every Saturday at 08:00, Taipei Time.

=> Run any code with GPU on kaggle today (3/4) to get (possible) extra
time next week.

● Time consumption is the sum of notebooks running interactively with gpu

and running in background with gpu.
● Please start early
Time management
● You can go over the time limit moderately. Kaggle will not interrupt your
code in the background if it is already running. If your time limit is
reached, you cannot run any code with GPU.

=> 時間快用完的時候在背景跑一隻程式，等於多12小時runtime

=> 時間快用完的時候在背景跑兩隻程式，等於多24小時runtime
Time management - Parallelization
● You can run two codes in the background
● If you are satisﬁed with your code, utilize this to run multiple random
seeds/multiple train validation split/multiple model structures, so
you can ensemble
A sample procedure for beating the boss baseline
The boss baseline could be beaten with a single model trained on kaggle for 12hrs

Train : 12h

Prediction

Your procedure can be ensemble of models with parallelization

Train : 12h Train : 12h

multiple random seeds

multiple train validation split Save checkpoint Ensemble Prediction

multiple model structures

Train : 12h Train : 12h
Experimental Tips
● Augmentation is a must to prevent overfitting. A good augmentation can
carry on to the testing phase with Test Time Augmentation.
● If you build your own network structure and have implemented
augmentation, don’t be afraid to scale up your model. (Many predefined
models structure are huge and perform great)
● In TA’s experiment, model structures with downsampling work better,
simply choosing the best performing models on ImageNet according to
websites is not always a good idea because pretrained weights are not
allowed.
Other tricks……
● on Classification
○ Label Smoothing Cross Entropy Loss
○ FocalLoss
● on Optimization
○ Dropout
○ Gradient Accumulation
○ BatchNorm
○ Image Normalization
Running with Google Colab
● We strongly recommend that you run with Kaggle for this homework
● If you would like to use colab, DO NOT store data in your drive and load
from there, the input/output is very slow. (store at ./ instead)
● If you mount your google drive in colab : G-suite google drive now has a
storage limit. Since models and data can be large, keep an eye on your
used space to prevent your account being suspended.
Report Questions
Q1. Augmentation Implementation (2%)
Implement augmentation by finishing train_tfm in the code with image size
of your choice. Copy your train_tfm code and paste it onto the
GradeScope.
● Your train_tfm must be capable of producing 5+ different results when
given an identical image multiple times.
● Your train_tfm in the report can be different from train_tfm in your
training code.
X
train_tfm

O
Q2. Residual Connection Implementation (2%)
Residual Connection is widely used in CNNs such as Deep Residual Learning
for Image Recognition. Residual is demonstrated in the following graph.

Image Source :
https://towardsdatascience.com/wh
at-is-residual-connection-efb07cab0
d55
Q2. Residual Connection Implementation (2%)
Implement Residual Connections in the Residual_Model, following the
graph below. Copy your Residual_Model code and paste it on Gradescope.
=Addition
● Your Residual_Model should connect like =ReLU
cnn_layer1

cnn_layer2

cnn_layer3

cnn_layer4

cnn_layer5

cnn_layer6

fc_layer
● You should only modify the forward part of the model
Submission Format
● train_tfm and Residual_Model are present in colab (scroll to
bottom) and kaggle (ML2022HW3 - Report Questions), you only need to
modify from our sample code.
Regulations and Grading Policy
Grading
● simple (public) +0.5 pts
● simple (private) +0.5 pts
● medium (public) +0.5 pts
● medium (private) +0.5 pts
● strong (public) +0.5 pts
● strong (private) +0.5 pts
● boss (public) +0.5 pts
● boss (private) +0.5 pts
● code submission +2 pts
● report +4 pts
Total : 10 pts
Code Submission
● NTU COOL
○ Compress your code and pack them into .zip ﬁle

<student_ID>_hw3.zip

● Your .zip ﬁle should include only

○ Code: either .py or .ipynb
● Do not submit models and data
● File Size Limit : 25MB
● Submit the code that corresponds to your chosen submission in Kaggle (One of the
best)
Report Submission
Answer the questions on GradeScope
Deadlines
● Kaggle, Code (NTU COOL), Report (GradeScope)

2022/03/25 23:59 (UTC+8)

Rules
● DO NOT attempt to find the original labels of the testing set.
● DO NOT use any external datasets.
● DO NOT use any pretrained models.
○ Also, do not attempt to “test how effective pretraining is” by submitting to kaggle.
Pretraining is very effective and you may test it after the competition ends.
● You may use any publicly available packages/code
○ But make sure you do not use pretrained models. Most code use those.
○ You may not upload your code/checkpoints to be publicly available during the timespan
of this homework.
Rules
● You should finish your homework on your own.
● You should not modify your prediction files manually
● Do not share codes or prediction files with any living creatures.
● Do not use any approaches to submit your results more than 5 times a
day.
● Your final grade x 0.9 and 0 pt for this HW if you violate any of the
above rules, final grade = Fail for repeat offenders
● Prof. Lee & TAs preserve the rights to change the rules & grades.
Links
Kaggle : https://www.kaggle.com/c/ml2022spring-hw3b

Kaggle code (join competition ﬁrst) :

https://www.kaggle.com/c/ml2022spring-hw3b/code?competitionId=34954&so
rtBy=dateRun

Colab :
https://colab.research.google.com/drive/15hMu9YiYjE_6HY99UXon2vKGk2Kwu
gWu
Contact us if you have problems…
● Kaggle Discussion (Recommended for this HW)
● NTU COOL
● Email
○ [email protected]
○ The title should begin with “[hw3]”

C1W3 Assignment
No ratings yet
C1W3 Assignment
7 pages
Dakin's Solution HHV278
100% (1)
Dakin's Solution HHV278
2 pages
Bosch MEG1 SMART PDF
No ratings yet
Bosch MEG1 SMART PDF
2 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
"I C U N N ": Mage Lassification Sing Eural Etworks
No ratings yet
"I C U N N ": Mage Lassification Sing Eural Etworks
15 pages
Project Documentation
No ratings yet
Project Documentation
24 pages
Assignment 3 DS5620
No ratings yet
Assignment 3 DS5620
11 pages
CIFAR_10_ Dataset_Using_CNN_Aniiiii_HTML
No ratings yet
CIFAR_10_ Dataset_Using_CNN_Aniiiii_HTML
8 pages
cat_dog_classification_CNN_Model
No ratings yet
cat_dog_classification_CNN_Model
13 pages
Anurag -2025 --
No ratings yet
Anurag -2025 --
18 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
DL Lab-final
No ratings yet
DL Lab-final
22 pages
How to Develop a CNN for MNIST Handwritten Digit Classification
No ratings yet
How to Develop a CNN for MNIST Handwritten Digit Classification
43 pages
Assignment3
No ratings yet
Assignment3
6 pages
Deep Learning
No ratings yet
Deep Learning
30 pages
Tensorflow and Deep Learning
No ratings yet
Tensorflow and Deep Learning
51 pages
NN From Scratch
No ratings yet
NN From Scratch
5 pages
Deep Learning for Vision Lab Manual 2024
100% (1)
Deep Learning for Vision Lab Manual 2024
25 pages
DL Lab-III-II
No ratings yet
DL Lab-III-II
98 pages
keras
No ratings yet
keras
4 pages
Deep Learning Project for Computer Vision with Python 2022
No ratings yet
Deep Learning Project for Computer Vision with Python 2022
297 pages
TMA01 Question 1 (45 Marks)
No ratings yet
TMA01 Question 1 (45 Marks)
31 pages
Chapter04 - Getting Started With Neural Networks
No ratings yet
Chapter04 - Getting Started With Neural Networks
9 pages
L2 - Basic ANN model building with TF-Keras.pptx
No ratings yet
L2 - Basic ANN model building with TF-Keras.pptx
16 pages
Assignment 2.4.1 Multiclass Classification
No ratings yet
Assignment 2.4.1 Multiclass Classification
5 pages
ASNM Program Explain
No ratings yet
ASNM Program Explain
4 pages
Capstone Project-1
No ratings yet
Capstone Project-1
15 pages
Deep Learning Practical File
No ratings yet
Deep Learning Practical File
36 pages
NNDL_RECORD_MANUAL
No ratings yet
NNDL_RECORD_MANUAL
36 pages
S-5
No ratings yet
S-5
10 pages
Experiement 3
No ratings yet
Experiement 3
4 pages
Project
No ratings yet
Project
4 pages
L8 - Image Classification
No ratings yet
L8 - Image Classification
20 pages
Explore the Implementation of CNNs in Python
No ratings yet
Explore the Implementation of CNNs in Python
10 pages
Assignment 2 Dl
No ratings yet
Assignment 2 Dl
10 pages
PDL Final Assignment-3 Aryan
No ratings yet
PDL Final Assignment-3 Aryan
8 pages
Lab 1 Assignment_W2022
No ratings yet
Lab 1 Assignment_W2022
7 pages
CTRL
No ratings yet
CTRL
5 pages
CSY3025 Artificial Intelligence Techniques: Deep Learning
No ratings yet
CSY3025 Artificial Intelligence Techniques: Deep Learning
42 pages
Plant Disease Identification
No ratings yet
Plant Disease Identification
17 pages
DL7 1
No ratings yet
DL7 1
19 pages
Deep Learning and Machine Learning: Lab Explanation
No ratings yet
Deep Learning and Machine Learning: Lab Explanation
34 pages
Image Classification of An American Sign Language Dataset: Objectives
No ratings yet
Image Classification of An American Sign Language Dataset: Objectives
11 pages
Cad and Dog 2
No ratings yet
Cad and Dog 2
5 pages
CNN TF Keras
No ratings yet
CNN TF Keras
6 pages
Unit III
No ratings yet
Unit III
28 pages
NN From Scratch PDF 1735495327
No ratings yet
NN From Scratch PDF 1735495327
19 pages
Handwritten Character Recognition With Neural Network
No ratings yet
Handwritten Character Recognition With Neural Network
12 pages
Image Classification: Keras
No ratings yet
Image Classification: Keras
21 pages
Dlv Lab Manual Print
No ratings yet
Dlv Lab Manual Print
29 pages
DL Record Merged
No ratings yet
DL Record Merged
113 pages
CCS355-Neural networks and deep learning_____Assignment 1
No ratings yet
CCS355-Neural networks and deep learning_____Assignment 1
15 pages
tutorial 4
No ratings yet
tutorial 4
6 pages
CH 02 Summary
No ratings yet
CH 02 Summary
3 pages
07 Milestone Project 1 Food Vision
No ratings yet
07 Milestone Project 1 Food Vision
20 pages
Assignment3 AL
No ratings yet
Assignment3 AL
23 pages
Crash Course On Tensorflow!: Vincent Lepetit!
No ratings yet
Crash Course On Tensorflow!: Vincent Lepetit!
63 pages
DL_EXP-6_16010422230
No ratings yet
DL_EXP-6_16010422230
8 pages
Neural_DEEP
No ratings yet
Neural_DEEP
39 pages
Google Aiml
No ratings yet
Google Aiml
50 pages
Microsoft Azure DevOps Engineer AZ 400
From Everand
Microsoft Azure DevOps Engineer AZ 400
Manish Soni
No ratings yet
Scrum Art Hand Book: Effective Tips & Techniques
From Everand
Scrum Art Hand Book: Effective Tips & Techniques
Durga Madiraju
No ratings yet
Boulder City High School
No ratings yet
Boulder City High School
4 pages
Life and Works of Rizal - Activities 1-4
No ratings yet
Life and Works of Rizal - Activities 1-4
8 pages
Gaussian Curvature
No ratings yet
Gaussian Curvature
8 pages
2013 OL ICT Model Paper @
No ratings yet
2013 OL ICT Model Paper @
4 pages
Chapter12-Business Communication
No ratings yet
Chapter12-Business Communication
23 pages
Interview Confirmation
No ratings yet
Interview Confirmation
3 pages
Research Roadmap Ayurvedic Perspectives
No ratings yet
Research Roadmap Ayurvedic Perspectives
26 pages
Letters - Keats
No ratings yet
Letters - Keats
7 pages
AhmedTanvir EvaluationoftheEIASysteminBangladesh
No ratings yet
AhmedTanvir EvaluationoftheEIASysteminBangladesh
7 pages
Final Bulletin 77
No ratings yet
Final Bulletin 77
57 pages
Practicum Verification Letter
No ratings yet
Practicum Verification Letter
1 page
Lecture11 19437 Zero Lecture MTH174
No ratings yet
Lecture11 19437 Zero Lecture MTH174
71 pages
GS3055-I CF: Installation Manual
No ratings yet
GS3055-I CF: Installation Manual
12 pages
Polarization
No ratings yet
Polarization
44 pages
Hitachi-Man B&W Diesel Engine Service Data: Accumulator Charge Pressure For ME Engines
No ratings yet
Hitachi-Man B&W Diesel Engine Service Data: Accumulator Charge Pressure For ME Engines
5 pages
2019 Mission Menu
No ratings yet
2019 Mission Menu
24 pages
2021 Mod Maths Specialist Complex Test
No ratings yet
2021 Mod Maths Specialist Complex Test
5 pages
Ielts Speaking A Collection of Common Topics: Unit 1 People Lesson 1 A Person Who Often Helps Others
No ratings yet
Ielts Speaking A Collection of Common Topics: Unit 1 People Lesson 1 A Person Who Often Helps Others
8 pages
Module 1 - Grade 6
No ratings yet
Module 1 - Grade 6
24 pages
Case Study Identification
No ratings yet
Case Study Identification
9 pages
Concept of Knowledge Management
No ratings yet
Concept of Knowledge Management
9 pages
Data Structures Abstraction and Design Using Java 2nd Edition by Elliot Koffman, Paul Wolfgang 0470128704 978-0470128701 - Read the ebook online or download it to own the full content
100% (8)
Data Structures Abstraction and Design Using Java 2nd Edition by Elliot Koffman, Paul Wolfgang 0470128704 978-0470128701 - Read the ebook online or download it to own the full content
32 pages
Unit 8 Presentation Ef310
No ratings yet
Unit 8 Presentation Ef310
9 pages
Masters in Adult Education
No ratings yet
Masters in Adult Education
9 pages
Sample Literature Review Summary Table
100% (2)
Sample Literature Review Summary Table
7 pages
kirzner manell syllabus
No ratings yet
kirzner manell syllabus
23 pages
Torrens University Business Bbus16-Course-Flyer
No ratings yet
Torrens University Business Bbus16-Course-Flyer
2 pages
Beat 182 03-2021 EN
No ratings yet
Beat 182 03-2021 EN
100 pages