0% found this document useful (0 votes)

441 views44 pages

Computer Vision 15 Exam Q and A

The document outlines the grading criteria and exam structure for a Computer Vision course at Utrecht University, emphasizing the importance of both practical assignments and written exams. It details the exam criteria, including theoretical and conceptual knowledge, types of questions, and preparation guidelines. Additionally, it lists key topics covered in the course, exam logistics, and encourages student feedback for course improvement.

Uploaded by

laughriotclip

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

441 views44 pages

Computer Vision 15 Exam Q and A

Uploaded by

laughriotclip

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 44

COMPUTER VISION

2024 - 2025
>EXAM Q&A
UTRECHT UNIVERSITY
RONALD POPPE
GRADING
Practical assignments: 60%, Written exam: 40%
• Retake only if exam grade is >= 4
• No assignment retakes!

To pass the course:

• Final score must be at least 5.5 to pass, and
• Minimum grade 4 for weighted average of assignments, and
• Minimum grade 4 for the exam.
EXAM CRITERIA
EXAM CRITERIA
You will be graded on:
• Theoretical knowledge
• Conceptual knowledge/insight

Open questions
• Open questions to test understanding, often cross-topic
• Some explanation questions, some development questions

Multiple-choice questions
• Focus on insights
• Always multiple (or no) options possible
EXAM CRITERIA2
Theoretical knowledge. Be able to explain:
• How a method works (dropout, voxel reconstruction)
• Different steps
• Input/output of each step
• Relevance of each step
• (Dis)advantages/limitations of the method

• Differences between methods (1- vs. 2-stage object detection)

• Relative (dis)advantages
EXAM CRITERIA3
Conceptual knowledge/insight:
• Why are things the way they are (why use batch norm, why use HSV)?
• Explain (dis)advantages/limitations
• Combinations/parallels between topics

• How would you address a certain problem?

• Step-by-step process
• Explain (pseudo-code or brief sentences) how it works
• You might be asked to write the pseudo-code for a problem
EXAM CRITERIA4
Typically:
• 4 MC questions (multiple answers possible)
• 5 open questions

How to answer:
• Concise: longer is not needed. But make sure all criteria are covered.
• Specific: I need to be sure (not guess) that you understood
EXAM CRITERIA5
I should be able to understand your answer just from the text
• No links, no references to slides, knowledge clips etc.

If you add irrelevant or incorrect information, I might deduct points

• Avoid “hitting all buttons”

Don’t use “vague” terms

• “much more”, “sometimes”, “almost always”, “better”
• “use an algorithm”
PREPARATION
Slides of the knowledge clips and lectures

Additional reading (for your own understanding):

• Links to books
• Links to websites
• Links to lectures

Insights that were gained while working on the assignments

PREPARATION2
In general, you should be able to:
• Understand each statement in a slide
• Be able to explain it
• Be able to give an example of how something should be applied
• Be able to give an example of a case in which something does/doesn’t work

If you cannot do this, use the additional reading material!

Can you can also post your questions on Teams, exam preparation channel
TOPICS
1. Pixels, images, video 9. CNN architectures
2. Camera geometry & Image formation 10. Object detection and segmentation
3. 3D computer vision 11. Vision transformers
4. Learning-based computer vision 12. Vision language models
5. Performance measures 13. Image and video generation
6. Neural networks & Backpropagation 14. Computer vision challenges
7. Convolutional neural networks 15. Exam Q&A
8. Training CNNs
1. PIXELS, IMAGES, VIDEO
General background:
• Applications of CV
• Challenges in CV (in which applications are these important)
• Image/video data structure
• Color spaces and distances
2. CAMERA GEOMETRY AND
IMAGE FORMATION
Camera geometry:
• Intrinsics/extrinsics/camera matrix: how to calculate (equations), what is each element
• Calibration: how does it work (algorithm), which are the important parameters, which
are the assumptions
• Experience from Assignment 1

Camera radiometry:
• Sensors: how do they work, how do we measure color?
• Distortions: what are they and how/when do they occur?
3. 3D COMPUTER VISION
Depth from images:
• Which ways are there to get depth/3D from images?
• 3D reconstruction: Voxel vs. mesh models: (dis)advantages
• Silhouette-based reconstruction: how does it work (algorithm), look-up table,
what can we model (limitations), how to improve speed/memory
requirements, how to obtain a mesh model (algorithm)
• Experience from Assignment 2

Background subtraction
• How does it work, equation, assumptions, challenges
• Experience from Assignment 2
4. LEARNING-BASED
COMPUTER VISION
Common vision tasks image classification vs. object detection
• Role of image descriptors, intra-class vs. inter-class
• Supervised classification: generalization, overfitting
• Unsupervised classification: clustering, K-means (algorithm)

Object detection
• Sliding window, image pyramid, Selective Search
5. TRAINING, TESTING, AND
PERFORMANCE MEASURES
Training
• Splits: training, validation, test sets
• Cross-validation, parameter tuning
• Hard negative mining (how to use), data augmentation (options, risks), data synthesis

Experiment design: parameter search: grid search, evolutionary optimization

Performance measures:
• Precision/recall, F1, PR-curve, average precision
• Single vs. multiclass: confusion matrix
• Detection: IOU, non-maximum suppression, AP
6. NEURAL NETWORKS
Neurons:
• Activation functions, perceptrons, limitations, concepts

Networks:
• Feed-forward, hidden units, limitations, challenges, low vs. high-level
features, non-linearity
• Training neural networks: backpropagation (no equations)
7. CONVOLUTIONAL NEURAL
NETWORKS
Overall architecture: layers, inputs, outputs
• Convolution, pooling, fully connected, flatten layer, output layer

Receptive field

For all layers:

• Number of connections, parameters, activation volume size, how to connect them
• Experience from Assignment 3
8. TRAINING CNNS
Loss functions: all discussed
Learning rate: role, schedules
Optimizers: properties, (mini-batch) (stochastic) gradient descent, momentum
Regularization: L1/L2, batch normalization, dropout

Initialization: role (no equations)

Pretraining, fine-tuning
9. CNN ARCHITECTURES
Understand how the network works:
• AlexNet
• VGG

Inception: 1x1 convolution, auxiliary classifiers

ResNet/ DenseNet: vanishing gradient problem, skip connection

Two-stream network: video input, late/mid-level fusion

10. OBJECT DETECTION AND
SEGMENTATION
Architectures:
• R-CNN: architecture, way of training
• SPP: spatial pyramid pooling
• Fast R-CNN: way of training
• Faster R-CNN: anchor boxes, end-to-end training
• YOLOv1: architecture, output, one-stage vs. two-stage object detection
• Mask R-CNN: additional segmentation head

Multi-task learning: concept, architecture, assumptions, multi-dataset, adversarial

outputs

Insights from working on Assignment 4

11. VISION
TRANSFORMERS
Vision transformer:
• Global processing steps, inductive bias
• Input preparation: patchify, embedding, class token, position encoding
• Encoder: architecture, self-attention mechanism
• Output processing: role of class token output
• Training

No Swin transformer
12. VISION-LANGUAGE MODELS
Aliging text and image:
• Importance and consequences

CLIP:
• Way of training, why of using it, zero-shot learning capabilities

Decoder:
• Architecture, process of outputting tokens, training options
PRACTICE EXAMS
Five test “exams” online:
• 2015: NOT 4-6
• 2016: NOT 3-7
• 2018 test: Answers at the end (NOT 2, this is also not a complete exam)
• 2018: NOT 5-9
• 2021 test: NOT 2, 3, 6
• Last one is the most representative one
• A new one will be provided

Don’t rely on these materials to “guess” which questions will be asked

REQUESTED TOPICS
TRAINABLE PARAMETERS
Trainable parameters are learned during training

All neural networks:

• Fully connected layer: weights between each input neuron and
output neuron + bias

Convolutional neural networks:

• Convolution layer: kernel weights + bias
TRAINABLE PARAMETERS2
Vision transformers:
Per image:
• Patch embedding matrix
• Positional encoding (when using learned encodings) (matrix)
• Initial class embedding (matrix)
• FC layers in the final MLP head

Per transformer block:

• Weight matrices WK, WQ, WV
• FC layers in MLP
• Weight matrix WO (for multi-head self-attention)
TRAINABLE PARAMETERS3
We distinguish between parameters and hyperparameters:
• Parameters: we learn during training, are part of the model
• Hyperparameter: govern the training process

Typically, hyperparameters are set ourselves before training

Bigger models usually have more parameters

• CNNs: more layers
• ViTs: larger embedding space, more transformer blocks
CONVOLUTION LAYERS
What are the inputs, parameters and outputs of a convolution layer?

Input: a volume WxHxD

• Each element in the input is a neuron

Output: a volume W’xH’xD’

• Again, each element is a neuron
CONVOLUTION LAYERS2
Trainable parameters are in the convolution kernels
• These determine what kind of patterns are extracted
• All kernel elements + a bias term

Number of parameters is limited given input size

• Consequence of weight sharing
CONVOLUTION LAYERS3
Neurons in input volume are connected to neurons in output volume
• Not fully connected but locally connected

Example:
• Output s2 depends on x1-x3
• Multiplied by Yellow,Black,Blue
• Output is weighted sum

• Output s3 depends on x2-x4

• Multiplied by Yellow,Black,Blue
• Output is weighted sum
CONVOLUTION LAYERS4
Input and output layer connected with shared weights
• Values that are similar for many connections
BACKPROPAGATION
Goal of backpropagation is to calculate gradient for each parameter
• Gradient determines (with optimizer and learning rate) the change in
the value of the parameter (the weight update)

Developed for feed-forward neural networks

• Each output can be described as a (non-linear) function of the inputs
BACKPROPAGATION2
Simple mathematical formulation for a regular neural network at layer L:
• Input aL
• Output aL+1
• Weights between layer L and layer L+1 WL
• Activation function fL

Output calculated as: aL+1 = fL(WL aL)

BACKPROPAGATION3
𝑦𝑦
At the last layer N, the output is aN
𝐶𝐶
aN
During training, output aN is compared to the actual output y
• Loss function C used: C(aN, y)
• C is higher when aN deviates further from y a1N-1 a2N-1

But aN is calculated from WN-1 and aN-1!

BACKPROPAGATION4
𝑦𝑦
Each input into aN contributes to loss
𝐶𝐶
• We can calculate the partial derivative to each input aN

Adjust WN-1 and aN-1 to move in the right direction

• But then we also need to change WN-2 and aN-2 a1N-1 a2N-1
• Until we reach the input
BACKPROPAGATION5
In this backward pass, we have visited each parameter
• Received a “weight update” to move in the correct direction

But many parameters connect to multiple neurons

• Weight updates typically accumulated per parameter
BACKPROPAGATION6
For convolution layers, weights are shared
• Weight updates are accumulated over (many) different paths
BACKPROPAGATION7
For pooling layers, updating the weights in preceding layers depends on pooling type

For max-pool, only a single value is selected

• Gradient of 1 for the selected (maximum) input
• Gradient of 0 for all other inputs

For average pooling, all inputs are used but gradient divided by the number of inputs
FINALLY…
ASSIGNMENT
Assignment 4:
• Deadline: Sunday March 30, 23:00
• Don’t underestimate the time required to prepare the outputs and
implementing the loss function

Need help?
• Use Teams for questions
EXAM
Monday April 7, 13:30-15:30, EDUC-Alfa
• Two hours (plus 20 if you’re eligible for extra time)
• No materials and calculator allowed
• Just a pen and food/drinks

For people that have “minder massaal” provision

• Ruppert-029, 13:30-15:30
COURSE EVALUATION
I hope you have enjoyed the course!

Please give us feedback by filling in the Caracal course evaluation form. We

always like to improve the course:
• If you have suggestions
• If you thought something was bad
• If you enjoyed something
FINALLY…
Good luck with the exam and final assignment!

And thanks for your enthusiasm!

Lab 7 RC Time Constant
100% (2)
Lab 7 RC Time Constant
8 pages
Peugeot 307 Owners Manual 2003
100% (2)
Peugeot 307 Owners Manual 2003
83 pages
Ai Fellowship 2023
No ratings yet
Ai Fellowship 2023
13 pages
New CV Syllabus
No ratings yet
New CV Syllabus
3 pages
Lec 01 Introduction Compressed
No ratings yet
Lec 01 Introduction Compressed
111 pages
AI & Deep Learning TensorFlow, Keras, PyTorch - 80 Hours-1
No ratings yet
AI & Deep Learning TensorFlow, Keras, PyTorch - 80 Hours-1
12 pages
Deep Learning Syllabus
No ratings yet
Deep Learning Syllabus
4 pages
Course Guide 230726 - CVDL - Computer Vision With Deep Learning
No ratings yet
Course Guide 230726 - CVDL - Computer Vision With Deep Learning
4 pages
Convolutional Neural Networks Notes
No ratings yet
Convolutional Neural Networks Notes
29 pages
Ai and ML
No ratings yet
Ai and ML
6 pages
Deep Learning NLP and Computer Vision
No ratings yet
Deep Learning NLP and Computer Vision
9 pages
Ann 5TH
No ratings yet
Ann 5TH
98 pages
3 2c735de418 Syllabus Computer Vision Modified
No ratings yet
3 2c735de418 Syllabus Computer Vision Modified
5 pages
Deep Learning Curriculum
No ratings yet
Deep Learning Curriculum
23 pages
Stage 424 June 2023
No ratings yet
Stage 424 June 2023
89 pages
1.neural Networks and Convolutional Processing
No ratings yet
1.neural Networks and Convolutional Processing
94 pages
Syllabus
No ratings yet
Syllabus
15 pages
Lecture 3
No ratings yet
Lecture 3
48 pages
01 Intro
No ratings yet
01 Intro
49 pages
Lecture1 Introduction CVML
No ratings yet
Lecture1 Introduction CVML
26 pages
CV Digital Notes
No ratings yet
CV Digital Notes
77 pages
New Text Document
No ratings yet
New Text Document
2 pages
Computer Vision 2011
100% (1)
Computer Vision 2011
103 pages
CompVisNotes PDF
No ratings yet
CompVisNotes PDF
115 pages
5 BCA - Electives Syllabus
No ratings yet
5 BCA - Electives Syllabus
10 pages
CV 2025 Spring 16
No ratings yet
CV 2025 Spring 16
53 pages
CV SVD L01 P1 Intro
No ratings yet
CV SVD L01 P1 Intro
35 pages
CV Ss16 0609 Deep Learning
No ratings yet
CV Ss16 0609 Deep Learning
91 pages
Ml@ok Questions
No ratings yet
Ml@ok Questions
16 pages
03 Pytorch Computer Vision
No ratings yet
03 Pytorch Computer Vision
29 pages
01 Introduction To MachineVision
No ratings yet
01 Introduction To MachineVision
53 pages
Seminar Report cnn1
No ratings yet
Seminar Report cnn1
23 pages
DLCV Ch2 Neural Network
No ratings yet
DLCV Ch2 Neural Network
68 pages
Question Bank Advanced CO1, CO2
No ratings yet
Question Bank Advanced CO1, CO2
4 pages
Terms To Review
No ratings yet
Terms To Review
9 pages
CampusX (D.L) Course Syllabus
No ratings yet
CampusX (D.L) Course Syllabus
5 pages
CII4Q3 VISI KOMPUTER - Deep Learning - CNN
No ratings yet
CII4Q3 VISI KOMPUTER - Deep Learning - CNN
106 pages
RMK Group 21cs905 CV Unit 5
No ratings yet
RMK Group 21cs905 CV Unit 5
101 pages
Convolutional Networks
No ratings yet
Convolutional Networks
37 pages
CV #1 Course Introduction-1
No ratings yet
CV #1 Course Introduction-1
61 pages
Lecture 2 Handout
No ratings yet
Lecture 2 Handout
154 pages
6-DeepVisualLearning L6
No ratings yet
6-DeepVisualLearning L6
82 pages
Unit 1
No ratings yet
Unit 1
186 pages
02 CNN Slides
No ratings yet
02 CNN Slides
77 pages
A Comprehensive Guide To Computer Vision
No ratings yet
A Comprehensive Guide To Computer Vision
6 pages
Syllabus Udacity Default en Us
No ratings yet
Syllabus Udacity Default en Us
4 pages
Anthony
No ratings yet
Anthony
33 pages
Computer Vision
No ratings yet
Computer Vision
2 pages
Unit Iv - NNDL
No ratings yet
Unit Iv - NNDL
32 pages
Convnets
No ratings yet
Convnets
41 pages
1 AI - Introduction and ML
No ratings yet
1 AI - Introduction and ML
32 pages
Week8 - Machine Learning
No ratings yet
Week8 - Machine Learning
35 pages
Deep Learning Convolution Neural Networks
No ratings yet
Deep Learning Convolution Neural Networks
73 pages
Year 1 - Python, Math & Foundations of AI
No ratings yet
Year 1 - Python, Math & Foundations of AI
48 pages
Module 8
No ratings yet
Module 8
3 pages
An Overview of Convolutional Neural Network Architectures For Deep Learning
No ratings yet
An Overview of Convolutional Neural Network Architectures For Deep Learning
22 pages
Chapter21 4e
No ratings yet
Chapter21 4e
35 pages
Computer Vision: Field of AI That Enables Computers To Derive Meaningful Information From
No ratings yet
Computer Vision: Field of AI That Enables Computers To Derive Meaningful Information From
26 pages
CNN and Autoencoder
No ratings yet
CNN and Autoencoder
56 pages
Case 2 Object Detection
No ratings yet
Case 2 Object Detection
77 pages
Agile Foundation Courseware – English
From Everand
Agile Foundation Courseware – English
Nader Rad
No ratings yet
DevOps Master Courseware
From Everand
DevOps Master Courseware
Alejandro Pestchanker
No ratings yet
English 3 Program
No ratings yet
English 3 Program
8 pages
Jawaban Exam
No ratings yet
Jawaban Exam
26 pages
Circuit Meets Challenges of Fast, High-Current NiCd Charging
No ratings yet
Circuit Meets Challenges of Fast, High-Current NiCd Charging
5 pages
GCSE Maths Revision Checklist Higher
No ratings yet
GCSE Maths Revision Checklist Higher
32 pages
Employee Benefit Plans 6: Limitations On Contributions and Benefits
No ratings yet
Employee Benefit Plans 6: Limitations On Contributions and Benefits
23 pages
Holsetpartnumbers 2008
No ratings yet
Holsetpartnumbers 2008
1 page
Grade 5 Math Bow Q1
No ratings yet
Grade 5 Math Bow Q1
4 pages
Full Wave Analysis of The Exposure of Implantable Medical Devices To Electromagnetic Fields
No ratings yet
Full Wave Analysis of The Exposure of Implantable Medical Devices To Electromagnetic Fields
2 pages
Solvent Deasphalting PPT Final - 1
100% (5)
Solvent Deasphalting PPT Final - 1
30 pages
Chemguard Ratio Flow Controller
No ratings yet
Chemguard Ratio Flow Controller
3 pages
Market Structure
No ratings yet
Market Structure
14 pages
ABB DCS Function Code 15
No ratings yet
ABB DCS Function Code 15
2 pages
Analysis of Bleach Lab Report
83% (6)
Analysis of Bleach Lab Report
8 pages
LRDI-07 Number Based With Solutions
100% (1)
LRDI-07 Number Based With Solutions
10 pages
Maintenance Schedules / Maintenance Parts
100% (1)
Maintenance Schedules / Maintenance Parts
29 pages
Rule of Thumb Formulae
80% (5)
Rule of Thumb Formulae
54 pages
Common Emitter Amplifier
100% (1)
Common Emitter Amplifier
11 pages
Taking The Control System For Granted - Ensuring The Integrity of Sub-Sil Instrumented Functions
No ratings yet
Taking The Control System For Granted - Ensuring The Integrity of Sub-Sil Instrumented Functions
5 pages
Free Body Diagrams With Animated GIF Files: Paper ID #16401
No ratings yet
Free Body Diagrams With Animated GIF Files: Paper ID #16401
12 pages
APC200 ECM-ECI Error Codes TE13,15,17,27,32, Ver2.6
No ratings yet
APC200 ECM-ECI Error Codes TE13,15,17,27,32, Ver2.6
15 pages
Electro Chemistry (MS)
No ratings yet
Electro Chemistry (MS)
208 pages
Basics of A Jet Engine
No ratings yet
Basics of A Jet Engine
34 pages
Computer - Science - Notes - CH05 - Data - Structures - Solutionrider PDF
No ratings yet
Computer - Science - Notes - CH05 - Data - Structures - Solutionrider PDF
28 pages
MV Seapace - Final Safety Investigation Report Annexes (Rocking Test)
No ratings yet
MV Seapace - Final Safety Investigation Report Annexes (Rocking Test)
120 pages
Simple Neon Lamp Circuits and Working Explained 2
No ratings yet
Simple Neon Lamp Circuits and Working Explained 2
36 pages
Various Methods of Ligation Ties: Review Article
No ratings yet
Various Methods of Ligation Ties: Review Article
6 pages
BSP03 Multi Process Control Trainer
No ratings yet
BSP03 Multi Process Control Trainer
2 pages
Circuit Note: Dual-Channel Colorimeter With Programmable Gain Transimpedance Amplifiers and Digital Synchronous Detection
No ratings yet
Circuit Note: Dual-Channel Colorimeter With Programmable Gain Transimpedance Amplifiers and Digital Synchronous Detection
8 pages