0% found this document useful (0 votes)

84 views5 pages

Exercises INF 5860: Exercise 1 Linear Regression

This document contains 12 exercises covering a range of deep learning topics: - Exercises 1-3 cover linear regression, logistic regression, and basic neural networks - Exercises 4-6 discuss generalization, representations, and convolutional networks - Exercises 7-8 focus on training deep networks and popular architectures - Exercises 9-10 involve visualization, adversarial training, and recurrent neural networks - Exercises 11-12 explore reinforcement learning, unsupervised learning, and embedding techniques The exercises consist of conceptual questions about algorithms and their applications across deep learning domains.

Uploaded by

Patrick O'Rourke

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

84 views5 pages

Exercises INF 5860: Exercise 1 Linear Regression

Uploaded by

Patrick O'Rourke

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Exercises INF 5860

Exercise 1 Linear regression

a) What is the loss function for linear regression?

b) Why would we want an iterative algorithm for the linear regression problem?

c) How does gradient descent update the estimate, give the general formulae?

d) Given x=
Plot x,y as points in a plot.
e) If we start with ϴ0=0 and ϴ1=0, compute the value of the initial loss function

f) If we start with ϴ0=0 and ϴ1=0, compute the estimate after one iteration if the learning rate
is 1.

Exercise 2 – Logistic classification

a) Given a trained logistic classifier for a single feature and 2 classes. What is the equation for the
decision boundary if W=2 and b=1?

b) With logistic classification, how is a new sample classified?

c) How can we generalize logistic classification to more than 2 classes?

Exercise 3: Basic neural networks

a) What are the problems with using a sigmoid activation function?

b) In what way does the tanh activation function share the same drawbacks?

c) Discuss briefly why feature scaling of the input features is important.

d) Why is initializing all the weights to zero problematic?

e) Assume that we have a 2-layer net (one hidden layer) with weights W(1), b(1) , W(2) and b(2) .
Assume that we use RELU-activations in the hidden layer, and no activation on the output
layer. Write down an equation for the output of the j’th node in the hidden layer, a(j) .

f) Explain shortly how maxnorm regularization works.

g) When using dropout, if you do not consider any scaling during training, how should you then
compensate during prediction of new data/test samples?

h) Explain briefly how momentum gradient descent works, and why this can be more robust
than regular gradient descent.

Exercise 3: A simple network

a) Perform backpropagation on this graph

Exercise 4: Generalization

1. Why can testing out multiple models on your test data be a problem and when is it
problematic?

2. How does searching through more hypotheses affect the probability of searching through a
solution close to the correct solution?

3. Give an example of a way to measure model complexity.

4. What are the implications of the “No free lunch” theorem mean for machine learning?

5. Give three examples of common assumptions (priors) machine learning models make.

Exercise 5: Representations

1. What do we mean when we refer to the image manifold?

2. Explain why working with image gradients can be better than working with raw pixels. What
additional effect can you achieve by scaling the gradients based on the gradient magnitude?
3. How can multiple layers of discriminators/classifiers reduce the need for training examples in
image analysis?

Exercise 6: Convolutional nets

1. You have a 32x32x5 image and filter it with a 5x5x5 kernel, the way most convolutional
neural networks are implemented. If you use no padding, what will be the output size of the
activation map?

2. What do we mean by dilated convolutions and how are they used?

3. Why is the effective field-of-view usually smaller than the theoretical field-of-view? By
theoretical field-of-view we mean the size of the image patch that can influence each of the
output values in the activation map. Practical field-of-view is the size of the patch of pixels
influencing the results of a given output value.

4. In deep learning frameworks, you usually operate on 4D tensors, when working with 2D
convolutions. If you want to use such a framework to do a average (blur) filtering of images,
how would you have to construct the kernel for the convolution? You should treat each of
the color channels (RGB) independently.
Exercise 7: Training deep networks

1. Gradient flow
a. Why is gradient flow important when training deep neural networks?
b. Give some common methods that help to ensure good gradient flow.

2. How does batch size relate to learning rate? Explain.

3. Why is it a problem to optimize accuracy directly with a deep neural network?

Exercise 8: Deep learning architectures

1. Give two possible explanations to why residual networks work better than standard feed
forward networks.

2. You want to find bounding-boxes for cars in an image. You don’t know how many cars there
will be in each image, but you can safely assume it’s between 0 – 100. Describe how you can
construct and train a deep neural network for this task.

3. What does it mean to use a “Fully-convolutional” architecture for image segmentation?

4. What is the reasoning behind the concatenation operations in U-Net for image
segmentation?

Exercise 9: Visualization and Adversarial training

1. You have a convolutional neural network trained for image classification. Describe a simple
way of detecting what parts of an image are responsible for a certain classification result,
without using the image gradients.

2. How can you get a simple estimate of how changing a set of pixel-values will affect the final
class probabilities?

3. For some visualization techniques, you apply a lowpass (blurring) filter between each
iteration of optimization. Why may this be a reasonable approach?

4. You have lots of training images for one application, but no labelled images for a similar
application. How can you use Adversarial domain adaption, to improve your results on the
new data?

Exercise 10: Recurrent Neural networks (RNN)

1. Why is vanishing gradients and outputs a more common problem in basic RNNs compared to
feed forward networks?
2. Why is vanishing gradients and outputs in RNN less problematic than for feed forward neural
networks?

3. Why can you only do gradient descent for a certain number iterations of an RNN and when is
this a problem? Explain and provide an example.

4. Give an overview of some common solutions to using deep learning for video data.

Exercise 11: Reinforcement learning

1. Is Reinforcement learning usually training faster or slower than standard supervised

learning?

2. In what kind of situations is it common to use Reinforcement learning? Explain why.

3. In what type of situation does Policy learning require a lot of memory?

4. How could you implement hard attention for image analysis in a fully supervised way,
without using reinforcement learning?

Exercise 12: Unsupervised learning

1. Draw and explain an example where t-SNE work better than PCA.

2. When you do a PCA of a dataset, you can easily transform new points with the same
transform. Why is it more difficult to transform new points with t-SNE?

3. Give basic overview of what an autencoder based on neural networks is.

4. Explain a typical situation where first learning an embedding unsupervised and then using
the embedding for supervised learning, can fail.

Sensorics - PLC Lab - Manual - MTE2243
No ratings yet
Sensorics - PLC Lab - Manual - MTE2243
55 pages
EBA 1203 - Math 1 Sheet
No ratings yet
EBA 1203 - Math 1 Sheet
45 pages
UNIT3
No ratings yet
UNIT3
17 pages
Lecture 2
No ratings yet
Lecture 2
69 pages
RBF Slides
No ratings yet
RBF Slides
30 pages
Four-Bit Adder-Subtractor
No ratings yet
Four-Bit Adder-Subtractor
10 pages
Datos de Cosas Interesantes
No ratings yet
Datos de Cosas Interesantes
16 pages
Fundamentals of Artificial Intelligence - Unit 3 - Week 1 - AI and AI Problem Solving
No ratings yet
Fundamentals of Artificial Intelligence - Unit 3 - Week 1 - AI and AI Problem Solving
3 pages
GECG - Lab Manual ACD - Final
No ratings yet
GECG - Lab Manual ACD - Final
50 pages
Fuzzy Logic and Neural Networks - 4 - Solution
100% (1)
Fuzzy Logic and Neural Networks - 4 - Solution
13 pages
MU - Syllabus BCA NEW
No ratings yet
MU - Syllabus BCA NEW
29 pages
Ad3501 - Deep Learning
No ratings yet
Ad3501 - Deep Learning
2 pages
Symposium
100% (1)
Symposium
16 pages
Computer Organization and Architecture: UNIT-2
No ratings yet
Computer Organization and Architecture: UNIT-2
29 pages
DF GTU Study Material Presentations Unit-1 27062020073456AM
No ratings yet
DF GTU Study Material Presentations Unit-1 27062020073456AM
81 pages
Perceptron Lecture 3
No ratings yet
Perceptron Lecture 3
25 pages
Chapter Ten: Integrated Circuit Biasing and Active Loads
No ratings yet
Chapter Ten: Integrated Circuit Biasing and Active Loads
15 pages
Kaggle Machine Learning Projects Ashok Kumar Harnal: FORE School of Management, New Delhi
No ratings yet
Kaggle Machine Learning Projects Ashok Kumar Harnal: FORE School of Management, New Delhi
52 pages
L13 - Coupled Circuits - Updated
100% (2)
L13 - Coupled Circuits - Updated
13 pages
Neural Network Module 2 Notes
100% (1)
Neural Network Module 2 Notes
72 pages
r05320505 Neural Networks
100% (2)
r05320505 Neural Networks
5 pages
Unit 4 (Velocity and Static Force Analysis)
No ratings yet
Unit 4 (Velocity and Static Force Analysis)
42 pages
Cs7015 (Deep Learning) : Lecture 11: Convolutional Neural Networks, Lenet, Alexnet, Zf-Net, Vggnet, Googlenet and Resnet
No ratings yet
Cs7015 (Deep Learning) : Lecture 11: Convolutional Neural Networks, Lenet, Alexnet, Zf-Net, Vggnet, Googlenet and Resnet
477 pages
Deep Learning Technique Syllabus
No ratings yet
Deep Learning Technique Syllabus
2 pages
ETEG 425 Internal Exam Questions 2021
No ratings yet
ETEG 425 Internal Exam Questions 2021
2 pages
Chapter 5 - Operational Amplifier
No ratings yet
Chapter 5 - Operational Amplifier
0 pages
Exercises INF 5860 Solution Hints
No ratings yet
Exercises INF 5860 Solution Hints
11 pages
Microprocessor Lab MANUAL
No ratings yet
Microprocessor Lab MANUAL
139 pages
BEE Question Bank With Answers
No ratings yet
BEE Question Bank With Answers
11 pages
FLNN Question Bank
75% (4)
FLNN Question Bank
23 pages
Numerical Method Question
No ratings yet
Numerical Method Question
29 pages
19A54301 Complex Variables, Transforms & Partial Differential Equations
No ratings yet
19A54301 Complex Variables, Transforms & Partial Differential Equations
2 pages
061 - ME8791, ME6702 Mechatronics - Notes
No ratings yet
061 - ME8791, ME6702 Mechatronics - Notes
63 pages
Soft Computing
No ratings yet
Soft Computing
92 pages
R22 B.tech CSE 1 1 Sem Syllabus
No ratings yet
R22 B.tech CSE 1 1 Sem Syllabus
20 pages
Lecture Notes 5
No ratings yet
Lecture Notes 5
3 pages
8051 Microcontroller Logical Operations
0% (2)
8051 Microcontroller Logical Operations
19 pages
SJDMNTS Action Plan On Sardo Intervention
100% (1)
SJDMNTS Action Plan On Sardo Intervention
3 pages
Marketing The HRD Function
67% (3)
Marketing The HRD Function
14 pages
Microprocessor Answer Key
No ratings yet
Microprocessor Answer Key
36 pages
Sem - 2 - Engineering Maths - III & IV
No ratings yet
Sem - 2 - Engineering Maths - III & IV
336 pages
Write A Program of Division of Two 8 Bit Numbers
100% (1)
Write A Program of Division of Two 8 Bit Numbers
2 pages
Solution 4
No ratings yet
Solution 4
6 pages
Modernisation and Removal of Obsolenscence (Modrobs)
No ratings yet
Modernisation and Removal of Obsolenscence (Modrobs)
5 pages
Eee404 - Eee302 - Lab Sheet Gub
No ratings yet
Eee404 - Eee302 - Lab Sheet Gub
68 pages
TYPES of Listening Questions in TOEFL
No ratings yet
TYPES of Listening Questions in TOEFL
19 pages
Lecture Notes17
No ratings yet
Lecture Notes17
122 pages
Syllabus: Cambridge IGCSE (9-1) Art & Design 0989
No ratings yet
Syllabus: Cambridge IGCSE (9-1) Art & Design 0989
27 pages
ch2 The Z - Transform With Example
No ratings yet
ch2 The Z - Transform With Example
15 pages
Rectifier Control
No ratings yet
Rectifier Control
3 pages
QM Method
No ratings yet
QM Method
13 pages
Laplace Transform Examples
No ratings yet
Laplace Transform Examples
19 pages
Acco 30013 Accounting For Special Transactions 2019
No ratings yet
Acco 30013 Accounting For Special Transactions 2019
8 pages
Project Batch Formation Notice
No ratings yet
Project Batch Formation Notice
2 pages
Simulation of Switching Converters
No ratings yet
Simulation of Switching Converters
103 pages
Complex Variable Question Paper
No ratings yet
Complex Variable Question Paper
3 pages
Year 8 English Unit Plan Nips Xi
50% (2)
Year 8 English Unit Plan Nips Xi
3 pages
Experiment No.: 3: PC With Atmel Studio
No ratings yet
Experiment No.: 3: PC With Atmel Studio
11 pages
Modernisation and Removal of Obsolenscence (Modrobs)
No ratings yet
Modernisation and Removal of Obsolenscence (Modrobs)
29 pages
MODROPS Proposal
No ratings yet
MODROPS Proposal
4 pages
BASIC 8085 Programs (Must Have)
100% (16)
BASIC 8085 Programs (Must Have)
3 pages
Principles of Communication System
No ratings yet
Principles of Communication System
2 pages
AI - LAB Midterm Fall-2020 Updated Paper
No ratings yet
AI - LAB Midterm Fall-2020 Updated Paper
3 pages
KANNUR UNIVERSITY BTech.S7 EE Syllabus
No ratings yet
KANNUR UNIVERSITY BTech.S7 EE Syllabus
16 pages
Systems For Digital Signal Processing: 1 - Introduction
No ratings yet
Systems For Digital Signal Processing: 1 - Introduction
21 pages
Tle10 Afa Poultry q4 Mod7 Poultrybreedsclasses Varieties v4
No ratings yet
Tle10 Afa Poultry q4 Mod7 Poultrybreedsclasses Varieties v4
43 pages
The LION Way: Roberto Battiti Mauro Brunato
No ratings yet
The LION Way: Roberto Battiti Mauro Brunato
257 pages
Instruction Format 8051
No ratings yet
Instruction Format 8051
26 pages
Self-Evaluation Large Group Completed
No ratings yet
Self-Evaluation Large Group Completed
7 pages
Int. To Data Analytics and Cyber Security Syllabus
No ratings yet
Int. To Data Analytics and Cyber Security Syllabus
2 pages
BT4395 RR Final
No ratings yet
BT4395 RR Final
32 pages
Assgnmnt 6 - Lesson Plan
No ratings yet
Assgnmnt 6 - Lesson Plan
7 pages
Taylor Spratt Cosmo The Cat
No ratings yet
Taylor Spratt Cosmo The Cat
4 pages
22-23 Professional Growth Plan Kacey
No ratings yet
22-23 Professional Growth Plan Kacey
3 pages
r206668v AMutenda Model
No ratings yet
r206668v AMutenda Model
62 pages
Mandatory Disclosure GEC
No ratings yet
Mandatory Disclosure GEC
25 pages
The Influence of Culture Upon Communication
No ratings yet
The Influence of Culture Upon Communication
11 pages
Edu 536 - Mini Lesson 2
No ratings yet
Edu 536 - Mini Lesson 2
2 pages
ED 1111 (Notes)
No ratings yet
ED 1111 (Notes)
8 pages
TC Shs Final Utilization Plan Science
No ratings yet
TC Shs Final Utilization Plan Science
3 pages
Gradient Descent Regression Logistic Regression
No ratings yet
Gradient Descent Regression Logistic Regression
14 pages
Rubric For Student Reflections
No ratings yet
Rubric For Student Reflections
2 pages
Convex Optimization in Classification Problems: MIT/ORC Spring Seminar
No ratings yet
Convex Optimization in Classification Problems: MIT/ORC Spring Seminar
39 pages
M6 1st ORALCOMM SY20 21
No ratings yet
M6 1st ORALCOMM SY20 21
15 pages
Elementary Teacher: 142 Your Address BLVD, City Name, CA XXXXX (XXX) XXX-XXXX
No ratings yet
Elementary Teacher: 142 Your Address BLVD, City Name, CA XXXXX (XXX) XXX-XXXX
2 pages
Lesson Planning PPP
No ratings yet
Lesson Planning PPP
10 pages
Soft Skills Interview Questions
No ratings yet
Soft Skills Interview Questions
3 pages
Execution and Control of Operations
No ratings yet
Execution and Control of Operations
1 page
Communication Mosaics An Introduction To The Field of Communication 6th Edition PDF
No ratings yet
Communication Mosaics An Introduction To The Field of Communication 6th Edition PDF
2 pages
CL1 CL2 CL3 CL4 CL5 CL6
No ratings yet
CL1 CL2 CL3 CL4 CL5 CL6
2 pages
Script For Teaching Research To Grade 10 Students
No ratings yet
Script For Teaching Research To Grade 10 Students
2 pages
MBTI Iran
No ratings yet
MBTI Iran
8 pages
PPP Basic Model
No ratings yet
PPP Basic Model
3 pages
Solutions Part I - Logistic Regression Backpropagation With A Single Training Example
No ratings yet
Solutions Part I - Logistic Regression Backpropagation With A Single Training Example
6 pages
Workshop On Fundamentals of Adr
No ratings yet
Workshop On Fundamentals of Adr
2 pages

Exercises INF 5860: Exercise 1 Linear Regression

Uploaded by

Exercises INF 5860: Exercise 1 Linear Regression

Uploaded by

Exercises INF 5860

Exercise 1 Linear regression

Exercise 2 – Logistic classification

b) With logistic classification, how is a new sample classified?

c) How can we generalize logistic classification to more than 2 classes?

Exercise 3: Basic neural networks

c) Discuss briefly why feature scaling of the input features is important.

d) Why is initializing all the weights to zero problematic?

f) Explain shortly how maxnorm regularization works.

Exercise 3: A simple network

a) Perform backpropagation on this graph

3. Give an example of a way to measure model complexity.

1. What do we mean when we refer to the image manifold?

Exercise 6: Convolutional nets

2. What do we mean by dilated convolutions and how are they used?

2. How does batch size relate to learning rate? Explain.

3. Why is it a problem to optimize accuracy directly with a deep neural network?

Exercise 8: Deep learning architectures

3. What does it mean to use a “Fully-convolutional” architecture for image segmentation?

Exercise 9: Visualization and Adversarial training

Exercise 10: Recurrent Neural networks (RNN)

Exercise 11: Reinforcement learning

1. Is Reinforcement learning usually training faster or slower than standard supervised

2. In what kind of situations is it common to use Reinforcement learning? Explain why.

3. In what type of situation does Policy learning require a lot of memory?

Exercise 12: Unsupervised learning

3. Give basic overview of what an autencoder based on neural networks is.

You might also like