Tutorial 1

Uploaded by

Он самый

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views11 pages

Tutorial 1

Uploaded by

Он самый

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Dynamic Programming &

Reinforcement Learning

Tutorial session 1
Exercise 1
Compute by hand the discounted (β=3/4) and long-run
average rewards for the following infinite series:
• 1,1,1,1,1,1,1,….
• 2,0,2,0,2,0,2,….
• 100 1’s, then ∞ many 0’s
• 100 0’s, then ∞ many 1’s
• Give a reason why you prefer discounting over
averaging
• Give a sufficient condition on the rewards under
which discounted series converge
• Give an example of a diverging series
Exercise 2
• Find the shortest path from A to D in the directed
graph below using dynamic programming (by hand)
B
3 13

A 10 D
1
5 5
C
• After how many iterations are you certain to have
found the optimal solution? Give 2 termination
conditions
Exercise 3
• Look up on the internet how Dijkstra’s
algorithm works and apply it to the
example of Exercise 2
• Construct a simple graph for which Dijkstra
does not work and backward recursion
does (hint: you are allowed to use negative
arc lengths)
Exercise 4
• Determine by enumeration (i.e., try all
combinations) the shortest path in the first
example (the drawing of the city) of the
transparencies of lecture 1
• Check the correctness by verifying the
backward recursion
• Implement the problem in some suitable
tool or language (R/Excel/…) and solve it
Exercise 5
• Consider an inventory problem with 10 items and
immediate replenishments
• Let N be the maximum stock level for each item
• What is the number of states?
• You have a computer with 200GB memory available
for computations
• How big can N be such that you can still execute a
backward recursion algorithm?
– Hint: argue first why it suffices to store only 2 vectors
the size of X in memory
Exercise 6
• Estimate the total number of positions of
chess pieces on the chess board (feasible
& unfeasible)
• Estimate the total number of feasible
positions
• Compare your estimation with information
you find on the web
Exercise 7
• Consider a knapsack problem with W=10,
T=4, w=(5,4,3,3) and v=(3,2,2,2)
• Solve the problem by:
a) dynamic programming
b) a decision tree in which you consider all
possibilities
c) What is the complexity (≈ number of
calculations) of both methods as a function of
W and T?
Exercise 8
• Prove the following property of knapsack
problems: Vt(x) ≤ Vt(y) for all t and x ≤ y
• Hint: use induction on t starting from T
Exercise 9
• A variant of the Ludo board game (Dutch:
Mens-erger-je-niet) has the following rules.
A token advances with the role of a die,
when a player roll a 6 he/she can roll again
until the outcome is less than 6. What is
the expected number of squares that the
token advances in a full turn?

German board (wikipedia)

Exercise 10
• Two common examples of discrete and continuous
distributions are the Poisson and exponential
distribution:
λ n −λ
N ∼ Poisson(λ) ⇔ P(N = n) = e
n!
X ∼ exp(μ) ⇔ P(X ≤ t) = 1 − e−μt
• What are their expectations EN and EX?
(Hint: for the exponential distribution first derive the density
and then use partial integration; or use a formula for the
expectation that uses the tail of F)

Painless Pre-Algebra
From Everand
Painless Pre-Algebra
Barron's Educational Series
3/5 (2)
Reinforcement Learning: Foundations Exam
No ratings yet
Reinforcement Learning: Foundations Exam
42 pages
RL Theory Tutorial
No ratings yet
RL Theory Tutorial
80 pages
Chapter3_odd
No ratings yet
Chapter3_odd
13 pages
Permodelan
No ratings yet
Permodelan
165 pages
Dynamic Programming Handout-IICPC
No ratings yet
Dynamic Programming Handout-IICPC
5 pages
MIT6 006F11 Lec20 PDF
No ratings yet
MIT6 006F11 Lec20 PDF
6 pages
RL UNIT PPT
No ratings yet
RL UNIT PPT
595 pages
Tut21 RL
No ratings yet
Tut21 RL
101 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
101 pages
Dynamic Programming
No ratings yet
Dynamic Programming
46 pages
Lec13 Dynamic Programming
No ratings yet
Lec13 Dynamic Programming
47 pages
Dynamic Programming: Md. Bakhtiar Hasan
No ratings yet
Dynamic Programming: Md. Bakhtiar Hasan
70 pages
CS648A 1 Overview of the Course 2025
No ratings yet
CS648A 1 Overview of the Course 2025
35 pages
What We Learned Last Time: 1. Intelligence Is The Computational Part of The Ability To Achieve Goals
No ratings yet
What We Learned Last Time: 1. Intelligence Is The Computational Part of The Ability To Achieve Goals
32 pages
RL Class Notes (4)
No ratings yet
RL Class Notes (4)
68 pages
notes-2
No ratings yet
notes-2
8 pages
Dynamic Progamming
No ratings yet
Dynamic Progamming
181 pages
Dynamic Programming
No ratings yet
Dynamic Programming
4 pages
Final Notes Chapter1 7 1
No ratings yet
Final Notes Chapter1 7 1
19 pages
dp
No ratings yet
dp
38 pages
Dynamic Programming
No ratings yet
Dynamic Programming
7 pages
Dynamic Programming: - A Session by Infero
No ratings yet
Dynamic Programming: - A Session by Infero
42 pages
Q-Learning and Deep Q Networks (DQN)
No ratings yet
Q-Learning and Deep Q Networks (DQN)
52 pages
Top-Down DP - G5 - I (With Code)
No ratings yet
Top-Down DP - G5 - I (With Code)
61 pages
Lecture 19: Dynamic Programming I: Memoization, Fibonacci, Shortest Paths, Guessing
No ratings yet
Lecture 19: Dynamic Programming I: Memoization, Fibonacci, Shortest Paths, Guessing
6 pages
05 DP1
No ratings yet
05 DP1
19 pages
A Guide To Competitive Programming
No ratings yet
A Guide To Competitive Programming
8 pages
DAA-Unit4-2025
No ratings yet
DAA-Unit4-2025
178 pages
AIML Lab manual final
No ratings yet
AIML Lab manual final
43 pages
Advanced Algorithms
No ratings yet
Advanced Algorithms
218 pages
Dynamic Programming Handout - : 14.451 Recitation, February 18, 2005 - Todd Gormley
No ratings yet
Dynamic Programming Handout - : 14.451 Recitation, February 18, 2005 - Todd Gormley
11 pages
Computational Complexity: Definition of Mixed-Integer Programming
No ratings yet
Computational Complexity: Definition of Mixed-Integer Programming
8 pages
CS3491- AI&ML Lab Record
No ratings yet
CS3491- AI&ML Lab Record
47 pages
Tutorial
No ratings yet
Tutorial
81 pages
L8_EC4070 (1)
No ratings yet
L8_EC4070 (1)
37 pages
3 DP PDF
No ratings yet
3 DP PDF
42 pages
AI&ML Lab Manual
No ratings yet
AI&ML Lab Manual
50 pages
Probab 10
No ratings yet
Probab 10
3 pages
6.006 Introduction To Algorithms: Mit Opencourseware
No ratings yet
6.006 Introduction To Algorithms: Mit Opencourseware
5 pages
Subex Linear Programming
No ratings yet
Subex Linear Programming
19 pages
50DynamicProgramming
No ratings yet
50DynamicProgramming
9 pages
Aiml Cse Record
No ratings yet
Aiml Cse Record
49 pages
HW2
No ratings yet
HW2
5 pages
15-17 Dynamic Programming - Algorithms (Series Lecture)
No ratings yet
15-17 Dynamic Programming - Algorithms (Series Lecture)
63 pages
Instant Access to Programming Interview Problems: Dynamic Programming (with solutions in Python) 1st Edition Leonardo Rossi ebook Full Chapters
100% (18)
Instant Access to Programming Interview Problems: Dynamic Programming (with solutions in Python) 1st Edition Leonardo Rossi ebook Full Chapters
65 pages
Exercises
No ratings yet
Exercises
62 pages
2025_MDPs 1
No ratings yet
2025_MDPs 1
62 pages
Detailed_Algorithms_Examples
No ratings yet
Detailed_Algorithms_Examples
5 pages
Lab07
No ratings yet
Lab07
5 pages
11-DL-Deep Learning For Reinforcement Learning
No ratings yet
11-DL-Deep Learning For Reinforcement Learning
47 pages
Dynamic_Programming_and_Optimal_Control
No ratings yet
Dynamic_Programming_and_Optimal_Control
62 pages
Home Exercise 3: Dynamic Programming and Randomized Algorithms
No ratings yet
Home Exercise 3: Dynamic Programming and Randomized Algorithms
5 pages
Toolbox Stat
No ratings yet
Toolbox Stat
32 pages
TOA-cheatsheet
No ratings yet
TOA-cheatsheet
43 pages
CS2311 Lec05 Function
No ratings yet
CS2311 Lec05 Function
82 pages
Vdoc.pub Discrete Optimization Spring 2015
No ratings yet
Vdoc.pub Discrete Optimization Spring 2015
82 pages
Week 14
No ratings yet
Week 14
35 pages
0282 Algorithms (1)
No ratings yet
0282 Algorithms (1)
90 pages
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
2.5/5 (2)
CR 88 Intro 2022
No ratings yet
CR 88 Intro 2022
6 pages
haskell
No ratings yet
haskell
3 pages
Masters-proj-gantt
No ratings yet
Masters-proj-gantt
1 page
windmillswinekebab_LATE_126681_5585159_windmillswinekebab-Assignment3
No ratings yet
windmillswinekebab_LATE_126681_5585159_windmillswinekebab-Assignment3
15 pages
CORDIS_project_688228_en
No ratings yet
CORDIS_project_688228_en
6 pages
s11219-015-9292-4
No ratings yet
s11219-015-9292-4
30 pages
Layering
No ratings yet
Layering
3 pages
Mandalas 3rd - 5th Grade: Teacher Name(s) Grade(s) /content Area Content Area(s) Integrated Unit Title
No ratings yet
Mandalas 3rd - 5th Grade: Teacher Name(s) Grade(s) /content Area Content Area(s) Integrated Unit Title
9 pages
Machinery's Handbook, Large Print, 30th Edition (eBook PDF) - Quickly download the ebook to start your content journey
100% (1)
Machinery's Handbook, Large Print, 30th Edition (eBook PDF) - Quickly download the ebook to start your content journey
44 pages
(Indefinite Integrals) : Problem 1
No ratings yet
(Indefinite Integrals) : Problem 1
1 page
MT+BBA+ATA+502+QP+101224
No ratings yet
MT+BBA+ATA+502+QP+101224
5 pages
Edci 332: Mathematics Teaching Methods Department: Ciem Group 4
No ratings yet
Edci 332: Mathematics Teaching Methods Department: Ciem Group 4
5 pages
CH 1
No ratings yet
CH 1
62 pages
Continuous Random Variable
No ratings yet
Continuous Random Variable
24 pages
1
No ratings yet
1
11 pages
What Is Matlab?: Introduction To Matlab Programming
No ratings yet
What Is Matlab?: Introduction To Matlab Programming
7 pages
Analisis Pengendalian Motor DC Menggunakan Logika Pid Dengan Mikro Kontroler ATMEGA 8535
No ratings yet
Analisis Pengendalian Motor DC Menggunakan Logika Pid Dengan Mikro Kontroler ATMEGA 8535
7 pages
Differential Algebra - Joseph Ritt
No ratings yet
Differential Algebra - Joseph Ritt
189 pages
Precalculus MATH 1730 Final Exam Review
No ratings yet
Precalculus MATH 1730 Final Exam Review
29 pages
3.1 Why Boolean Algebra?
No ratings yet
3.1 Why Boolean Algebra?
18 pages
SolidWORKS (FEA) Simulation Theory Manual
100% (4)
SolidWORKS (FEA) Simulation Theory Manual
115 pages
Numerical Method For Linear Algebra
No ratings yet
Numerical Method For Linear Algebra
16 pages
pc11 Sol c03 CP
No ratings yet
pc11 Sol c03 CP
3 pages
Assessment in Learning 1 Midterm Problem Set 2nd Sem 22-23
No ratings yet
Assessment in Learning 1 Midterm Problem Set 2nd Sem 22-23
7 pages
Chapter 1 Basic Concepts of Error Estimation
100% (1)
Chapter 1 Basic Concepts of Error Estimation
23 pages
2021 en Mathematics 10-11-12 STEM
No ratings yet
2021 en Mathematics 10-11-12 STEM
4 pages
Optimization-Based Control: Richard M. Murray Control and Dynamical Systems California Institute of Technology
No ratings yet
Optimization-Based Control: Richard M. Murray Control and Dynamical Systems California Institute of Technology
21 pages
Complex Analysis PDF
100% (2)
Complex Analysis PDF
162 pages
Studyguide360: Application of Derivatives Important Points To Remember
No ratings yet
Studyguide360: Application of Derivatives Important Points To Remember
14 pages
Liu 2009
No ratings yet
Liu 2009
4 pages
IGCSE Math Nov 2014 QP - 41
No ratings yet
IGCSE Math Nov 2014 QP - 41
20 pages
Chapter 8 Sts Reporting
No ratings yet
Chapter 8 Sts Reporting
19 pages
MATH 220 Course Notes 2022-09-06
No ratings yet
MATH 220 Course Notes 2022-09-06
133 pages
Concrete Art Manifesto 1930
No ratings yet
Concrete Art Manifesto 1930
70 pages
Eulers de Serie Lambertina Translated From Latin To English Wit
No ratings yet
Eulers de Serie Lambertina Translated From Latin To English Wit
22 pages
Combinatorics
No ratings yet
Combinatorics
110 pages
The Unit Circle - Radian Measure
100% (5)
The Unit Circle - Radian Measure
1 page

Tutorial 1

Uploaded by

Tutorial 1

Uploaded by

Dynamic Programming &

German board (wikipedia)

You might also like