0% found this document useful (0 votes)

543 views3 pages

Reinforcement Learning - Unit 6 - Week 4

This document discusses an online course on reinforcement learning from NPTEL. It provides information on the course structure, including weekly topics and assignments. It also includes a 10 question practice quiz on concepts from reinforcement learning and Markov decision processes, covering topics like state transition probabilities, value functions, optimal policies, and convergence of algorithms.

Uploaded by

Addy Rao

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

543 views3 pages

Reinforcement Learning - Unit 6 - Week 4

Uploaded by

Addy Rao

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

X

(https://swayam.gov.in)

(https://swayam.gov.in/nc_details/NPTEL)

[email protected] 

NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL)
»
Reinforcement Learning (course)

Week 4: Assignment 4
(https://examform.nptel.ac.in/2022_01/exam_form/dashboard)

Assignment not submitted Due date: 2022-02-23, 23:59 IST.

Course outline
1) State True/False
1 point
How does an

NPTEL online The state transition graph for any MDP is a directed acyclic graph.
course work?
()
True

False
Week 0 ()
2) Consider the following statements:
1 point

Week 1 ()
(i) The optimal policy of an MDP is unique.
(ii) We can determine an optimal policy for a MDP using only the optimal value function(v∗ ) , without
Week 2 ()
accessing the MDP parameters.
(iii) We can determine an optimal policy for a given MDP using only the optimal q-value function(q ∗ ) ,
Week 3 ()
without accessing the MDP parameters.

Week 4 ()
Which of these statements are true?
MDP Modelling

Only (ii)
(unit?
unit=43&lesson=44)
Only (iii)

Bellman

Only (i), (ii)
Equation (unit?
Only (i), (iii)
unit=43&lesson=45)
Only (ii), (iii)
Bellman
3) Which of the following statements are true for a finite MDP? (Select all that apply). 1 point
Optimality
Equation (unit?
The Bellman equation of a value function of a finite MDP defines a contraction in Banach
space
unit=43&lesson=46)
(using the max norm).
Cauchy If 0 ≤ γ < 1 , then the eigenvalues of γPπ are less than 1.
Sequence and
We call a normed vector space ’complete’ if Cauchy sequences exist in that vector space.
Green's Equation
(unit?
he sequence defined by vn = rπ + γP π vn−1 is a Cauchy sequence in Banach space (using
the max
unit=43&lesson=47)
norm).

Banach Fixed (Pπ is a stochastic matrix)

Point Theorem
(unit? 4) Which of the following is a benefit of using RL algorithms for solving MDPs? 1 point
unit=43&lesson=48)

They do not require the state of the agent for solving a MDP.
Convergence

They do not require the action taken by the agent for solving a MDP.
Proof (unit?
unit=43&lesson=49)
They do not require the state transition probability matrix for solving a MDP.

They do not require the reward signal for solving a MDP.
Practice: Week
4: Assignment 4 5) Consider the following equations:
1 point
(Non Graded)

(assessment? ∞

name=142) (
i) vπ (s) = Eπ [∑ γ
i−t
Ri+1 |St = s]
i=t
Quiz: Week 4:
(ii) q π (s, a) = ∑ p(s |s, a)v
′ π
(s )
′

Assignment 4
′
s
(assessment?
name=143) (iii) vπ (s) = ∑ π(a|s)q
π
(s, a)

Reinforcement

Learning : Week Which of the above are correct?

4 Feedback
Form (unit?
Only (i)
unit=43&lesson=129)
Only (i), (ii)

Week 5 ()
Only (ii), (iii)

Only (i), (iii)
DOWNLOAD
(i), (ii), (iii)
VIDEOS ()
6) What is true about the γ (discount factor) in reinforcement learning? 1 point
Text

discount factor can be any real number
Transcripts ()

the value of γ cannot affect the optimal policy

the lower the value of gamma, the more myopic the agent gets, i.e the agent maximises
rewards
that it receives over a shorter horizon

7) Consider the following statements for a finite MDP (I is an identity matrix with dimensions
1 point
|S| × |S| (S is the set of all states) and Pπ is a stochastic matrix):

(i) MDP with stochastic rewards may not have a deterministic optimal policy.
(ii) There can be multiple optimal stochastic policies.
(iii) If 0 ≤ γ < 1 , then rank of the matrix I − γP π is equal to |S| .

(iv) If 0 ≤ γ < 1 , then rank of the matrix I − γP π is less than |S| .

Which of the above statements are true?

Only (ii), (iii)

Only (ii), (iv)

Only (i), (iii)

Only (i), (ii), (iii)

8) Consider an MDP with 3 states A,B,C. At each state we can go to either of the two states.
i.e 1 point
if we are in state A then we can perform 2 actions, going to state B or C. The rewards for
each transactions
are r(A, B) = −2 (reward if we go from A to B),
r(B, A) = 3, r(B, C ) = 10, r(C , B) = −7, r(A, C ) = −2, r(C , A) = 4 , discount factor is 0.9. Find
the fixed point of the
value function for the policy π(A) = B (if we are in state A we choose the action to
go to C)
π(B) = C , π(C ) = A. vπ([ABC ]) =?
(round to 1 decimal place)

[35.2, 48.6, 10.7]

[37.8, 44.2, 38.0]

[37.8, 38.0, 44.2]

[40.6, 20.2, 75.3]

9) Which of the following is not a valid norm function? (x is a D dimensional vector) 1 point

max d∈{1,...,D} |x d |

−−−−−

D

 2
∑ x
d
⎷
d=1

mind∈{1,...,D} |x d |

D
∑ |x d |
d=1

10) For an operator L, which of the following properties must be satisfied by x for it to be a fixed 1 point
point for L?(Multi-Correct)

Lx = x

2
L x = x

∀λ > 0Lx = λx

None of the above

You may submit any number of times before the due date. The final submission will be considered for
grading.
Submit Answers

Understanding Machine Learning Solution Manual: 2 Gentle Start
No ratings yet
Understanding Machine Learning Solution Manual: 2 Gentle Start
67 pages
AI 3000 / CS5500: Reinforcement Learning Exam 1: Instructions
0% (1)
AI 3000 / CS5500: Reinforcement Learning Exam 1: Instructions
4 pages
Assignment 4: Reinforcement Learning Prof. B. Ravindran
No ratings yet
Assignment 4: Reinforcement Learning Prof. B. Ravindran
4 pages
AI 3000 / CS 5500: Reinforcement Learning Assignment 1: Problem 1: Markov Reward Process
No ratings yet
AI 3000 / CS 5500: Reinforcement Learning Assignment 1: Problem 1: Markov Reward Process
5 pages
Assignment 5 (Sol.) : Reinforcement Learning
100% (1)
Assignment 5 (Sol.) : Reinforcement Learning
4 pages
Practice Assignment 4: Reinforcement Learning Prof. B. Ravindran
No ratings yet
Practice Assignment 4: Reinforcement Learning Prof. B. Ravindran
2 pages
PA12
100% (2)
PA12
3 pages
Assignment 7 (Sol.) : Reinforcement Learning
0% (1)
Assignment 7 (Sol.) : Reinforcement Learning
3 pages
Deep Learning - IIT Ropar - Unit 6 - Week 3
No ratings yet
Deep Learning - IIT Ropar - Unit 6 - Week 3
4 pages
Assignment 11: Reinforcement Learning Prof. B. Ravindran
No ratings yet
Assignment 11: Reinforcement Learning Prof. B. Ravindran
4 pages
Deep Learning - IIT Ropar - Unit 4 - Week 1
No ratings yet
Deep Learning - IIT Ropar - Unit 4 - Week 1
5 pages
Assignment 11: Introduction To Machine Learning Prof. B. Ravindran
100% (2)
Assignment 11: Introduction To Machine Learning Prof. B. Ravindran
3 pages
Assignment - Week 6 (Neural Networks) Type of Question: MCQ/MSQ
No ratings yet
Assignment - Week 6 (Neural Networks) Type of Question: MCQ/MSQ
4 pages
Assignment 2: Introduction To Machine Learning Prof. B. Ravindran
100% (1)
Assignment 2: Introduction To Machine Learning Prof. B. Ravindran
3 pages
Assignment 8: Reinforcement Learning Prof. B. Ravindran
No ratings yet
Assignment 8: Reinforcement Learning Prof. B. Ravindran
4 pages
DIP - Assignment 9 Solution
No ratings yet
DIP - Assignment 9 Solution
6 pages
1157 CS F425 20231222015056 Mid Semester Question Paper DL
No ratings yet
1157 CS F425 20231222015056 Mid Semester Question Paper DL
2 pages
NPTEL Online Certification Courses Indian Institute of Technology Kharagpur
100% (1)
NPTEL Online Certification Courses Indian Institute of Technology Kharagpur
4 pages
DEEP LEARNING IIT Kharagpur Assignment - 5 - 2024
No ratings yet
DEEP LEARNING IIT Kharagpur Assignment - 5 - 2024
9 pages
Deep Learning - IIT Ropar - Unit 4 - Week 1
No ratings yet
Deep Learning - IIT Ropar - Unit 4 - Week 1
8 pages
Assignment 11
100% (1)
Assignment 11
4 pages
Introduction To Machine Learning - Unit 3 - Week 1 - Non - Graded
No ratings yet
Introduction To Machine Learning - Unit 3 - Week 1 - Non - Graded
3 pages
Assignment 2
No ratings yet
Assignment 2
7 pages
Assignment 1
No ratings yet
Assignment 1
7 pages
Week3 Assignment
No ratings yet
Week3 Assignment
6 pages
Internship Presentation
No ratings yet
Internship Presentation
30 pages
Practice Assignment 6: Reinforcement Learning Prof. B. Ravindran
No ratings yet
Practice Assignment 6: Reinforcement Learning Prof. B. Ravindran
24 pages
Assignment 4 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 4 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
4 pages
Digital Image Processing Assignment Week 6: NPTEL Online Certificate Courses Indian Institute of Technology, Kharagpur
No ratings yet
Digital Image Processing Assignment Week 6: NPTEL Online Certificate Courses Indian Institute of Technology, Kharagpur
4 pages
Artificial Intelligence - Knowledge Representation and Reasoning - Unit 8 - Week 5
100% (1)
Artificial Intelligence - Knowledge Representation and Reasoning - Unit 8 - Week 5
5 pages
Assignment 6 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 6 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
10 pages
Coa Neptel Answers
No ratings yet
Coa Neptel Answers
69 pages
MCQ Question
No ratings yet
MCQ Question
5 pages
Assignment 7
No ratings yet
Assignment 7
3 pages
Clustering Social Network Graphs
No ratings yet
Clustering Social Network Graphs
12 pages
ML Question
No ratings yet
ML Question
2 pages
DWM List of Experiment 2020-21 Even Sem
No ratings yet
DWM List of Experiment 2020-21 Even Sem
1 page
Introduction To Machine Learning - Unit 4 - Week 2
100% (1)
Introduction To Machine Learning - Unit 4 - Week 2
3 pages
Deep Learning Assignment3 Solution
No ratings yet
Deep Learning Assignment3 Solution
9 pages
Cs230exam spr21 Soln
No ratings yet
Cs230exam spr21 Soln
21 pages
Assignment 4: Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 4: Introduction To Machine Learning Prof. B. Ravindran
2 pages
Thank You For Taking The Week 3: Assignment 3. Week 3: Assignment 3
No ratings yet
Thank You For Taking The Week 3: Assignment 3. Week 3: Assignment 3
3 pages
Assignment 3: Reinforcement Learning Prof. B. Ravindran
100% (1)
Assignment 3: Reinforcement Learning Prof. B. Ravindran
4 pages
IAT-1 Workbook P3-Python
No ratings yet
IAT-1 Workbook P3-Python
16 pages
CS6659 Artificial Intelligence
No ratings yet
CS6659 Artificial Intelligence
10 pages
Thyroid Disease Classification Using Machine Learning Project
No ratings yet
Thyroid Disease Classification Using Machine Learning Project
34 pages
Assignment 3: Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 3: Introduction To Machine Learning Prof. B. Ravindran
4 pages
Summary Notes of CNN
No ratings yet
Summary Notes of CNN
23 pages
Week 7 Assignment 1
No ratings yet
Week 7 Assignment 1
6 pages
Unit III Ai Kcs071
No ratings yet
Unit III Ai Kcs071
50 pages
Topic - 7 (Uncertainty)
No ratings yet
Topic - 7 (Uncertainty)
25 pages
2005 Neural Networks and Applications
No ratings yet
2005 Neural Networks and Applications
4 pages
Fuzzy Logic and Neural Networks - 4 - Solution
100% (1)
Fuzzy Logic and Neural Networks - 4 - Solution
13 pages
Introduction To Machine Learning - Unit 3 - Week 1
No ratings yet
Introduction To Machine Learning - Unit 3 - Week 1
3 pages
EC8093 - Digital Image Processing (Ripped From Amazon Kindle Ebooks by Sai Seena) PDF
0% (1)
EC8093 - Digital Image Processing (Ripped From Amazon Kindle Ebooks by Sai Seena) PDF
102 pages
Assignment 4 (Sol.) : Reinforcement Learning
No ratings yet
Assignment 4 (Sol.) : Reinforcement Learning
6 pages
Soft Computing
No ratings yet
Soft Computing
26 pages
Assigment - 1 - Week 1 - 2023 - G
No ratings yet
Assigment - 1 - Week 1 - 2023 - G
3 pages
Deep Learning - IIT Ropar - Unit 12 - Week 9
No ratings yet
Deep Learning - IIT Ropar - Unit 12 - Week 9
4 pages
DRL Homework 1
No ratings yet
DRL Homework 1
4 pages
Calc 006
No ratings yet
Calc 006
2 pages
Pythagoras Theorem Project
No ratings yet
Pythagoras Theorem Project
3 pages
Tutorial 1
No ratings yet
Tutorial 1
58 pages
KTG - Thermodynamics
No ratings yet
KTG - Thermodynamics
4 pages
m02 - Trigonometry Ratios
No ratings yet
m02 - Trigonometry Ratios
6 pages
Sapienza Programmesinenglish2023 - 0
No ratings yet
Sapienza Programmesinenglish2023 - 0
2 pages
MyReport 08-Oct-2024
No ratings yet
MyReport 08-Oct-2024
20 pages
Introduction To Hankel Transforms Part 1
No ratings yet
Introduction To Hankel Transforms Part 1
6 pages
Physics 2 LM 1
No ratings yet
Physics 2 LM 1
7 pages
Engr. Mark Jexter A. Sibayam
No ratings yet
Engr. Mark Jexter A. Sibayam
20 pages
Aqlps 4
No ratings yet
Aqlps 4
46 pages
Shear Connections Notes: (1) The Beams Considered With Coped Condition
No ratings yet
Shear Connections Notes: (1) The Beams Considered With Coped Condition
9 pages
Linear Combinations, Basis, Span, and Independence Math 130 Linear Algebra
No ratings yet
Linear Combinations, Basis, Span, and Independence Math 130 Linear Algebra
2 pages
Metallurgical and Micostructural Effect On CVN Impact Toughness in 2,25Cr1Mo Weld Metal
No ratings yet
Metallurgical and Micostructural Effect On CVN Impact Toughness in 2,25Cr1Mo Weld Metal
16 pages
SikafloorMultiFlexPB-21-certificare OS13
No ratings yet
SikafloorMultiFlexPB-21-certificare OS13
4 pages
Aravindan Dhanasekaran (Be, Mechanical) : Objective
No ratings yet
Aravindan Dhanasekaran (Be, Mechanical) : Objective
4 pages
3 Math5Q4Week1
No ratings yet
3 Math5Q4Week1
25 pages
Dave Staton Thermal Design PDF
No ratings yet
Dave Staton Thermal Design PDF
101 pages
Assignment 3
No ratings yet
Assignment 3
4 pages
C2-Consistency Cement Paste
No ratings yet
C2-Consistency Cement Paste
4 pages
Machine Learning Based Prediction of Flyrock Distance in Rock Blasting
No ratings yet
Machine Learning Based Prediction of Flyrock Distance in Rock Blasting
16 pages
Theoretical Foundations of Culture, Society and Politics
No ratings yet
Theoretical Foundations of Culture, Society and Politics
9 pages
Application of Green's Function 01
No ratings yet
Application of Green's Function 01
26 pages
Vessel Calculation Sheet: Internal Pressure Design
No ratings yet
Vessel Calculation Sheet: Internal Pressure Design
7 pages
Koch Five Lectures About Hegel
No ratings yet
Koch Five Lectures About Hegel
63 pages
Lecture 2: Laplace Transform
No ratings yet
Lecture 2: Laplace Transform
58 pages
GSA Report Bagua Grande
No ratings yet
GSA Report Bagua Grande
3 pages
Atomic Force Microscopy: Thesis
No ratings yet
Atomic Force Microscopy: Thesis
39 pages
Statement of Hypothesis in Research Paper
100% (3)
Statement of Hypothesis in Research Paper
8 pages
De Cuong On Tap Hk1 Tieng Anh 12 Global Success Co Dap An 1732181212
No ratings yet
De Cuong On Tap Hk1 Tieng Anh 12 Global Success Co Dap An 1732181212
8 pages