0% found this document useful (0 votes)

2 views5 pages

Week 10

The document contains a series of questions and answers related to decision-making strategies in the context of betting and Markov Decision Processes (MDPs). It discusses concepts such as minimax regret, the Hurwicz criterion, policy iteration, and the utility curve of agents. The answers provided indicate the correct choices for each question based on the given scenarios.

Uploaded by

Ankur Verma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views5 pages

Week 10

Uploaded by

Ankur Verma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Please do not message repeatedly. You will get the answer before the deadline.

0
Category Search Your Courses.... My Account

 [Week 1-12] NPTEL An Introduction to Artificial Intelligence Assignment Answers 2022 

About Lesson
Q1. Ram has the opportunity to make one of 2 bets (say A,B) or invest equally in both
bets or make no bets each of which is based on the outcome of a cricket match. The
payoffs to Ram on winning/losing each of the bets are as described in the table
below:

If Ram employs minimax regret to decide in this situation, what action does he take?

Makes bet A
Makes bet B
Invest equally in A and B
Makes no bet

Accepted Answers: Makes bet B

Q2. If Ram employs the Hurwicz criterion to decide, for which of the following values
of the coefficient of realism does Ram choose to not make a bet?
0.2
0.5
0.7
0.4
Accepted Answers: 0.2
0.4
Q3. Assume that an insider tells Ram that he can tell Ram beforehand whether Ram
will win or lose a bet. Also assume that all bets have an equal likelihood of success
and failure. What is the maximum amount of money Ram should be willing to pay the
agent for this information?

Accepted Answers: (Type: Numeric) 45

Q4. For an MDP of discrete finite state space S and discrete finite action space, what is
the memory size of the transition function, in the most general case?

O(|S|^2)
O(|S||A|) O(|S|^2|A|) O(|S||A|^2)

Accepted Answers:
O(|S|^2|A|)
For Question 5 – 7 :

Consider the MDP given below for a robot trying to walk.

The MDP has three states: S={Standing,Moving,Fallen} and two actions: moving the
robot legs slowly(a) and moving the robot legs aggressively (b), denoted by the colour
black and green respectively. The task is to perform policy iteration for the above
MDP with discount factor 1.

Q5. We start with a policy 𝜋(s) = a for all s in S and V 𝜋 (s) = 0 for all s. What is the
value of the Fallen state after one iteration of bellman update during policy
evaluation?

Return the answer as a decimal rounded to 1 decimal place.

Accepted Answers:(Type: Numeric) -0.2

Q6. Suppose we perform the policy improvement step just after one iteration of
bellman update as in Q5, what is the updated policy. Write in the order of actions for
Standing, Moving and Fallen.

Example, if the policy is 𝜋(Standing) = b, 𝜋(Moving) = b, 𝜋(Fallen) = a, write the

answer as bba.

Accepted Answers:(Type: String) aba

Q7. After one iteration of policy evaluation as in Q5, what is the value of
Q(state,action) where state = Moving and action = b?

Return the answer as a decimal rounded to 2 decimal places.

Accepted Answers:(Type: Range) 2.16,2.48

Q8. If the utility curve of an agent varies as m^2 for money m, then the agent is:
Risk-prone
Risk-averse
Risk-neutral
Can be any of these

Accepted Answers:Risk-prone
Q9. Which of the following statements are true regarding Markov Decision Processes
(MDPs)?

Discount factor is not useful for finite horizon MDPs.

We assume that the reward and cost models are independent of the previous state
transition history, given the current state.
MDPs assume full observability of the environment
Goal states may have transitions to other states in the MDP

Accepted Answers:
We assume that the reward and cost models are independent of the
previous state transition history, given the current state.
MDPs assume full observability of the environment
Goal states may have transitions to other states in the MDP
Q10. Which of the following are true regarding value and policy iteration?

Value iteration is guaranteed to converge in a finite number of steps for any value of
epsilon and any MDP, if the MDP has a fixed point.
The convergence of policy iteration is dependent on the initial policy.
Value iteration is generally expected to converge in a lesser number of iterations as
compared to policy iteration.
In each iteration of policy iteration, value iteration is run as a subroutine, using a fixed
policy

Accepted Answers:
Value iteration is guaranteed to converge in a finite number of steps for any
value of epsilon and any MDP, if the MDP has a fixed point.
In each iteration of policy iteration, value iteration is run as a subroutine,
using a fixed policy

 Previous Next 

0% Complete
Mark as Complete

Quick Links Get In Touch

Login Add. : 4, Shivpuri Road No 1B,
With 5+ years of experience, Answer
GPT helps students by providing clear About Us
and accurate assignment solutions for Shivpuri, Patna, Bihar – 800023
Contact Us
NPTEL courses. We are dedicated to
supporting students in their studies and Disclaimer
helping them succeed. Privacy Policy Email: [email protected]
Refund Policy
Shipping Policy Hours: Mon-Fri 9:00AM - 6:00PM
Terms & Conditions

BCA I & II Sem New Syllabus As Per AICTE
No ratings yet
BCA I & II Sem New Syllabus As Per AICTE
31 pages
One To One Functions
No ratings yet
One To One Functions
64 pages
Programming For Problem Solving All Unit Notes
No ratings yet
Programming For Problem Solving All Unit Notes
187 pages
MIT 6.036 Lecture
No ratings yet
MIT 6.036 Lecture
64 pages
Infinite Horizon Problems
No ratings yet
Infinite Horizon Problems
69 pages
Reinforcement Learning Lec12
No ratings yet
Reinforcement Learning Lec12
60 pages
18 - Dynamic Programming For Markov Decision Processes
No ratings yet
18 - Dynamic Programming For Markov Decision Processes
50 pages
Lecture4 Model Free Prediction
No ratings yet
Lecture4 Model Free Prediction
34 pages
کتاب هشتم بارگزاری شده
No ratings yet
کتاب هشتم بارگزاری شده
112 pages
Lec 12
No ratings yet
Lec 12
60 pages
Lecture Notes
No ratings yet
Lecture Notes
29 pages
Tut21 RL
No ratings yet
Tut21 RL
101 pages
CS 747, Autumn 2023: Lecture 6: Shivaram Kalyanakrishnan
No ratings yet
CS 747, Autumn 2023: Lecture 6: Shivaram Kalyanakrishnan
68 pages
Cs5811 Ch17 Complex Dec
No ratings yet
Cs5811 Ch17 Complex Dec
29 pages
Handling Uncertainty 03 - Solving MDP
No ratings yet
Handling Uncertainty 03 - Solving MDP
11 pages
Policy (RL IITH)
No ratings yet
Policy (RL IITH)
46 pages
Finite Markov Decision Processes-BR
No ratings yet
Finite Markov Decision Processes-BR
31 pages
DSA5102 Lecture12
No ratings yet
DSA5102 Lecture12
41 pages
Lecture#5 Monte Carlo Methods Part I
No ratings yet
Lecture#5 Monte Carlo Methods Part I
28 pages
DRL #4-5 - Introducing MDP and Dynamic Programming Solution
No ratings yet
DRL #4-5 - Introducing MDP and Dynamic Programming Solution
74 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
43 pages
RL-UNIT2 - RL Unit 2 RL-UNIT2 - RL Unit 2
No ratings yet
RL-UNIT2 - RL Unit 2 RL-UNIT2 - RL Unit 2
23 pages
Class XI Evolution of Computer
No ratings yet
Class XI Evolution of Computer
9 pages
242 Sheet 02 03
No ratings yet
242 Sheet 02 03
5 pages
L12 Markov Decision Processes
No ratings yet
L12 Markov Decision Processes
64 pages
MIT16 410F10 Lec22
No ratings yet
MIT16 410F10 Lec22
19 pages
17 - Markov Decision Processes
No ratings yet
17 - Markov Decision Processes
59 pages
Action Robust Reinforcement Learning and Applications in Continuous Control
No ratings yet
Action Robust Reinforcement Learning and Applications in Continuous Control
10 pages
NPTEL Solutions Merged
No ratings yet
NPTEL Solutions Merged
134 pages
Stochastic DP
No ratings yet
Stochastic DP
23 pages
RL Problem Sheet: E0 270: Machine Learning (Spring 2025)
No ratings yet
RL Problem Sheet: E0 270: Machine Learning (Spring 2025)
10 pages
Lec 09
No ratings yet
Lec 09
51 pages
20ai903 - RL - Unit 2
No ratings yet
20ai903 - RL - Unit 2
27 pages
CS229
No ratings yet
CS229
17 pages
DSA5102 Lecture11
No ratings yet
DSA5102 Lecture11
44 pages
RL Unit - Ii
No ratings yet
RL Unit - Ii
20 pages
Deep RL - Content Beyond Syllabus
No ratings yet
Deep RL - Content Beyond Syllabus
16 pages
AIDS Lab Manual
No ratings yet
AIDS Lab Manual
41 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
101 pages
Markov Decision
No ratings yet
Markov Decision
4 pages
Cs229-Notes12 Reinforcement in Control
No ratings yet
Cs229-Notes12 Reinforcement in Control
17 pages
RL Paper Deepsk
No ratings yet
RL Paper Deepsk
4 pages
Quiz2 Sol
No ratings yet
Quiz2 Sol
4 pages
Reinforcement Learning Note
No ratings yet
Reinforcement Learning Note
16 pages
Lecture 3 - MDPs and Dynamic Programming
No ratings yet
Lecture 3 - MDPs and Dynamic Programming
66 pages
Quick Start: Resolving A Markov Decision Process Problem Using The Mdptoolbox in Matlab
No ratings yet
Quick Start: Resolving A Markov Decision Process Problem Using The Mdptoolbox in Matlab
9 pages
A17 Complexdecisions
No ratings yet
A17 Complexdecisions
28 pages
EE290 Lecture 16
No ratings yet
EE290 Lecture 16
4 pages
1.1 Discounted (Infinite-Horizon) Markov Decision Processes
No ratings yet
1.1 Discounted (Infinite-Horizon) Markov Decision Processes
26 pages
Textbook Solutions Expert Q&A Practice: Find Solutions For Your Homework
No ratings yet
Textbook Solutions Expert Q&A Practice: Find Solutions For Your Homework
6 pages
New CZ3005 Module 4 - Markov Decision Process
No ratings yet
New CZ3005 Module 4 - Markov Decision Process
38 pages
RL 10 QUESTIONS FOR MID II Scheme of Evaluvation
No ratings yet
RL 10 QUESTIONS FOR MID II Scheme of Evaluvation
15 pages
Java Programming Notes
No ratings yet
Java Programming Notes
8 pages
Markov Decision Processes and Exact Solution Methods
No ratings yet
Markov Decision Processes and Exact Solution Methods
34 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
Reinforcement Learning Cheat Sheet: Return
No ratings yet
Reinforcement Learning Cheat Sheet: Return
7 pages
An Application of Inverse Reinforcement Learning To Medical Records of Diabetes Treatment
No ratings yet
An Application of Inverse Reinforcement Learning To Medical Records of Diabetes Treatment
8 pages
Lecture 9-C++ Arrays
No ratings yet
Lecture 9-C++ Arrays
19 pages
AI 3000 / CS 5500: Reinforcement Learning Assignment 1: Problem 1: Markov Reward Process
No ratings yet
AI 3000 / CS 5500: Reinforcement Learning Assignment 1: Problem 1: Markov Reward Process
5 pages
Study Material of Operating Systems
No ratings yet
Study Material of Operating Systems
247 pages
Reinforcement Learning in A Nutshell
No ratings yet
Reinforcement Learning in A Nutshell
12 pages
Markov Decision Processes & Reinforcement Learning: Megan Smith Lehigh University, Fall 2006
No ratings yet
Markov Decision Processes & Reinforcement Learning: Megan Smith Lehigh University, Fall 2006
40 pages
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Get Off To A Fast Start With Db2 V9 Purexml, Part 2
No ratings yet
Get Off To A Fast Start With Db2 V9 Purexml, Part 2
16 pages
19.5 Markov Decision Processes: Resolving Unbounded Expected Rewards
No ratings yet
19.5 Markov Decision Processes: Resolving Unbounded Expected Rewards
13 pages
Reinforcement Learning - Unit 6 - Week 4
No ratings yet
Reinforcement Learning - Unit 6 - Week 4
3 pages
CS 188 Fall 2018 Written HW4 Soln
No ratings yet
CS 188 Fall 2018 Written HW4 Soln
6 pages
C Programming-Structures & Union
No ratings yet
C Programming-Structures & Union
31 pages
Exam Prep 4 Solutions: Q1. MDPS: Dice Bonanza
No ratings yet
Exam Prep 4 Solutions: Q1. MDPS: Dice Bonanza
4 pages
Introduction To Computer Science: Michael A. Nielsen University of Queensland
No ratings yet
Introduction To Computer Science: Michael A. Nielsen University of Queensland
39 pages
Library Management System Using C++ With Source Code
No ratings yet
Library Management System Using C++ With Source Code
14 pages
Java Unit 1
No ratings yet
Java Unit 1
17 pages
Problem 1: Markov Reward Process
No ratings yet
Problem 1: Markov Reward Process
3 pages
AI 3000 / CS5500: Reinforcement Learning Exam 1: Instructions
0% (1)
AI 3000 / CS5500: Reinforcement Learning Exam 1: Instructions
4 pages
Lab Record Download
No ratings yet
Lab Record Download
58 pages
2013 IMAS - 1st - MP
No ratings yet
2013 IMAS - 1st - MP
8 pages
CS Practical File 3
No ratings yet
CS Practical File 3
18 pages
SBI IVR Flow
No ratings yet
SBI IVR Flow
1 page
AI CH 5
No ratings yet
AI CH 5
37 pages
Error
No ratings yet
Error
13 pages
Unit 4 CD
No ratings yet
Unit 4 CD
8 pages
Digital Electronics 1
No ratings yet
Digital Electronics 1
26 pages
Tre Q
No ratings yet
Tre Q
4 pages
Why We're Unlikely To Get Artificial General Intelligence Anytime Soon
No ratings yet
Why We're Unlikely To Get Artificial General Intelligence Anytime Soon
3 pages
Class Xi A Cs 2022 Pt2
No ratings yet
Class Xi A Cs 2022 Pt2
4 pages
Learning C Programming
No ratings yet
Learning C Programming
19 pages
Data Structure Assignment
No ratings yet
Data Structure Assignment
10 pages
Week 6
No ratings yet
Week 6
5 pages
Incubation Center Visit
No ratings yet
Incubation Center Visit
1 page
Govt. Polytechnic College, Jalore: Computer Engineering Department III Year Syllabus Faculty Name: Ankur Verma
No ratings yet
Govt. Polytechnic College, Jalore: Computer Engineering Department III Year Syllabus Faculty Name: Ankur Verma
9 pages
Pointer To Pointer (Double Pointer) in C
No ratings yet
Pointer To Pointer (Double Pointer) in C
6 pages
Network Security Glossary
No ratings yet
Network Security Glossary
6 pages
Font Name:-Crosswordbelle Date
No ratings yet
Font Name:-Crosswordbelle Date
1 page
Resume AnkurVerma CSE JIET
No ratings yet
Resume AnkurVerma CSE JIET
3 pages
Array Test Practise
No ratings yet
Array Test Practise
3 pages
Refer Below Table and Answer The Question
No ratings yet
Refer Below Table and Answer The Question
3 pages
CS231n - Convolutional-Networks 1
No ratings yet
CS231n - Convolutional-Networks 1
3 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
3 pages
Text
No ratings yet
Text
2 pages

Week 10

Uploaded by

Week 10

Uploaded by

Please do not message repeatedly. You will get the answer before the deadline.

 [Week 1-12] NPTEL An Introduction to Artificial Intelligence Assignment Answers 2022 

Accepted Answers: Makes bet B

Accepted Answers: (Type: Numeric) 45

Consider the MDP given below for a robot trying to walk.

Return the answer as a decimal rounded to 1 decimal place.

Accepted Answers:(Type: Numeric) -0.2

Example, if the policy is 𝜋(Standing) = b, 𝜋(Moving) = b, 𝜋(Fallen) = a, write the

Accepted Answers:(Type: String) aba

Return the answer as a decimal rounded to 2 decimal places.

Accepted Answers:(Type: Range) 2.16,2.48

Discount factor is not useful for finite horizon MDPs.

Quick Links Get In Touch

AnswerGPT - © 2024 All Rights Reserved.

You might also like