This document contains short answer and long answer questions about reinforcement learning. The short answer questions cover topics like reinforcement learning definitions, agents, active vs passive learning, utility-based vs Q-learning agents, prioritized sweeping, greedy agents, optimal vs suboptimal actions, bandit problems, and GLIE schemes. The long answer questions ask about passive reinforcement learning, direct utility estimates, adaptive dynamic programming, a passive learning algorithm using ADP, temporal-difference learning, differences between ADP and TD learning, active reinforcement learning, and explaining the Q-learning algorithm.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
254 views1 page
UNIT - V Question Bank
This document contains short answer and long answer questions about reinforcement learning. The short answer questions cover topics like reinforcement learning definitions, agents, active vs passive learning, utility-based vs Q-learning agents, prioritized sweeping, greedy agents, optimal vs suboptimal actions, bandit problems, and GLIE schemes. The long answer questions ask about passive reinforcement learning, direct utility estimates, adaptive dynamic programming, a passive learning algorithm using ADP, temporal-difference learning, differences between ADP and TD learning, active reinforcement learning, and explaining the Q-learning algorithm.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1
UNIT - V
QUESTION BANK
Short Answer Questions
1. What is Reinforcement learning? 2. Explain the significance of Reinforcement learning in Artificial Intelligence. 3. List the agents of Reinforcement learning. 4. Differentiate Active learning and Passive learning. 5. Differentiate Utility-based agent and Q-learning agent. 6. What is Prioritized Sweeping. 7. Define a Greedy Agent. 8. Discuss how choosing the optimal action leads to suboptimal results in Reinforcement learning. 9. What is Bandit Problem? 10. Explain greedy in the limit of infinite exploration (GLIE) scheme. 11. Define Q-learning. 12. Discuss the similarity and differences between Q-learning and SARSA (State-Action-Reward-State-Action).
Long Answer Questions
1. Explain Passive Reinforcement Learning. 2. How direct utility estimate is computed in Reinforcement learning? 3. Explain Adaptive Dynamic Programming (ADP). 4. Write and explain an algorithm for Passive Reinforcement learning agent based on Adaptive Dynamic Programming (ADP) 5. Explain Temporal-Difference Learning. 6. List and explain the differences between ADP and TD learning. 7. Explain Active Reinforcement Learning. 8. Write and explain Q-learning algorithm.