0% found this document useful (0 votes)
2 views

Lecture 22 - Value Function Approximation

The document outlines a lecture on Value Function Approximation in Reinforcement Learning, covering key topics such as Monte Carlo Learning, Temporal Difference Learning, and Q Learning. It emphasizes the importance of Gradient Descent and Stochastic Gradient Descent in the context of linear value function approximation. Additionally, it discusses the differences between tabular methods and approximation methods, and the use of neural networks as function approximators.

Uploaded by

Hadia Ramzan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views

Lecture 22 - Value Function Approximation

The document outlines a lecture on Value Function Approximation in Reinforcement Learning, covering key topics such as Monte Carlo Learning, Temporal Difference Learning, and Q Learning. It emphasizes the importance of Gradient Descent and Stochastic Gradient Descent in the context of linear value function approximation. Additionally, it discusses the differences between tabular methods and approximation methods, and the use of neural networks as function approximators.

Uploaded by

Hadia Ramzan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 17

AI-832 Reinforcement Learning

Instructor: Dr. Zuhair Zafar

Lecture # 22: Value Function Approximation


Recap

• Monte Carlo Learning


• Prediction

• Control

• Temporal Difference Learning


• Prediction

• TD (0), n-step TD, lambda return, TD(lambda)

• Control (On-policy Learning)

• SARSA (0), n-step SARSA, SARSA (Lambda), Expected SARSA

• Q Learning (Off-policy Learning)


Today’s Agenda

• Value Function Approximation

• Gradient Descent

• Stochastic Gradient Descent

• Linear Value Function Approximation


Large-Scale Reinforcement Learning
Large-Scale Reinforcement Learning
Value Function Approximation
Types of Value Function Approximation
Tabular Methods vs. Approximation Methods

Monte Carlo / Temporal Difference Learning

Neural Network
Which Function Approximator?
Which Function Approximator?
Today’s Agenda

• Value Function Approximation

• Gradient Descent

• Stochastic Gradient Descent

• Linear Value Function Approximation


Gradient Descent
Value Function Approx. By Stochastic Gradient Descent
Today’s Agenda

• Value Function Approximation

• Gradient Descent

• Stochastic Gradient Descent

• Linear Value Function Approximation


Feature Vectors
Linear Value Function Approximation
Table Lookup Features

You might also like