RL Sem 8
RL Sem 8
✅ Module 0: Prerequisites
Probability distributions and expected values
Basic linear algebra (e.g., inner products)
o SARSA
Exploration Strategies:
o Optimistic Initial Values
o Upper-Confidence-Bound (UCB)
o Gradient Bandits
📌 Strategy Tips:
Focus more on Modules 2 to 5 (they are both concept-heavy and
application-focused).
Practice numerical examples like reward computation and value
iteration.
Revise terminology and differences (e.g., TD vs MC, SARSA vs Q-
Learning).
Prepare case study applications for Module 6 to write high-scoring
descriptive answers.