Reinforcement Learning- Introduction
Reinforcement Learning- Introduction
Definition
Machine Learning
Reinforcement:
Supervised • Cause and Effect
• Agent learns to interact
with environment for
reward.
Unsupervised
Reinforcement
Intuitive example
Reinforcement
Value Based
Policy based
Model Based
Value Based
• Try to maximize a value function 𝑉(𝑠)
max 𝑉𝜋(𝑠)
• The value of reward which the agent expects to gain in the future
upon starting at that state s.
• 𝐸- 𝑅/01 + 𝛾𝑅/04 + 𝛾 4 𝑅/05 + ⋯ |𝑆/ = 𝑠
Policy Based
• Try to produce a policy such that the action performed at each state is
optimal to gain maximum reward in the future.
• 𝜋 𝑠, 𝑎
• Deterministic
• At any state s, same action a is produced by policy 𝜋