Reinforcement_Learning_Overview
Reinforcement_Learning_Overview
Overview
Reinforcement Learning (RL) is an area of machine learning concerned with how agents should take
where learning is driven by interactions with the environment and feedback in the form of rewards or
punishments.
Example
A classic example of reinforcement learning is training a robot to walk. The robot takes steps
(actions) in an environment (floor) and receives feedback (reward) based on whether it maintains
balance and moves forward. Over time, the robot learns a policy that maximizes its total reward.
- A set of states S
- A set of actions A
- A transition function T(s, a, s') which gives the probability of reaching state s' from state s using
action a
Values
Value functions are used to evaluate how good it is to be in a given state, or how good a particular
- Action Value Function Q(s, a): Expected return starting from state s and taking action a
Back on Holiday: Using Reinforcement Learning
Consider planning a holiday trip using reinforcement learning. The agent (you) wants to visit
locations that provide maximum enjoyment (reward). Based on previous experience and outcomes
(feedback), the agent updates its policy to choose better destinations and activities over time.
- Recommendation systems
- Autonomous vehicles
- Industrial automation