R22ML 5
R22ML 5
1
Reinforcement Learning
• Reinforcement Learning(RL) is a type of machine learning
technique that enables an agent to learn in an interactive
environment by trial and error using feedback from its own
actions and experiences.
• Consider building a learning robot. The robot, or agent, has
a set of sensors to observe the state of its environment, and
a set of actions it can perform to alter this state.
• For example, a mobile robot may have sensors such as a
camera and sensors, and actions such as "move forward"
and "turn." Its task is to learn a control strategy, or policy,
for choosing actions that achieve its goals.
2
3
• Environment: Physical world in which the agent
operates
• State: Current situation of the agent
• Reward: Feedback from the environment
• Policy: Method to map agent’s state to actions
• Value: Future reward that an agent would receive by
taking an action in a particular state
4
• The figure below represents the basic idea and
elements involved in a reinforcement learning model.
5
6
7
8
Reinforced vs Supervised Learning
• Though both supervised and reinforcement learning
use mapping between input and output, unlike
supervised learning where feedback provided to the
agent is correct set of actions for performing a task,
reinforcement learning uses rewards and punishment
as signals for positive and negative behavior.
• As compared to unsupervised learning, reinforcement
learning is different in terms of goals. While the goal
in unsupervised learning is to find similarities and
differences between data points, in reinforcement
learning the goal is to find a suitable action model
that would maximize the total cumulative reward of
the agent.
9
Hidden Markov Models
Markov Chains