0% found this document useful (0 votes)
3 views24 pages

R22ML 5

Reinforcement Learning (RL) is a machine learning technique that allows an agent to learn through trial and error in an interactive environment, using feedback from its actions. Key components of RL include the environment, state, reward, policy, and value, which help the agent determine the best actions to achieve its goals. Unlike supervised learning, which provides correct actions, RL uses rewards and punishments to guide the agent's behavior towards maximizing cumulative rewards.

Uploaded by

mekalaanusha0218
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views24 pages

R22ML 5

Reinforcement Learning (RL) is a machine learning technique that allows an agent to learn through trial and error in an interactive environment, using feedback from its actions. Key components of RL include the environment, state, reward, policy, and value, which help the agent determine the best actions to achieve its goals. Unlike supervised learning, which provides correct actions, RL uses rewards and punishments to guide the agent's behavior towards maximizing cumulative rewards.

Uploaded by

mekalaanusha0218
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 24

UNIT-5

1
Reinforcement Learning
• Reinforcement Learning(RL) is a type of machine learning
technique that enables an agent to learn in an interactive
environment by trial and error using feedback from its own
actions and experiences.
• Consider building a learning robot. The robot, or agent, has
a set of sensors to observe the state of its environment, and
a set of actions it can perform to alter this state.
• For example, a mobile robot may have sensors such as a
camera and sensors, and actions such as "move forward"
and "turn." Its task is to learn a control strategy, or policy,
for choosing actions that achieve its goals.

2
3
• Environment: Physical world in which the agent
operates
• State: Current situation of the agent
• Reward: Feedback from the environment
• Policy: Method to map agent’s state to actions
• Value: Future reward that an agent would receive by
taking an action in a particular state

4
• The figure below represents the basic idea and
elements involved in a reinforcement learning model.

5
6
7
8
Reinforced vs Supervised Learning
• Though both supervised and reinforcement learning
use mapping between input and output, unlike
supervised learning where feedback provided to the
agent is correct set of actions for performing a task,
reinforcement learning uses rewards and punishment
as signals for positive and negative behavior.
• As compared to unsupervised learning, reinforcement
learning is different in terms of goals. While the goal
in unsupervised learning is to find similarities and
differences between data points, in reinforcement
learning the goal is to find a suitable action model
that would maximize the total cumulative reward of
the agent.
9
Hidden Markov Models
Markov Chains

You might also like