0% found this document useful (0 votes)
18 views7 pages

Reinforcement Learning

Reinforcement learning involves an agent taking actions in an environment to maximize rewards. It learns from trial and error interactions rather than from explicit examples or instructions. The agent receives feedback in terms of rewards and tries to learn a behavior or path that gives the maximum reward. It has applications in robotics, machine learning, education and other domains.

Uploaded by

Vignesh Senthil
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
18 views7 pages

Reinforcement Learning

Reinforcement learning involves an agent taking actions in an environment to maximize rewards. It learns from trial and error interactions rather than from explicit examples or instructions. The agent receives feedback in terms of rewards and tries to learn a behavior or path that gives the maximum reward. It has applications in robotics, machine learning, education and other domains.

Uploaded by

Vignesh Senthil
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 7

Reinforcement learning

Introduction
• Reinforcement learning is an area of Machine Learning. It is about taking
suitable action to maximize reward in a particular situation.
• It is employed by various software and machines to find the best
possible behavior or path it should take in a specific situation.
• Reinforcement learning differs from supervised learning in a way that in
supervised learning the training data has the answer key with it so the
model is trained with the correct answer itself whereas in reinforcement
learning, there is no answer but the reinforcement agent decides what
to do to perform the given task.
• In the absence of a training dataset, it is bound to learn from its
experience.
Example: The problem is as follows: We
have an agent and a reward, with many
hurdles in between. The agent is supposed
to find the best possible path to reach the
reward. The following problem explains the
problem more easily.

The image shows the robot, diamond, and fire.


The goal of the robot is to get the reward that is
the diamond and avoid the hurdles that are
fired. The robot learns by trying all the possible
paths and then choosing the path which gives
him the reward with the least hurdles. Each
right step will give the robot a reward and each
wrong step will subtract the reward of the
robot. The total reward will be calculated when
it reaches the final reward that is the diamond.
Main points in Reinforcement learning
• Input: The input should be an initial state from which the model will
start
• Output: There are many possible outputs as there are a variety of
solutions to a particular problem
• Training: The training is based upon the input, The model will return a
state and the user will decide to reward or punish the model based on
its output.
• The model keeps continuing to learn.
• The best solution is decided based on the maximum reward.
Difference between Reinforcement learning
and Supervised learning:
Types of Reinforcement

Positive Negative
• Positive Reinforcement is defined as when • Negative Reinforcement is defined as
an event, occurs due to a particular strengthening of behavior because a
behavior, increases the strength and the negative condition is stopped or
frequency of the behavior. In other words, avoided.
it has a positive effect on behavior.
• Advantages of reinforcement learning • Advantages of reinforcement
are: learning:
• Maximizes Performance • Increases Behavior
• Sustain Change for a long period of time • Provide defiance to a minimum standard
• Too much can lead to an overload of of performance
states which can diminish the results • It only provides enough to meet up the
minimum behavior
Various Practical applications of
Reinforcement Learning
• RL can be used in robotics for industrial automation.
• RL can be used in machine learning and data processing
• RL can be used to create training systems that provide custom
instruction and materials according to the requirement of students.

• RL can be used in large environments in the following situations:


• A model of the environment is known, but an analytic solution is not available;
• Only a simulation model of the environment is given (the subject of simulation-
based optimization)
• The only way to collect information about the environment is to interact with it.

You might also like