Ai PPT New
Ai PPT New
Ai PPT New
Learning
Presented by:
-Baljit kaur
-Loveleen kaur
-Kulwinder singh
-Lovepreet kaur
One subfield of machine learning is reinforcement
learning. It involves acting appropriately to
maximize reward in a certain circumstance. It is
utilized by a variety of software and devices to
determine the optimal course of action or behavior
for a given circumstance. Reinforcement learning
and supervised learning are different in that in
supervised learning, the model is trained with the
correct answer already present in the training data,
while in reinforcement learning, the model is
trained without an answer and is guided by the
reinforcement agent's decision on how to complete
the task at hand.
A self-governing, self-teaching system,
reinforcement learning basically learns by making
mistakes. It learns by doing in order to attain the
best results, or, to put it another way, it takes
actions with the intention of maximizing rewards.
For instance: The issue is this: There are numerous
obstacles between our agent and the reward. The
agent's job is to determine the most efficient route
to the reward. For easier understanding, consider
the problem that follows..
In reinforcement learning, creators come up with
a way to penalize bad conduct and reward good
behaviour. With this approach, the agent is
encouraged to use the desired activities by
assigning them positive values, while the
undesirable behaviours are discouraged by
assigning them negative values. In order to find
the best answer, this programs the agent to look
for long-term and maximum total rewards . By
setting long-term objectives, the agent can avoid
becoming bogged down in smaller, less crucial
ones. The agent eventually learns to look for the
positive and steer clear of the negative. Artificial
intelligence (AI) has embraced this learning
technique to guide unsupervised machine learning
using incentives, or positive reinforcement, and
punishments, or negative reinforcement.
Two categories of reinforcement exist:
Positive: Favorable Reinforcement is the process through
which an event that results from a specific behavior
intensifies and repeats the behavior. Put otherwise, it
produces a favorable impact on behavior. Among
reinforcement learning's benefits are: Optimizes
Performance Long-term Change Sustain ChangeAn excess of
reinforcement may result in an overabundance of states,
which could reduce the effectiveness.
Reward Signal: After each action taken by the agent, the environment provides a single numerical reward. This
reward depends on the agent’s action and the current state. The agent’s ultimate goal is to maximize the total
reward received over the long term. The reward signal guides the agent by indicating what decisions are
beneficial (high reward) or detrimental (low reward).
Value Function: While the reward signal informs the agent about immediate goodness or badness, the value
function looks ahead to the long run. It specifies what is advantageous over extended periods. Essentially, it
helps the agent evaluate the desirability of different states or state-action pairs. The value function
complements the reward signal by considering the cumulative impact of actions over time1.
Environment Model (Optional): Although not always present, an environment model can be useful. It represents
the agent’s understanding of how the environment behaves. This model allows the agent to simulate potential
Playing games: DeepMind's AlphaGo is a well-known example. It has
vanquished world champions in the game of Go by using reinforcement learning.
Reinforcement learning has also been applied to the mastery of other games,
such as Atari games and chess . Robotics: Reinforcement learning is widely
applied in robotics to accomplish tasks like object handling, navigation, and arm
control. In order to interact with their environment and complete tasks quickly,
robots learn by making mistakes . Healthcare: Personalized treatment
recommendations, hospital resource allocation, and clinical trial optimization
are just a few of the activities in which reinforcement learning is used
The model can correct the errors that occurred during the training
process.
Flexibility: In dynamic and unpredictable contexts, reinforcement learning enables agents to adjust and
gain knowledge from past experiences. Because of its adaptability, reinforcement learning (RL) can be
used in a variety of contexts where the environment may change or be unknown.
Autonomy:Once trained, reinforcement learning agents possess the ability to function independently,
including decision-making and action-taking without human guidance. This autonomy is very helpful for
applications like gaming, robotics, and driverless cars.
Adaptability: Real-time agents have the ability to modify their tactics and actions in response to
environmental input. They are able to improve their performance repeatedly by learning from both
positive reinforcement and negative consequences.