Ai PPT New

Uploaded by

ks8408783

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

92 views14 pages

Ai PPT New

Uploaded by

ks8408783

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Reinforcement

Learning

Presented by:
-Baljit kaur
-Loveleen kaur
-Kulwinder singh
-Lovepreet kaur
One subfield of machine learning is reinforcement
learning. It involves acting appropriately to
maximize reward in a certain circumstance. It is
utilized by a variety of software and devices to
determine the optimal course of action or behavior
for a given circumstance. Reinforcement learning
and supervised learning are different in that in
supervised learning, the model is trained with the
correct answer already present in the training data,
while in reinforcement learning, the model is
trained without an answer and is guided by the
reinforcement agent's decision on how to complete
the task at hand.
A self-governing, self-teaching system,
reinforcement learning basically learns by making
mistakes. It learns by doing in order to attain the
best results, or, to put it another way, it takes
actions with the intention of maximizing rewards.
For instance: The issue is this: There are numerous
obstacles between our agent and the reward. The
agent's job is to determine the most efficient route
to the reward. For easier understanding, consider
the problem that follows..
In reinforcement learning, creators come up with
a way to penalize bad conduct and reward good
behaviour. With this approach, the agent is
encouraged to use the desired activities by
assigning them positive values, while the
undesirable behaviours are discouraged by
assigning them negative values. In order to find
the best answer, this programs the agent to look
for long-term and maximum total rewards . By
setting long-term objectives, the agent can avoid
becoming bogged down in smaller, less crucial
ones. The agent eventually learns to look for the
positive and steer clear of the negative. Artificial
intelligence (AI) has embraced this learning
technique to guide unsupervised machine learning
using incentives, or positive reinforcement, and
punishments, or negative reinforcement.
Two categories of reinforcement exist:
Positive: Favorable Reinforcement is the process through
which an event that results from a specific behavior
intensifies and repeats the behavior. Put otherwise, it
produces a favorable impact on behavior. Among
reinforcement learning's benefits are: Optimizes
Performance Long-term Change Sustain ChangeAn excess of
reinforcement may result in an overabundance of states,
which could reduce the effectiveness.

Negative: Adverse The definition of reinforcement is when a

behavior gets stronger as a result of a bad circumstance
being avoided or halted. Reinforcement learning benefits
include: Boosts ConductShow disdain for a minimal level of
performanceIt only offers enough to satisfy the bare
minimum of conduct..
Policy: The policy defines how an agent behaves at any given moment. Formally, it’s a mapping from
environmental states to specific actions that the agent should take when in those states. Policies can be simple
functions, lookup tables, or involve more complex computations. Importantly, the policy alone determines the
agent’s behavior, and it often involves stochastic choices.

Reward Signal: After each action taken by the agent, the environment provides a single numerical reward. This
reward depends on the agent’s action and the current state. The agent’s ultimate goal is to maximize the total
reward received over the long term. The reward signal guides the agent by indicating what decisions are
beneficial (high reward) or detrimental (low reward).

Value Function: While the reward signal informs the agent about immediate goodness or badness, the value
function looks ahead to the long run. It specifies what is advantageous over extended periods. Essentially, it
helps the agent evaluate the desirability of different states or state-action pairs. The value function
complements the reward signal by considering the cumulative impact of actions over time1.

Environment Model (Optional): Although not always present, an environment model can be useful. It represents
the agent’s understanding of how the environment behaves. This model allows the agent to simulate potential
Playing games: DeepMind's AlphaGo is a well-known example. It has
vanquished world champions in the game of Go by using reinforcement learning.
Reinforcement learning has also been applied to the mastery of other games,
such as Atari games and chess . Robotics: Reinforcement learning is widely
applied in robotics to accomplish tasks like object handling, navigation, and arm
control. In order to interact with their environment and complete tasks quickly,
robots learn by making mistakes . Healthcare: Personalized treatment
recommendations, hospital resource allocation, and clinical trial optimization
are just a few of the activities in which reinforcement learning is used

Reinforcement learning can be applied to tasks like text summarization,

machine translation, and conversation synthesis in natural language processing
(NLP). From user interactions or environmental feedback, the agent learns to
produce coherent and contextually relevant replies . These illustrations show
how adaptable reinforcement learning is in a variety of contexts, where agents
are trained to make decisions by interacting with their surroundings in order to
accomplish predetermined goals.
Reinforcement learning can be used to solve very complex
problems that cannot be solved by conventional techniques.

The model can correct the errors that occurred during the training
process.

In RL, training data is obtained via the direct interaction of the

agent with the environment

Reinforcement learning can handle environments that are non-

deterministic, meaning that the outcomes of actions are not always
predictable. This is useful in real-world applications where the
environment may change over time or is uncertain.

Reinforcement learning can be used to solve a wide range of

problems, including those that involve decision making, control, and
optimization.

Reinforcement learning is a flexible approach that can be combined

with other machine learning techniques, such as deep learning
The difficulties of using reinforcement learningAlthough
it has a lot of potential, reinforcement learning has significant
drawbacks. Its application is still restricted and its
deployment can be challenging. This kind of machine
learning's reliance on environment exploration is one of its
deployment's obstacles.A robot that relies on reinforcement
learning to negotiate a complicated physical world, for
instance, would seek out new states and change its course as
it advances.This method can be computationally costly and
have limited effectiveness due to the amount of time needed
to achieve proper learning. The demands on time and
computational resources increase in tandem with the
complexity of the training environment.
Typical algorithms for reinforcement learningReinforcement learning is not limited to a
single algorithm; rather, it encompasses a variety of methods with somewhat diverse
methodologies. The primary cause of the variations is the various methods they employ
to investigate their surroundings:Reward-state-action-state-action. The first step in
this reinforcement learning process is to provide the agent with a policy. The agent is
not given any policies; instead, it discovers the worth of an activity by investigating its
surroundings. This strategy is more self-directed than model-based. Python
programming is frequently used to write Q-learning implementations in the real
world.Q-networks at depth. These algorithms employ reinforcement learning methods
along with neural networks and deep Q-learning. They employ the self-directed
reinforcement learning environment and go by the name "deep reinforcement learning"
as well.
There are numerous benefits to using reinforcement learning while tackling challenging jobs and
challenges.

Flexibility: In dynamic and unpredictable contexts, reinforcement learning enables agents to adjust and
gain knowledge from past experiences. Because of its adaptability, reinforcement learning (RL) can be
used in a variety of contexts where the environment may change or be unknown.

Autonomy:Once trained, reinforcement learning agents possess the ability to function independently,
including decision-making and action-taking without human guidance. This autonomy is very helpful for
applications like gaming, robotics, and driverless cars.

Adaptability: Real-time agents have the ability to modify their tactics and actions in response to
environmental input. They are able to improve their performance repeatedly by learning from both
positive reinforcement and negative consequences.

1: Reinforcement learning is not preferable to use for solving simple problems.

2: Reinforcement learning needs a lot of data and a lot of computation
3: Reinforcement learning is highly dependent on the quality of the reward function. If the
reward function is poorly designed, the agent may not learn the desired behavior.
4: Reinforcement learning can be difficult to debug and interpret. It is not always clear why
the agent is behaving in a certain way, which can make it difficult to diagnose and fix
problems.
supervised learning, algorithms In unsupervised learning, Reinforcement learning. This
train on a body of labeled data. developers turn algorithms loose takes a different approach. It
Supervised learning algorithms on fully unlabeled data. The situates an agent in an
can only learn attributes that algorithms learn by cataloging their environment with clear
are specified in the data set. A own observations about data parameters defining beneficial
common application of features without being told what to activity and nonbeneficial activity
supervised learning is image look for. and an overarching endgame to
recognition models. These reach.
models receive a set of labeled
images and learn to distinguish
common attributes of
predefined forms.
Reinforcement
use one or morelearning is projectedincluding
training approaches, to play reinforcement
a bigger rolelearning.
in the future of AI. The other
approaches to training machine learning algorithms require large amounts of
preexisting training data. Reinforcement learning agents, on the other hand, require
the time to gradually learn how to operate via interactions with their environments.
Despite the challenges, various industries are expected to continue exploring
reinforcement learning's potential.Reinforcement learning has already demonstrated
promise in various areas. For example, marketing and advertising firms are using
algorithms trained this way for recommendation engines. Manufacturers are using
reinforcement learning to train their next-generation robotic systems.Scientists at
Alphabet's AI subsidiary, Google DeepMind, have proposed that reinforcement
learning could bring the current state of AI -- often called narrow AI -- to its theoretical
final form of artificial general intelligence. They believe machines that learn through
reinforcement learning will eventually become sentient and operate independently of
human supervision.Machine learning algorithms
Thanks!
Do you have any
questions?

Reinforcement Learning 1
No ratings yet
Reinforcement Learning 1
14 pages
CMPE257 - W10C13 - Reinforcement Learning
No ratings yet
CMPE257 - W10C13 - Reinforcement Learning
161 pages
7.reinforcement Learning-Introduction-The Learning Task Q-Learning
No ratings yet
7.reinforcement Learning-Introduction-The Learning Task Q-Learning
34 pages
cs8691 - 2 Marks With Answers
100% (1)
cs8691 - 2 Marks With Answers
31 pages
Reinforcement Learning
100% (1)
Reinforcement Learning
25 pages
Reinforcement Learning With Python - Master Reinforcemearning in Python Without Being An Expert - Bob Story (Bob Story) (Z-Library)
No ratings yet
Reinforcement Learning With Python - Master Reinforcemearning in Python Without Being An Expert - Bob Story (Bob Story) (Z-Library)
58 pages
Introduction To Prolog-Unit3
No ratings yet
Introduction To Prolog-Unit3
30 pages
Unit V Reinforcement Learning and Genetic Algorithm
No ratings yet
Unit V Reinforcement Learning and Genetic Algorithm
40 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
11 pages
Lecture#1 - RL An Introduction 2023
No ratings yet
Lecture#1 - RL An Introduction 2023
44 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
7 pages
Unit 5
No ratings yet
Unit 5
58 pages
Reinforcement Learning B.Tech. IV Year I Sem. Unit - I
No ratings yet
Reinforcement Learning B.Tech. IV Year I Sem. Unit - I
27 pages
Unit 5 ML 3year
No ratings yet
Unit 5 ML 3year
17 pages
Lecture 9 - Reinforced Learning
No ratings yet
Lecture 9 - Reinforced Learning
18 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
5 pages
UNIT V Reinforcement Learning
No ratings yet
UNIT V Reinforcement Learning
8 pages
Exp-14 Reinforcement Learning
No ratings yet
Exp-14 Reinforcement Learning
11 pages
SL Week01
No ratings yet
SL Week01
13 pages
Module - 1 - Reinforcement Learning and Markov Decision Process
No ratings yet
Module - 1 - Reinforcement Learning and Markov Decision Process
19 pages
Introduction To Reinforcement Learning: Presented by - Rohit Mahto
No ratings yet
Introduction To Reinforcement Learning: Presented by - Rohit Mahto
9 pages
Unit 6
No ratings yet
Unit 6
34 pages
Reinforcement Learning, Q-Learning
No ratings yet
Reinforcement Learning, Q-Learning
20 pages
RL & DL Notes
No ratings yet
RL & DL Notes
43 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
7 pages
RL & DL Notes
No ratings yet
RL & DL Notes
73 pages
Unit 3
No ratings yet
Unit 3
29 pages
Lect 2
No ratings yet
Lect 2
26 pages
Unit 5-1
No ratings yet
Unit 5-1
8 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
17 pages
IntroductiontoRL BR
No ratings yet
IntroductiontoRL BR
22 pages
Winter Semester 2023-24 - CSE4037 - ETH - AP2023246000594 - 2024-01-05 - Reference-Material-I
No ratings yet
Winter Semester 2023-24 - CSE4037 - ETH - AP2023246000594 - 2024-01-05 - Reference-Material-I
35 pages
What Is Reinforcement Learning
No ratings yet
What Is Reinforcement Learning
5 pages
Module 1
No ratings yet
Module 1
72 pages
Module 01
No ratings yet
Module 01
66 pages
Reinforcement Learning (RL) : Agent
No ratings yet
Reinforcement Learning (RL) : Agent
35 pages
Reinforcement Learning-1
No ratings yet
Reinforcement Learning-1
13 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
19 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
29 pages
Deep Reinforcement Learning
No ratings yet
Deep Reinforcement Learning
25 pages
What Is Reinforcement Learning
No ratings yet
What Is Reinforcement Learning
15 pages
Deep Reinforcement Learning
No ratings yet
Deep Reinforcement Learning
25 pages
Reinforcemnet Learning
No ratings yet
Reinforcemnet Learning
8 pages
Unit 5 ML
No ratings yet
Unit 5 ML
15 pages
Reinforcement 2
No ratings yet
Reinforcement 2
2 pages
Unit-5 Mla
No ratings yet
Unit-5 Mla
22 pages
L-14 - Reinforcement-L-d-07062024-111949am
No ratings yet
L-14 - Reinforcement-L-d-07062024-111949am
22 pages
tiếng anhi
No ratings yet
tiếng anhi
7 pages
UNIT-V-Reinforcement Learning
No ratings yet
UNIT-V-Reinforcement Learning
4 pages
ML 10
No ratings yet
ML 10
9 pages
Reinforced Learning
No ratings yet
Reinforced Learning
25 pages
Reinforcement Learning Overview
No ratings yet
Reinforcement Learning Overview
2 pages
Reinforcement Learning Is An Autonomous
No ratings yet
Reinforcement Learning Is An Autonomous
3 pages
Unit-5 (AI)
No ratings yet
Unit-5 (AI)
21 pages
Reinforcement Learning MY101
No ratings yet
Reinforcement Learning MY101
15 pages
Reinforcement
No ratings yet
Reinforcement
9 pages
Unit 5
No ratings yet
Unit 5
45 pages
Assignment 15 Modern AI
No ratings yet
Assignment 15 Modern AI
3 pages
Unit 3
No ratings yet
Unit 3
14 pages
Unit - 1
No ratings yet
Unit - 1
65 pages
A Beginners Guide To Deep Reinforcement Learning PDF
No ratings yet
A Beginners Guide To Deep Reinforcement Learning PDF
9 pages
Artificial Intelligence
67% (3)
Artificial Intelligence
33 pages
AI Unit-4 Software Agents Communication
No ratings yet
AI Unit-4 Software Agents Communication
9 pages
Artificial Intelligence Tutorial PDF
0% (1)
Artificial Intelligence Tutorial PDF
68 pages
Ai Chap 1
No ratings yet
Ai Chap 1
62 pages
CS-1351 Artificial Intelligence - Two Marks
100% (1)
CS-1351 Artificial Intelligence - Two Marks
24 pages
Laudon Chapter 11: Knowledge Systems
100% (1)
Laudon Chapter 11: Knowledge Systems
35 pages
Latest Trends in IT Final
No ratings yet
Latest Trends in IT Final
35 pages
CDS2002 Outline 2025
No ratings yet
CDS2002 Outline 2025
3 pages
AI in 6 Hours
No ratings yet
AI in 6 Hours
287 pages
AI Answer
No ratings yet
AI Answer
15 pages
AI Lectures
No ratings yet
AI Lectures
81 pages
AI Handout For CS by Mengistu E Mod
No ratings yet
AI Handout For CS by Mengistu E Mod
29 pages
AI Notes
No ratings yet
AI Notes
8 pages
Agents and Environments Rationality Peas (Performance Measure, Environment, Actuators, Sensors) Class of Environment Agent Types
No ratings yet
Agents and Environments Rationality Peas (Performance Measure, Environment, Actuators, Sensors) Class of Environment Agent Types
32 pages
Saj 7 2010
No ratings yet
Saj 7 2010
119 pages
Aiml Solved QA
No ratings yet
Aiml Solved QA
54 pages
Robotics, AI, and Humanity: Science, Ethics, and Policy Joachim Von Braun - Instantly Access The Complete Ebook With Just One Click
100% (2)
Robotics, AI, and Humanity: Science, Ethics, and Policy Joachim Von Braun - Instantly Access The Complete Ebook With Just One Click
69 pages
Artificial Intelligence MCQ
No ratings yet
Artificial Intelligence MCQ
9 pages
Seminar Report ON: Ai and Its Intelligent Agents
No ratings yet
Seminar Report ON: Ai and Its Intelligent Agents
17 pages
Intelligent System 1
No ratings yet
Intelligent System 1
43 pages
AI Unit-1
No ratings yet
AI Unit-1
72 pages
Lecture-2 (Intelligent Agents)
No ratings yet
Lecture-2 (Intelligent Agents)
42 pages
L01 - Introduction To Intelligent Systems
No ratings yet
L01 - Introduction To Intelligent Systems
42 pages
Notes 1
No ratings yet
Notes 1
24 pages
Ia 1 Scheme
No ratings yet
Ia 1 Scheme
10 pages
Ai Viva
No ratings yet
Ai Viva
6 pages
Fuzzy Logic in Agent-Based Game Design: Yifan Li, Petr Musilek and Loren Wyard-Scott
No ratings yet
Fuzzy Logic in Agent-Based Game Design: Yifan Li, Petr Musilek and Loren Wyard-Scott
6 pages
Intermediate AI Prompting – Reinforcement Learning
From Everand
Intermediate AI Prompting – Reinforcement Learning
Eric Centore
No ratings yet
Reinforcement Learning Explained - A Step-by-Step Guide to Reward-Driven AI
From Everand
Reinforcement Learning Explained - A Step-by-Step Guide to Reward-Driven AI
Luka Nikolic
No ratings yet