Reinforcement Learning Is An Autonomous

Reinforcement learning is a self-teaching system that learns through trial and error to maximize rewards by performing actions in an environment. It involves an agent that interacts with the environment, receives feedback in the form of rewards or penalties, and updates its policy to improve future actions. Key concepts include positive and negative reinforcement, various algorithms, and practical applications in fields such as robotics, autonomous vehicles, and AI development.

Uploaded by

surya.s2710153

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views3 pages

Reinforcement Learning Is An Autonomous

Uploaded by

surya.s2710153

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Reinforcement learning

Reinforcement learning is an autonomous, self-teaching system that essentially learns by trial and error. It
performs actions with the aim of maximizing rewards, or in other words, it is learning by doing in order to
achieve the best outcomes.

How Does Reinforcement Learning Work?

1. Start in a state.
2. Take an action.
3. Receive a reward or penalty from the environment.
4. Observe the new state of the environment.
5. Update your policy to maximize future rewards.

Here what do you see?

You can see a dog and a master. Let’s imagine you are training your dog to get the stick. Each time the
dog gets a stick successfully, you offered him a feast (a bone let’s say). Eventually, the dog understands
the pattern, that whenever the master throws a stick, it should get it as early as it can to gain a reward (a
bone) from a master in a lesser time.

Terminologies used in Reinforcement Learning

Agent – is the sole decision-maker and learner

Environment – a physical world where an agent learns and decides the actions to be performed
Action – a list of action which an agent can perform
State – the current situation of the agent in the environment
Reward – For each selected action by agent, the environment gives a reward. It’s usually a scalar value
and nothing but feedback from the environment
Policy – the agent prepares strategy(decision-making) to map situations to actions.
Value Function – The value of state shows up the reward achieved starting from the state until the policy
is executed
Model – Every RL agent doesn’t use a model of its environment. The agent’s view maps state-action
pairs probability distributions over the states

Reinforcement Learning Workflow

– Create the Environment
– Define the reward
– Create the agent
– Train and validate the agent
– Deploy the policy

Characteristics of Reinforcement Learning

– No supervision, only a real value or reward signal
– Decision making is sequential
– Time plays a major role in reinforcement problems
– Feedback isn’t prompt but delayed
– The following data it receives is determined by the agent’s actions

Reinforcement Learning Algorithms

There are 3 approaches to implement reinforcement learning algorithms

Fig: Reinforcement Learning Algorithms

Value-Based – The main goal of this method is to maximize a value function. Here, an agent through a
policy expects a long-term return of the current states. Eg- robot learning to navigate a maze.

Policy-Based – In policy-based, you enable to come up with a strategy that helps to gain maximum
rewards in the future through possible actions performed in each state. Two types of policy-based
methods are deterministic and stochastic.eg- Training a self-driving car to navigate traffic.

Model-Based – In this method, we need to create a virtual model for the agent to help in learning to
perform in each specific environment. Eg- Teaching a robot to manipulate objects in the real world.

Types of Reinforcement Learning

 Positive reinforcement: Adding

something pleasant to increase the
likelihood of a behaviour.(eg-
Training a dog to sit on
command.)
 Negative reinforcement: Removing
something unpleasant to increase
the likelihood of a behaviour.(eg -
you have a headache, and you
take pain medication.)

1. Positive Reinforcement

Positive reinforcement is defined as when an event, occurs due to specific behaviour, increases the
strength and frequency of the behaviour. It has a positive impact on behaviour.
Advantages
– Maximizes the performance of an action
– Sustain change for a longer period

Disadvantage
– Excess reinforcement can lead to an overload of states which would minimize the results.

2. Negative Reinforcement

Negative Reinforcement is represented as the strengthening of a behaviour. In other ways, when a

negative condition is barred or avoided, it tries to stop this action in the future.
Advantages
– Maximized behaviour
– Provide a decent to minimum standard of performance
Disadvantage
limits itself enough to meet up a minimum behaviour

Widely used models for reinforcement learning

1. Markov Decision Process (MDP’s)
2. Q Learning

Practical Applications of reinforcement learning

– Robotics for Industrial Automation
– Text summarization engines, dialogue agents (text, speech), gameplays
– Autonomous Self Driving Cars
– Machine Learning and Data Processing
– Training system which would issue custom instructions and materials with respect to the requirements
of students
– AI Toolkits, Manufacturing, Automotive, Healthcare, and Bots
– Aircraft Control and Robot Motion Control
– Building artificial intelligence for computer games

CMPE257 - W10C13 - Reinforcement Learning
No ratings yet
CMPE257 - W10C13 - Reinforcement Learning
161 pages
AI Unit - 3
No ratings yet
AI Unit - 3
102 pages
Sara Reinforcement Learning
No ratings yet
Sara Reinforcement Learning
69 pages
Unit 6
No ratings yet
Unit 6
34 pages
RL & DL Notes
No ratings yet
RL & DL Notes
73 pages
RL & DL Notes
No ratings yet
RL & DL Notes
43 pages
RL Week - 1
No ratings yet
RL Week - 1
53 pages
Unit 5
No ratings yet
Unit 5
58 pages
Reinforcemnet Learning
No ratings yet
Reinforcemnet Learning
8 pages
Module - 1 - Reinforcement Learning and Markov Decision Process
No ratings yet
Module - 1 - Reinforcement Learning and Markov Decision Process
19 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
7 pages
What Is Reinforcement Learning
No ratings yet
What Is Reinforcement Learning
15 pages
Winter Semester 2023-24 - CSE4037 - ETH - AP2023246000594 - 2024-01-05 - Reference-Material-I
No ratings yet
Winter Semester 2023-24 - CSE4037 - ETH - AP2023246000594 - 2024-01-05 - Reference-Material-I
35 pages
Unit V Reinforcement Learning and Genetic Algorithm
No ratings yet
Unit V Reinforcement Learning and Genetic Algorithm
40 pages
L11 Reinforcement Learning 1
No ratings yet
L11 Reinforcement Learning 1
18 pages
AI Week 15
No ratings yet
AI Week 15
3 pages
Unit 4
No ratings yet
Unit 4
56 pages
Reinforcement Learning
100% (1)
Reinforcement Learning
25 pages
Reinforcement Learning-1
No ratings yet
Reinforcement Learning-1
19 pages
Module 1
No ratings yet
Module 1
72 pages
Reinforcement Learning With Python - Master Reinforcemearning in Python Without Being An Expert - Bob Story (Bob Story) (Z-Library)
No ratings yet
Reinforcement Learning With Python - Master Reinforcemearning in Python Without Being An Expert - Bob Story (Bob Story) (Z-Library)
58 pages
Unit 3
No ratings yet
Unit 3
29 pages
UNIT V Reinforcement Learning
No ratings yet
UNIT V Reinforcement Learning
8 pages
3GP ML Reinforcement Learning
No ratings yet
3GP ML Reinforcement Learning
3 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
29 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
17 pages
Exp-14 Reinforcement Learning
No ratings yet
Exp-14 Reinforcement Learning
11 pages
UNIT-V-Reinforcement Learning
No ratings yet
UNIT-V-Reinforcement Learning
4 pages
Unit4 (AI) 2024 Docx-1
No ratings yet
Unit4 (AI) 2024 Docx-1
22 pages
21ai020 & Reinforcement Learning UNIT 1-LM:1
No ratings yet
21ai020 & Reinforcement Learning UNIT 1-LM:1
8 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
11 pages
L-14 - Reinforcement-L-d-07062024-111949am
No ratings yet
L-14 - Reinforcement-L-d-07062024-111949am
22 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
8 pages
Types of Data:: Reference Website
No ratings yet
Types of Data:: Reference Website
15 pages
Conceptual Approach
50% (2)
Conceptual Approach
22 pages
Unit 5 ML 3year
No ratings yet
Unit 5 ML 3year
17 pages
Ausubel's Meaningful Verbal Learning
No ratings yet
Ausubel's Meaningful Verbal Learning
30 pages
Unit 5-1
No ratings yet
Unit 5-1
8 pages
ML 10
No ratings yet
ML 10
9 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
5 pages
Unit-5 (AI)
No ratings yet
Unit-5 (AI)
21 pages
Reinforcement Learning: Nazia Bibi
100% (1)
Reinforcement Learning: Nazia Bibi
61 pages
Unit - 5 Re-Inforcement Learning
No ratings yet
Unit - 5 Re-Inforcement Learning
3 pages
Introduction To Reinforcement Learning: Presented by - Rohit Mahto
No ratings yet
Introduction To Reinforcement Learning: Presented by - Rohit Mahto
9 pages
Assignment 15 Modern AI
No ratings yet
Assignment 15 Modern AI
3 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
4 pages
Unit-5 Mla
No ratings yet
Unit-5 Mla
22 pages
Ai PPT New
No ratings yet
Ai PPT New
14 pages
Unit 5
No ratings yet
Unit 5
45 pages
Unit Iii
No ratings yet
Unit Iii
16 pages
Java Programming Lab Manual
No ratings yet
Java Programming Lab Manual
46 pages
Reinforcement learning-WPS Office
No ratings yet
Reinforcement learning-WPS Office
1 page
Reinforced Learning
No ratings yet
Reinforced Learning
25 pages
Reinforcement
No ratings yet
Reinforcement
9 pages
Unit 5 - Reinforcement Learning
No ratings yet
Unit 5 - Reinforcement Learning
15 pages
Reinforcement Learning - Basics
No ratings yet
Reinforcement Learning - Basics
7 pages
What Is Reinforcement Learning
No ratings yet
What Is Reinforcement Learning
5 pages
What Is Reinforcement Learning
No ratings yet
What Is Reinforcement Learning
12 pages
Unit-III Notes Updated
No ratings yet
Unit-III Notes Updated
32 pages
RL Unit 1
100% (1)
RL Unit 1
26 pages
RL Vishnu Sankar
No ratings yet
RL Vishnu Sankar
26 pages
Unit Ii
No ratings yet
Unit Ii
23 pages
ML Assignment 2
No ratings yet
ML Assignment 2
6 pages
Unit 2
No ratings yet
Unit 2
25 pages
Strategic Management Final
No ratings yet
Strategic Management Final
16 pages
Unit I
No ratings yet
Unit I
24 pages
MC Unit 1
No ratings yet
MC Unit 1
20 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
32 pages
Woro Widiastuti
No ratings yet
Woro Widiastuti
42 pages
Effects of Augmented Reality Game-Based Learning On Students Engagement
No ratings yet
Effects of Augmented Reality Game-Based Learning On Students Engagement
18 pages
THE SYLLABUS, Schemes of Work, Notes
No ratings yet
THE SYLLABUS, Schemes of Work, Notes
13 pages
Elements of Experiential Consumption
No ratings yet
Elements of Experiential Consumption
13 pages
Psychosocial Interventions For Dementia: From Evidence To Practice
No ratings yet
Psychosocial Interventions For Dementia: From Evidence To Practice
11 pages
Note 1
No ratings yet
Note 1
2 pages
Lesson Plan - Indefinite Pronouns Aug 16
100% (5)
Lesson Plan - Indefinite Pronouns Aug 16
4 pages
Needs Analysis Planning A Syllabus For A PDF
No ratings yet
Needs Analysis Planning A Syllabus For A PDF
17 pages
Lesson Plan Writing CPD
No ratings yet
Lesson Plan Writing CPD
4 pages
IOSH - Level - 6 - Diploma
No ratings yet
IOSH - Level - 6 - Diploma
2 pages
Thuyết trình
No ratings yet
Thuyết trình
2 pages
Reviewer Info Texts
No ratings yet
Reviewer Info Texts
7 pages
Cross Cultural Communication Assignment Topics
No ratings yet
Cross Cultural Communication Assignment Topics
5 pages
Learning and Development: Trainer and Trainee Assessment
No ratings yet
Learning and Development: Trainer and Trainee Assessment
10 pages
Intelligent Construction
No ratings yet
Intelligent Construction
4 pages
DLL Philo Week 1
No ratings yet
DLL Philo Week 1
5 pages
Mental Health of Teachers
No ratings yet
Mental Health of Teachers
13 pages
Oral Communication
No ratings yet
Oral Communication
5 pages
TOR-Levels of Comprehension
No ratings yet
TOR-Levels of Comprehension
2 pages
Perfect Partners: Activity Type
100% (1)
Perfect Partners: Activity Type
2 pages
Effective Communication Skill PPT at Bec Doms Mba
100% (4)
Effective Communication Skill PPT at Bec Doms Mba
16 pages
Business Entrepreneurship Project
No ratings yet
Business Entrepreneurship Project
4 pages
Rizza Frez Mae C. Biares #27 South Central Aurora Hill, Baguio City Contact #: 09668399012 E-Mail Add
No ratings yet
Rizza Frez Mae C. Biares #27 South Central Aurora Hill, Baguio City Contact #: 09668399012 E-Mail Add
2 pages
BC KMBN 107 Put
No ratings yet
BC KMBN 107 Put
3 pages
Minnesota Satisfaction Questionnaire
No ratings yet
Minnesota Satisfaction Questionnaire
2 pages
AP 2023-2024 Kibalus
No ratings yet
AP 2023-2024 Kibalus
2 pages
Module 4 Professional Ed
No ratings yet
Module 4 Professional Ed
2 pages
Teaching Young Children Briody McGarry
No ratings yet
Teaching Young Children Briody McGarry
4 pages
Reinforcement Learning Explained - A Step-by-Step Guide to Reward-Driven AI
From Everand
Reinforcement Learning Explained - A Step-by-Step Guide to Reward-Driven AI
Luka Nikolic
No ratings yet
Intermediate AI Prompting – Reinforcement Learning
From Everand
Intermediate AI Prompting – Reinforcement Learning
Eric Centore
No ratings yet