Lecture_01 - Introduction - I

Reinforcement Learning (RL) is a type of machine learning where an agent learns to perform tasks through trial and error, receiving positive or negative feedback based on its actions. Key concepts include the agent, environment, states, observations, action spaces, and policies, with the ultimate goal of maximizing cumulative rewards. RL is distinct from other learning methods due to its lack of supervision, delayed feedback, and the importance of sequential actions.

Uploaded by

attaurrehman1017

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views15 pages

Lecture_01 - Introduction - I

Uploaded by

attaurrehman1017

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Reinforcement Learning

Reinforcement Learning
Supervised (inductive) learning is the simplest and
most studied type of learning
How can an agent learn behaviors when it doesn’t
have a teacher to tell it how to perform?
◼ The agent has a task to perform
◼ It takes some actions in the world
◼ At some later point, it gets feedback telling it how well it did
on performing the task
◼ The agent performs the same task over and over again
This problem is called reinforcement learning:
◼ The agent gets positive reinforcement for tasks done well
◼ The agent gets negative reinforcement for tasks done poorly
Reinforcement Learning (cont.)
The goal is to get the agent to act in the
world so as to maximize its rewards
The agent has to figure out what it did that
made it get the reward/punishment
◼ This is known as the credit assignment problem
Reinforcement learning approaches can be
used to train computers to do many tasks
◼ backgammon and chess playing
◼ Autonomous cars
◼ controlling robot limbs
Characteristics of
Reinforcement Learning
What makes RL different from other
machine learning algorithms?
◼ There is no supervision, only a reward single
◼ Feedback is delayed, not instantaneous
◼ Time really matters, sequential, no i.i.d data
◼ Agent’s action affect the subsequent data it
receives
Key Concepts and Terminologies
Main characters of RL
◼ Agent
◼ Environment: World that
the agent lives in and
interacts with
◼ At every step of interaction, the agent sees a

observation of the state of the world, and

then decides on an action to take.
◼ The environment changes when the agent

acts on it but may also change on its own.

Key Concepts and Terminologies
Main characters of RL
◼ Agent
◼ Environment
◼ Reward: Agent perceives a reward signal from
the environment, a number that tells it how
good or bad the current world state is.
◼ The goal of the agent is to maximize its
cumulative reward, called return.
Reinforcement learning methods are ways
that the agent can learn behaviors to
achieve its goal.
Key Concepts and Terminologies
To talk more specifically what RL does,
we need to introduce additional
terminology. We need to talk about
States and observations
Action spaces
Policies
Trajectories
RL optimization problem
Value functions
Key Concepts and Terminologies
State: A state s is complete description of the world.
No information about world is hidden from state
Observation: An observation o is partial description of
the state.
Action space: The set of all valid actions in a given
environment is often called the action space.
Some environments, like Atari and Go, have discrete
action spaces while other environments, like agent
controls a robot in a physical world, have continuous
action spaces. In continuous spaces, actions are real-
valued vectors.
Key Concepts and Terminologies
Policy Example
3 +1

2 -1

1 2 3 4

• A policy  is a complete mapping from states to actions

Formalization
Given:
◼ a state space S
◼ a set of actions a1, …, ak
◼ reward value at the end of each trial (may
be positive or negative)
Output:
◼
example:
a mapping Alvinnto
from states (driving
actionsagent)
state: configuration of the car
learn a steering action for each state
Reactive Agent Algorithm
Accessible or
Repeat: observable state
 s  sensed state
 If s is terminal then exit
 a  choose action (given s)
 Perform a
RL Agent Algorithm

Repeat:
 s  sensed state
 If s is terminal then exit
 a  (s)
 Perform a
Approaches
Learn policy directly– function mapping
from states to actions
Learn utility values for states (i.e., the
value function)
RL Summary
Active area of research
Approaches from both OR and AI
There are many more sophisticated
algorithms that we have not discussed
Applicable to game-playing, robot
controllers, others

Hearth Book 1 - Heart - Earth Syncretism (121 Page 4
50% (2)
Hearth Book 1 - Heart - Earth Syncretism (121 Page 4
12 pages
Android Secret Codes USSD Codes
No ratings yet
Android Secret Codes USSD Codes
22 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
23 pages
STS - (3000K, 6000K) - H1 Smart Transformer Station User Manual
No ratings yet
STS - (3000K, 6000K) - H1 Smart Transformer Station User Manual
145 pages
Module 1
No ratings yet
Module 1
72 pages
Introduction To Reinforcement Learning
100% (1)
Introduction To Reinforcement Learning
52 pages
Unit 1 - Reinforcement Learning,Overfitting, Training, Validation Sets, Metrics, Bias and Variance
No ratings yet
Unit 1 - Reinforcement Learning,Overfitting, Training, Validation Sets, Metrics, Bias and Variance
16 pages
L13 Reinforcement Learning
No ratings yet
L13 Reinforcement Learning
35 pages
ReinforcementLearning
No ratings yet
ReinforcementLearning
17 pages
Reinforcement Learning and Robotics
No ratings yet
Reinforcement Learning and Robotics
35 pages
AI unit -3.docx
No ratings yet
AI unit -3.docx
102 pages
REINFORCEMENT LEARNING-1
No ratings yet
REINFORCEMENT LEARNING-1
19 pages
ML_Unit-4
No ratings yet
ML_Unit-4
10 pages
A Beginner's Guide To Deep Reinforcement Learning: Skymind - Ai
No ratings yet
A Beginner's Guide To Deep Reinforcement Learning: Skymind - Ai
23 pages
Unit-5 (AI)
No ratings yet
Unit-5 (AI)
21 pages
L11 Reinforcement Learning 1
No ratings yet
L11 Reinforcement Learning 1
18 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
38 pages
2024 MTH058 Lecture05 ReinforcementLearning
No ratings yet
2024 MTH058 Lecture05 ReinforcementLearning
59 pages
Lecture 1: Introduction To Reinforcement Learning: David Silver
No ratings yet
Lecture 1: Introduction To Reinforcement Learning: David Silver
46 pages
Deep Reinforcement Learning
No ratings yet
Deep Reinforcement Learning
25 pages
kguh
No ratings yet
kguh
38 pages
Module 01
No ratings yet
Module 01
66 pages
4.1 Reinforcement Learning 2
No ratings yet
4.1 Reinforcement Learning 2
31 pages
F90de-Introduction To Reinforcement Learning
No ratings yet
F90de-Introduction To Reinforcement Learning
67 pages
Lecture Week12
No ratings yet
Lecture Week12
37 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
29 pages
RL Introduction
No ratings yet
RL Introduction
225 pages
Reinforced Learning
No ratings yet
Reinforced Learning
25 pages
AI Week 15
No ratings yet
AI Week 15
3 pages
Reinforcement Learning: Russell and Norvig: CH 21
No ratings yet
Reinforcement Learning: Russell and Norvig: CH 21
16 pages
Multi-Agent Systems and Strategic Decision Making: Module CS4760
No ratings yet
Multi-Agent Systems and Strategic Decision Making: Module CS4760
21 pages
Maai 6
No ratings yet
Maai 6
143 pages
Reinforcement Learning: Russell and Norvig: CH 21
No ratings yet
Reinforcement Learning: Russell and Norvig: CH 21
16 pages
DRL Final Notes
No ratings yet
DRL Final Notes
281 pages
7.reinforcement Learning-Introduction-The Learning Task Q-Learning
No ratings yet
7.reinforcement Learning-Introduction-The Learning Task Q-Learning
34 pages
What Is Reinforcement Learning
No ratings yet
What Is Reinforcement Learning
15 pages
Deep Reinforcement Learning
No ratings yet
Deep Reinforcement Learning
25 pages
Reinforcement learning
No ratings yet
Reinforcement learning
10 pages
Unit 5
No ratings yet
Unit 5
10 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
4 pages
A (Long) Peek Into Reinforcement Learning _ Lil'Log
No ratings yet
A (Long) Peek Into Reinforcement Learning _ Lil'Log
23 pages
Reinforcement Learning and Deep Learning Unit 1,2
No ratings yet
Reinforcement Learning and Deep Learning Unit 1,2
74 pages
Reinforcement Learning-1
No ratings yet
Reinforcement Learning-1
13 pages
UNIT-4
No ratings yet
UNIT-4
56 pages
Reinforcement Learning Notes ?
No ratings yet
Reinforcement Learning Notes ?
40 pages
unit 3 ai
No ratings yet
unit 3 ai
5 pages
Sections
No ratings yet
Sections
76 pages
Unit 5
No ratings yet
Unit 5
45 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
10 pages
tiếng anhi
No ratings yet
tiếng anhi
7 pages
10 Deep Reinforcement
No ratings yet
10 Deep Reinforcement
40 pages
Unit-5 Mla
No ratings yet
Unit-5 Mla
22 pages
RL Module 1
No ratings yet
RL Module 1
6 pages
RL Ese Answers
No ratings yet
RL Ese Answers
16 pages
Unit 3
No ratings yet
Unit 3
12 pages
L07 Slides.rl1
No ratings yet
L07 Slides.rl1
20 pages
Lec 04 Reinforcement Learning
No ratings yet
Lec 04 Reinforcement Learning
57 pages
RL UNIT - III (1)
No ratings yet
RL UNIT - III (1)
20 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
32 pages
Reinforcement Learning: By: Chandra Prakash IIITM Gwalior
No ratings yet
Reinforcement Learning: By: Chandra Prakash IIITM Gwalior
64 pages
RL Week_1
No ratings yet
RL Week_1
53 pages
A Primer Chapter on Reinforcement Learning-Final
No ratings yet
A Primer Chapter on Reinforcement Learning-Final
22 pages
Reinforcement Learning Explained - A Step-by-Step Guide to Reward-Driven AI
From Everand
Reinforcement Learning Explained - A Step-by-Step Guide to Reward-Driven AI
Luka Nikolic
No ratings yet
Lecture-20=Understanding Lab Reports
No ratings yet
Lecture-20=Understanding Lab Reports
14 pages
Lecture_02 - Introduction - II
No ratings yet
Lecture_02 - Introduction - II
43 pages
DRL - AI309 - A - Assignment - 2 - SP25 - GIKI
No ratings yet
DRL - AI309 - A - Assignment - 2 - SP25 - GIKI
2 pages
Critique On Cosmological Argument
No ratings yet
Critique On Cosmological Argument
19 pages
B.SC - I July 2021
No ratings yet
B.SC - I July 2021
3 pages
Ipath3.0: Interactive Pathways Explorer V3: Youssef Darzi, Ivica Letunic, Peer Bork and Takuji Yamada
No ratings yet
Ipath3.0: Interactive Pathways Explorer V3: Youssef Darzi, Ivica Letunic, Peer Bork and Takuji Yamada
4 pages
Unit 4 Part 2
No ratings yet
Unit 4 Part 2
24 pages
Download Full Patterns in the Machine A Software Engineering Guide to Embedded Development 1st Edition John T Taylor Wayne T Taylor PDF All Chapters
100% (3)
Download Full Patterns in the Machine A Software Engineering Guide to Embedded Development 1st Edition John T Taylor Wayne T Taylor PDF All Chapters
65 pages
AI Use Cases For Business Leaders:: Realize Value With AI
No ratings yet
AI Use Cases For Business Leaders:: Realize Value With AI
16 pages
Numerical Transformer Differential Relay
No ratings yet
Numerical Transformer Differential Relay
2 pages
Sidra Trip User Guide: Restricted Document For Use Under SIDRA TRIP Software Licence Only
No ratings yet
Sidra Trip User Guide: Restricted Document For Use Under SIDRA TRIP Software Licence Only
131 pages
Leadership in The Digital Age
No ratings yet
Leadership in The Digital Age
2 pages
OKR Spreadsheet Template With Weekly Checkins
No ratings yet
OKR Spreadsheet Template With Weekly Checkins
30 pages
KS-PE-SPC-0045 Purchase of Valves General Specification
100% (1)
KS-PE-SPC-0045 Purchase of Valves General Specification
37 pages
Ats MCCB Sequence of Operation
No ratings yet
Ats MCCB Sequence of Operation
3 pages
4th sem back sub list
No ratings yet
4th sem back sub list
5 pages
1-2. (CT VT) - 2020
No ratings yet
1-2. (CT VT) - 2020
35 pages
Đề thi học kì 1 môn Tiếng Anh lớp 8 năm 2020-2021 có đáp án - Trường THCS Nguyễn Văn Trỗi (download tai tailieutuoi.com)
No ratings yet
Đề thi học kì 1 môn Tiếng Anh lớp 8 năm 2020-2021 có đáp án - Trường THCS Nguyễn Văn Trỗi (download tai tailieutuoi.com)
8 pages
BITLY API REFERENCE
No ratings yet
BITLY API REFERENCE
92 pages
Hot Rolled Low, Medium and High Tensile Structural Steel: TR7wr)
No ratings yet
Hot Rolled Low, Medium and High Tensile Structural Steel: TR7wr)
11 pages
Audit Checklist 2020 V 3 - Ims
No ratings yet
Audit Checklist 2020 V 3 - Ims
2 pages
b 9757 tey
No ratings yet
b 9757 tey
8 pages
Fundamentals
No ratings yet
Fundamentals
104 pages
Tech Man Eng Schemes 3590et 3590egt
No ratings yet
Tech Man Eng Schemes 3590et 3590egt
24 pages
DF-cylindrical-fuseholders
No ratings yet
DF-cylindrical-fuseholders
17 pages
Airtel Customer Care Number - Google Search 2
No ratings yet
Airtel Customer Care Number - Google Search 2
1 page
Hygienic Plant Manual
100% (4)
Hygienic Plant Manual
203 pages
Aanhidayatulloh,+7+etty+padmiati (1) - Dikonversi
No ratings yet
Aanhidayatulloh,+7+etty+padmiati (1) - Dikonversi
26 pages
CC StoCast Brick EN Web S973
No ratings yet
CC StoCast Brick EN Web S973
4 pages
Yaesu FT-450 and TS-450D - Recommended Interconnection Diagram
No ratings yet
Yaesu FT-450 and TS-450D - Recommended Interconnection Diagram
1 page
Social Media
No ratings yet
Social Media
4 pages