0% found this document useful (0 votes)

20 views10 pages

Reinforcement ML

Machine learning concept

Uploaded by

Nikitha Vadlamani

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views10 pages

Reinforcement ML

Machine learning concept

Uploaded by

Nikitha Vadlamani

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 10

Topic: Introduction to ML Course: Machine Learning

Machine Learning (VR17)

IV B.Tech – I Semester
UNIT-1
Lecture:3
Topic: Reinforcement Machine Learning

COURSE INSTRUCTOR:
Dr.R.UMamaheswari
Assoc.prof & HoD ECM

Department of Electronics and Computer Engineering Slide No. 1

Topic: Reinforcement learning Course: Machine Learning

Reinforcement learning

Reinforcement learning is a goal-directed computational approach where a

computer learns to perform a task by interacting with an unknown dynamic
environment.

This learning approach enables a computer to make a series of decisions to

maximize the cumulative reward for the task without human intervention and
without being explicitly programmed to achieve the task.

Department of Electronics and Computer Engineering Slide No. 2

Topic: Reinforcement learning Course: Machine Learning

Reinforcement learning

Department of Electronics and Computer Engineering Slide No. 3

Topic: Reinforcement learning Course: Machine Learning

Reinforcement learning
The goal of reinforcement learning is to train an agent to complete a task within an
unknown environment. The agent receives observations and a reward from the
environment and sends actions to the environment. The reward is a measure of
how successful an action is with respect to completing the task goal.

The agent contains two components: a policy and a learning algorithm.

The policy is a mapping that selects actions based on the observations from the
environment. Typically, the policy is a function approximator with tunable
parameters, such as a deep neural network.

The learning algorithm continuously updates the policy parameters based on the
actions, observations, and reward. The goal of the learning algorithm is to find an
optimal policy that maximizes the cumulative reward received during the task.

Department of Electronics and Computer Engineering Slide No. 4

Topic: Reinforcement learning Course: Machine Learning

In other words, reinforcement learning involves an agent learning the optimal

behaviour through repeated trial-and-error interactions with the environment
without human involvement.
As an example, consider the task of parking a vehicle using an automated driving
system.
The goal of this task is for the vehicle computer (agent) to park the vehicle in the
correct position and orientation.
To do so, the controller uses readings from cameras, accelerometers,
gyroscopes, a GPS receiver, and lidar (observations) to generate steering, braking,
and acceleration commands (actions).
The action commands are sent to the actuators that control the vehicle.
The resulting observations depend on the actuators, sensors, vehicle dynamics,
road surface, wind, and many other less-important factors.
All these factors, that is, everything that is not the agent, make up
the environment in reinforcement learning.

Department of Electronics and Computer Engineering Slide No. 5

Topic :Reinforcement learning Course: Machine Learning

To learn how to generate the correct actions from the observations, the computer
repeatedly tries to park the vehicle using a trial-and-error process.

To guide the learning process, you provide a signal that is one when the car
successfully reaches the desired position and orientation and zero otherwise
(reward).

During each trial, the computer selects actions using a mapping (policy) initialized
with some default values.

After each trial, the computer updates the mapping to maximize the reward
(learning algorithm).

This process continues until the computer learns an optimal mapping that
successfully parks the car.

Department of Electronics and Computer Engineering Slide No. 6

Topic: Reinforcement learning Course: Machine Learning

Reinforcement Learning Workflow

Formulate problem — Define the task for the agent to learn, including
how the agent interacts with the environment and any primary and
secondary goals the agent must achieve.
Create environment — Define the environment within which the agent
operates, including the interface between agent and environment and the
environment dynamic model.

Department of Electronics and Computer Engineering Slide No. 7

Topic: Reinforcement learning Course: Machine Learning

Define reward — Specify the reward signal that the agent uses to
measure its performance against the task goals and how to calculate
this signal from the environment.
Create agent — Create the agent, which includes defining a policy
representation and configuring the agent learning algorithm.
Train agent — Train the agent policy representation using the
defined environment, reward, and agent learning algorithm
Validate agent — Evaluate the performance of the trained agent by
simulating the agent and environment together.
Deploy policy — Deploy the trained policy representation using, for
example, generated GPU code.

Department of Electronics and Computer Engineering Slide No. 8

Topic: Reinforcement learning Course: Machine Learning

Training an agent using reinforcement learning is an iterative process. Decisions

and results in later stages can require you to return to an earlier stage in the
learning workflow.

Training settings

Learning algorithm configuration

Policy representation

Reward signal definition

Action and observation signals

Environment dynamics

Department of Electronics and Computer Engineering Slide No. 9

Topic: Reinforcement learning Course: Machine Learning

Thank You

Department of Electronics and Computer Engineering Slide No. 10

Reinforcement Learning
100% (1)
Reinforcement Learning
25 pages
CMPE257 - W10C13 - Reinforcement Learning
No ratings yet
CMPE257 - W10C13 - Reinforcement Learning
161 pages
CSA3003 - REINFORCEMENT-LEARNING - LT - 1.0 - 1 - CSA3003 - Reinforcement Learning
No ratings yet
CSA3003 - REINFORCEMENT-LEARNING - LT - 1.0 - 1 - CSA3003 - Reinforcement Learning
2 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
14 pages
Complete Grokking Artificial Intelligence Algorithms 1st Edition Rishal Hurbans PDF For All Chapters
No ratings yet
Complete Grokking Artificial Intelligence Algorithms 1st Edition Rishal Hurbans PDF For All Chapters
40 pages
Reinforcement Learning: Nazia Bibi
100% (1)
Reinforcement Learning: Nazia Bibi
61 pages
Lecture 1: Introduction: Reinforcement Learning With Tensorflow&Openai Gym
No ratings yet
Lecture 1: Introduction: Reinforcement Learning With Tensorflow&Openai Gym
18 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
30 pages
ML Module 5 2
No ratings yet
ML Module 5 2
32 pages
Multi-Agent Systems and Strategic Decision Making: Module CS4760
No ratings yet
Multi-Agent Systems and Strategic Decision Making: Module CS4760
21 pages
What Is Reinforcement Learning
No ratings yet
What Is Reinforcement Learning
15 pages
RL & DL Notes
No ratings yet
RL & DL Notes
73 pages
Reinforcement Learning - Introduction
No ratings yet
Reinforcement Learning - Introduction
12 pages
Unit V Reinforcement Learning and Genetic Algorithm
No ratings yet
Unit V Reinforcement Learning and Genetic Algorithm
40 pages
R22ML 5
No ratings yet
R22ML 5
24 pages
RL & DL Notes
No ratings yet
RL & DL Notes
43 pages
Introduction To Prolog-Unit3
No ratings yet
Introduction To Prolog-Unit3
30 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
11 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
9 pages
Types of Data:: Reference Website
No ratings yet
Types of Data:: Reference Website
15 pages
What Is Reinforcement Learning
No ratings yet
What Is Reinforcement Learning
12 pages
Exp-14 Reinforcement Learning
No ratings yet
Exp-14 Reinforcement Learning
11 pages
Unit 5
No ratings yet
Unit 5
58 pages
L-14 - Reinforcement-L-d-07062024-111949am
No ratings yet
L-14 - Reinforcement-L-d-07062024-111949am
22 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
8 pages
Reinforcement Learning in AI
No ratings yet
Reinforcement Learning in AI
4 pages
Additional MCQs Chap 4 MA
No ratings yet
Additional MCQs Chap 4 MA
4 pages
Lect 2
No ratings yet
Lect 2
26 pages
Unit 5-1
No ratings yet
Unit 5-1
8 pages
First Reinforcement Learning Blog Post
No ratings yet
First Reinforcement Learning Blog Post
2 pages
Unleashing The Power of Reinforcement Learning
No ratings yet
Unleashing The Power of Reinforcement Learning
2 pages
L11 Reinforcement Learning 1
No ratings yet
L11 Reinforcement Learning 1
18 pages
Lecture 3 RL Basics Part3
No ratings yet
Lecture 3 RL Basics Part3
37 pages
Unit 3
No ratings yet
Unit 3
29 pages
Seminar Report
No ratings yet
Seminar Report
12 pages
SL Week01
No ratings yet
SL Week01
13 pages
Ai PPT New
No ratings yet
Ai PPT New
14 pages
RL Unit 1
100% (1)
RL Unit 1
26 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
38 pages
Introduction To Reinforcement Learning: Presented by - Rohit Mahto
No ratings yet
Introduction To Reinforcement Learning: Presented by - Rohit Mahto
9 pages
Unit 2 DL
No ratings yet
Unit 2 DL
44 pages
ML Assign Shubham
No ratings yet
ML Assign Shubham
13 pages
Unit 5
No ratings yet
Unit 5
45 pages
RL Vishnu Sankar
No ratings yet
RL Vishnu Sankar
26 pages
Unit 4
No ratings yet
Unit 4
56 pages
Lesson 1
100% (1)
Lesson 1
36 pages
Unit-5 Mla
No ratings yet
Unit-5 Mla
22 pages
Lecture Week12
No ratings yet
Lecture Week12
37 pages
Lecture 5
No ratings yet
Lecture 5
28 pages
Lec 01
No ratings yet
Lec 01
60 pages
ML 10
No ratings yet
ML 10
9 pages
Reinforcement Learning: By: Chandra Prakash IIITM Gwalior
No ratings yet
Reinforcement Learning: By: Chandra Prakash IIITM Gwalior
64 pages
IntroductiontoRL BR
No ratings yet
IntroductiontoRL BR
22 pages
What Is Reinforcement Learning
No ratings yet
What Is Reinforcement Learning
5 pages
Grokking Artificial Intelligence Algorithms First Edition Rishal Hurbans - Download The Full Ebook Now For A Seamless Reading Experience
No ratings yet
Grokking Artificial Intelligence Algorithms First Edition Rishal Hurbans - Download The Full Ebook Now For A Seamless Reading Experience
47 pages
CS 4501-Introduction To Reinforcement Learning
No ratings yet
CS 4501-Introduction To Reinforcement Learning
7 pages
JNTUA R20 B.tech - CSE III IV Year Course Structure Syllabus
No ratings yet
JNTUA R20 B.tech - CSE III IV Year Course Structure Syllabus
117 pages
Deep Q-Network
No ratings yet
Deep Q-Network
15 pages
Grom Paper
No ratings yet
Grom Paper
13 pages
Reinforcement Learning-1
No ratings yet
Reinforcement Learning-1
19 pages
Course Code: Course Title TPC Version No. Course Pre-Requisites/ Co-Requisites Anti-Requisites (If Any) - Objectives
No ratings yet
Course Code: Course Title TPC Version No. Course Pre-Requisites/ Co-Requisites Anti-Requisites (If Any) - Objectives
2 pages
ML Assignment 2
No ratings yet
ML Assignment 2
6 pages
IT565 Reinforcement Learning Winter 24 - Abhishek Jindal
No ratings yet
IT565 Reinforcement Learning Winter 24 - Abhishek Jindal
2 pages
Transcript Intro ML Course of Gatech
No ratings yet
Transcript Intro ML Course of Gatech
10 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
2 pages
ML Unit 5
No ratings yet
ML Unit 5
57 pages
Midterm: CS 188 Spring 2019 Introduction To Artificial Intelligence
No ratings yet
Midterm: CS 188 Spring 2019 Introduction To Artificial Intelligence
23 pages
Exploration of Reinforcement Learning To SNAKE: Bowei Ma, Meng Tang, Jun Zhang
No ratings yet
Exploration of Reinforcement Learning To SNAKE: Bowei Ma, Meng Tang, Jun Zhang
5 pages
Heat Behind The Meter: A Hidden Threat of Thermal Attacks in Edge Colocation Data Centers
No ratings yet
Heat Behind The Meter: A Hidden Threat of Thermal Attacks in Edge Colocation Data Centers
14 pages
Reinforcement ch.1
No ratings yet
Reinforcement ch.1
43 pages
AI Guide For Government
No ratings yet
AI Guide For Government
97 pages
Survey Cleanversion
No ratings yet
Survey Cleanversion
61 pages
Training Large Language Models For Reasoning Through Reverse Curriculum Reinforcement Learning
No ratings yet
Training Large Language Models For Reasoning Through Reverse Curriculum Reinforcement Learning
19 pages
2023 Unit Outline - CITS5017 Deep Learning
No ratings yet
2023 Unit Outline - CITS5017 Deep Learning
7 pages
Erka
No ratings yet
Erka
11 pages
AI Pastpaper Solve by M.Noman Tariq
No ratings yet
AI Pastpaper Solve by M.Noman Tariq
23 pages
Intelligent Attitude Control of Satellites Via Deep Reinforcement Learning
No ratings yet
Intelligent Attitude Control of Satellites Via Deep Reinforcement Learning
127 pages
RAMAN Reinforcement Learning Inspired Algorithm For Mapping Applications Onto Mesh Network-on-Chip
No ratings yet
RAMAN Reinforcement Learning Inspired Algorithm For Mapping Applications Onto Mesh Network-on-Chip
7 pages
Exploring The Latest Trends in Artificial Intellig
No ratings yet
Exploring The Latest Trends in Artificial Intellig
13 pages
3 - Chapter 4 Value Iteration and Policy Iteration
No ratings yet
3 - Chapter 4 Value Iteration and Policy Iteration
20 pages
AI Based Modeling: Techniques, Applications and Research Issues Towards Automation, Intelligent and Smart Systems
No ratings yet
AI Based Modeling: Techniques, Applications and Research Issues Towards Automation, Intelligent and Smart Systems
20 pages
Reinforcement Learning in Spacecraft Control Applications - Advances, Prospects, and Challenges
No ratings yet
Reinforcement Learning in Spacecraft Control Applications - Advances, Prospects, and Challenges
23 pages
DQN Muhammed
No ratings yet
DQN Muhammed
46 pages
What Are Data Distributions, and Why Are They Important
No ratings yet
What Are Data Distributions, and Why Are They Important
4 pages
AIML (3rd - Year) Syllabus Igdtuw
No ratings yet
AIML (3rd - Year) Syllabus Igdtuw
34 pages
RL Model
No ratings yet
RL Model
16 pages
Artificial Intelligent
No ratings yet
Artificial Intelligent
77 pages
Intermediate AI Prompting – Reinforcement Learning
From Everand
Intermediate AI Prompting – Reinforcement Learning
Eric Centore
No ratings yet
Reinforcement Learning Explained - A Step-by-Step Guide to Reward-Driven AI
From Everand
Reinforcement Learning Explained - A Step-by-Step Guide to Reward-Driven AI
Luka Nikolic
No ratings yet
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
From Everand
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
Elaine Tate
No ratings yet