Reinforcement Learning 1
Reinforcement Learning 1
Learning
Group Members
Ayesh
Kainat a
Komal Marya
m
Topics
Steps and
Reinforceme Working Types
Example
nt learning
Working
.Policy: A strategy that the agent uses to determine the next action based on the current state.
.Reward Function: A function that provides feedback on the actions taken, guiding the agent
towards its goal.
.Value Function: Estimates the future cumulative rewards the agent will receive from a given
state.
.Model of the Environment: A representation of the environment that predicts future states
and rewards, aiding in planning.
Example: Navigating a Maze
• 1. Positive Reinforcement
• Positive Reinforcement is defined as when an event, occurs due to a particular
behavior, increases the strength and the frequency of the behavior. In other
words, it has a positive effect on behavior.
• Advantages: Maximizes performance, helps sustain change over time.
• Disadvantages: Overuse can lead to excess states that may reduce
effectiveness.
• 2. Negative Reinforcement
• Negative Reinforcement is defined as strengthening of behavior because a
negative condition is stopped or avoided.
• Advantages: Increases behavior frequency, ensures a minimum performance
standard.
• Disadvantages: It may only encourage just enough action to avoid penalties
Applications