0% found this document useful (0 votes)
76 views

Reinforcement Learning

Reinforcement learning is a machine learning method where an agent learns from interactions with an environment by receiving rewards or penalties for actions. The agent aims to maximize rewards by learning which actions work best in different situations. Key aspects include the agent, which perceives and acts in the environment, and the environment itself, which can be unpredictable. Common reinforcement learning approaches are value-based, which learns optimal values for states; policy-based, which directly learns optimal policies for actions; and model-based, which uses a model of the environment.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
76 views

Reinforcement Learning

Reinforcement learning is a machine learning method where an agent learns from interactions with an environment by receiving rewards or penalties for actions. The agent aims to maximize rewards by learning which actions work best in different situations. Key aspects include the agent, which perceives and acts in the environment, and the environment itself, which can be unpredictable. Common reinforcement learning approaches are value-based, which learns optimal values for states; policy-based, which directly learns optimal policies for actions; and model-based, which uses a model of the environment.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

Reinforcement Learning

Reinforcement learning is a feedback-based machine learning approach in


which an agent(An entity that can perceive) learns to behave in an environment
by executing actions and seeing the outcomes of actions. The agent receives
positive feedback for each successful action, and negative feedback or a
penalty for each unsuccessful action.
We must consider two key factors in order to fully understand how the RL
operates:
 Agent: An entity with the capacity to see, investigate, and react to its
surroundings. An intelligent agent such as AI robot

 Environment: an environment or scenario in which an agent is present


or surrounded by. Since the environment is assumed to be unpredictable
in RL, it is essentially random. It can be anything such as a room, maze,
football ground, etc

Approaches to implement Reinforcement Learning:

1. Value-based:
The value-based approach is about to find the optimal value function, which is
the maximum value at a state under any policy. Therefore, the agent expects
the long-term return at any state(s) under policy π.
2. Policy-based:
Policy-based approach is to find the optimal policy for the maximum future
rewards without using the value function. In this approach, the agent tries to
apply such a policy that the action performed in each step helps to maximize
the future reward.
The policy-based approach has mainly two types of policy:
o Deterministic: The same action is produced by the policy (π) at any
state.
o Stochastic: In this policy, probability determines the produced action.
3. Model-based: In the model-based approach, a virtual model is created for the
environment, and the agent explores that environment to learn it. There is no
particular solution or algorithm for this approach because the model
representation is different for each environment.

You might also like