Lecture 02 Updated
Lecture 02 Updated
Fiona French
Recap - Intelligent agents
• So, AI is about trying to produce intelligent agents
Think rationally
What is an agent?
OUTLINE
• Agent
• Rational Agent
• Task Environment
• Environment Types
What is an agent?
An entity that PERCEIVES its environment using SENSORS and acts on its
environment using ACTUATORS
What is an agent?
• Human agent: eyes, ears, and other organs for sensors/observers.
Eg. key inputs, data packets arriving ----- screen display, write files, send data
Percept
• Perceptual inputs - what the agent can sense OR The agent’s
perceptual inputs at any given time
Depends on:
(Step 1).
(Steps 2-3).
(Steps 4-6).
A task environment refers to everything an agent interacts with while performing its
tasks. It includes:
• Sensors – The tools used to perceive and gather information from the environment.
P.E.A.S.
• Performance measure – which qualities it should have?
•Environment:
•Actuators:
•Sensors:
19
Medical Diagnosis System
•Agent
Performance measure: Diagnosis accuracy, treatment suggestions,
speed, patient safety.
•Environment:
•Actuators:
•Sensors:
21
On-line math tutor agent
•Performance measure: Maximize student's score on test,
Accuracy of solutions, clarity of explanations, student
engagement, learning progress, response time.
•Performance Measure?
•Environment ?
•Actuators ?
•Effectors ?
23
Chatbot (e.g., ChatGPT, Google Bard,
Customer Support Bots)
• Performance Measure: Accuracy, relevance, response time,
user satisfaction.
25
Task environment: Properties
1. Fully / partially observable (unobservable) – relevance
Fully observable
• Can detect all aspects that are relevant to the choice of action
Partially observable
Eg., “partially-observable “ agent is a robot that has sensors to detect “only” smoke
but cannot detect water when it is in the same environment.
Task environment: Properties
2. Single / multi-agent - competitive or cooperative?
(communication skills, random behaviour is unpredictable)
Task environment: Properties
3. Deterministic / stochastic - uncertain outcomes, probability
• If the next state of the environment is completely determined by the current state
and the agent’s action, then it’s Deterministic, otherwise Stochastic
Eg., A game is generally considered deterministic, even though there are many moves
that “might” be available…and Taxi driving is clearly stochastic in this sense, because
one can never predict the behavior of traffic exactly
Task environment: Properties
4. Episodic / sequential - short-term actions have long-term consequences
Episodic
• The agent’s experience is divided into atomic episodes
• Each episode consists of the agent perceiving and then performing a single
action
• The next episode does not depend on the actions taken in the previous
episode (depend only on episode itself)
Eg., spotting defective parts on an assembly line is episodic
Many classification tasks are episodic.
Task environment: Properties
4. Episodic / sequential - short-term actions have long-term consequences
Sequential
• The current decision could affect all future decisions.
Eg Chess and taxi driving are sequential: in both cases, short-term actions can have
long-term consequences.,
Task environment: Properties
5. Static / dynamic (semi-dynamic) - changing environment
• If the environment can change while the agent is deliberating (acting) & agent is
continuously asking what to do next, then it’s dynamic environment otherwise
static environment.
E.g.
• Driving is dynamic
• Crossword puzzle is static
Task environment: Properties
6. Discrete / continuous - state of environment over time
If the environment states are limited and the agent’s percepts and actions are limited
then the environment is discrete otherwise continuous.
Eg : chess environment has a finite number of distinct states (excluding the clock),
Chess also has a discrete set of percepts and actions.
The architecture makes the percepts from the sensors available to the program,
runs the program, and feeds the program's action choices to the actuators as they
are generated.
Agent Types
1. Simple reflex agent - Condition action rule
if car-in-front is braking, then initiate braking
5. Learning agent
Learning agents
PERFORMANCE STANDARD
CRITIC SENSOR
E
N
Provides V
FEEDBACK I
CHANGES KNOWLEDGE
Performance R
Learning O
element element N
M
Learning E
GOALS N
ACTUATOR T
PROBLEM
GENERATOR
Weekly exercise – Mind Map 03