0% found this document useful (0 votes)

7 views41 pages

02 Agents

The document discusses intelligent agents, defining them as entities that perceive their environment through sensors and act upon it through actuators. It outlines the concept of rational agents, emphasizing the importance of maximizing expected performance measures based on percept sequences. Additionally, it categorizes different types of agents and environments, providing examples such as vacuum cleaners and automated taxi drivers to illustrate these concepts.

Uploaded by

Surya Basnet

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views41 pages

02 Agents

Uploaded by

Surya Basnet

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 41

CS 5/7320

Artificial
Intelligence

Intelligent Agents
AIMA Chapter 2

Slides by Michael Hahsler

based on slides by Svetlana Lazepnik
with figures from the AIMA textbook.

This work is licensed under a Creative Commons Image: "Robot at the British Library Science Fiction Exhibition"
Attribution-ShareAlike 4.0 International License.
by BadgerGravling
Outline

PEAS
(Performance
What is an
measure, Environment
intelligent Rationality Agent types
Environment, types
agent?
Actuators,
Sensors)
Outline

PEAS
(Performance
What is an
measure, Environment
intelligent Rationality Agent types
Environment, types
agent?
Actuators,
Sensors)
What is an Agents?
• An agent is anything that can be viewed as perceiving its environment
through sensors and acting upon that environment through actuators.

• Control theory: A closed-loop control system (= feedback control system)

is a set of mechanical or electronic devices that automatically regulate a
process variable to a desired state or set point without human interaction.
The agent is called a controller.
• Softbot: Agent is a software program that runs on a host device.
Agent Function and Agent Program
The agent function maps from the set of all possible percept sequences 𝑃𝑃∗ to the
set of actions 𝐴𝐴 formulated as an abstract mathematical function.

𝒑𝒑

𝒂𝒂 = 𝒇𝒇(𝒑𝒑)
𝑓𝑓 ∶ 𝑃𝑃∗ → 𝐴𝐴
𝒂𝒂

The agent program is a concrete implementation of this function for a given

physical system.

Agent = architecture (hardware) + agent program (implementation of 𝑓𝑓)

• Sensors
• Memory
• Computational power
Example:
Vacuum-cleaner World
• Percepts:
Location and status,
e.g., [A, Dirty]
• Actions:
Most recent
Left, Right, Suck, NoOp
Percept 𝑝𝑝

Agent function: 𝑓𝑓 ∶ 𝑃𝑃∗ → 𝐴𝐴 Implemented agent program:

Percept Sequence Action function Vacuum-Agent([location, status])

[A, Clean] Right returns an action 𝑎𝑎
[A, Dirty] Suck
… if status = Dirty then return Suck
[A, Clean], [B, Clean] Left else if location = A then return Right
else if location = B then return Left
…
[A, Clean], [B, Clean], [A, Dirty] Suck
…

Problem: This table can become infinitively large!

Outline

PEAS
(Performance
What is an
measure, Environment
intelligent Rationality Agent types
Environment, types
agent?
Actuators,
Sensors)
Rational Agents: What is Good Behavior?
Foundation
• Consequentialism: Evaluate behavior by its consequences.
• Utilitarianism: Maximize happiness and well-being.

Definition of a rational agent:

“For each possible percept sequence, a rational agent should select an
action that maximizes its expected performance measure, given the
evidence provided by the percept sequence and the agent’s built-in
knowledge.”

• Performance measure: An objective criterion for success of an agent's

behavior (often called utility function or reward function).
• Expectation: Outcome averaged over all possible situations that may
arise.

Rule: Pick the action that maximize the expected utility

𝑎𝑎 = argmax𝑎𝑎∈A 𝐸𝐸 𝑈𝑈 𝑎𝑎)
Rational Agents
Rule: Pick the action that maximize the expected utility

𝑎𝑎 = argmax𝑎𝑎∈A 𝐸𝐸 𝑈𝑈 𝑎𝑎)

This means:

• Rationality is an ideal – it implies that no one can build a better agent

• Rationality ≠ Omniscience – rational agents can make mistakes if percepts and

knowledge do not suffice to make a good decision

• Rationality ≠ Perfection – rational agents maximize expected outcomes not actual

outcomes

• It is rational to explore and learn – I.e., use percepts to supplement prior knowledge
and become autonomous

• Rationality is often bounded by available memory, computational power, available

sensors, etc.
Example:
Vacuum-cleaner World
• Percepts:
Location and status,
e.g., [A, Dirty]
• Actions:
Left, Right, Suck, NoOp
Agent function: Implemented agent program:

Percept Sequence Action function Vacuum-Agent([location, status])

[A, Clean] Right returns an action
[A, Dirty] Suck
… if status = Dirty then return Suck
[A, Clean], [B, Clean] Left else if location = A then return Right
else if location = B then return Left
…

What could be a performance measure?

Is this agent program rational?
Outline

PEAS
(Performance
What is an
measure, Environment
intelligent Rationality Agent types
Environment, types
agent?
Actuators,
Sensors)
Problem Specification: PEAS Performance
measure

Performance
Environment Actuators Sensors
measure

Components and Defines

Defines utility Defines
rules of how actions percepts
and what is available
affect the
rational actions
environment.
Example: Automated Taxi Driver

Performance
Environment Actuators Sensors
measure
• Safe • Roads • Steering • Cameras
• fast • other traffic wheel • sonar
• legal • pedestrians • accelerator • speedometer
• comfortable • customers • brake • GPS
trip • signal • Odometer
• maximize • horn • engine
profits sensors
• keyboard
Example: Spam Filter

Performance
Environment Actuators Sensors
measure
• Accuracy: • A user’s email • Mark as spam • Incoming
Minimizing account • delete messages
false • email server • etc. • other
positives, information
false about user’s
negatives account
Outline

PEAS
(Performance
What is an
measure, Environment
intelligent Rationality Agent types
Environment, types
agent?
Actuators,
Sensors)
Environment Types
Fully observable: The agent's sensors Partially observable: The agent cannot see all
give it access to the complete state of vs. aspects of the environment. E.g., it can’t see
the environment. The agent can “see” through walls
the whole environment.

Stochastic: Changes cannot be determined from

the current state and the action (there is some
Deterministic: Changes in the environment
randomness).
is completely determined by the current vs.
Strategic: The environment is stochastic and
state of the environment and the agent’s
adversarial. It chooses actions strategically to
action.
harm the agent. E.g., a game where the other
player is modeled as part of the environment.

Known: The agent knows the rules of the Unknown: The agent cannot predict the outcome
environment and can predict the
vs. of actions.
outcome of actions.
Environment Types
Static: The environment is not changing
Dynamic: The environment is changing while
while agent is deliberating. vs. the agent is deliberating.
Semidynamic: the environment is static,
but the agent's performance score
depends on how fast it acts.

Discrete: The environment provides a fixed Continuous: Percepts, actions, state variables or
number of distinct percepts, actions, and vs. time are continuous leading to an infinite state,
environment states. Time can also evolve in a percept or action space.
discrete or continuous fashion.

Episodic: Episode = a self-contained

Sequential: Actions now affect the outcomes
sequence of actions. The agent's choice of vs. later. E.g., learning makes problems
action in one episode does not affect the
sequential.
next episodes. The agent does the same task
repeatedly.

Single agent: An agent operating by itself in vs. Multi-agent: Agent cooperate or compete in the
an environment. same environment.
Examples of Different Environments

Word jumble Chess with Scrabble Taxi driving

solver a clock

Observable Fully Fully Partially Partially

Determ. game
Deterministic Stochastic
Deterministic Mechanics Stochastic
+Strategic
+ Strategic*
Episodic? Episodic Episodic Episodic Sequential
Static Static Semidynamic Static Dynamic

Discrete Discrete Discrete Discrete Continuous

Single agent Single Multi* Multi* Multi*

* Can be models as a single agent problem with the other agent(s) in the environment.
Outline

PEAS
(Performance
What is an
measure, Environment
intelligent Rationality Agent types
Environment, types
agent?
Actuators,
Sensors)
Designing a Rational Agent
Remember the definition of a
rational agent:
𝑓𝑓 “For each possible percept sequence, a
action rational agent should select an action
that maximizes its expected
performance measure, given the
evidence provided by the percept
sequence and the agent’s built-in
knowledge.”

Percept to the
Agent Function agent function
• Assess Note: Everything
𝑓𝑓 outside the agent
performance function can be
measure seen as the
• Remember environment.
percept sequence Action from the
• Built-in knowledge agent function
Hierarchy of Agent Types

Utility-based agents

Goal-based agents

Model-based reflex agents

Simple reflex agents

Simple Reflex Agent
• Uses only built-in knowledge in the form of rules that select action only based
on the current percept. This is typically very fast!
• The agent does not know about the performance measure! But well-designed
rules can lead to good performance.
• The agent needs no memory and ignores all past percepts.

𝑎𝑎 = 𝑓𝑓(𝑝𝑝)

The interaction is a sequence: 𝑝𝑝0 , 𝑎𝑎0 , 𝑝𝑝1 , 𝑎𝑎1 , 𝑝𝑝2 , 𝑎𝑎2 , … 𝑝𝑝𝑡𝑡 , 𝑎𝑎𝑡𝑡 , …

Example: A simple vacuum cleaner that uses rules based on its current sensor input.
Model-based Reflex Agent
• Maintains a state variable to keeps track of aspects of the environment that
cannot be currently observed. I.e., it has memory and knows how the
environment reacts to actions (called transition function).
• The state is updated using the percept.
• There is now more information for the rules to make better decisions.

𝑠𝑠

𝑠𝑠 ′ = 𝑇𝑇(𝑠𝑠, 𝑎𝑎)

𝑎𝑎 = 𝑓𝑓(𝑝𝑝, 𝑠𝑠)

The interaction is a sequence: 𝑠𝑠0 , 𝑎𝑎0 , 𝑝𝑝1 , 𝑠𝑠1 , 𝑎𝑎1 , 𝑝𝑝2 , 𝑠𝑠2 , 𝑎𝑎2 , 𝑝𝑝3 , … , 𝑝𝑝𝑡𝑡 , 𝑠𝑠𝑡𝑡 , 𝑎𝑎𝑡𝑡 , …

Example: A vacuum cleaner that remembers were it has already cleaned.

Transition Function
• The environment is modeled as a discrete dynamical system.
• Changed in the environment are a sequence of states 𝑠𝑠0 , 𝑠𝑠1 , … 𝑠𝑠𝑇𝑇 , where the
index is the time step.
• Example of a state diagram: switch on

Light is Light is
off on

switch off
• States change because of:
a. System dynamics of the environment.
b. The actions of the agent.

• Both types of changes are represented by the transition function written as

𝑇𝑇: 𝑆𝑆 × 𝐴𝐴 → 𝑆𝑆 or 𝑠𝑠𝑠 = 𝑇𝑇(𝑠𝑠, 𝑎𝑎) 𝑆𝑆 … set of states

𝐴𝐴 … set of available actions
𝑎𝑎 ∈ 𝐴𝐴 … an action
𝑠𝑠 ∈ 𝑆𝑆 … current state
𝑠𝑠 ′ ∈ 𝑆𝑆 … next state
State Representation
States help to keep track of the environment and the agent in the environment. This is
often also called the system state. The representation can be
• Atomic: Just a label for a black box. E.g., A, B
• Factored: A set of attribute values called fluents.
E.g., [location = left, status = clean, temperature = 75 deg. F]

Action causes Variables describing the

transition system state are called
“fluents”

We often construct atomic labels from factored information. E.g.: If the agent’s state is
the coordinate x = 7 and y = 3, then the atomic state label could be the string “(7, 3)”.
With the atomic representation, we can only compare if two labels are the same. With
the factored state representation, we can reason more and calculate the distance
between states!

The set of all possible states is called the state space 𝑺𝑺. This set is typically very large!
Old-school vs. Smart Thermostat

Old-school thermostat Smart thermostat

Percepts States Percepts States
Old-school vs. Smart Thermostat
Change
Set temperatur
temperature e when you
range are too
cold/warm.

Old-school thermostat Smart thermostat

Percepts Percepts States
States
• Temp: deg. F Factored states
• Outside temp. • Estimated
• Weather report time to cool
temperature: • Energy the house
Low, ok, high No states need curtailment • Someone
• Someone walking home?
by • How long till
• Someone changes someone is
temp. coming
• Day & time home?
• … • A/C: on, off
Goal-based Agent
• The agent has the task to reach a defined goal state and is then finished.
• The agent needs to move towards the goal. It can use search algorithms to
plan actions that lead to the goal.
• Performance measure: the cost to reach the goal.

𝑇𝑇

𝑎𝑎 = argmin𝑎𝑎0∈A � 𝑐𝑐𝑡𝑡 � 𝑠𝑠𝑇𝑇 ∈ 𝑆𝑆 𝑔𝑔𝑔𝑔𝑔𝑔𝑔𝑔

𝑡𝑡=0

plan Sum of the cost

of a planed sequence of
actions that leads to a
goal state

The interaction is a sequence: 𝑠𝑠0 , 𝑎𝑎0 , 𝑝𝑝1 , 𝑠𝑠1 , 𝑎𝑎1 , 𝑝𝑝2 , 𝑠𝑠2 , 𝑎𝑎2 , … , 𝑠𝑠 𝑔𝑔𝑔𝑔𝑔𝑔𝑔𝑔
cost
Example: Solving a puzzle. What action gets me closer to the solution?
Utility-based Agent
• The agent uses a utility function to evaluate the desirability of each possible
states. This is typically expressed as the reward of being in a state 𝑅𝑅(𝑠𝑠).
• Choose actions to stay in desirable states.
• Performance measure: The discounted sum of expected utility over time.
∞

𝑎𝑎 = arg𝑚𝑚𝑚𝑚𝑚𝑚𝑎𝑎0∈A 𝔼𝔼 � 𝛾𝛾 𝑡𝑡 𝑟𝑟𝑡𝑡
𝑡𝑡=0

Utility is the
expected future
discounted reward

Techniques: Markov decision

processes, reinforcement learning

The interaction is a sequence: 𝑠𝑠0 , 𝑎𝑎0 , 𝑝𝑝1 , 𝑠𝑠1 , 𝑎𝑎1 . 𝑝𝑝2 , 𝑠𝑠2 , 𝑎𝑎2 , …
reward
Example: An autonomous Mars rover prefers states where its battery is not critically low.
Agents that Learn

The learning element modifies the agent program (reflex-based, goal-

based, or utility-based) to improve its performance.

How is the agent

currently performing?

Update the agent

Agent
program program

Exploration
Example: Smart Thermostat
Change
temperature
when you are
too
cold/warm.

Smart thermostat
Percepts States
• Temp: deg. F Factored states
• Outside temp. • Estimated
• Weather report time to cool
• Energy the house
curtailment • Someone
• Someone walking home?
by • How long till
• Someone changes someone is
temp. coming
• Day & time home?
• … • A/C: on, off
Example: Modern Vacuum Robot
Features are:
• Control via App
• Cleaning Modes
• Navigation
• Mapping
• Boundary blockers

Source: https://www.techhive.com/article/3269782/best-robot-
vacuum-cleaners.html
PEAS Description of a
Modern Robot Vacuum

Performance
Environment Actuators Sensors
measure
What Type of Intelligent Agent is a
Modern Robot Vacuum?

Does it collect utility over

Utility-based agents time? How would the utility for
each state be defined?
Is it learning?

Goal-based agents Does it have a goal state?

Does it store state information.

Model-based reflex agents How would they be defined
(atomic/factored)?

Does it use simple rules based

Simple reflex agents on the current percepts?

Check what applies

What Type of Intelligent
Agent is this?
PEAS Description of ChatGPT

Performance
Environment Actuators Sensors
measure
How does ChatGPT work?
What Type of Intelligent Agent is
ChatGPT?
Does it collect utility over
Utility-based agents time? How would the utility for
each state be defined?
Is it learning?

Goal-based agents Does it have a goal state?

Does it store state information.

Model-based reflex agents How would they be defined
(atomic/factored)?

Does it use simple rules based

Simple reflex agents
on the current percepts?

Answer the following questions:

Check what applies • Does ChatGPT pass the Touring test?
• Is ChatGPT a rational agent? Why?

We will talk about knowledge-based agents later.

Intelligent Systems as
Sets of Agents:
Self-driving Car

Make sure the passenger has a pleasant drive High-level

Utility-based agents (not too much sudden breaking = utility) planning
It should learn!

Goal-based agents Plan the route to the destination.

Remember where every other car is and

Model-based reflex agents calculate where they will be in the next few
seconds.
React to unforeseen issues like a child Low-level
Simple reflex agents running in front of the car quickly.
planning
Some Environment Types Revisited
Fully observable: The agent’s sensors Partially observable: The agent only
vs. perceives part of the state and needs to
always show the whole state.
remember or infer the test.

Stochastic:
Deterministic: Percepts are 100% reliable • Percepts are unreliable (noise distribution,
and changes in the environment is vs. sensor failure probability, etc.). This is called a
completely determined by the current state stochastic sensor model.
of the environment and the agent’s action. • The transition function is stochastic leading
to transition probabilities and a Markov
process.

Known: The agent knows the transition Unknown: The needs to learn the transition
vs. function by trying actions.
function.

We will spend the whole semester on discussing algorithms that can deal with
environments that have different combinations of these three properties.
Conclusion

Intelligent agents inspire the research areas of modern AI

Stay within given

Search for a goal Optimize functions constraints
(e.g., navigation). (e.g., utility). (constraint satisfaction problem;
e.g., reach the goal without
running out of power)

Learn a good agent

Deal with uncertainty program from data Sensing
(e.g., current traffic on the (e.g, natural language
road). and improve over time processing, vision)
(machine learning).

TS DRA 2022 en Create Drawings
No ratings yet
TS DRA 2022 en Create Drawings
1,070 pages
Intelligent Agents
No ratings yet
Intelligent Agents
39 pages
Geographic Vs Projected Coordinate Systems PDF
No ratings yet
Geographic Vs Projected Coordinate Systems PDF
8 pages
CS 4700: Foundations of Artificial Intelligence: Structure of Intelligent Agents and Environments
No ratings yet
CS 4700: Foundations of Artificial Intelligence: Structure of Intelligent Agents and Environments
35 pages
Azizuddin Cv-Store Keeper
100% (1)
Azizuddin Cv-Store Keeper
3 pages
Summary - Intelligent Agents
100% (1)
Summary - Intelligent Agents
37 pages
R2 - Data Acquisition From Greenhouses by Using Autonomous Mobile Robot
No ratings yet
R2 - Data Acquisition From Greenhouses by Using Autonomous Mobile Robot
5 pages
BDP and CapDev Format Sample
No ratings yet
BDP and CapDev Format Sample
17 pages
Unit 5 Binary Trees
No ratings yet
Unit 5 Binary Trees
28 pages
TUV Certificate - HC900 Safety
No ratings yet
TUV Certificate - HC900 Safety
1 page
Unit 3 Analysis - A System Requirements
No ratings yet
Unit 3 Analysis - A System Requirements
47 pages
Ch02 (1) (1) - Merged
No ratings yet
Ch02 (1) (1) - Merged
100 pages
Chapter 2 Type of Agentsfinal Edition
No ratings yet
Chapter 2 Type of Agentsfinal Edition
57 pages
M.Tech.: Data Science & Engineering
No ratings yet
M.Tech.: Data Science & Engineering
17 pages
James Instruments - Windsor - HP - Probe - System - Data - Manual
No ratings yet
James Instruments - Windsor - HP - Probe - System - Data - Manual
80 pages
Wellcomm User Guide
100% (1)
Wellcomm User Guide
23 pages
02 Agents
No ratings yet
02 Agents
38 pages
Module 3 - Agents Environments
No ratings yet
Module 3 - Agents Environments
7 pages
C Programming Sollution
100% (1)
C Programming Sollution
43 pages
Is-Lecture 2 (Intelligent Agents)
No ratings yet
Is-Lecture 2 (Intelligent Agents)
51 pages
2 Chapter - 2 - AI Intelligent Agents
No ratings yet
2 Chapter - 2 - AI Intelligent Agents
44 pages
Netflix - Ecommerce
No ratings yet
Netflix - Ecommerce
17 pages
02 Agents
No ratings yet
02 Agents
68 pages
Slides Kbagents
No ratings yet
Slides Kbagents
97 pages
Chapter 2
No ratings yet
Chapter 2
71 pages
IGBT ApplicationManual E
No ratings yet
IGBT ApplicationManual E
16 pages
Felcom SSASInfo SVC Manual
No ratings yet
Felcom SSASInfo SVC Manual
56 pages
SE Course Pack Final
No ratings yet
SE Course Pack Final
220 pages
02 Agents
No ratings yet
02 Agents
36 pages
02 Agents
No ratings yet
02 Agents
38 pages
Dkvm-8E: 8-Port Keyboard, Video, and Mouse Switch
No ratings yet
Dkvm-8E: 8-Port Keyboard, Video, and Mouse Switch
30 pages
Task Model
No ratings yet
Task Model
68 pages
Lesson 03 - Intelligent Agents
No ratings yet
Lesson 03 - Intelligent Agents
32 pages
L04-06 Intelligent Agents
No ratings yet
L04-06 Intelligent Agents
28 pages
COMP 469 - CH 2 - Intelligent Agents and Environments
No ratings yet
COMP 469 - CH 2 - Intelligent Agents and Environments
37 pages
Unit 1 Complexity Analysis
No ratings yet
Unit 1 Complexity Analysis
6 pages
Laudon Ess10e PP 4
No ratings yet
Laudon Ess10e PP 4
48 pages
Intelligent Agents: Lecturer: DR - Nguyen Thanh Binh
No ratings yet
Intelligent Agents: Lecturer: DR - Nguyen Thanh Binh
39 pages
Machine Learning
No ratings yet
Machine Learning
68 pages
Agents
No ratings yet
Agents
29 pages
Artificial Intelligence and Expert System: Fall 2017-18 Intelligent Agents
No ratings yet
Artificial Intelligence and Expert System: Fall 2017-18 Intelligent Agents
35 pages
Chapter 1 Lab Lab Assignment
No ratings yet
Chapter 1 Lab Lab Assignment
7 pages
Lec03 Agents-1689184299238
No ratings yet
Lec03 Agents-1689184299238
18 pages
16-2 p30 Mapping of j1939 To Can FD Cia602 Zeltwanger
No ratings yet
16-2 p30 Mapping of j1939 To Can FD Cia602 Zeltwanger
2 pages
Intelligent Agent
No ratings yet
Intelligent Agent
33 pages
Agents & Environment
No ratings yet
Agents & Environment
24 pages
Week 2 Intelligent Agents UPDATED
No ratings yet
Week 2 Intelligent Agents UPDATED
55 pages
cs221 Lecture10
No ratings yet
cs221 Lecture10
43 pages
LTE Overview
No ratings yet
LTE Overview
44 pages
Disk Management
No ratings yet
Disk Management
46 pages
Chap 2 Intelligent Agents
No ratings yet
Chap 2 Intelligent Agents
20 pages
Chapter2 IntelligentAgents
No ratings yet
Chapter2 IntelligentAgents
53 pages
OS - Question&Answers - M4 & M5
No ratings yet
OS - Question&Answers - M4 & M5
22 pages
W01L02 - FA23 - AIC262 - Intro To Artificial Intelligence - Syed Ahmed
No ratings yet
W01L02 - FA23 - AIC262 - Intro To Artificial Intelligence - Syed Ahmed
19 pages
Intelligent Agents
No ratings yet
Intelligent Agents
60 pages
CSC3402-Lecture2-Intelligent Agents
No ratings yet
CSC3402-Lecture2-Intelligent Agents
32 pages
Deadlock
No ratings yet
Deadlock
38 pages
Agentm2 Agents
No ratings yet
Agentm2 Agents
45 pages
2 - Intelligent Agents
No ratings yet
2 - Intelligent Agents
33 pages
Intelligent Agents
No ratings yet
Intelligent Agents
36 pages
Lecture 2 - Agents
No ratings yet
Lecture 2 - Agents
45 pages
Chapter 2
No ratings yet
Chapter 2
20 pages
CH 1 2 Agents Norvig Book
No ratings yet
CH 1 2 Agents Norvig Book
50 pages
Collaborative Optimization of Dynamic Pricing and Seat Allocation For High-Speed Railways An Empirical Study From China
No ratings yet
Collaborative Optimization of Dynamic Pricing and Seat Allocation For High-Speed Railways An Empirical Study From China
11 pages
Ai ch18 Learning From Examples Part 2
No ratings yet
Ai ch18 Learning From Examples Part 2
30 pages
Unit 4 - C Designing Interfaces and Dialouges
No ratings yet
Unit 4 - C Designing Interfaces and Dialouges
25 pages
cs221 Lecture12
No ratings yet
cs221 Lecture12
28 pages
Wa0002.
No ratings yet
Wa0002.
29 pages
Lecture 2 - Agents
No ratings yet
Lecture 2 - Agents
45 pages
Creating A Standby Using RMAN Duplicate (RAC or Non RAC) (Doc ID 1617946.1)
No ratings yet
Creating A Standby Using RMAN Duplicate (RAC or Non RAC) (Doc ID 1617946.1)
11 pages
Lecture+Set+02 Agents+and+Environments
No ratings yet
Lecture+Set+02 Agents+and+Environments
41 pages
Introduction To AI: 1.3 Intelligent Agents
No ratings yet
Introduction To AI: 1.3 Intelligent Agents
60 pages
m2 Agents
No ratings yet
m2 Agents
48 pages
Intelligent Agent: Dept. of Computer Science Faculty of Science and Technology
No ratings yet
Intelligent Agent: Dept. of Computer Science Faculty of Science and Technology
40 pages
Introduction To Artificial Intelligence: The Ability To Solve Problems
No ratings yet
Introduction To Artificial Intelligence: The Ability To Solve Problems
11 pages
Intelligent Agents: Fundamentals of Artificial Intelligence
No ratings yet
Intelligent Agents: Fundamentals of Artificial Intelligence
51 pages
Dept. of Computer Science Faculty of Science and Technology
No ratings yet
Dept. of Computer Science Faculty of Science and Technology
40 pages
AI - Intelligent Agent 4-1
No ratings yet
AI - Intelligent Agent 4-1
44 pages
Module-1-Outline: Intelligent Agents
No ratings yet
Module-1-Outline: Intelligent Agents
17 pages
CH 02
No ratings yet
CH 02
33 pages
Lesson 10001
No ratings yet
Lesson 10001
29 pages
System Call
No ratings yet
System Call
21 pages
Unit 2 Linked Lists
No ratings yet
Unit 2 Linked Lists
21 pages
AI Chapter 2 Agents
No ratings yet
AI Chapter 2 Agents
28 pages
m2 Agents
No ratings yet
m2 Agents
42 pages
Module 4 Learning Plan 1
No ratings yet
Module 4 Learning Plan 1
11 pages
Session 2
No ratings yet
Session 2
14 pages
Intelligent Agents-1
No ratings yet
Intelligent Agents-1
29 pages
Topic 3 - Java Data Types and Variables
No ratings yet
Topic 3 - Java Data Types and Variables
19 pages
03 - Agents
No ratings yet
03 - Agents
25 pages
Unit 2 Fai
No ratings yet
Unit 2 Fai
11 pages
Agents: Agent Environment Sensors Actuators
No ratings yet
Agents: Agent Environment Sensors Actuators
20 pages
Aneka
No ratings yet
Aneka
12 pages
Project Report Template 2023.docx-1
No ratings yet
Project Report Template 2023.docx-1
10 pages
E Commercesecurityandpaymentsystems
No ratings yet
E Commercesecurityandpaymentsystems
21 pages
Architecture of Server Virtualization 3
No ratings yet
Architecture of Server Virtualization 3
13 pages
Midterm - Lecture 1 - WEBAPPS
No ratings yet
Midterm - Lecture 1 - WEBAPPS
18 pages
Unit 3 Stacks and Queues
No ratings yet
Unit 3 Stacks and Queues
13 pages
Agents: Aiza Shabir Lecturer Institute of CS&IT The Women University, Multan
No ratings yet
Agents: Aiza Shabir Lecturer Institute of CS&IT The Women University, Multan
30 pages
2 Intelligent Agents
No ratings yet
2 Intelligent Agents
36 pages
Process Creation 2
No ratings yet
Process Creation 2
11 pages
Hypervisor ESXI 5
No ratings yet
Hypervisor ESXI 5
8 pages
More Aneka Examples
No ratings yet
More Aneka Examples
10 pages
Unit 4 Recursion
No ratings yet
Unit 4 Recursion
10 pages
Chapter 3 Lab Lab Assignment
No ratings yet
Chapter 3 Lab Lab Assignment
7 pages
Chapter 2 Lab Lab Assignment
No ratings yet
Chapter 2 Lab Lab Assignment
6 pages
Intelligent Agents
No ratings yet
Intelligent Agents
7 pages
OS Syllabus
No ratings yet
OS Syllabus
5 pages
Implementation of Door Step Banking Services (DSB) Through Universal Touch Points (UTP)
No ratings yet
Implementation of Door Step Banking Services (DSB) Through Universal Touch Points (UTP)
2 pages
Quadcopter With Arduino Uno Running MultiWii
No ratings yet
Quadcopter With Arduino Uno Running MultiWii
5 pages
XII Sci Practical SLips
No ratings yet
XII Sci Practical SLips
2 pages
Chapter 4 Lab Instructions
No ratings yet
Chapter 4 Lab Instructions
3 pages
2 Vector-Calculus
No ratings yet
2 Vector-Calculus
3 pages
BPL PVC Pipe
No ratings yet
BPL PVC Pipe
1 page
2CS402 - Database Management Systems
No ratings yet
2CS402 - Database Management Systems
2 pages
Addition of Integers
No ratings yet
Addition of Integers
6 pages
m2 Agents
No ratings yet
m2 Agents
34 pages

02 Agents

Uploaded by

02 Agents

Uploaded by

CS 5/7320

Slides by Michael Hahsler

• Control theory: A closed-loop control system (= feedback control system)

The agent program is a concrete implementation of this function for a given

Agent = architecture (hardware) + agent program (implementation of 𝑓𝑓)

Agent function: 𝑓𝑓 ∶ 𝑃𝑃∗ → 𝐴𝐴 Implemented agent program:

Percept Sequence Action function Vacuum-Agent([location, status])

Problem: This table can become infinitively large!

Definition of a rational agent:

• Performance measure: An objective criterion for success of an agent's

Rule: Pick the action that maximize the expected utility

• Rationality is an ideal – it implies that no one can build a better agent

• Rationality ≠ Omniscience – rational agents can make mistakes if percepts and

• Rationality ≠ Perfection – rational agents maximize expected outcomes not actual

• Rationality is often bounded by available memory, computational power, available

Percept Sequence Action function Vacuum-Agent([location, status])

What could be a performance measure?

Components and Defines

Stochastic: Changes cannot be determined from

Episodic: Episode = a self-contained

Word jumble Chess with Scrabble Taxi driving

Observable Fully Fully Partially Partially

Discrete Discrete Discrete Discrete Continuous

Single agent Single Multi* Multi* Multi*

Model-based reflex agents

Simple reflex agents

Example: A vacuum cleaner that remembers were it has already cleaned.

• Both types of changes are represented by the transition function written as

𝑇𝑇: 𝑆𝑆 × 𝐴𝐴 → 𝑆𝑆 or 𝑠𝑠𝑠 = 𝑇𝑇(𝑠𝑠, 𝑎𝑎) 𝑆𝑆 … set of states

Action causes Variables describing the

Old-school thermostat Smart thermostat

Old-school thermostat Smart thermostat

𝑎𝑎 = argmin𝑎𝑎0∈A � 𝑐𝑐𝑡𝑡 � 𝑠𝑠𝑇𝑇 ∈ 𝑆𝑆 𝑔𝑔𝑔𝑔𝑔𝑔𝑔𝑔

plan Sum of the cost

Techniques: Markov decision

The learning element modifies the agent program (reflex-based, goal-

How is the agent

Update the agent

Does it collect utility over

Goal-based agents Does it have a goal state?

Does it store state information.

Does it use simple rules based

Check what applies

Goal-based agents Does it have a goal state?

Does it store state information.

Does it use simple rules based

Answer the following questions:

We will talk about knowledge-based agents later.

Make sure the passenger has a pleasant drive High-level

Goal-based agents Plan the route to the destination.

Remember where every other car is and

Intelligent agents inspire the research areas of modern AI

Stay within given

Learn a good agent

You might also like