0% found this document useful (0 votes)

56 views43 pages

AI Lecture2 - 3

The document discusses agents and environments. It defines an agent as anything that can perceive its environment and act upon it. Environments can have different properties such as being fully or partially observable, deterministic or stochastic. Rational agents aim to maximize their performance based on their percepts and knowledge. The document also introduces different types of agents from simple reflex agents to more complex goal-based and utility-based agents.

Uploaded by

javeria

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

56 views43 pages

AI Lecture2 - 3

Uploaded by

javeria

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 43

Intelligent Agents

and
Environment

1
Outline
 Agents and environments
 Good Behavior: Rationality
 PEAS (Performance measure, Environment,
Actuators, Sensors)
 Environment types
 Agent types
Agents
 An agent is anything that can perceive its
environment through sensors and act upon that
environment through actuators

 Human agent: eyes, ears, and other organs for

sensors; hands, legs, mouth, and other body parts
for actuators

 Robotic agent: camera and microphone for sensors;

various motors for actuators

3
Agents
 Percept: Agent’s perceptual inputs at any
given instance.

 Percept sequence: Complete history of

everything the agent has ever perceived.

 Agent’s choice of action at any given instant

can depend on the entire percept sequence
observed to date.
Agents and environments

 The agent function maps from percept histories to actions:

[f: P*  A]

 The agent program runs on the physical architecture to produce

 Agent function is an abstract mathematical description; the

agent program is a concrete implementation, running on the
5
agent architecture.
The vacuum-cleaner world

 Environment: square A and B

 Percepts: [location and content] e.g. [A, Dirty]
 Actions: left, right, suck, and no-op

6
The vacuum-cleaner world

Percept sequence Action

[A,Clean] Right
[A, Dirty] Suck
[B, Clean] Left
[B, Dirty] Suck
[A, Clean],[A, Clean] Right
[A, Clean],[A, Dirty] Suck
… …

Partial tabulation of a simple agent function for the vacuum cleaner world
7
The vacuum-cleaner world

function REFLEX-VACUUM-AGENT ([location, status]) return an action

if status == Dirty then return Suck
else if location == A then return Right
else if location == B then return Left

8
Rational agents
 For each possible percept sequence, a rational
agent should select an action that is expected to
maximize its performance measure,

 given the evidence provided by the percept

sequence, and whatever built-in knowledge the
agent has.
Rationality
 What is rational at any given time depends
on four things:
 The performance measure that defines the
criterion of success.
 The agent’s prior knowledge of the
environment
 The actions that the agent can perform
 The agent’s percept sequence to date
Rational agents
 Agents can perform actions in order to
modify future percepts so as to obtain useful
information (information gathering,
exploration)

 An agent is autonomous if its behavior is

determined by its own experience (with
ability to learn and adapt)

 Right action is the one that cause the

agent to be most successful. Therefore,
we will need some way to measure
success.
Performance measures
 Performance measure: An objective criterion
for success of an agent's behavior

 E.g., performance measure of a vacuum-cleaner

agent could be amount of dirt cleaned up,
amount of time taken, amount of electricity
consumed, amount of noise generated, etc.
Environments
 To design an agent we must
specify its task environment.
 PEAS description of the task
environment:
 Performance
 Environment
 Actuators
 Sensors

13
PEAS
 Consider, e.g., the task of designing an
automated taxi driver:
 Performance measure: Safe, fast, comfortable, maximize
profits

 Environment: Roads, pedestrians, customers

 Actuators: Steering wheel, accelerator, brake, signal, horn

 Sensors: Cameras, sonar, speedometer, GPS, engine sensors

14
PEAS
 Agent: Medical diagnosis system
 Performance measure: Healthy patient,
minimize costs, lawsuits
 Environment: Patient, hospital, staff
 Actuators: Screen display (questions, tests,
diagnoses, treatments, referrals)
 Sensors: Keyboard (entry of symptoms,
findings, patient's answers)
PEAS
 Agent: Part-picking robot
 Performance measure: Percentage of parts
in correct bins
 Environment: Conveyor belt with parts, bins
 Actuators: Jointed arm and hand
 Sensors: Camera, joint angle sensors
PEAS
 Agent: Interactive English tutor
 Performance measure: Maximize student's
score on test
 Environment: Set of students
 Actuators: Screen display (exercises,
suggestions, corrections)
 Sensors: Keyboard
Environment Types
 The range of task environments is vast in AI.

 Task environments are categorized into

multiple categories/dimesions

 Dimensions determine the appropriate agent

design and its implementation
Environment types
Fully observable vs. partially observable
 An agent's sensors give it access to the
complete state of the environment at each point
in time is called fully observable environment.
 A task environment is effectively fully
observable if the sensors detect all aspects that
are relevant to the choice of action
 Partially observable environment is due to
noise and inaccurate sensors
 Examples include Solitaire is fully observable,
automated taxi driving is partially.
Environment types
Deterministic vs. stochastic
 The next state of the environment is completely

determined by the current state and the action

executed by the agent then the environment is
deterministic.
 If the environment is partially observable then it
is stochastic. E.g environment is complex, hard
to keep track of all the unobserved aspects
 If the environment is deterministic except the
actions of other agents, then the environment is
strategic.
 For example, Vacuum world is deterministic, Taxi

driving is stocastic i.e Tyres blow out or engine seizes

Environment types
Episodic vs. sequential
 The agent's experience is divided into atomic

"episodes" (each episode consists of the agent

perceiving and then performing a single action),
and the choice of action in each episode
depends only on the episode itself.
 E.g agent has to pick the defected parts from

conveyer belt, decision depends on the current part.

 In sequential, current decision could affect all future

decisions.
 Chess and taxi driving are sequential. Short term

action has long term consequences.

Environment types
Static vs. dynamic
 The environment is unchanged while an agent is

deliberating then it is called static, otherwise

dynamic.
 The environment is semidynamic if the

environment itself does not change with the

passage of time but the agent's performance
score does
 Taxi driving is dynamic, the other cars and taxi

itself is moving and the driving algorithm

dithers about what to do next.
 Crossworld puzzle is static.
Environment types
Discrete vs. continuous
 A limited number of distinct, clearly defined
percepts and actions. E.g chess environment has
finite number of states, percepts and actions whereas
Continuous environment is taxi driving i.e its speed,
location etc

Single agent vs. multiagent

 An agent operating by itself in an environment.
E.g agent solving a crossworld puzzle by itself
 Agent playing chess in a two-agent environment is an

example of multiagent.
Environment types

 The simplest environment is

 Fully observable, deterministic, episodic,
static, discrete and single-agent.

 Most real situations are:

 Partially observable, stochastic,
sequential, dynamic, continuous and
multi-agent.

24
Environment types
Chess with Chess without Taxi driving
a clock a clock
Fully observable Yes Yes No
Deterministic Strategic Strategic No
Episodic No No No
Static Semi Yes No
Discrete Yes Yes No
Single agent No No No

 The environment type largely determines the agent design.

Agent functions and programs
 An agent is completely specified by the
agent function mapping percept sequences
to actions
 Aim: find a way to implement the rational
agent function concisely

 agent=architecture+program
Agent Program
Function TABLE-DRIVEN_AGENT(percept) returns an action

static: percepts, a sequence initially empty

table, a table of actions, indexed by percept
sequence

append percept to the end of percepts

action  LOOKUP(percepts, table)
return action
Agent Program
 Drawbacks:
 Huge table
 Take a long time to build the table
 Even with learning, need a long time to learn
the table entries
Agent types
 Four basic kind of agent programs
will be discussed:
 Simple reflex agents
 Model-based reflex agents
 Goal-based agents
 Utility-based agents

 All these can be turned into learning

agents.
1. Simple reflex agents
 Select action on the basis of only the current
percept.
 E.g. the vacuum-agent
 Implemented through condition-action rules
 If dirty then suck
 If car in-front is braking then initiate braking.
Simple reflex agents
The vacuum-cleaner world

function REFLEX-VACUUM-AGENT ([location, status]) return an action

if status == Dirty then return Suck
else if location == A then return Right
else if location == B then return Left

32
Agent types; simple reflex
function SIMPLE-REFLEX-AGENT(percept) returns an action

static: rules, a set of condition-action rules

Generates abstracted
state  INTERPRET-INPUT(percept) description of the current
rule  RULE-MATCH(state, rule) state from the percept
action  RULE-ACTION[rule] Returns the first rule in
return action the set of rules that
matches the given state
description
Will only work if the environment is fully
observable.
For example if car would either brake
continuously and unnecessarily or worse, never
brake at all.
2-Model-based reflex agents
 To tackle partially observable environments.
 Maintain internal state that depends on percept history
 Reflects at least some of the unobserved aspects of the
current state
 Over time update state using world knowledge
 Model of the World

 Agent uses such model is called model based agent.

 Current percept is combined with old internal state to

generate the updated description of the current state.
Model-based reflex agents
Agent types; reflex and state
function REFLEX-AGENT-WITH-STATE(percept) returns an
action

static: rules, a set of condition-action rules

state, a description of the current world state
action, the most recent action.

state  UPDATE-STATE(state, action, percept) Responsible for

rule  RULE-MATCH(state, rule) creating the new
action  RULE-ACTION[rule] internal state
description
return action
Goal-based agents
 The agent needs a goal to know which situations are
desirable.

 Typically investigated in search and planning research.

 Major difference: future is taken into account

Goal-based agents
Utility-based agents
 Certain goals can be reached in different ways.
 Some are better, have a higher utility.
 For example, many action sequences that will get the taxi
to its destination but some are quicker, safer, more
reliable and cheaper than others.

 Utility function maps a (sequence of) state(s) onto a real

number, which describes the associated degree of
happiness.
Utility-based agents
Learning agents
 All previous agent-programs describe methods
for selecting actions.
 Yet it does not explain the origin of these programs.
 Learning mechanisms can be used to perform this task.
 Teach them instead of instructing them.
 Advantage is the robustness of the program toward initially
unknown environments.
Learning agents
Learning agents
 Performance element: selecting actions based on
percepts.
 Corresponds to the previous agent programs

 Learning element: introduce improvements in

performance element.
 Critic provides feedback on agents performance based on fixed performance
standard.

 Problem generator: suggests actions that will lead to new

and informative experiences.
 Suggest experiments

Dark & Soft Gradients Pitch Deck by Slidesgo
No ratings yet
Dark & Soft Gradients Pitch Deck by Slidesgo
52 pages
AI Lecture2 - 30-10-21
No ratings yet
AI Lecture2 - 30-10-21
58 pages
Unit 1 (Intelligent Agents)
No ratings yet
Unit 1 (Intelligent Agents)
43 pages
Chapter2 IntelligentAgents
No ratings yet
Chapter2 IntelligentAgents
53 pages
Intelligent Agents
No ratings yet
Intelligent Agents
39 pages
Chapter 2
No ratings yet
Chapter 2
71 pages
Chapter 2
No ratings yet
Chapter 2
41 pages
2 Intelligent Agents
No ratings yet
2 Intelligent Agents
36 pages
Intelligent Agents-1
No ratings yet
Intelligent Agents-1
29 pages
Artificial Intelligence (Week 1-l2)
No ratings yet
Artificial Intelligence (Week 1-l2)
29 pages
Intelligent Agents
No ratings yet
Intelligent Agents
36 pages
CSC3402-Lecture2-Intelligent Agents
No ratings yet
CSC3402-Lecture2-Intelligent Agents
32 pages
Chapter 2 Intelligent Agents
No ratings yet
Chapter 2 Intelligent Agents
59 pages
m2 Agents
No ratings yet
m2 Agents
48 pages
AAI - Intro Lec 3 4
No ratings yet
AAI - Intro Lec 3 4
29 pages
AI-Intelligent Agents: Dr. Azhar Mahmood
No ratings yet
AI-Intelligent Agents: Dr. Azhar Mahmood
45 pages
Topic - 2 (Intelligent Agents)
No ratings yet
Topic - 2 (Intelligent Agents)
31 pages
Lec 02 Intelligent Agents
100% (1)
Lec 02 Intelligent Agents
40 pages
Chapter - 2 Intelligent Agent
No ratings yet
Chapter - 2 Intelligent Agent
36 pages
Chapter 2 AI
No ratings yet
Chapter 2 AI
37 pages
Fundamentals of Artificial Intelligence: Intelligent Agents
No ratings yet
Fundamentals of Artificial Intelligence: Intelligent Agents
19 pages
Ai ch2
No ratings yet
Ai ch2
44 pages
m2 Agentscomplete
No ratings yet
m2 Agentscomplete
45 pages
Chapter II Intelligent Agents
No ratings yet
Chapter II Intelligent Agents
50 pages
Intelligent Agents - Chapter 2
No ratings yet
Intelligent Agents - Chapter 2
64 pages
Chapter 2 - Intelligent Agents
No ratings yet
Chapter 2 - Intelligent Agents
21 pages
Intelligent Agents
No ratings yet
Intelligent Agents
44 pages
Lecture 05 - 06 Intelligent Agents - AI - UAAR
No ratings yet
Lecture 05 - 06 Intelligent Agents - AI - UAAR
33 pages
AI1 Intelligent Agents
No ratings yet
AI1 Intelligent Agents
21 pages
CSE 630: Artificial Intelligence I Chapter 2: Agents: Jeremy Morris Spring 2012
No ratings yet
CSE 630: Artificial Intelligence I Chapter 2: Agents: Jeremy Morris Spring 2012
34 pages
Lecture 2 - Agents
No ratings yet
Lecture 2 - Agents
31 pages
Artificial Intelligence and Expert System: Fall 2017-18 Intelligent Agents
No ratings yet
Artificial Intelligence and Expert System: Fall 2017-18 Intelligent Agents
35 pages
Topic - 2 (Intelligent Agents)
No ratings yet
Topic - 2 (Intelligent Agents)
34 pages
Lec 2
No ratings yet
Lec 2
30 pages
Agents
No ratings yet
Agents
29 pages
Chapter 2
No ratings yet
Chapter 2
39 pages
CS 4700: Foundations of Artificial Intelligence: Structure of Intelligent Agents and Environments
No ratings yet
CS 4700: Foundations of Artificial Intelligence: Structure of Intelligent Agents and Environments
35 pages
Intelligent Agent
No ratings yet
Intelligent Agent
59 pages
CH 2 Agents Type and Strcuture
No ratings yet
CH 2 Agents Type and Strcuture
28 pages
Topic - 2 (Intelligent Agents)
No ratings yet
Topic - 2 (Intelligent Agents)
40 pages
Agents and Environment
No ratings yet
Agents and Environment
35 pages
Artificial Intelligence - Unit 2
No ratings yet
Artificial Intelligence - Unit 2
28 pages
AI Agent
No ratings yet
AI Agent
43 pages
Structure Environment
No ratings yet
Structure Environment
52 pages
Intelligent Agents: Outline
No ratings yet
Intelligent Agents: Outline
18 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
18 pages
CoSc4142 AI Chapter 2
No ratings yet
CoSc4142 AI Chapter 2
33 pages
Chapter Two Slide
No ratings yet
Chapter Two Slide
33 pages
By DR Narayana Swamy Ramaiah Professor, Dept of CSE SCSE, FET, JAIN Deemed To Be University
No ratings yet
By DR Narayana Swamy Ramaiah Professor, Dept of CSE SCSE, FET, JAIN Deemed To Be University
27 pages
Intelligent Agents
No ratings yet
Intelligent Agents
54 pages
Lecture Intelligent Agents
No ratings yet
Lecture Intelligent Agents
31 pages
Binder 1
No ratings yet
Binder 1
242 pages
asset-v1-ColumbiaX+CSMM.101x+1T2017+type@asset+block@AI Edx Intelligent Agents New 1
No ratings yet
asset-v1-ColumbiaX+CSMM.101x+1T2017+type@asset+block@AI Edx Intelligent Agents New 1
39 pages
Artificial Intelligence Slide 2
No ratings yet
Artificial Intelligence Slide 2
38 pages
AI - Intelligent Agent 4-1
No ratings yet
AI - Intelligent Agent 4-1
44 pages
Lecture+Set+02 Agents+and+Environments
No ratings yet
Lecture+Set+02 Agents+and+Environments
41 pages
01 Intelligent Agent Upload2
No ratings yet
01 Intelligent Agent Upload2
31 pages
2.1 Agent and Environment
No ratings yet
2.1 Agent and Environment
39 pages
Lecture03-AI-UMT-Spring 24
No ratings yet
Lecture03-AI-UMT-Spring 24
21 pages
Reinforcement Learning Explained - A Step-by-Step Guide to Reward-Driven AI
From Everand
Reinforcement Learning Explained - A Step-by-Step Guide to Reward-Driven AI
Luka Nikolic
No ratings yet
AI Agents Revolutionizing The Future Of Work And Life
From Everand
AI Agents Revolutionizing The Future Of Work And Life
Michael Smith
No ratings yet
Ob Labor Delivery Skills Checklist
No ratings yet
Ob Labor Delivery Skills Checklist
3 pages
Medical Coding New
No ratings yet
Medical Coding New
26 pages
External Thermal Insulation Composite Systems Etics
No ratings yet
External Thermal Insulation Composite Systems Etics
34 pages
Reading Froudekrylov
No ratings yet
Reading Froudekrylov
6 pages
CP16
No ratings yet
CP16
19 pages
CCV 308
No ratings yet
CCV 308
8 pages
Seed 2024
No ratings yet
Seed 2024
49 pages
1st You Need To Take Off The Dash. A Few Pic. Where The Screws Are
No ratings yet
1st You Need To Take Off The Dash. A Few Pic. Where The Screws Are
6 pages
AC Daikin FXMQ-P-Ducted-Engineering-Data PDF
No ratings yet
AC Daikin FXMQ-P-Ducted-Engineering-Data PDF
42 pages
White Tiger Brochure
No ratings yet
White Tiger Brochure
2 pages
The Soil Underfoot - Infinite Possibilities For A Finite Resource (Gnv64)
100% (2)
The Soil Underfoot - Infinite Possibilities For A Finite Resource (Gnv64)
462 pages
Why The Cross 2
No ratings yet
Why The Cross 2
17 pages
Philosophy, Goals and Objectives
No ratings yet
Philosophy, Goals and Objectives
31 pages
Fernando Reinforcement and Extension Worksheets
No ratings yet
Fernando Reinforcement and Extension Worksheets
27 pages
Studi Karakteristik Permukiman Tepian Sungai Di Kalimantan Barat
No ratings yet
Studi Karakteristik Permukiman Tepian Sungai Di Kalimantan Barat
13 pages
Glaxies
No ratings yet
Glaxies
9 pages
Sources of Air Pollution PDF
100% (1)
Sources of Air Pollution PDF
30 pages
Science General Chemistry 1: Whole Brain Learning System Outcome-Based Education
No ratings yet
Science General Chemistry 1: Whole Brain Learning System Outcome-Based Education
20 pages
For Fill Slope For Cut Slope
No ratings yet
For Fill Slope For Cut Slope
2 pages
MMM-Case Study
No ratings yet
MMM-Case Study
24 pages
Comparative Study of Finite Element
No ratings yet
Comparative Study of Finite Element
6 pages
Dance 101
No ratings yet
Dance 101
17 pages
Centralised Lubrication System For A Manitou MT 732 Complete
No ratings yet
Centralised Lubrication System For A Manitou MT 732 Complete
35 pages
Book Nocse
No ratings yet
Book Nocse
340 pages
AIATS Second Step JEE (Main & Advanced) 2024
No ratings yet
AIATS Second Step JEE (Main & Advanced) 2024
5 pages
Appendix K Tym
No ratings yet
Appendix K Tym
118 pages
(IFAC Symposia Series) International Federation of Automatic Control, C. McGreavy-Dynamics and Control of Chemical Reactors and Distillation Columns. Selected Papers from the IFAC Symposium, Bournemou.pdf
No ratings yet
(IFAC Symposia Series) International Federation of Automatic Control, C. McGreavy-Dynamics and Control of Chemical Reactors and Distillation Columns. Selected Papers from the IFAC Symposium, Bournemou.pdf
322 pages
c E mc: Pan Pearl River Delta Physics Olympiad 2008 Part-1 (Total 6 Problems) 卷-1（共 6 题）
No ratings yet
c E mc: Pan Pearl River Delta Physics Olympiad 2008 Part-1 (Total 6 Problems) 卷-1（共 6 题）
8 pages
Global Climate Change
No ratings yet
Global Climate Change
45 pages

AI Lecture2 - 3

Uploaded by

AI Lecture2 - 3

Uploaded by

Intelligent Agents

 Human agent: eyes, ears, and other organs for

 Robotic agent: camera and microphone for sensors;

 Percept sequence: Complete history of

 Agent’s choice of action at any given instant

 The agent function maps from percept histories to actions:

 The agent program runs on the physical architecture to produce

 Agent function is an abstract mathematical description; the

 Environment: square A and B

Percept sequence Action

function REFLEX-VACUUM-AGENT ([location, status]) return an action

 given the evidence provided by the percept

 An agent is autonomous if its behavior is

 Right action is the one that cause the

 E.g., performance measure of a vacuum-cleaner

 Environment: Roads, pedestrians, customers

 Actuators: Steering wheel, accelerator, brake, signal, horn

 Sensors: Cameras, sonar, speedometer, GPS, engine sensors

 Task environments are categorized into

 Dimensions determine the appropriate agent

determined by the current state and the action

driving is stocastic i.e Tyres blow out or engine seizes

"episodes" (each episode consists of the agent

conveyer belt, decision depends on the current part.

action has long term consequences.

deliberating then it is called static, otherwise

environment itself does not change with the

itself is moving and the driving algorithm

Single agent vs. multiagent

 The simplest environment is

 Most real situations are:

 The environment type largely determines the agent design.

static: percepts, a sequence initially empty

append percept to the end of percepts

 All these can be turned into learning

function REFLEX-VACUUM-AGENT ([location, status]) return an action

static: rules, a set of condition-action rules

 Agent uses such model is called model based agent.

 Current percept is combined with old internal state to

static: rules, a set of condition-action rules

state  UPDATE-STATE(state, action, percept) Responsible for

 Typically investigated in search and planning research.

 Major difference: future is taken into account

 Utility function maps a (sequence of) state(s) onto a real

 Learning element: introduce improvements in

 Problem generator: suggests actions that will lead to new

You might also like