Artificial Intelligence: CS482, CS682, MW 1 - 2:15, SEM 201, MS 227 Prerequisites: 302, 365 Instructor: Sushil Louis
Artificial Intelligence: CS482, CS682, MW 1 - 2:15, SEM 201, MS 227 Prerequisites: 302, 365 Instructor: Sushil Louis
• Now
• if you see (observe) [B, Dirty] you are in {2}
• If you observe [b, Clean] you are in {4}
• Transition Model is more complicated, otherwise this is not
very different from other search problems
3-stage transition model
•• Prediction
stage
• Predicted belief state is b^ = Predict(b, a)
• Observation prediction stage
• Possible-Percepts(b^) = {o : o = Percept(s) and s ϵ b^}
• Update stage
• = Update(b^, o) = {s : o = Percept(s) and s ϵ b^}
• So, Results(b, a)
• = { : = Update(Predict(b, a), o) and o ϵ Possible-Percepts(Predict(b, a))}
Example: Slippery vacuum
And-Or solution
• Given this problem formulation, we can use the And-Or search
algorithm to come up with a plan to solve the problem
• Given [A, Dirty], Plan = {Suck, Right, if Bstate = {6} then Suck else []}
Partially observable environments
• An agent in a partially observable environment must update belief
state from percept
• b’ = Update(Predict(b, a), o)
• So the agent is only looking at the current o (percept) not the entire
history, as we considered earlier. This is recursive state estimation
• Example: Kindergarten vacuum world
Localization in robotics
• Maintaining belief states is a core function of any Intelligent Agent
• Monitoring, filtering, state estimation
• Robot:
• Four sonar sensors (NSWE) give correct data
• Robot has correct map of environment
• Move is broken Robot moves to random adjacent square
• Robot must determine current location
• Can be very bad (two ways back for every way forward above)
• Let’s augment HC with memory
• Learning real-time A* (LRTA*)
• Updates cost estimates, g(s), for the state it leaves
• Likes unexplored states
• f(s) = h(s) not g(s) + h(s) for unexplored states
LRTA* Example
• We are in shaded state
LRTA* algorithm
Questions
• DFS always expands at least as many nodes as A* with an
admissable heuristic (True/False). Explain.
• H(n) = 0 is an admissible heuristic for the 8-puzzle
• BFS is complete even if 0 step costs are allowed
Types of task environments
Task Env Observable Agents Deterministic Episodic Static Discrete
Soccer
Explore Titan
Shopping for
used AI books
on the Net
Playing tennis
Playing tennis
against a wall
Performing a
high jump
Knitting a
sweater
Bidding on an
item in an
auction