0% found this document useful (0 votes)

16 views38 pages

Adversial Search

different types of searching like informed and uninformed

Uploaded by

Surya Basnet

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views38 pages

Adversial Search

different types of searching like informed and uninformed

Uploaded by

Surya Basnet

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 38

Announcements

 Assignment 3 due before midnight tonight

 Assignment 4 out today, due next Thursday

1
CS 221: Artificial Intelligence

Lecture 8: Adversarial Search

also known as
Games

Peter Norvig and Sebastian Thrun

Slide credits: Dan Klein, Stuart Russell, Andrew Moore

2
Do the Right Thing under Uncertainty

3
Problems of Agent Uncertainty
 Problem: Stochasticity
 Solution: MDPs → a policy π(s)
 Problem: Partial Observability
 Operate in belief space: POMDPs → policy
 Problem: Unknown Model
 Exploration; Reinforcement learning
 Problem: Computational Limitation
 Heuristics; A*; Monte Carlo approximations
 Problem: Other agents; Adversaries
4
Environments and Agents
 What is the environment?
 Something that evolves from one state to next in
response to actions: Result(s, a) → s′

 What is an Agent? An adversary?

 Lecture 2: agent: “a person or thing that acts”
 For a sailor: The moon?
 Tides: “remain at rest unless a force acts …”
 The wind and waves?
 For Pacman: A RandomGhostAgent?

5
Decision to Model as an Agent
 Model an object as an agent iff:
 The object’s actions are dependent not just on
the state, but on my belief state
(especially my beliefs about my own actions)
“Now, a clever man would put
the poison into his own goblet,
because he would know that
only a great fool would reach
for what he was given. I am not
a great fool, so I can clearly not
choose the wine in front of you.
But you must have known I was
not a great fool, you would
have counted on it, so I can
clearly not choose the wine in
front of me.”
Princess Bride: Vizzini vs. Wesley 6
Other Agents

7
Other Agents
 Cooperation

 Competition

8
Types of Games
 Deterministic (Chess)

 Stochastic (Soccer)

 Partially Observable (Poker)

 (Also n > 2 players; stochastic)

 Large state space (Go)

9
Game Playing State-of-the-Art
 Chess: Deep Blue defeated human world champion Gary Kasparov in a six-
game match in 1997. Deep Blue examined 200 million positions per second,
used very sophisticated evaluation and undisclosed methods for extending
some lines of search up to 40 ply. Current programs are even better, if less
historic.

 Checkers: Chinook ended 40-year-reign of human world champion Marion

Tinsley in 1994. Used an endgame database defining perfect play for all
positions involving 8 or fewer pieces on the board, a total of 443 billion
positions. Checkers is now solved!

 Othello: Human champions refuse to compete against computers, which are

too good.

 Go: Human champions are just beginning to be challenged by machines,

though the best humans still beat the best machines. In go, b > 300, so most
programs use pattern knowledge bases to suggest plausible moves, along with
aggressive pruning.

 Pacman: unknown

10
1: Deterministic, Fully Observable
 Many possible formalizations, one is:
 States: S (start at s0)
 Players: P={1...N} (usually take turns; often N=2)
 Actions: A (may depend on player / state)
 Transition Function: SxA  S or Sx{Ai}  S
 Terminal Test: S  {t,f}
 Terminal Utilities: SxP  R

 Solution for a player is a policy: S  A

11
Deterministic Single-Player
 Deterministic, single player
(solitaire), perfect information:
 Know the rules
 Know what actions do
 Know when you win
 E.g. Freecell, Rubik’s cube
 … it’s just search!

 Slight reinterpretation:
 Each node stores a value: the
best outcome it can reach
 This is the maximal outcome of
its children (the max value)
 Note that we don’t have path
sums as before (utilities at end)
lose win lose

12
Deterministic Two-Player
 Deterministic, zero-sum games: Minimax values:
computed recursively
 Tic-tac-toe, chess, checkers
5 max
 One player maximizes result
 The other minimizes result
2 5 min
 Minimax search:
 A state-space search tree
 Players alternate turns 8 2 5 6
 Each node has a minimax
Terminal values:
value: best achievable utility
part of the game
against a rational adversary

13
Computing Minimax Values
 Two recursive functions:
 max-value maxes the values of successors
 min-value mins the values of successors

def value(state):
If the state is a terminal state: return the state’s utility
If the next agent is MAX: return max-value(state)
If the next agent is MIN: return min-value(state)
def max-value(state):
Initialize max = -∞
For each successor of state:
V ← value(successor)
max ← maximum(max, v)
Return max
Tic-tac-toe Game Tree

15
Minimax Example

3 max

3 2 1

3 12 8 2 3 9 14 1 8

16
Minimax Properties
 Optimal against a perfect player.
Against non-perfect player?
max
 Time complexity?
 O(bm) min

 Space complexity?
 O(bm)
10 10 9 100

 For chess, b  35, m  100

 Exact solution is completely infeasible
 But, do we need to explore the whole tree?

17
Overcoming Computational Limits
 Cannot search to leaves in most games
4 max
 Depth-limited search
 Instead, search a limited depth of tree -2 min 4 min
 Replace terminal utilities with an evaluation limit=2
function for non-terminals -1 -2 4 9
 Guarantee of optimal play is gone

 More plies makes a BIG difference

(as does good evaluation function)

 Example: Chess program

 Suppose we have 100 seconds, can explore
10K nodes / sec
 So can check 1M nodes per move
 Minimax won’t finish depth 4: novice
 If we could reach depth 8: decent
 How could we achieve that?
? ? ? ?
18
Depth-Limited Search
 Still two recursive functions:
 max-value and min-value

def value(state, limit):

If the state is a terminal state: return the state’s utility
If limit = 0: return evaluation_function(state)
If the next agent is MAX: return max-value(state, limit)
If the next agent is MIN: return min-value(state, limit)
def max-value(state, limit):
Initialize max = -∞
For each successor of state:
V ← value(successor, limit-1)
max ← maximum(max, v)
Return max
Problem: Horizon Effect
(or Why Pacman Starves)

1 1

1 0 1

 Example: Depth limited search with depth 2

 Evaluation function = number of dots eaten.
 (For now ignore ghosts; treat as single-player game.)
 Backing up values gives 1 for root, and 1 for west, east moves from root.
 So Pacman is just as happy to eat as not eat.
 But note he might forever go east, west, east, west, … !
Evaluation Functions
 Function which scores non-terminals

 Ideal function: returns the utility of the position

 In practice: typically weighted linear sum of features:

 e.g. f1(s) = (num white queens – num black queens), etc.

21
Pruning in Minimax

3 max

3 ≤2 ≤1

3 12 8 2 14 1

22
-: Pruning in Depth-Limited Search
 General configuration
  is the best value that Player

MAX can get at any

choice point along the Opponent 

current path
 If n becomes worse than
, MAX will avoid it, so Player
can stop considering n’s
other children Opponent n

 Define  similarly for MIN

23
Another - Pruning Example

≤2 ≤1
3

3 12 2 14 5 1

≥8

8
- Pruning Algorithm

25
- Pruning Properties
 Pruning has no effect on final action computed

 Good move ordering improves effectiveness of pruning

 With “perfect ordering”:

 Time complexity drops to O(bm/2)
 Doubles solvable depth
 Chess: from bad to good player, but far from perfect

 A simple example of metareasoning, here reasoning

about which computations are relevant

26
Stochasticity

28
Expectimax Search Trees
 What if we don’t know what the
result of an action will be? E.g.,
 In solitaire, next card is unknown
 In backgammon, dice roll max
 In minesweeper, mine locations
 In Pacman, random ghost moves

 Can do expectimax search chance

 Max nodes as in minimax search
 Chance nodes are like min nodes,
except the outcome is uncertain
 Chance nodes take average
(expectation) of value of children 10 4 5 7

 This is a Markov Decision

Process
couched in the language of trees

29
Reminder: Expectations
 We can define function f(X) of a random variable X

 The expected value, E[f(X)], is the average value,

weighted by the probability of each value X=xi

 Example: How long to get to the airport?

 Length of driving time as a function of traffic, L(T):
L(none) = 20 min, L(light) = 30 min, L(heavy) = 60 min
 Given P(T) = {none: 0.25, light: 0.5, heavy: 0.25}
 What is my expected driving time, E[ L(T) ]?
 E[ L(T) ] = ∑i L(ti) P(ti)
 E[ L(T) ] = L(none) P(none) + L(light) P(light) + L(heavy) P(heavy)
 E[ L(T) ] = (20 * 0.25) + (30 * 0.5) + (60 * 0.25) = 35 min

30
Expectimax Search
 In expectimax search, we have a
probabilistic model of how the
opponent (or environment) will
behave in any state
 Model could be a simple uniform
distribution (roll a die)
 Model could be sophisticated
and require a great deal of
computation
 We have a node for every
outcome out of our control:
opponent or environment
 The model might say that
adversarial actions are likely!
 For now, assume for any state
we magically have a distribution
to assign probabilities to
opponent actions / environment
outcomes Having a probabilistic belief about
an agent’s action does not mean
that agent is flipping any coins! 31
Expectimax Algorithm
def value(s)
if s is a max node return maxValue(s)
if s is an exp node return expValue(s)
if s is a terminal node return evaluation(s)

def maxValue(s)
values = [value(s’) for s’ in successors(s)]
return max(values)
8 4 5 6

def expValue(s)
values = [value(s’) for s’ in successors(s)]
weights = [probability(s, s’) for s’ in successors(s)]
return expectation(values, weights)

32
Expectimax for Pacman
 Notice that we’ve gotten away from thinking that the
ghosts are trying to minimize pacman’s score
 Instead, they are now a part of the environment
 Pacman has a belief (distribution) over how they will act
 Quiz: Can we see minimax as a special case of
expectimax?
 Quiz: what would pacman’s computation look like if we
assumed that the ghosts were doing 1-ply minimax and
taking the result 80% of the time, otherwise moving
randomly?
 If you take this further, you end up calculating belief
distributions over your opponents’ belief distributions
over your belief distributions, etc…
 Can get unmanageable very quickly!

33
Expectimax for Pacman
Results from playing 5 games

Minimizing Random
Ghost Ghost

Won 5/5 Won 5/5

Minimax
Pacman Avg. Score: Avg. Score:
493 483

Won 1/5 Won 5/5

Expectimax
Pacman Avg. Score: Avg. Score:
-303 503

Pacman used depth 4 search with an eval function that avoids trouble
Ghost used depth 2 search with an eval function that seeks Pacman
Expectimax Example

35
Expectimax Pruning?

36
Expectimax Evaluation
 Evaluation functions quickly return an estimate for a
node’s true value (which value, expectimax or minimax?)
 For minimax, evaluation function scale doesn’t matter
 We just want better states to have higher evaluations
(get the ordering right)
 For expectimax, we need magnitudes to be meaningful

0 40 20 30 x2 0 1600 400 900

Expectiminimax
 E.g. Backgammon
 Environment is an extra
player that moves after
each agent
 Combines minimax
and expectimax

ExpectiMinimax-Value(state):
Stochastic Two-Player
 Dice rolls increase b: 21 possible rolls with
2 dice
 Backgammon  20 legal moves
 Depth 2 = 20 x (21 x 20)3 = 1.2 x 109
 As depth increases, probability of reaching
a given search node shrinks
 So usefulness of search is diminished
 So limiting depth is less damaging
 But pruning is trickier…
 TDGammon uses depth-2 search + very
good evaluation function + reinforcement
learning:
world-champion level play
 1st AI world champion in any game!

George's T Shirts Final
40% (5)
George's T Shirts Final
5 pages
Stanford Machine Learning Course Notes by Andrew NG
No ratings yet
Stanford Machine Learning Course Notes by Andrew NG
16 pages
cs188 Su24 Lec06
No ratings yet
cs188 Su24 Lec06
79 pages
CS 188: Artificial Intelligence: Adversarial Search
No ratings yet
CS 188: Artificial Intelligence: Adversarial Search
44 pages
Cs188 Lecture 6 - Adversarial Search - Print (Edx) (2PP)
No ratings yet
Cs188 Lecture 6 - Adversarial Search - Print (Edx) (2PP)
35 pages
Lec 04
No ratings yet
Lec 04
79 pages
Unit 2 Adversial Search
No ratings yet
Unit 2 Adversial Search
36 pages
Chapter. 06 - Adversarial Search and Games - No Embedded Videos
No ratings yet
Chapter. 06 - Adversarial Search and Games - No Embedded Videos
51 pages
Lecture 6 - Adversarial Search
No ratings yet
Lecture 6 - Adversarial Search
45 pages
Ai Lecture-4
No ratings yet
Ai Lecture-4
37 pages
SP14 CS188 Lecture 6 Adversarial Search
No ratings yet
SP14 CS188 Lecture 6 Adversarial Search
29 pages
AI Lec07 Adversarial Search
No ratings yet
AI Lec07 Adversarial Search
29 pages
SP14 CS188 Lecture 6 - Adversarial Search - Print
No ratings yet
SP14 CS188 Lecture 6 - Adversarial Search - Print
31 pages
ITSC6121 Lecture 4 - Game Trees I
No ratings yet
ITSC6121 Lecture 4 - Game Trees I
38 pages
Games
No ratings yet
Games
41 pages
Lec7 LU Su20
No ratings yet
Lec7 LU Su20
46 pages
ITSC6121 Lecture 4 - Game Trees I
No ratings yet
ITSC6121 Lecture 4 - Game Trees I
34 pages
Adversarial Search
No ratings yet
Adversarial Search
91 pages
6 Game
No ratings yet
6 Game
42 pages
Chapter. 07 - Expectimax Search and Utilities
No ratings yet
Chapter. 07 - Expectimax Search and Utilities
47 pages
Adversarial Search and Game Playing: Games
No ratings yet
Adversarial Search and Game Playing: Games
8 pages
Week-11 - Adversarial Search
No ratings yet
Week-11 - Adversarial Search
50 pages
cs188 sp23 Lec09
No ratings yet
cs188 sp23 Lec09
47 pages
Foundations of Artificial Intelligence: Local Search
No ratings yet
Foundations of Artificial Intelligence: Local Search
91 pages
AI Unit-3
No ratings yet
AI Unit-3
109 pages
Lec04-Adverserial Search
No ratings yet
Lec04-Adverserial Search
41 pages
06 Minimax
No ratings yet
06 Minimax
53 pages
Adversarial Search
No ratings yet
Adversarial Search
42 pages
Adversarial Search
No ratings yet
Adversarial Search
37 pages
Lecture 6 - Minmax Alpha Beta
No ratings yet
Lecture 6 - Minmax Alpha Beta
41 pages
Chapter3 - Search4
No ratings yet
Chapter3 - Search4
37 pages
Lecture13 - Adversial Search Algorithms
No ratings yet
Lecture13 - Adversial Search Algorithms
23 pages
(24F-COSE361) 2. Game
No ratings yet
(24F-COSE361) 2. Game
37 pages
05 Adversarial Search
No ratings yet
05 Adversarial Search
51 pages
18cs753 Ai Module 4
No ratings yet
18cs753 Ai Module 4
44 pages
Artificial Intelligence: Adversarial Search
No ratings yet
Artificial Intelligence: Adversarial Search
58 pages
Adversarial Search
No ratings yet
Adversarial Search
78 pages
Unit Ii
No ratings yet
Unit Ii
56 pages
18cs753 Ai Module 4
No ratings yet
18cs753 Ai Module 4
44 pages
AI T5 Adversarial Search
No ratings yet
AI T5 Adversarial Search
39 pages
18CS753 AI Module 4
No ratings yet
18CS753 AI Module 4
44 pages
Games, The Mini-Max Algorithm
No ratings yet
Games, The Mini-Max Algorithm
160 pages
Local Adversarial Search
No ratings yet
Local Adversarial Search
44 pages
Ai Chap 2
No ratings yet
Ai Chap 2
48 pages
Week 13
No ratings yet
Week 13
45 pages
Artificial Intelligence: Adversarial Search
No ratings yet
Artificial Intelligence: Adversarial Search
62 pages
Lecture05 AdversarialSearch
No ratings yet
Lecture05 AdversarialSearch
51 pages
Expectimax Search and Utilities
No ratings yet
Expectimax Search and Utilities
44 pages
Adversarial Search and Game Playing
No ratings yet
Adversarial Search and Game Playing
77 pages
Why Do AI Researchers Study Game Playing?
No ratings yet
Why Do AI Researchers Study Game Playing?
42 pages
Adversial Search
No ratings yet
Adversial Search
101 pages
Adversarial Search
No ratings yet
Adversarial Search
109 pages
Lecture Notes Adversarial Search
No ratings yet
Lecture Notes Adversarial Search
13 pages
Ai Lecture-5
No ratings yet
Ai Lecture-5
34 pages
Lecture Notes Adversarial Search
No ratings yet
Lecture Notes Adversarial Search
15 pages
Aiml Unit-2
No ratings yet
Aiml Unit-2
61 pages
AAI Lecture 7 SP 25
No ratings yet
AAI Lecture 7 SP 25
51 pages
Artificial Intelligence For Gam Es: Shashwat Shukla
No ratings yet
Artificial Intelligence For Gam Es: Shashwat Shukla
56 pages
Lecture 7 - Expectmax and Mcts
No ratings yet
Lecture 7 - Expectmax and Mcts
40 pages
Frostgrave: The Wizards’ Conclave
From Everand
Frostgrave: The Wizards’ Conclave
Joseph A. McCullough
4.5/5 (2)
The Lost Ancient World of Zanterian - d20 Role Playing Game Book: The World's Dangerous Dungeon
From Everand
The Lost Ancient World of Zanterian - d20 Role Playing Game Book: The World's Dangerous Dungeon
James A. Grosse
No ratings yet
Game Guide for Little Big Planet 3 (Unofficial)
From Everand
Game Guide for Little Big Planet 3 (Unofficial)
Fusion Media
No ratings yet
System Call
No ratings yet
System Call
21 pages
Unit 3 Analysis - A System Requirements
No ratings yet
Unit 3 Analysis - A System Requirements
47 pages
Ai ch18 Learning From Examples Part 2
No ratings yet
Ai ch18 Learning From Examples Part 2
30 pages
Deadlock
No ratings yet
Deadlock
38 pages
OS Syllabus
No ratings yet
OS Syllabus
5 pages
Unit 4 - C Designing Interfaces and Dialouges
No ratings yet
Unit 4 - C Designing Interfaces and Dialouges
25 pages
Process Creation 2
No ratings yet
Process Creation 2
11 pages
Disk Management
No ratings yet
Disk Management
46 pages
cs221 Lecture10
No ratings yet
cs221 Lecture10
43 pages
Slides Kbagents
No ratings yet
Slides Kbagents
97 pages
cs221 Lecture12
No ratings yet
cs221 Lecture12
28 pages
Architecture of Server Virtualization 3
No ratings yet
Architecture of Server Virtualization 3
13 pages
Hypervisor ESXI 5
No ratings yet
Hypervisor ESXI 5
8 pages
Task Model
No ratings yet
Task Model
68 pages
Machine Learning
No ratings yet
Machine Learning
68 pages
Chapter 4 Lab Instructions
No ratings yet
Chapter 4 Lab Instructions
3 pages
More Aneka Examples
No ratings yet
More Aneka Examples
10 pages
Chapter 3 Lab Lab Assignment
No ratings yet
Chapter 3 Lab Lab Assignment
7 pages
Aneka
No ratings yet
Aneka
12 pages
Laudon Ess10e PP 4
No ratings yet
Laudon Ess10e PP 4
48 pages
Chapter 2 Lab Lab Assignment
No ratings yet
Chapter 2 Lab Lab Assignment
6 pages
Unit 4 Recursion
No ratings yet
Unit 4 Recursion
10 pages
Chapter 1 Lab Lab Assignment
No ratings yet
Chapter 1 Lab Lab Assignment
7 pages
Unit 3 Stacks and Queues
No ratings yet
Unit 3 Stacks and Queues
13 pages
Unit 5 Binary Trees
No ratings yet
Unit 5 Binary Trees
28 pages
Unit 2 Linked Lists
No ratings yet
Unit 2 Linked Lists
21 pages
2 Vector-Calculus
No ratings yet
2 Vector-Calculus
3 pages
Unit 1 Complexity Analysis
No ratings yet
Unit 1 Complexity Analysis
6 pages
E Commercesecurityandpaymentsystems
No ratings yet
E Commercesecurityandpaymentsystems
21 pages
Data Quetion S
No ratings yet
Data Quetion S
5 pages
MGMT 322 Chp3. Probability Distributions
No ratings yet
MGMT 322 Chp3. Probability Distributions
68 pages
Reviewproblems1 Comm 215 PDF
No ratings yet
Reviewproblems1 Comm 215 PDF
11 pages
MIT6 041SCF13 Assn03
No ratings yet
MIT6 041SCF13 Assn03
3 pages
7b Decision Analysis
No ratings yet
7b Decision Analysis
15 pages
1968-Kozelka - A Bayesian Approach To Jamaican Fishing
No ratings yet
1968-Kozelka - A Bayesian Approach To Jamaican Fishing
9 pages
Math For Computer Science Roadmap - Everything You Need To Know - Math
No ratings yet
Math For Computer Science Roadmap - Everything You Need To Know - Math
11 pages
The Psychological Effects of Parental Pressure
No ratings yet
The Psychological Effects of Parental Pressure
311 pages
Detail-Lesson-Plan MEAN VARIANCE & SD OF DISCRETE RANDOM VARIABLE
No ratings yet
Detail-Lesson-Plan MEAN VARIANCE & SD OF DISCRETE RANDOM VARIABLE
12 pages
Probability: Determining Probabilities
No ratings yet
Probability: Determining Probabilities
56 pages
Effect of Uncertainty in Input and Parameter Values On Model Prediction Error
No ratings yet
Effect of Uncertainty in Input and Parameter Values On Model Prediction Error
9 pages
Continuous Random Variable PDF
No ratings yet
Continuous Random Variable PDF
31 pages
Statistics Unit 5 Notes
No ratings yet
Statistics Unit 5 Notes
13 pages
IUT EEE Courses Descriptions
No ratings yet
IUT EEE Courses Descriptions
62 pages
Stat Ess Mod 3 Ses 1
50% (2)
Stat Ess Mod 3 Ses 1
29 pages
Kami Export - Caroline Kinsey - 5.2+Mean,+Variance+and+Expectation+Notes
No ratings yet
Kami Export - Caroline Kinsey - 5.2+Mean,+Variance+and+Expectation+Notes
22 pages
Investment Analysis and Portfolio Management: Lecture Presentation Software
No ratings yet
Investment Analysis and Portfolio Management: Lecture Presentation Software
48 pages
Chapter One
No ratings yet
Chapter One
64 pages
Mean of Discrete Variable LP
No ratings yet
Mean of Discrete Variable LP
4 pages
Natural Language Processing Natural Language Processing: Unit - 1 Elementary Probability Theory
No ratings yet
Natural Language Processing Natural Language Processing: Unit - 1 Elementary Probability Theory
54 pages
Learner's Activity Sheet: Statistics and Probability (Quarter III - Week 2)
100% (1)
Learner's Activity Sheet: Statistics and Probability (Quarter III - Week 2)
6 pages
Lecture 10 - 5 - Expectation of Binomial Variable
No ratings yet
Lecture 10 - 5 - Expectation of Binomial Variable
36 pages
AP Stat FRQ Index For All Years To 2018
No ratings yet
AP Stat FRQ Index For All Years To 2018
24 pages
Mathematical Expectations 2
No ratings yet
Mathematical Expectations 2
30 pages
Nonmem Users Guide Introduction To Nonmem 7 Robert J. Bauer ICON Development Solutions Ellicott City, Maryland February 26, 2010
No ratings yet
Nonmem Users Guide Introduction To Nonmem 7 Robert J. Bauer ICON Development Solutions Ellicott City, Maryland February 26, 2010
61 pages
Basics of Programming
100% (1)
Basics of Programming
40 pages
STAT230 Course Notes F16
No ratings yet
STAT230 Course Notes F16
365 pages
Chap 7
No ratings yet
Chap 7
23 pages

Adversial Search

Uploaded by

Adversial Search

Uploaded by

Announcements

 Assignment 3 due before midnight tonight

 Assignment 4 out today, due next Thursday

Lecture 8: Adversarial Search

Peter Norvig and Sebastian Thrun

 What is an Agent? An adversary?

 Partially Observable (Poker)

 Large state space (Go)

 Checkers: Chinook ended 40-year-reign of human world champion Marion

 Othello: Human champions refuse to compete against computers, which are

 Go: Human champions are just beginning to be challenged by machines,

 Solution for a player is a policy: S  A

 For chess, b  35, m  100

 More plies makes a BIG difference

 Example: Chess program

def value(state, limit):

 Example: Depth limited search with depth 2

 Ideal function: returns the utility of the position

 e.g. f1(s) = (num white queens – num black queens), etc.

MAX can get at any

 Define  similarly for MIN

 Good move ordering improves effectiveness of pruning

 With “perfect ordering”:

 A simple example of metareasoning, here reasoning

 Can do expectimax search chance

 This is a Markov Decision

 The expected value, E[f(X)], is the average value,

 Example: How long to get to the airport?

Won 5/5 Won 5/5

Won 1/5 Won 5/5

0 40 20 30 x2 0 1600 400 900

You might also like