0% found this document useful (0 votes)

104 views62 pages

Artificial Intelligence: Adversarial Search

This document discusses adversarial search and games in artificial intelligence. Some key points: - Adversarial search problems involve games with an opponent that cannot be controlled and is planning against the agent. The optimal solution is a strategy rather than a single sequence of actions. - Games present difficult search problems for AI due to their large branching factors and search spaces. Heuristic evaluation functions are used to evaluate non-terminal positions when searching to limited depths. - Minimax search is commonly used for two-player zero-sum games. It recursively evaluates the utility of states assuming optimal play from both players until reaching terminal states.

Uploaded by

Khawir Mahmood

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

104 views62 pages

Artificial Intelligence: Adversarial Search

Uploaded by

Khawir Mahmood

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 62

Artificial Intelligence

Adversarial Search
Adversarial Search
• Adversarial search problems ⌘ games

• They occur in multiagent competitive environments

• There is an opponent we can’t control planning again us!

• Game vs. search: optimal solution is not a sequence of actions

but a strategy (policy) If opponent does a, agent does b, else
if opponent does c, agent does d, etc.

• Tedious and fragile if hard-coded (i.e., implemented with rules)

• Good news: Games are modeled as search problems and use

heuristic evaluation functions.
Games: hard topic
• Games are a big deal in AI

• Games are interesting to AI because they are too hard to solve

• Chess has a branching factor of 35, with 35100 nodes ⇡ 10154

• Need to make some decision even when the optimal decision

is infeasible
Adversarial Search
Checkers:
• Chinook ended 40-year-reign of human world champion Marion
Tinsley in 1994.

• Used an endgame database defining perfect play for all po-

sitions involving 8 or fewer pieces on the board, a total of
443,748,401,247 positions.
Adversarial Search
Chess:
• In 1949, Caude E. Shannon in his paper “Programming a Com-
puter for Playing Chess”, suggested Chess as an AI problem
for the community.
• Deep Blue defeated human world champion Gary Kasparov in
a six-game match in 1997.
• In 2006, Vladmir Kramnik, the undisputed world champion,
was defeated 4-2 by Deep Fritz.
Adversarial Search
Go: b > 300! Google Deep mind Project AlphaGo. In 2016,
AlphaGo beat both Fan Hui, the European Go champion and Lee
Sedol the worlds best player.
Othello: Several computer othello exists and human champions
refuse to compete against computers, that are too good.

By Donarreisko↵er By Paul 012

via Wikimedia Commons
Types of games

We are mostly interested in deterministic games, fully ob-

servable environments, zero-sum, where two agents act al-
ternately.
Zero-sum Games
• Adversarial: Pure competition.
• Agents have di↵erent values on the outcomes.
• One agent maximizes one single value, while the other mini-
mizes it.
Zero-sum Games
• Adversarial: Pure competition.
• Agents have di↵erent values on the outcomes.
• One agent maximizes one single value, while the other mini-
mizes it.
• Each move by one of the players is called a “ply.”

One function: one agents maximizes it and one minimizes it!

Embedded thinking...
Embedded thinking or backward reasoning!

• One agent is trying to figure out what to do.

• How to decide? He thinks about the consequences of the
possible actions.
• He needs to think about his opponent as well...
• The opponent is also thinking about what to do etc.
• Each will imagine what would be the response from the oppo-
nent to their actions.
• This entails an embedded thinking.
Formalization
• The initial state

• Player(s): defines which player has the move in state s. Usually

taking turns.

• Actions(s): returns the set of legal moves in s

• Transition function: S ⇥ A ! S defines the result of a move

• Terminal test: True when the game is over, False otherwise.

States where game ends are called terminal states

• U tility(s, p): utility function or objective function for a game

that ends in terminal state s for player p. In Chess, the outcome
is a win, loss, or draw with values +1, 0, 1/2. For tic-tac-toe
we can use a utility of +1, -1, 0.
Single player...
Assume we have a tic-tac-toe with one player.
Let’s call him Max and have him play three moves only for the
sake of the example.
Single player...
Single player...

In the case of one player, nothing will prevent Max from winning
(choose the path that leads to the desired utility here 1), unless
there is another player who will do everything to make Max lose,
let’s call him Min (the Mean :))
Adversarial search: minimax
• Two players: Max and Min
• Players alternate turns
• Max moves first
• Max maximizes results
• Min minimizes the result
• Compute each node’s minimax value’s the best achievable util-
ity against an optimal adversary
• Minimax value ⌘ best achievable payo↵ against best play
Minimax example
Adversarial search: minimax
• Find the optimal strategy for Max:

– Depth-first search of the game tree

– An optimal leaf node could appear at any depth of the tree
– Minimax principle: compute the utility of being in a state
assuming both players play optimally from there until the
end of the game
– Propagate minimax values up the tree once terminal nodes
are discovered
Adversarial search: minimax

• If state is terminal node: Value is utility(state)

• If state is MAX node: Value is highest value of all successor

node values (children)

• If state is MIN node: Value is lowest value of all successor node

values (children)
Adversarial search: minimax

For a state s minimax(s) =

8
>
< U tility(s) if Terminal-test(s)
maxa2Actions(s) minimax(Result(s,a)) if Player(s) = Max
>
: mina2Actions(s) minimax(Result(s,a)) if Player(s) = Min
The minimax algorithm
Minimax example
Minimax example
Minimax example
Minimax example
Minimax example
Properties of minimax
• Optimal (opponent plays optimally) and complete (finite tree)
• DFS time: O(bm)
• DFS space: O(bm)

– Tic-Tac-Toe
⇤ ⇡ 5 legal moves on average, total of 9 moves (9 plies).
⇤ 59 = 1, 953, 125
⇤ 9! = 362, 880 terminal nodes
– Chess
⇤ b ⇡ 35 (average branching factor)
⇤ d ⇡ 100 (depth of game tree for a typical game)
⇤ bd ⇡ 35100 ⇡ 10154 nodes
– Go branching factor starts at 361 (19X19 board)
Case of limited resources
• Problem: In real games, we are limited in time, so we can’t
search the leaves.

• To be practical and run in a reasonable amount of time, min-

imax can only search to some depth.

• More plies make a big di↵erence.

• Solution:
1. Replace terminal utilities with an evaluation function for
non-terminal positions.
2. Use Iterative Deepening Search (IDS).
3. Use pruning: eliminate large parts of the tree.
↵ pruning

A two-ply game tree.

↵ pruning
↵ pruning
↵ pruning
Which values are necessary?
↵ pruning

M inimax(root) = max(min(3, 12, 8), min(2, X, Y ), min(14, 5, 2))

↵ pruning

M inimax(root) = max(min(3, 12, 8), min(2, X, Y ), min(14, 5, 2))

= max(3, min(2, X, Y ), 2)
↵ pruning

M inimax(root) = max(min(3, 12, 8), min(2, X, Y ), min(14, 5, 2))

= max(3, min(2, X, Y ), 2)

= max(3, Z, 2) where Z = min(2, X, Y )  2

↵ pruning

M inimax(root) = max(min(3, 12, 8), min(2, X, Y ), min(14, 5, 2))

= max(3, min(2, X, Y ), 2)

= max(3, Z, 2) where Z = min(2, X, Y )  2

=3
↵ pruning

M inimax(root) = max(min(3, 12, 8), min(2, X, Y ), min(14, 5, 2))

= max(3, min(2, X, Y ), 2)

= max(3, Z, 2) where Z = min(2, X, Y )  2

=3
Minimax decisions are independent of the values of X and Y .
↵ pruning
• Strategy: Just like minimax, it performs a DFS.
↵ pruning
• Strategy: Just like minimax, it performs a DFS.
• Parameters: Keep track of two bounds
– ↵: largest value for Max across seen children (current lower
bound on MAX’s outcome).
– : lowest value for MIN across seen children (current upper
bound on MIN’s outcome).
↵ pruning
• Strategy: Just like minimax, it performs a DFS.
• Parameters: Keep track of two bounds
– ↵: largest value for Max across seen children (current lower
bound on MAX’s outcome).
– : lowest value for MIN across seen children (current upper
bound on MIN’s outcome).
• Initialization: ↵ = 1, =1
↵ pruning
• Strategy: Just like minimax, it performs a DFS.
• Parameters: Keep track of two bounds
– ↵: largest value for Max across seen children (current lower
bound on MAX’s outcome).
– : lowest value for MIN across seen children (current upper
bound on MIN’s outcome).
• Initialization: ↵ = 1, =1
• Propagation: Send ↵, values down during the search to be
used for pruning.
– Update ↵, values by propagating upwards values of ter-
minal nodes.
– Update ↵ only at Max nodes and update only at Min
nodes.
↵ pruning
• Strategy: Just like minimax, it performs a DFS.
• Parameters: Keep track of two bounds
– ↵: largest value for Max across seen children (current lower
bound on MAX’s outcome).
– : lowest value for MIN across seen children (current upper
bound on MIN’s outcome).
• Initialization: ↵ = 1, =1
• Propagation: Send ↵, values down during the search to be
used for pruning.
– Update ↵, values by propagating upwards values of ter-
minal nodes.
– Update ↵ only at Max nodes and update only at Min
nodes.
• Pruning: Prune any remaining branches whenever ↵ .
↵ pruning

• If ↵ is better than a for Max, then Max will avoid it, that is
prune that branch.

• If is better than b for Min, then Min will avoid it, that is
prune that branch.
↵ pruning
↵ pruning
↵ pruning
↵ pruning
↵ pruning
↵ pruning
↵ pruning
↵ pruning
Move ordering

• It does matter as it a↵ects the e↵ectiveness of ↵ pruning.

• Example: We could not prune any successor of D because the

worst successors for Min were generated first. If the third one
(leaf 2) was generated first we would have pruned the two
others (14 and 5).

• Idea of ordering: examine first successors that are likely best.

Move ordering
• Worst ordering: no pruning happens (best moves are on the
right of the game tree). Complexity O(bm).

• Ideal ordering: lots of pruning happens (best moves are on

the left of the game tree). This solves tree twice as deep as
minimax in the same amount of time. Complexity O(bm/2) (in
practice). The search can go deeper in the game tree.

• How to find a good ordering?

– Remember the best moves from shallowest nodes.
– Order the nodes so as the best are checked first.
– Use domain knowledge: e.g., for chess, try order: captures
first, then threats, then forward moves, backward moves.
– Bookkeep the states, they may repeat!
Real-time decisions
• Minimax: generates the entire game search space

• ↵ algorithm: prune large chunks of the trees

• BUT ↵ still has to go all the way to the leaves

• Impractical in real-time (moves has to be done in a reasonable

amount of time)

• Solution: bound the depth of search (cut o↵ search) and re-

place utiliy(s) with eval(s), an evaluation function to esti-
mate value of current board configurations
Real-time decisions
• eval(s) is a heuristic at state s
E.g., Othello: white pieces - black pieces
E.g., Chess: Value of all white pieces Value of all black pieces
turn non-terminal nodes into terminal leaves!

• An ideal evaluation function would rank terminal states in the

same way as the true utility function; but must be fast

• Typical to define features, make the function a linear weighted

sum of the features

• Use domain knowledge to craft the best and useful features.

Real-time decisions

• How does it works?

– Select useful features f1, . . . , fn e.g., Chess: # pieces on

board, value of pieces (1 for pawn, 3 for bishop, etc.)
– Weighted linear function:
n
X
eval(s) = wifi(s)
i=1
– Learn wi from the examples
– Deep blue uses about 6,000 features!
Stochastic games
• Include a random element (e.g., throwing a die).
• Include chance nodes.
• Backgammon: old board game combining skills and chance.
• The goal is that each player tries to move all of his pieces o↵
the board before his opponent does.

Ptkfgs [Public domain], via Wikimedia Commons

Stochastic games

Partial game tree for Backgammon.

Stochastic games
Algorithm Expectiminimax generalized Minimax to handle chance
nodes as follows:

• If state is a Max node then

return the highest Expectiminimax-Value of Successors(state)

• If state is a Min node then

return the lowest Expectiminimax-Value of Successors(state)

• If state is a chance node then

return average of Expectiminimax-Value of Successors(state)
Stochastic games
Example with coin-flipping:
Expectiminimax

For a state s:
Expectiminimax(s) =
8
> U tility(s) if Terminal-test(s)
<
maxa2Actions(s) Expectiminimax(Result(s,a)) if Player(s) = Max
>
: P a2Actions(s) Expectiminimax(Result(s,a))
min if Player(s) = Min
r P (r) Expectiminimax(Result(s,r)) if Player(s) = Chance

Where r represents all chance events (e.g., dice roll), and Re-
sult(s,r) is the same state as s with the result of the chance event
is r.
Games: conclusion
• Games are modeled in AI as a search problem and use heuristic
to evaluate the game.

• Minimax algorithm choses the best most given an optimal play

from the opponent.

• Minimax goes all the way down the tree which is not practical
give game time constraints.

• Alpha-Beta pruning can reduce the game tree search which

allow to go deeper in the tree within the time constraints.

• Pruning, bookkeeping, evaluation heuristics, node re-ordering

and IDS are e↵ective in practice.
Games: conclusion
• Games is an exciting and fun topic for AI.

• Devising adversarial search agents is challenging because of the

huge state space.

• We have just scratched the surface of this topic.

• Further topics to explore include partially observable games

(card games such as bridge, pocker, etc.).

• Except for robot football (a.k.a. soccer), there was no much

interest from AI in physical games.
(see http://www.robocup.org/).

• Interested in chess? check out the evaluation functions in

Claude Shannon’s paper.

• You will implement a game in your homework assignment.

Recruitment Landscape
No ratings yet
Recruitment Landscape
184 pages
2025 Lecture01 IntroductionToAI
No ratings yet
2025 Lecture01 IntroductionToAI
39 pages
Astrological Combinations For Government Jobs PDF - Google Search
0% (1)
Astrological Combinations For Government Jobs PDF - Google Search
5 pages
Perhitungan Kayu Excel Ali Amin Alhakim
No ratings yet
Perhitungan Kayu Excel Ali Amin Alhakim
37 pages
CH 7 - Knowledge Representation
No ratings yet
CH 7 - Knowledge Representation
30 pages
Artificial Intelligence: Heuristic Search
100% (1)
Artificial Intelligence: Heuristic Search
26 pages
3 AI - Search III
No ratings yet
3 AI - Search III
72 pages
Artificial Intelligence Lecture No. 6
No ratings yet
Artificial Intelligence Lecture No. 6
30 pages
UNIT1-3 Notes
No ratings yet
UNIT1-3 Notes
57 pages
Perceptron: Learning Rule. Generalize From Its Training Vectors and Work With
No ratings yet
Perceptron: Learning Rule. Generalize From Its Training Vectors and Work With
32 pages
Local Search and Optimization
No ratings yet
Local Search and Optimization
28 pages
Artificial Intelligence: Russell & Norvig, AI: A Modern Approach, 3rd Ed
No ratings yet
Artificial Intelligence: Russell & Norvig, AI: A Modern Approach, 3rd Ed
40 pages
Ch2 3 Informed (Heuristic) Search1 - 0
No ratings yet
Ch2 3 Informed (Heuristic) Search1 - 0
82 pages
BFS DFS Uniform Cost Search Algos
No ratings yet
BFS DFS Uniform Cost Search Algos
27 pages
Chapter 2-Uninformed (Blind) Search
No ratings yet
Chapter 2-Uninformed (Blind) Search
147 pages
Lecture06 CSP
No ratings yet
Lecture06 CSP
62 pages
CC511 Week 4
No ratings yet
CC511 Week 4
57 pages
4 Adversel Search Game Tree
No ratings yet
4 Adversel Search Game Tree
51 pages
Features of Jagannatha Hora Software: Siddhantas (Planetary Models)
100% (1)
Features of Jagannatha Hora Software: Siddhantas (Planetary Models)
6 pages
02 Subject - Marriage
No ratings yet
02 Subject - Marriage
16 pages
A11 Part Book Finance AndProfession A Vedic Approach by LT Col Rajkumar
No ratings yet
A11 Part Book Finance AndProfession A Vedic Approach by LT Col Rajkumar
71 pages
Germany
No ratings yet
Germany
5 pages
Constraint Satisfaction Problems: Unit 3
No ratings yet
Constraint Satisfaction Problems: Unit 3
28 pages
1 - ESMO 2024 - Exhibition Prospectus LR
No ratings yet
1 - ESMO 2024 - Exhibition Prospectus LR
47 pages
Pharmaceutical Park at Chaygaon Kamrup Rural
No ratings yet
Pharmaceutical Park at Chaygaon Kamrup Rural
1 page
Session - 1 2 Co-2 Constrained Satisfaction Based Search Algorithms
No ratings yet
Session - 1 2 Co-2 Constrained Satisfaction Based Search Algorithms
54 pages
Informed Search
No ratings yet
Informed Search
37 pages
AI-Lecture 6 (Adversarial Search)
No ratings yet
AI-Lecture 6 (Adversarial Search)
68 pages
Free Dsa Course
No ratings yet
Free Dsa Course
12 pages
Artificial Intelligence: Uninformed Search
No ratings yet
Artificial Intelligence: Uninformed Search
43 pages
1823 - Part - B - DCHB - Kamrup Metropolitan PDF
No ratings yet
1823 - Part - B - DCHB - Kamrup Metropolitan PDF
170 pages
Knowledge Representation Using Logic
No ratings yet
Knowledge Representation Using Logic
55 pages
Intelligent Agents
No ratings yet
Intelligent Agents
37 pages
Inventory Theory.S5 The (S, Q) Inventory Policy
No ratings yet
Inventory Theory.S5 The (S, Q) Inventory Policy
13 pages
ch2 Probs
No ratings yet
ch2 Probs
40 pages
Kamrup District Study Report
No ratings yet
Kamrup District Study Report
49 pages
Saturn's Effect in All The Houses - An Article by Shri Yogeshwaranand Ji
100% (1)
Saturn's Effect in All The Houses - An Article by Shri Yogeshwaranand Ji
17 pages
Adversarial Search
No ratings yet
Adversarial Search
91 pages
Sheet 1 Neural Network
No ratings yet
Sheet 1 Neural Network
5 pages
44 TrueJaiminiNavamsaRevealed1 Southindian PDF
No ratings yet
44 TrueJaiminiNavamsaRevealed1 Southindian PDF
15 pages
Chapter. 06 - Adversarial Search and Games - No Embedded Videos
No ratings yet
Chapter. 06 - Adversarial Search and Games - No Embedded Videos
51 pages
Artificial Intelligence NPTEl Assignment 1
No ratings yet
Artificial Intelligence NPTEl Assignment 1
10 pages
Check For Good Mahadasha
No ratings yet
Check For Good Mahadasha
2 pages
Artificial Intelligence (UNIT - 2)
No ratings yet
Artificial Intelligence (UNIT - 2)
53 pages
Karakatwas of The Grahas
No ratings yet
Karakatwas of The Grahas
2 pages
Index or Milestones To Astrology Road
No ratings yet
Index or Milestones To Astrology Road
13 pages
Perceptron: Neuron Model (Special Form of Single Layer Feed Forward)
No ratings yet
Perceptron: Neuron Model (Special Form of Single Layer Feed Forward)
17 pages
Howtowin Checkers Reinfeld
100% (2)
Howtowin Checkers Reinfeld
112 pages
HPC UNIT 3 To UNIT 6 Technical-Merged
No ratings yet
HPC UNIT 3 To UNIT 6 Technical-Merged
143 pages
Problem Formulation & Solving by Search
No ratings yet
Problem Formulation & Solving by Search
37 pages
Grade 5 HBC Mathematics Paper 1 Term 2 2025
No ratings yet
Grade 5 HBC Mathematics Paper 1 Term 2 2025
4 pages
Case of A Seven Year Old Boy Who Just Escaped Death
100% (1)
Case of A Seven Year Old Boy Who Just Escaped Death
8 pages
Nitin Gadkari's Vipreet Raja Yoga Condition
No ratings yet
Nitin Gadkari's Vipreet Raja Yoga Condition
11 pages
Jyotish Kalpdrum Bhasha
100% (1)
Jyotish Kalpdrum Bhasha
64 pages
100 Endgames You Must Know
0% (1)
100 Endgames You Must Know
11 pages
Strategic Management - Group 7 - Section F: Industry Analysis: Aksharchem (India) Limited Mid-Corporate Chemicals &
No ratings yet
Strategic Management - Group 7 - Section F: Industry Analysis: Aksharchem (India) Limited Mid-Corporate Chemicals &
26 pages
Chart - Phoolan Devi
No ratings yet
Chart - Phoolan Devi
47 pages
Argala - The Linchpin
No ratings yet
Argala - The Linchpin
3 pages
Houses: Back To KAV FAQ Index
No ratings yet
Houses: Back To KAV FAQ Index
4 pages
Sample Horoscope
50% (2)
Sample Horoscope
27 pages
Vedic Astrological Principles To Consider
No ratings yet
Vedic Astrological Principles To Consider
13 pages
Benefic Planets: Back To KAV FAQ Index
No ratings yet
Benefic Planets: Back To KAV FAQ Index
1 page
Group 2 Main Control Valve: 1. Structure
No ratings yet
Group 2 Main Control Valve: 1. Structure
3 pages
Untitled
No ratings yet
Untitled
35 pages
Reading Writing 5 Unit 2
No ratings yet
Reading Writing 5 Unit 2
3 pages
British Chess Magazine 2023 No 12 December
No ratings yet
British Chess Magazine 2023 No 12 December
64 pages
5 - Nakshatra On The Day of Muhurtha
No ratings yet
5 - Nakshatra On The Day of Muhurtha
1 page
Jaimini Astrology - Chara Dasha
No ratings yet
Jaimini Astrology - Chara Dasha
5 pages
Cross of Dasa Bukthi Techniques
No ratings yet
Cross of Dasa Bukthi Techniques
5 pages
General Guidelines
No ratings yet
General Guidelines
17 pages
Chess Survivor: Andor Lilienthal
No ratings yet
Chess Survivor: Andor Lilienthal
19 pages
Primordial Abstraction - Jacobite
No ratings yet
Primordial Abstraction - Jacobite
8 pages
Alleluia Luo-1
No ratings yet
Alleluia Luo-1
1 page
Gukesh Dommaraju
No ratings yet
Gukesh Dommaraju
14 pages
Unit 1 - Sports With A Difference
No ratings yet
Unit 1 - Sports With A Difference
35 pages
Pensamentos Blaise Pascal
No ratings yet
Pensamentos Blaise Pascal
328 pages
Be - Mechanical Engineering - Semester 3 - 2022 - December - Engineering Mathematics III Rev 2019 C' Scheme
No ratings yet
Be - Mechanical Engineering - Semester 3 - 2022 - December - Engineering Mathematics III Rev 2019 C' Scheme
2 pages
Guduvanchery Column Layout
No ratings yet
Guduvanchery Column Layout
1 page
Histpp2 f4 June 2025
No ratings yet
Histpp2 f4 June 2025
3 pages
A2 Foundation Layout and Details RMHC
No ratings yet
A2 Foundation Layout and Details RMHC
1 page
Iron Maiden. Fear of The Dark
No ratings yet
Iron Maiden. Fear of The Dark
9 pages
Paper Craft
No ratings yet
Paper Craft
12 pages
MIT15 387S15 Lecture21
No ratings yet
MIT15 387S15 Lecture21
26 pages
Anti Ragging Squad 2024 NDVSU JBP
No ratings yet
Anti Ragging Squad 2024 NDVSU JBP
1 page
FR 05 09
No ratings yet
FR 05 09
6 pages
Class 8th Math Ws 25
No ratings yet
Class 8th Math Ws 25
2 pages
WS Unit 9 English P5 2025
No ratings yet
WS Unit 9 English P5 2025
4 pages
Aiims Memory Based-1
No ratings yet
Aiims Memory Based-1
3 pages
Infrastructure & Socio-Economic Snapshot For Nashi
No ratings yet
Infrastructure & Socio-Economic Snapshot For Nashi
3 pages
Eight Term Data Science Summer25
No ratings yet
Eight Term Data Science Summer25
2 pages