0% found this document useful (0 votes)

20 views29 pages

AI Lec07 Adversarial Search

Uploaded by

tinphamhuynh282

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views29 pages

AI Lec07 Adversarial Search

Uploaded by

tinphamhuynh282

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

Introduction to

Artificial Intelligence
Lecture: Adversarial Search
Outline

● Games
● Optimal Decisions in Games
● α-β Pruning
● Imperfect, Real-time Decisions
● Stochastic Games

2
Multiagent environments

● Each agent needs to consider the actions of other agents and

how they affect its own welfare.
● The unpredictability of other agents introduce contingencies
into the agent’s problem-solving process
● Game theory views any multiagent environment as a game.
○ The impact of each agent on the others is “significant,”
regardless of whether the agents are cooperative or
competitive.
● Types of games:
○ Perfect information vs Imperfect information
○ Deterministic vs Chance
A sequential game has perfect information if each player, when making any decision, is perfectly informed of all the events
that have previously occurred, including the "initialization event" of the game.
3
Types of game

Deterministic Chance
Perfect Chess, Checkers, Go, Backgammon, Monopoly
information Othello
Imperfect Bridge, poker, scrabble
information nuclear war

4
Adversarial search

● Adversarial search (known as games) covers competitive

environments in which the agents’ goals are in conflict.
● Zero-sum games of perfect information
○ Deterministic, fully observable environments, turn-taking,
two-player
○ The utility values at the end are always equal and opposite.

5
Primary assumptions

● Two players only, called MAX and MIN.

○ MAX moves first, and then they take turns moving until the
game ends
○ Winner gets reward, loser gets penalty.
● Both players have complete knowledge of the game’s state
● No element of chance
● Zero-sum games
○ The total payoff to all players is the same for every game
instance.
● Rational players
○ Each player always tries to maximize his/her utility

6
Games as search

● 𝑆0 – Initial state: How the game is set up at the start 0.

● 𝑃𝐿𝐴𝑌𝐸𝑅(𝑠): Which player has the move in a state, MAX/MIN?
● 𝐴𝐶𝑇𝐼𝑂𝑁𝑆(𝑠) – Successor function: A list of (move, state) pairs
specifying legal moves.
● 𝑅𝐸𝑆𝑈𝐿𝑇(𝑠, 𝑎) – Transition model: Result of move 𝑎 on state 𝑠
● 𝑇𝐸𝑅𝑀𝐼𝑁𝐴𝐿 − 𝑇𝐸𝑆𝑇(𝑠): Is the game finished?
● 𝑈𝑇𝐼𝐿𝐼𝑇𝑌(𝑠,𝑝) – Utility function: A numerical value of a terminal
state 𝑠 for a player 𝑝

7
Games vs. Search problems

● Complexity: games are too hard to be solved

● Time limits: make some decision even when calculating the
optimal decision is infeasible
● Efficiency: penalize inefficiency severely
○ Several interesting ideas on how to make the best possible
use of time are spawn.

8
Search Tree of Tic-Tac-Toe

9
Optimal decision in games

● Normal search problem

○ Optimal solution is a sequence of action leading to a goal
state.
● Games
○ A search path that guarantee win for a player
○ The optimal strategy can be determined from the minimax
value of each node
● MINIMAX(s) =
UTILITY(s) if TERMINAL-TEST(s)

maxa∈Actions(s)MINIMAX(RESULT(s, a)) if PLAYER(s) = MAX

mina∈Actions(s)MINIMAX(RESULT(s, a)) if PLAYER(s) =

MIN 10
Example

MAX best move

MIN best move

Utility values for MAX

11
The minimax algorithm

● Compute the minimax decision from the current state

● Use a simple recursive computation of the minimax values of
each successor state
○ The recursion proceeds all the way down to the leaves of
the tree, and then the minimax values are backed up
through the tree as the recursion unwinds.

12
The minimax algorithm

function MINIMAX-DECISION(state) returns an action

return argmaxa ∈ ACTIONS(s)MIN-VALUE(RESULT(state, a))

function MAX-VALUE(state) returns a utility value

if TERMINAL-TEST(state) then return UTILITY(state)
v ← -∞
for each a in ACTIONS(state) do
v ← MAX(v, MIN-VALUE(RESULT(s, a)))
return v

function MIN-VALUE(state) returns a utility value

if TERMINAL-TEST(state) then return UTILITY(state)
v←∞
for each a in ACTIONS(state) do
v ← MIN(v, MAX-VALUE(RESULT(s, a)))
return v
13
The minimax algorithm

● A complete depth-first exploration of the game tree

● Completeness
○ Yes (if tree is finite)
● Optimality
○ Yes (against an optimal opponent)
● Time complexity
○ 𝑂(𝑏𝑚)
● Space complexity
○ 𝑂(𝑏𝑚) (depth-first exploration)

14
The minimax algorithm

15
Problem with minimax search

● The number of game states is exponential in the tree’s depth

→ Do not examine every node
● Alpha-beta pruning: Prune away branches that cannot possibly
influence the final decision
● Bounded lookahead
○ Limit depth for each search
○ This is what chess players do: look ahead for a few moves
and see what looks best

16
Alpha-beta pruning

17
Alpha-beta pruning

18
Alpha-beta pruning

function ALPHA-BETA-SEARCH(state) returns an action

v ← MAX-VALUE(state,-∞,+∞)
return the action in ACTIONS(state) with value v
function MAX-VALUE(state,α,β) returns a utility value
if TERMINAL-TEST(state) then return UTILITY(state)
v ← -∞
for each a in ACTIONS(state) do
v ← MAX(v, MIN-VALUE(RESULT(s,a),α,β))
if v ≥ β then return v
α ← MAX(α, v)
return v
function MIN-VALUE(state,α,β) returns a utility value
if TERMINAL-TEST(state) then return UTILITY(state)
v ← +∞
for each a in ACTIONS(state) do
v ← MIN(v, MAX-VALUE(RESULT(s,a) ,α,β))
if v ≤ α then return v
β ← MIN(β, v)
return v
19
Alpha-beta pruning

● Pruning does not affect the result

● Good move ordering improves effectiveness of pruning
● Killer move heuristic
● Transposition table avoids re-evaluation a state

20
Alpha-beta pruning

21
Heuristic minimax

● Both minimax and alpha-beta pruning search all the way to

terminal states.
○ This depth is usually impractical because moves must be
made in a reasonable amount of time (~ minutes).
● Cut off the search earlier with some depth limit
● Use an evaluation function
H-MINIMAX(s, d) =

EVAL(s) if CUTOFF-TEST(s, d)

maxa∈ACTIONS(s)H-MINIMAX(RESULT(s, a), d+1) if PLAYER(s) = MAX

mina∈ACTIONS(s)H-MINIMAX(RESULT(s, a), d+1) if PLAYER(s) = MIN

22
Evaluation Functions

● The evaluation function should order the terminal states in the

same way as the true utility function does
○ States that are wins must evaluate better than draws, which
in turn must be better than losses.
● The computation must not take too long!
● For nonterminal states, their orders should be strongly
correlated with the actual chances of winning.

23
Cutting off search

● Minimax Cutoff is identical to Minimax Value except

○ 𝑇𝑒𝑟𝑚𝑖𝑛𝑎𝑙? is replaced by 𝐶𝑢𝑡𝑜𝑓𝑓?
○ 𝑈𝑡𝑖𝑙𝑖𝑡𝑦 is replaced by 𝐸𝑣𝑎𝑙

if CUTOFF-TEST(state, depth) then return EVAL(state)

24
Stochastic Games

● Uncertain outcomes controlled by chance, not an adversary!

● Why wouldn’t we know what the result of an action will be?
○ Explicit randomness: rolling dice
○ Unpredictable opponents: the ghosts respond randomly
○ Actions can fail: when moving a robot, wheels might slip

25
Expectimax search

● Values reflect average-case (expectimax) outcomes, not

worst-case (minimax) outcomes
● Expectimax search: compute the average score under optimal
play
○ Max nodes as in minimax search
○ Chance nodes are like min nodes, but the outcome is
uncertain
○ Calculate expected utilities, i.e. take weighted average of
children
● The underlying uncertain-result problems can be formulated as
Markov Decision Processes

26
Expectimax search

27
Expectimax search

● It is possible to perform pruning in expectimax search.

● Common techniques for pruning in expectimax include:
○ Depth Limiting: Limiting the depth of the search tree can
effectively prune branches that are too deep to be practically
explored.
○ Evaluation Function: Using an evaluation function to estimate
the value of a state without fully exploring its subtree. If the
evaluation function indicates that further exploration is unlikely to
yield significant improvements, you can prune the subtree.
○ Probabilistic Pruning: In scenarios where probabilities are
involved, you might prune branches that have very low
probabilities of occurring, as they contribute little to the overall
expectation.
○ Iterative Deepening: Iterative deepening can be combined with
pruning techniques to explore deeper parts of the tree only when
necessary, based on the current state of the search.
28
References

● Stuart Russell and Peter Norvig. 2009. Artificial Intelligence: A

Modern Approach (3rd ed.). Prentice Hall Press, Upper Saddle
River, NJ, USA.
● Lê Hoài Bắc, Tô Hoài Việt. 2014. Giáo trình Cơ sở Trí tuệ nhân
tạo. Khoa Công nghệ Thông tin. Trường ĐH Khoa học Tự
nhiên, ĐHQG-HCM.
● Nguyễn Ngọc Thảo, Nguyễn Hải Minh. 2020. Bài giảng Cơ sở
Trí tuệ Nhân tạo. Khoa Công nghệ Thông tin. Trường ĐH Khoa
học Tự nhiên, ĐHQG-HCM.

Unit 2 Adversial Search
No ratings yet
Unit 2 Adversial Search
36 pages
Chapter. 06 - Adversarial Search and Games - No Embedded Videos
No ratings yet
Chapter. 06 - Adversarial Search and Games - No Embedded Videos
51 pages
SET394 - AI - Lecture 06 - Adversarial Search
No ratings yet
SET394 - AI - Lecture 06 - Adversarial Search
27 pages
Games
No ratings yet
Games
41 pages
SP14 CS188 Lecture 6 - Adversarial Search - Print
No ratings yet
SP14 CS188 Lecture 6 - Adversarial Search - Print
31 pages
Lecture11 AdversarialSearch
No ratings yet
Lecture11 AdversarialSearch
74 pages
Adversarial Search
No ratings yet
Adversarial Search
37 pages
Adversarial Search
No ratings yet
Adversarial Search
42 pages
2025 Lecture03 AdversarialSearch
No ratings yet
2025 Lecture03 AdversarialSearch
51 pages
Artificial Intelligence: Adversarial Search
No ratings yet
Artificial Intelligence: Adversarial Search
62 pages
Chapter3 - Search4
No ratings yet
Chapter3 - Search4
37 pages
Artificial Inteligence
No ratings yet
Artificial Inteligence
4 pages
L07 Adversarial Search
No ratings yet
L07 Adversarial Search
48 pages
L06 Adversarial Search
No ratings yet
L06 Adversarial Search
66 pages
Lecture05 AdversarialSearch
No ratings yet
Lecture05 AdversarialSearch
51 pages
Ai Lecture-4
No ratings yet
Ai Lecture-4
37 pages
CS2201 7
No ratings yet
CS2201 7
56 pages
Chap 4 Games
No ratings yet
Chap 4 Games
31 pages
Unit Ii
No ratings yet
Unit Ii
56 pages
18cs753 Ai Module 4
No ratings yet
18cs753 Ai Module 4
44 pages
SP14 CS188 Lecture 6 Adversarial Search
No ratings yet
SP14 CS188 Lecture 6 Adversarial Search
29 pages
Oradea: Bucharest Arad Craiova
No ratings yet
Oradea: Bucharest Arad Craiova
53 pages
Game Playing
No ratings yet
Game Playing
32 pages
2.6-Adversarial Search Algorithms
No ratings yet
2.6-Adversarial Search Algorithms
21 pages
GamePlaying Minimax Unit-2 SPS
No ratings yet
GamePlaying Minimax Unit-2 SPS
72 pages
Adversial Search
No ratings yet
Adversial Search
101 pages
ITSC6121 Lecture 4 - Game Trees I
No ratings yet
ITSC6121 Lecture 4 - Game Trees I
34 pages
Institute of Southern Punjab Multan: Syed Zohair Quain Haider Lecturer ISP Multan
No ratings yet
Institute of Southern Punjab Multan: Syed Zohair Quain Haider Lecturer ISP Multan
41 pages
6 Game
No ratings yet
6 Game
42 pages
AAI Lecture 7 SP 25
No ratings yet
AAI Lecture 7 SP 25
51 pages
Adversarial Search and Game Playing: Games
No ratings yet
Adversarial Search and Game Playing: Games
8 pages
Cs188 Lecture 6 - Adversarial Search - Print (Edx) (2PP)
No ratings yet
Cs188 Lecture 6 - Adversarial Search - Print (Edx) (2PP)
35 pages
Ai Unit 3
No ratings yet
Ai Unit 3
138 pages
Game Playing MINMAX Search, Alpha-Beta Pruning, PDF
No ratings yet
Game Playing MINMAX Search, Alpha-Beta Pruning, PDF
4 pages
Adversarial Search
No ratings yet
Adversarial Search
20 pages
AI Unit-3
No ratings yet
AI Unit-3
109 pages
Adversial Search
No ratings yet
Adversial Search
39 pages
Lecture 6 - Minmax Alpha Beta
No ratings yet
Lecture 6 - Minmax Alpha Beta
41 pages
18CS753 Ai Module 4
No ratings yet
18CS753 Ai Module 4
43 pages
CSC-411-AI-lec6-Adversarial Search
No ratings yet
CSC-411-AI-lec6-Adversarial Search
38 pages
18CS753 AI Module 4
No ratings yet
18CS753 AI Module 4
44 pages
Lecture13 - Adversial Search Algorithms
No ratings yet
Lecture13 - Adversial Search Algorithms
23 pages
Lecture 7
No ratings yet
Lecture 7
62 pages
Lec 04
No ratings yet
Lec 04
79 pages
Adversarial Search: Course: Artificial Intelligence Effective Period: September 2018
No ratings yet
Adversarial Search: Course: Artificial Intelligence Effective Period: September 2018
35 pages
Lecture 5 - Adversal Search
No ratings yet
Lecture 5 - Adversal Search
88 pages
4 Adversel Search Game Tree
No ratings yet
4 Adversel Search Game Tree
51 pages
Adversarial Search
No ratings yet
Adversarial Search
91 pages
06 Minimax
No ratings yet
06 Minimax
53 pages
8.AI17game Final
No ratings yet
8.AI17game Final
30 pages
18cs753 Ai Module 4
No ratings yet
18cs753 Ai Module 4
44 pages
Lecture Notes Adversarial Search
No ratings yet
Lecture Notes Adversarial Search
13 pages
AI - Unit - 2
No ratings yet
AI - Unit - 2
30 pages
Adversarial Search and Game Playing
No ratings yet
Adversarial Search and Game Playing
77 pages
Artificial Intelligence: Adversarial Search
No ratings yet
Artificial Intelligence: Adversarial Search
58 pages
Adversarial Search
No ratings yet
Adversarial Search
49 pages
AI Unit3 Gameplaying
No ratings yet
AI Unit3 Gameplaying
43 pages
Minimax: Fundamentals and Applications
From Everand
Minimax: Fundamentals and Applications
Fouad Sabry
No ratings yet
Breaking the Game: The Science of Speedrunning
From Everand
Breaking the Game: The Science of Speedrunning
Connor Kaiser
No ratings yet
Fun Online Games For Teens with Tips and Tricks: Ages 13 And Up: Games for Kids and Teens
From Everand
Fun Online Games For Teens with Tips and Tricks: Ages 13 And Up: Games for Kids and Teens
Baby Professor
No ratings yet
B Batch Applied Machine Learning
No ratings yet
B Batch Applied Machine Learning
1 page
Linked List: by Mrs. Preeti S. Patil
No ratings yet
Linked List: by Mrs. Preeti S. Patil
52 pages
1.1. Using Pre-Written Code
No ratings yet
1.1. Using Pre-Written Code
2 pages
Introduction To Numerical Analysis For Engineers: - Roots of Non-Linear Equations 2.1-2.4
No ratings yet
Introduction To Numerical Analysis For Engineers: - Roots of Non-Linear Equations 2.1-2.4
17 pages
Problem Set 1 Solutions
No ratings yet
Problem Set 1 Solutions
9 pages
Unit 2 - Selection Sort
No ratings yet
Unit 2 - Selection Sort
10 pages
DS Module Iv Part 2
No ratings yet
DS Module Iv Part 2
20 pages
Array in DSA
No ratings yet
Array in DSA
8 pages
2 A Line Clipping
No ratings yet
2 A Line Clipping
31 pages
DB Scan
No ratings yet
DB Scan
7 pages
Lec 25 Page Replacement Anf Frame Allocation
No ratings yet
Lec 25 Page Replacement Anf Frame Allocation
17 pages
Practical 1,2,3 NSM
No ratings yet
Practical 1,2,3 NSM
14 pages
CS1231 Cheat Sheet 2
No ratings yet
CS1231 Cheat Sheet 2
2 pages
Week-05 Assignment
No ratings yet
Week-05 Assignment
5 pages
Data Structure and Algorithm Question Format
No ratings yet
Data Structure and Algorithm Question Format
8 pages
19MEE306 - Mid Sem Exam QP
No ratings yet
19MEE306 - Mid Sem Exam QP
2 pages
18cs32 - Data Structure and Its Application
No ratings yet
18cs32 - Data Structure and Its Application
22 pages
Module 3 - Trees
No ratings yet
Module 3 - Trees
39 pages
A Collection of Technical Interview Questions
82% (11)
A Collection of Technical Interview Questions
34 pages
1D Array Programs List
No ratings yet
1D Array Programs List
4 pages
Huffman Coding
No ratings yet
Huffman Coding
44 pages
Big-O Cheat Sheet-Letter
No ratings yet
Big-O Cheat Sheet-Letter
4 pages
6-2-Types of Linked List
No ratings yet
6-2-Types of Linked List
9 pages
444 Poster Final
No ratings yet
444 Poster Final
1 page
Sieve of Atkin: Algorithm
No ratings yet
Sieve of Atkin: Algorithm
4 pages
Non Linear Equations On Numerical Analysis
No ratings yet
Non Linear Equations On Numerical Analysis
31 pages
Implementation of Single Layer Perceptron Model Using MATLAB
No ratings yet
Implementation of Single Layer Perceptron Model Using MATLAB
5 pages
DAA - Notations
No ratings yet
DAA - Notations
12 pages
Data Structures and Their Use in Elementary Algorithms
No ratings yet
Data Structures and Their Use in Elementary Algorithms
134 pages
Elements of Dynamic Programming
No ratings yet
Elements of Dynamic Programming
13 pages

AI Lec07 Adversarial Search

Uploaded by

AI Lec07 Adversarial Search

Uploaded by

Introduction to

● Each agent needs to consider the actions of other agents and

● Adversarial search (known as games) covers competitive

● Two players only, called MAX and MIN.

● 𝑆0 – Initial state: How the game is set up at the start 0.

● Complexity: games are too hard to be solved

● Normal search problem

maxa∈Actions(s)MINIMAX(RESULT(s, a)) if PLAYER(s) = MAX

mina∈Actions(s)MINIMAX(RESULT(s, a)) if PLAYER(s) =

MAX best move

MIN best move

Utility values for MAX

● Compute the minimax decision from the current state

function MINIMAX-DECISION(state) returns an action

function MAX-VALUE(state) returns a utility value

function MIN-VALUE(state) returns a utility value

● A complete depth-first exploration of the game tree

● The number of game states is exponential in the tree’s depth

function ALPHA-BETA-SEARCH(state) returns an action

● Pruning does not affect the result

● Both minimax and alpha-beta pruning search all the way to

maxa∈ACTIONS(s)H-MINIMAX(RESULT(s, a), d+1) if PLAYER(s) = MAX

mina∈ACTIONS(s)H-MINIMAX(RESULT(s, a), d+1) if PLAYER(s) = MIN

● The evaluation function should order the terminal states in the

● Minimax Cutoff is identical to Minimax Value except

if CUTOFF-TEST(state, depth) then return EVAL(state)

● Uncertain outcomes controlled by chance, not an adversary!

● Values reflect average-case (expectimax) outcomes, not

● It is possible to perform pruning in expectimax search.

● Stuart Russell and Peter Norvig. 2009. Artificial Intelligence: A

You might also like