0% found this document useful (0 votes)

145 views8 pages

Adversarial Search and Game Playing: Games

The document discusses games and adversarial search. It notes that games involve multi-agent environments where agents' actions affect each other. Games can be cooperative or competitive, with competitive games giving rise to adversarial search, also known as games. The document then discusses why games are studied, different types of games, and how games relate to search problems. It provides an example of a partial game tree for tic-tac-toe and discusses optimal strategies and the minimax algorithm for finding the best strategy.

Uploaded by

Chandra Dian

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

145 views8 pages

Adversarial Search and Game Playing: Games

Uploaded by

Chandra Dian

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Games

n Games: multi-agent environment

Adversarial Search and Game q What do other agents do and how do they affect our
success?
Playing q Cooperative vs. competitive multi-agent environments.
q Competitive multi-agent environments give rise to
adversarial search a.k.a. games
n Why study games?
Russell and Norvig, Chapter 5
q Fun!
q They are hard
q Easy to represent and agents restricted to small
number of actions sometimes!

2
http://xkcd.com/601/

Relation of Games to Search Types of Games

n Search no adversary
q Solution is (heuristic) method for finding goal Deterministic Chance
q Heuristics and CSP techniques can find optimal solution
q Evaluation function: estimate of cost from start to goal through Perfect chess, go, backgammon
given node information checkers, othello
q Examples: path planning, scheduling activities
Imperfect Bridge, hearts Poker, canasta,
n Games adversary
information scrabble
q Solution is strategy (strategy specifies move for every possible
opponent reply). Our focus: deterministic, turn-taking, two-player,
q Time limits force approximate solutions zero-sum games of perfect information
q Examples: chess, checkers, Othello, backgammon zero-sum game: a participant's gain (or loss) is exactly balanced
by the losses (or gains) of the other participant.
perfect information: fully observable

3 4

Game setup Partial Game Tree for Tic-Tac-Toe

n Two players: MAX and MIN
n MAX moves first and they take turns until the game is over.
n Games as search:
q Initial state: e.g. starting board configuration
q Player(s): which player has the move in a state
q Action(s): set of legal moves in a state
q Result(s, a): the states resulting from a given move.
q Terminal-test(s): game over? (terminal states)
q Utility(s,p): value of terminal states, e.g., win (+1), lose (-1) and draw
(0) in chess.
n Players use search tree to determine next move.

5 6

1
The Tic-Tac-Toe Search Space
n Is this search space a tree or graph?

n What is the minimum search depth?

n What is the maximum search depth?

n What is the branching factor?

http://xkcd.com/832/
7

Optimal strategies Two-Ply Game Tree

n Find the best strategy for MAX assuming an infallible MIN
opponent.
n Assumption: Both players play optimally.
n Given a game tree, the optimal strategy can be determined
by using the minimax value of each node:

MINIMAX(s)=
UTILITY(s) If s is a terminal
maxa Actions(s) MINIMAX(RESULT(s,a)) If PLAYER(s)=MAX
mina Actions(s) MINIMAX(RESULT(s,a)) If PLAYER(s)=MIN
Definition: ply = turn of a two-player game

9 10

Two-Ply Game Tree Two-Ply Game Tree

The minimax decision!

The minimax value at a min node is the minimum Minimax maximizes the worst-case outcome for max.
of backed-up values, because your opponent will
do whats best for them (and worst for you).
11 12

2
Minimax Algorithm Properties of Minimax
n Minimax explores tree using DFS.
function MINIMAX-DECISION(state) returns an action
return arg maxa Actions(s) MIN-VALUE(RESULT(state,a)) n Therefore:
function MAX-VALUE(state) returns a utility value q Time complexity: O(bm) L
if TERMINAL-TEST(state) then return UTILITY(state) q Space complexity: O(bm)
v J
for each a in ACTIONS(state) do
v MAX(v,MIN-VALUE(RESULT(state,a)))
return v
function MIN-VALUE(state) returns a utility value
if TERMINAL-TEST(state) then return UTILITY(state)
v
for a in ACTIONS(state) do
v MIN(v,MAX-VALUE(RESULT(state,a)))
return v
13 14

Problem of minimax search Alpha-Beta Pruning

n Number of game states is exponential in the n : the highest (i.e. best for Max) value
number of moves. possible
q Solution: Do not examine every node n : the lowest (i.e. best for Min) value possible
q ==> Alpha-beta pruning
n initially and are (-, ).
n Remove branches that do not influence final decision
n General idea: you can bracket the highest/lowest value
at a node, even before all its successors have been
evaluated

Alpha-Beta Example Alpha-Beta Example (continued)

Range of possible values!

[-,+]
[-,+]

[-,3]
[-, +]

17 18

3
Alpha-Beta Example (continued) Alpha-Beta Example (continued)

[3,+]
[-,+]

[3,3]
[-,3]

19 20

Alpha-Beta Example (continued) Alpha-Beta Example (continued)

[3,+]
This node is worse ! [3,14] ,
for MAX

[3,3] [-,2]
[3,3] [-,2] [-,14]

21 22

Alpha-Beta Example (continued) Alpha-Beta Example (continued)

[3,5] , [3,3]

[3,3] [-,2] [2,2]

[3,3] [-,2] [-,5]

23 24

4
Alpha-Beta Example (continued)
Alpha-Beta Algorithm

function ALPHA-BETA-SEARCH(state) returns an action

vMAX-VALUE(state, - , +)
[3,3] return the action in ACTIONS(state) with value v

[3,3] [-,2] [2,2] function MAX-VALUE(state, , ) returns a utility value

if TERMINAL-TEST(state) then return UTILITY(state)
v-
for each a in ACTIONS(state) do
v MAX(v, MIN-VALUE(RESULT(state,a), , ))
if v then return v
MAX( ,v)
return v

25 26

Alpha-Beta Algorithm Alpha-beta pruning

n When enough is known about
a node n, it can be pruned.

function MIN-VALUE(state, , ) returns a utility value

if TERMINAL-TEST(state) then return UTILITY(state)
v+
for each a in ACTIONS(state) do
v MIN(v, MAX-VALUE(RESULT(state,a), , ))
if v then return v
MIN( ,v)
return v

27 28

Final Comments about Alpha-Beta Pruning Is this practical?

n Pruning does not affect final results n Minimax and alpha-beta pruning still have exponential
n Entire subtrees can be pruned, not just leaves. complexity.
n Good move ordering improves effectiveness of pruning n May be impractical within a reasonable amount of time.
n With perfect ordering, time complexity is O(bm/2) n SHANNON (1950):
Terminate search at a lower depth
q Effective branching factor of sqrt(b)
q

q Apply heuristic evaluation function EVAL instead of the UTILITY

q Consequence: alpha-beta pruning can look twice as
function
deep as minimax in the same amount of time

29 30

5
Cutting off search Heuristic EVAL
n Change: n Idea: produce an estimate of the expected utility of the game
q if TERMINAL-TEST(state) then return UTILITY(state) from a given position.
into n Performance depends on quality of EVAL.
q if CUTOFF-TEST(state,depth) then return EVAL(state) n Requirements:
q EVAL should order terminal-nodes in the same way as UTILITY.

n Introduces a fixed-depth limit depth q Fast to Compute.

q Selected so that the amount of time will not exceed what the q For non-terminal states the EVAL should be strongly correlated
rules of the game allow. with the actual chance of winning.
n When cuttoff occurs, the evaluation is performed.

31 32

Heuristic EVAL example How good are computers

n Lets look at the state of the art computer
programs that play games such as chess,
checkers, othello, go

Addition assumes
independence

Eval(s) = w1 f1(s) + w2 f2(s) + + wn fn(s)

In chess:

w1 material + w2 mobility + w3 king safety + w4 center control +

33 34

Checkers Chinook
n Components of Chinook:
q Search (variant of alpha-beta). Search space has 1020
states.
q Evaluation function
q Endgame database (for all states with 4 vs. 4 pieces;
roughly 40 billion positions).
q Opening book - a database of opening moves
n Chinook can determine the final result of the game
n Chinook: the first program to win the world within the first 10 moves.
champion title in a competition against a n Author has recently shown that several openings
lead to a draw.
human (1994) Jonathan Schaeffer, Neil Burch, Yngvi Bjornsson, Akihiro Kishimoto, Martin Muller, Rob
Lake, Paul Lu and Steve Sutphen. "Checkers is Solved," Science, 2007.
http://www.cs.ualberta.ca/~chinook/publications/solving_checkers.html

35 36

6
Chess Othello
n 1997: Deep Blue wins a 6-
game match against Garry
Kasparov
n Searches using iterative deepening
alpha-beta; evaluation function has
over 8000 features; opening book
of 4000 positions; end game
database.
n FRITZ plays world champion,
Vladimir Kramnik; wins 6-gam n The best Othello computer programs can
match. easily defeat the best humans (e.g. Logistello,
1997).
37 38

Go Games that include chance

n Possible moves (5-10,5-11), (5-11,19-24),(5-10,10-16) and

n Go: humans still much better! (5-11,11-16)

39 40

Games that include chance Expected minimax value

chance nodes! EXPECTIMINIMAX(s)=
UTILITY(s) If s is a terminal
maxa EXPECTIMINIMAX(RESULT(s,a)) If PLAYER(S)=MAX
mina EXPECTIMINIMAX(RESULT(s,a)) If PLAYER(S)=MIN
r P(r) EXPECTIMINIMAX(RESULT(s,r)) If PLAYER(S)=CHANCE
r is a chance event (e.g., a roll of the dice).
These equations can be propagated recursively in a similar way
to the MINIMAX algorithm.

n Possible moves (5-10,5-11), (5-11,19-24),(5-10,10-16) and

(5-11,11-16)
n [1,1],,[6,6] probability 1/36, all others - 1/18
n Can not calculate definite minimax value, only expected value
41 42

7
TD-Gammon (Tesauro, 1994) Summary
n Games are fun
n They illustrate several important points about AI
Whites turn, with q Perfection is (usually) unattainable -> approximation
a roll of 4-4
q Uncertainty constrains the assignment of values to
states

World class program based on a combination of reinforcement

Learning, neural networks and alpha-beta pruning to 3 plies.

Move analyses by TD-Gammon have lead to some changes in

accepted strategies.

http://www.research.ibm.com/massive/tdl.html
43 44

Prism Defence Guide 250309 194929-1
No ratings yet
Prism Defence Guide 250309 194929-1
1,116 pages
09.permutations and Combinations PDF
100% (1)
09.permutations and Combinations PDF
96 pages
Ai Lecture-4
No ratings yet
Ai Lecture-4
37 pages
Why Do AI Researchers Study Game Playing?
No ratings yet
Why Do AI Researchers Study Game Playing?
42 pages
cs188 sp23 Lec09
No ratings yet
cs188 sp23 Lec09
47 pages
Game Playing
No ratings yet
Game Playing
32 pages
Lec7 LU Su20
No ratings yet
Lec7 LU Su20
46 pages
ITSC6121 Lecture 4 - Game Trees I
No ratings yet
ITSC6121 Lecture 4 - Game Trees I
38 pages
Lecture 09 10+Game+Playing+++MinMax-AlphaBeta
No ratings yet
Lecture 09 10+Game+Playing+++MinMax-AlphaBeta
54 pages
035 MATHEMATICS NUMBER SYSTEM संख्या पद्धति Classification
No ratings yet
035 MATHEMATICS NUMBER SYSTEM संख्या पद्धति Classification
4 pages
Game Playing
No ratings yet
Game Playing
53 pages
05 Adversarial Search
No ratings yet
05 Adversarial Search
51 pages
AAI Lecture 7 SP 25
No ratings yet
AAI Lecture 7 SP 25
51 pages
Adversarial Search
No ratings yet
Adversarial Search
37 pages
Optimal Decision in Games
No ratings yet
Optimal Decision in Games
68 pages
Lecture05 AdversarialSearch
No ratings yet
Lecture05 AdversarialSearch
51 pages
Week 13
No ratings yet
Week 13
45 pages
ITSC6121 Lecture 4 - Game Trees I
No ratings yet
ITSC6121 Lecture 4 - Game Trees I
34 pages
8.AI17game Final
No ratings yet
8.AI17game Final
30 pages
MCS 3201-Intelligent Systems: Gihan Seneviratne Gps@ucsc - LK
No ratings yet
MCS 3201-Intelligent Systems: Gihan Seneviratne Gps@ucsc - LK
88 pages
Lecture 6 - Minmax Alpha Beta
No ratings yet
Lecture 6 - Minmax Alpha Beta
41 pages
AI Unit-3
No ratings yet
AI Unit-3
109 pages
6 Game
No ratings yet
6 Game
42 pages
L06 (Adversarial Search) Ori
No ratings yet
L06 (Adversarial Search) Ori
46 pages
Lec03 Ai Chapter6 Adversarial Search and Game Playing Aima
No ratings yet
Lec03 Ai Chapter6 Adversarial Search and Game Playing Aima
52 pages
Game Playing
No ratings yet
Game Playing
33 pages
Adveserial Search
No ratings yet
Adveserial Search
29 pages
Chapter. 06 - Adversarial Search and Games - No Embedded Videos
No ratings yet
Chapter. 06 - Adversarial Search and Games - No Embedded Videos
51 pages
Adversarial Search Two - Persons Game: Russel Norvig (Text) Book and Patrick Henry Winston (Reference Book)
No ratings yet
Adversarial Search Two - Persons Game: Russel Norvig (Text) Book and Patrick Henry Winston (Reference Book)
71 pages
W6-Adverserial Search
No ratings yet
W6-Adverserial Search
39 pages
Eulerian Results in Neutrosophic Graphs With Applications
No ratings yet
Eulerian Results in Neutrosophic Graphs With Applications
47 pages
04 Games PDF
No ratings yet
04 Games PDF
77 pages
AI Lec07 Adversarial Search
No ratings yet
AI Lec07 Adversarial Search
29 pages
Chapter 1 Worked Solutions
No ratings yet
Chapter 1 Worked Solutions
177 pages
Oradea: Bucharest Arad Craiova
No ratings yet
Oradea: Bucharest Arad Craiova
53 pages
CSC-411-AI-lec6-Adversarial Search
No ratings yet
CSC-411-AI-lec6-Adversarial Search
38 pages
Game Playing. Updated
No ratings yet
Game Playing. Updated
44 pages
Unit 2 Adversial Search
No ratings yet
Unit 2 Adversial Search
36 pages
Adversial Search
No ratings yet
Adversial Search
101 pages
Chapter3 - Search4
No ratings yet
Chapter3 - Search4
37 pages
2025 Lecture03 AdversarialSearch
No ratings yet
2025 Lecture03 AdversarialSearch
51 pages
Adversarial Search
No ratings yet
Adversarial Search
42 pages
GamePlaying Minimax Unit-2 SPS
No ratings yet
GamePlaying Minimax Unit-2 SPS
72 pages
JAEWOO - Siemens PDF
No ratings yet
JAEWOO - Siemens PDF
19 pages
Adversarial Search
No ratings yet
Adversarial Search
20 pages
Adversarial Search
No ratings yet
Adversarial Search
36 pages
Lecture 7
No ratings yet
Lecture 7
62 pages
Algorithm Complexity L3
No ratings yet
Algorithm Complexity L3
27 pages
CS335 Introduction To AI: Francisco Iacobelli June 25, 2015
No ratings yet
CS335 Introduction To AI: Francisco Iacobelli June 25, 2015
49 pages
GATE Questions
No ratings yet
GATE Questions
126 pages
Game Playing MINMAX Search, Alpha-Beta Pruning, PDF
No ratings yet
Game Playing MINMAX Search, Alpha-Beta Pruning, PDF
4 pages
Lecture11 AdversarialSearch
No ratings yet
Lecture11 AdversarialSearch
74 pages
Game-Playing & Adversarial Search
No ratings yet
Game-Playing & Adversarial Search
68 pages
Artificial Intelligence: Adversarial Search
No ratings yet
Artificial Intelligence: Adversarial Search
58 pages
1 1 4GamePlaying
No ratings yet
1 1 4GamePlaying
23 pages
3 GamePlaying - Minimax
No ratings yet
3 GamePlaying - Minimax
75 pages
Practical No 1 Aim
No ratings yet
Practical No 1 Aim
17 pages
Games
No ratings yet
Games
41 pages
Institute of Southern Punjab Multan: Syed Zohair Quain Haider Lecturer ISP Multan
No ratings yet
Institute of Southern Punjab Multan: Syed Zohair Quain Haider Lecturer ISP Multan
41 pages
6 Game
No ratings yet
6 Game
53 pages
4 Adversel Search Game Tree
No ratings yet
4 Adversel Search Game Tree
51 pages
CC511 Week 4
No ratings yet
CC511 Week 4
57 pages
Number System 2011 To 2018
No ratings yet
Number System 2011 To 2018
22 pages
AI-Lecture 6 (Adversarial Search)
No ratings yet
AI-Lecture 6 (Adversarial Search)
68 pages
Game Playing
No ratings yet
Game Playing
24 pages
SET394 - AI - Lecture 06 - Adversarial Search
No ratings yet
SET394 - AI - Lecture 06 - Adversarial Search
27 pages
1a. Introduction To Programming in Java
No ratings yet
1a. Introduction To Programming in Java
3 pages
Heuristics and Local Search
No ratings yet
Heuristics and Local Search
31 pages
Artificial Inteligence
No ratings yet
Artificial Inteligence
4 pages
Assignment 0
No ratings yet
Assignment 0
2 pages
Lesson 3
No ratings yet
Lesson 3
20 pages
Nptel Week6 Module5 Greedy Huffman Code
No ratings yet
Nptel Week6 Module5 Greedy Huffman Code
36 pages
Exponent Properties PDF
No ratings yet
Exponent Properties PDF
2 pages
PLUMBING ARITHMETIC - MODULE 1-Unlocked
No ratings yet
PLUMBING ARITHMETIC - MODULE 1-Unlocked
3 pages
Module-1 Abstract
No ratings yet
Module-1 Abstract
11 pages
MMW Module 5 Mathematical System
No ratings yet
MMW Module 5 Mathematical System
28 pages
Unit 2 Algorithm
No ratings yet
Unit 2 Algorithm
39 pages
Artificial Intelligence - AL3391 - Hand Written Notes - Unit 5 - Probabilistic Reasoning
No ratings yet
Artificial Intelligence - AL3391 - Hand Written Notes - Unit 5 - Probabilistic Reasoning
45 pages
01-Basic Principle of Counting
No ratings yet
01-Basic Principle of Counting
4 pages
Digital Principles and Computer Organization - CS3351 - Important Questions With Answer - Unit 3 - Computer Fundamentals
No ratings yet
Digital Principles and Computer Organization - CS3351 - Important Questions With Answer - Unit 3 - Computer Fundamentals
13 pages
Ai 6
No ratings yet
Ai 6
19 pages
Discrete Guidelines NEP-3
No ratings yet
Discrete Guidelines NEP-3
2 pages
Handout 3969 3634
No ratings yet
Handout 3969 3634
6 pages
Approximation Distributions
No ratings yet
Approximation Distributions
3 pages
Alco TD3
No ratings yet
Alco TD3
3 pages
AlgoPrep's 151 Problems Sheet
No ratings yet
AlgoPrep's 151 Problems Sheet
6 pages
Data Structures and Algorithms: Gce, Salem (Autonomous) R-2022 Syllabus 77
No ratings yet
Data Structures and Algorithms: Gce, Salem (Autonomous) R-2022 Syllabus 77
1 page
Drozen Legend Lore: Homebrew Gaming
From Everand
Drozen Legend Lore: Homebrew Gaming
Dave Rozen
No ratings yet
The Jungle Temple Oracle: The Mystery of Herobrine: Book Two: A Gameknight999 Adventure: An Unofficial Minecrafter's Adventure
From Everand
The Jungle Temple Oracle: The Mystery of Herobrine: Book Two: A Gameknight999 Adventure: An Unofficial Minecrafter's Adventure
Mark Cheverton
5/5 (2)
Mastering Dungeons & Dragons: A Comprehensive Guide
From Everand
Mastering Dungeons & Dragons: A Comprehensive Guide
Joses Peter
No ratings yet

Adversarial Search and Game Playing: Games

Uploaded by

Adversarial Search and Game Playing: Games

Uploaded by

Games

n Games: multi-agent environment

Relation of Games to Search Types of Games

Game setup Partial Game Tree for Tic-Tac-Toe

n What is the minimum search depth?

n What is the maximum search depth?

n What is the branching factor?

Optimal strategies Two-Ply Game Tree

Two-Ply Game Tree Two-Ply Game Tree

The minimax decision!

Problem of minimax search Alpha-Beta Pruning

Alpha-Beta Example Alpha-Beta Example (continued)

Range of possible values!

Alpha-Beta Example (continued) Alpha-Beta Example (continued)

Alpha-Beta Example (continued) Alpha-Beta Example (continued)

[3,3] [-,2] [2,2]

function ALPHA-BETA-SEARCH(state) returns an action

[3,3] [-,2] [2,2] function MAX-VALUE(state, , ) returns a utility value

Alpha-Beta Algorithm Alpha-beta pruning

function MIN-VALUE(state, , ) returns a utility value

Final Comments about Alpha-Beta Pruning Is this practical?

q Apply heuristic evaluation function EVAL instead of the UTILITY

n Introduces a fixed-depth limit depth q Fast to Compute.

Heuristic EVAL example How good are computers

Eval(s) = w1 f1(s) + w2 f2(s) + + wn fn(s)

Go Games that include chance

n Possible moves (5-10,5-11), (5-11,19-24),(5-10,10-16) and

Games that include chance Expected minimax value

n Possible moves (5-10,5-11), (5-11,19-24),(5-10,10-16) and

World class program based on a combination of reinforcement

Move analyses by TD-Gammon have lead to some changes in

You might also like