Game Playing

1) Computer game playing has achieved world-champion level in chess, checkers, and other games. Deep Blue defeated Kasparov in chess in 1997. Chinook is the world checkers champion. Computers are decent but not champions at Go or bridge yet. 2) Games can be modeled as game trees where the root node represents the current board state, branches represent legal moves, and leaf nodes are evaluated by a static evaluator function. Minimax search traverses the tree to select the best move for the current player. 3) Alpha-beta pruning improves minimax search by avoiding evaluating subtrees that cannot affect the result, allowing deeper search with the same resources. It guarantees finding the

Uploaded by

Aayush “Ashu”

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views24 pages

Game Playing

Uploaded by

Aayush “Ashu”

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 24

CMSC 471

Game Playing
Chapter 6
Adapted from slides by Some material adopted from notes
Tim Finin and by Charles R. Dyer, University of
Marie desJardins. Wisconsin-Madison
Outline
• Game playing
– State of the art and resources
– Framework
• Game trees
– Minimax
– Alpha-beta pruning
– Adding randomness
Why study games?
• Clear criteria for success
• Offer an opportunity to study problems involving
{hostile, adversarial, competing} agents.
• Historical reasons
• Fun
• Interesting, hard problems which require minimal
“initial structure”
• Games often define very large search spaces
– chess 35100 nodes in search tree, 1040 legal states
State of the art
• How good are computer game players?
– Chess:
• Deep Blue beat Gary Kasparov in 1997
• Garry Kasparav vs. Deep Junior (Feb 2003): tie!
• Kasparov vs. X3D Fritz (November 2003): tie!
http://www.cnn.com/2003/TECH/fun.games/11/19/kasparov.chess.ap/
– Checkers: Chinook (an AI program with a very large endgame database)
is(?) the world champion.
– Go: Computer players are decent, at best
– Bridge: “Expert-level” computer players exist (but no world champions
yet!)
• Good places to learn more:
– http://www.cs.ualberta.ca/~games/
– http://www.cs.unimass.nl/icga
Chinook
• Chinook is the World Man-Machine Checkers
Champion, developed by researchers at the University
of Alberta.
• It earned this title by competing in human tournaments,
winning the right to play for the (human) world
championship, and eventually defeating the best players
in the world.
• Visit http://www.cs.ualberta.ca/~chinook/ to play a
version of Chinook over the Internet.
• The developers have fully analyzed the game of
checkers and have the complete game tree for it.
– Perfect play on both sides results in a tie.
• “One Jump Ahead: Challenging Human Supremacy in
Checkers” Jonathan Schaeffer, University of Alberta
(496 pages, Springer. $34.95, 1998).
Ratings of human and computer chess champions
Typical case
• 2-person game
• Players alternate moves
• Zero-sum: one player’s loss is the other’s gain
• Perfect information: both players have access to complete
information about the state of the game. No information is
hidden from either player.
• No chance (e.g., using dice) involved
• Examples: Tic-Tac-Toe, Checkers, Chess, Go, Nim, Othello
• Not: Bridge, Solitaire, Backgammon, ...
How to play a game
• A way to play such a game is to:
– Consider all the legal moves you can make
– Compute the new position resulting from each move
– Evaluate each resulting position and determine which is best
– Make that move
– Wait for your opponent to move and repeat
• Key problems are:
– Representing the “board”
– Generating all legal next boards
– Evaluating a position
Evaluation function
• Evaluation function or static evaluator is used to evaluate
the “goodness” of a game position.
– Contrast with heuristic search where the evaluation function was a
non-negative estimate of the cost from the start node to a goal and
passing through the given node
• The zero-sum assumption allows us to use a single
evaluation function to describe the goodness of a board
with respect to both players.
– f(n) >> 0: position n good for me and bad for you
– f(n) << 0: position n bad for me and good for you
– f(n) near 0: position n is a neutral position
– f(n) = +infinity: win for me
– f(n) = -infinity: win for you
Evaluation function examples
• Example of an evaluation function for Tic-Tac-Toe:
f(n) = [# of 3-lengths open for me] - [# of 3-lengths open for you]
where a 3-length is a complete row, column, or diagonal
• Alan Turing’s function for chess
– f(n) = w(n)/b(n) where w(n) = sum of the point value of white’s pieces and b(n) =
sum of black’s
• Most evaluation functions are specified as a weighted sum of position
features:
f(n) = w1*feat1(n) + w2*feat2(n) + ... + wn*featk(n)
• Example features for chess are piece count, piece placement, squares
controlled, etc.
• Deep Blue had over 8000 features in its evaluation function
Game trees
• Problem spaces for typical games are
represented as trees
• Root node represents the current
board configuration; player must decide
the best single move to make next
• Static evaluator function rates a board
position. f(board) = real number with
f>0 “white” (me), f<0 for black (you)
• Arcs represent the possible legal moves for a player
• If it is my turn to move, then the root is labeled a "MAX" node;
otherwise it is labeled a "MIN" node, indicating my opponent's turn.
• Each level of the tree has nodes that are all MAX or all MIN; nodes at
level i are of the opposite kind from those at level i+1
Minimax procedure
• Create start node as a MAX node with current board configuration
• Expand nodes down to some depth (a.k.a. ply) of lookahead in the
game
• Apply the evaluation function at each of the leaf nodes
• “Back up” values for each of the non-leaf nodes until a value is
computed for the root node
– At MIN nodes, the backed-up value is the minimum of the values associated
with its children.
– At MAX nodes, the backed-up value is the maximum of the values associated
with its children.
• Pick the operator associated with the child node whose backed-up
value determined the value at the root
Minimax Algorithm
2

2 1 2 1

2 7 1 8 2 7 1 8 2 7 1 8

This is the move 2

Static evaluator selected by minimax
value
2 1
MAX
MIN 2 7 1 8
Partial Game Tree for Tic-Tac-Toe

• f(n) = +1 if the position is a

win for X.
• f(n) = -1 if the position is a
win for O.
• f(n) = 0 if the position is a
draw.
Minimax Tree
MAX node

MIN node

value computed
f value by minimax
Alpha-beta pruning
• We can improve on the performance of the minimax
algorithm through alpha-beta pruning
• Basic idea: “If you have an idea that is surely bad, don't
take the time to see how truly awful it is.” -- Pat Winston

MAX >=2 • We don’t need to compute

the value at this node.
MIN =2 <=1
• No matter what it is, it can’t
affect the value of the root
MAX node.
2 7 1 ?
Alpha-beta pruning
• Traverse the search tree in depth-first order
• At each MAX node n, alpha(n) = maximum value found so
far
• At each MIN node n, beta(n) = minimum value found so far
– Note: The alpha values start at -infinity and only increase, while beta
values start at +infinity and only decrease.
• Beta cutoff: Given a MAX node n, cut off the search below n
(i.e., don’t generate or examine any more of n’s children) if
alpha(n) >= beta(i) for some MIN node ancestor i of n.
• Alpha cutoff: stop searching below MIN node n if beta(n) <=
alpha(i) for some MAX node ancestor i of n.
Alpha-beta example
MAX 3

MIN 3 2 - prune 14 1 - prune

3 12 8 2 14 1
Alpha-beta algorithm
function MAX-VALUE (state, α, β)
;; α = best MAX so far; β = best MIN
if TERMINAL-TEST (state) then return UTILITY(state)
v := -∞
for each s in SUCCESSORS (state) do
v := MAX (v, MIN-VALUE (s, α, β))
if v >= β then return v
α := MAX (α, v)
end
return v

function MIN-VALUE (state, α, β)

if TERMINAL-TEST (state) then return UTILITY(state)
v := ∞
for each s in SUCCESSORS (state) do
v := MIN (v, MAX-VALUE (s, α, β))
if v <= α then return v
β := MIN (β, v)
end
return v
Effectiveness of alpha-beta
• Alpha-beta is guaranteed to compute the same value for the root
node as computed by minimax, with less or equal computation
• Worst case: no pruning, examining bd leaf nodes, where each
node has b children and a d-ply search is performed
• Best case: examine only (2b)d/2 leaf nodes.
– Result is you can search twice as deep as minimax!
• Best case is when each player’s best move is the first alternative
generated
• In Deep Blue, they found empirically that alpha-beta pruning
meant that the average branching factor at each node was about 6
instead of about 35!
Games of chance
• Backgammon is a two-player
game with uncertainty.
•Players roll dice to determine
what moves to make.
•White has just rolled 5 and 6
and has four legal moves:
• 5-10, 5-11
•5-11, 19-24
•5-10, 10-16
•5-11, 11-16

•Such games are good for

exploring decision making in
adversarial problems involving
skill and luck.
Game trees with chance nodes
• Chance nodes (shown as
circles) represent random events
• For a random event with N
outcomes, each chance node has
N distinct children; a probability
is associated with each
Min
• (For 2 dice, there are 21 distinct Rolls

outcomes)
• Use minimax to compute values
for MAX and MIN nodes
• Use expected values for chance Max
nodes Rolls

• For chance nodes over a max node,

as in C:

expectimax(C) = ∑i(P(di) * maxvalue(i))

• For chance nodes over a min node:

expectimin(C) = ∑ (P(d ) * minvalue(i))

Meaning of the evaluation function

A1 is best A2 is best
move move

2 outcomes
with prob
{.9, .1}

• Dealing with probabilities and expected values means we have to be careful

about the “meaning” of values returned by the static evaluator.
• Note that a “relative-order preserving” change of the values would not change
the decision of minimax, but could change the decision with chance nodes.
• Linear transformations are OK

SDLC Topic Computer New Book 1st Year 2025
No ratings yet
SDLC Topic Computer New Book 1st Year 2025
5 pages
Module 03 OS
No ratings yet
Module 03 OS
35 pages
Detecting EBPF Rootkits Using Virtualization and Memory Forensics
No ratings yet
Detecting EBPF Rootkits Using Virtualization and Memory Forensics
8 pages
Deep Learning For Crop Disease Detection Using YOLOv8
No ratings yet
Deep Learning For Crop Disease Detection Using YOLOv8
6 pages
HB Ac2 Acv2 Ethernet Ip Geraeteintegration en
No ratings yet
HB Ac2 Acv2 Ethernet Ip Geraeteintegration en
52 pages
Y6 Spring 5
No ratings yet
Y6 Spring 5
2 pages
Big Questions With Answers
100% (1)
Big Questions With Answers
32 pages
Intern Description
No ratings yet
Intern Description
3 pages
Mini
No ratings yet
Mini
6 pages
Form STUDY KELAYAKAN MUSTAHIK (Responses)
No ratings yet
Form STUDY KELAYAKAN MUSTAHIK (Responses)
41 pages
2018 - 4 - Answer Key of Naib Tehsildar (Main) - 2018 Held On 14-04-2018
No ratings yet
2018 - 4 - Answer Key of Naib Tehsildar (Main) - 2018 Held On 14-04-2018
2 pages
Application of Jacobian Series
No ratings yet
Application of Jacobian Series
6 pages
Acknowledgement For Thesis Work in Pakistan
100% (3)
Acknowledgement For Thesis Work in Pakistan
7 pages
Mini Max
100% (1)
Mini Max
9 pages
Lecture14 - Alpha Beta Pruning
No ratings yet
Lecture14 - Alpha Beta Pruning
47 pages
A Project Report
No ratings yet
A Project Report
7 pages
Unit 2 (With Page Number)
No ratings yet
Unit 2 (With Page Number)
30 pages
60% PDF
No ratings yet
60% PDF
1 page
Problem Solving by Searching
No ratings yet
Problem Solving by Searching
88 pages
ESG Economic Validation - Aruba ESP
No ratings yet
ESG Economic Validation - Aruba ESP
11 pages
Manual de Servicio Centiva
No ratings yet
Manual de Servicio Centiva
232 pages
DIY Obstacle Avoiding Robot
No ratings yet
DIY Obstacle Avoiding Robot
42 pages
Ais615 Lesson Plan Semester Oct 2023
No ratings yet
Ais615 Lesson Plan Semester Oct 2023
3 pages
Multimedia - Learning Livro Inglês
No ratings yet
Multimedia - Learning Livro Inglês
99 pages
Fraunhofer CML TOS-Study Excerpt PDF
No ratings yet
Fraunhofer CML TOS-Study Excerpt PDF
13 pages
Introduction To Syntax Analysis: CSCI4160: Compiler Design and Software Development
No ratings yet
Introduction To Syntax Analysis: CSCI4160: Compiler Design and Software Development
36 pages
Question: 2. An Air Conditioning Plant Comprising Lter, Cooler Coil, Fan A
No ratings yet
Question: 2. An Air Conditioning Plant Comprising Lter, Cooler Coil, Fan A
2 pages
Amazon Application Engineer - JD
No ratings yet
Amazon Application Engineer - JD
2 pages
## Parsing A Data File (Python For Beginner) Somet...
No ratings yet
## Parsing A Data File (Python For Beginner) Somet...
3 pages
Facebook Netiquette
No ratings yet
Facebook Netiquette
13 pages
State Space Search and Heuristic Search Techniques
100% (1)
State Space Search and Heuristic Search Techniques
16 pages
Ai Unit 2
No ratings yet
Ai Unit 2
135 pages
Swat Modflow Tutorial
No ratings yet
Swat Modflow Tutorial
11 pages
Ender-3 Assembly Instruction (V1.0)
No ratings yet
Ender-3 Assembly Instruction (V1.0)
14 pages
Abacus Math Worksheets Free
No ratings yet
Abacus Math Worksheets Free
10 pages
1 1 4GamePlaying
No ratings yet
1 1 4GamePlaying
23 pages
Alpha-Beta Pruning
No ratings yet
Alpha-Beta Pruning
5 pages
21CSC206T Unit3
100% (1)
21CSC206T Unit3
138 pages
Soft Computing
No ratings yet
Soft Computing
39 pages
Lecture08 AI UMT Fall 2020 21 - V3
No ratings yet
Lecture08 AI UMT Fall 2020 21 - V3
31 pages
SRM'24 AI Unit 2
No ratings yet
SRM'24 AI Unit 2
105 pages
Game Tree
100% (2)
Game Tree
25 pages
Lec11&12-Adversarial Search
No ratings yet
Lec11&12-Adversarial Search
30 pages
ML Unit-Iv
No ratings yet
ML Unit-Iv
18 pages
Game Playing in AI
No ratings yet
Game Playing in AI
36 pages
Unit III AI
100% (1)
Unit III AI
38 pages
Chapter 3 - Solving Problems by Searching
No ratings yet
Chapter 3 - Solving Problems by Searching
71 pages
Dynamic Programming
No ratings yet
Dynamic Programming
26 pages
Artificial Intelligence CS-3431w (V2)
No ratings yet
Artificial Intelligence CS-3431w (V2)
15 pages
Unit-2 Introduction To Hadoop
No ratings yet
Unit-2 Introduction To Hadoop
19 pages
All Pairs Shortest Path
No ratings yet
All Pairs Shortest Path
28 pages
Heuristic Search
No ratings yet
Heuristic Search
30 pages
Artificial Intelligence CS-3431w (V2)
No ratings yet
Artificial Intelligence CS-3431w (V2)
23 pages
Random - Number Generators
No ratings yet
Random - Number Generators
37 pages
Lec 5 Contd Minimax Alpha Beta Algorithm
No ratings yet
Lec 5 Contd Minimax Alpha Beta Algorithm
21 pages
Cluster Computing
No ratings yet
Cluster Computing
32 pages
Micro Processor 8086
No ratings yet
Micro Processor 8086
27 pages
MIniMax Algorithm
No ratings yet
MIniMax Algorithm
8 pages
SC QB
No ratings yet
SC QB
24 pages
Subject: Artificial Intelligence 5. Planning: Faculty Name: Anita Patil Mrs. Jyoti Joshi
No ratings yet
Subject: Artificial Intelligence 5. Planning: Faculty Name: Anita Patil Mrs. Jyoti Joshi
49 pages
DAA Unit-2: Fundamental Algorithmic Strategies
No ratings yet
DAA Unit-2: Fundamental Algorithmic Strategies
5 pages
Cs 171 07a Games MiniMax
No ratings yet
Cs 171 07a Games MiniMax
28 pages
AI.02a - Solving Problems by Searching - T
No ratings yet
AI.02a - Solving Problems by Searching - T
118 pages
Knowledge Representation & Reasoning: By: Irum Naz Sodhar Lecturer IT, SBBU-SBA Main Campus
100% (1)
Knowledge Representation & Reasoning: By: Irum Naz Sodhar Lecturer IT, SBBU-SBA Main Campus
22 pages
07 Game Playing
No ratings yet
07 Game Playing
30 pages
AI CH3 Unit3
No ratings yet
AI CH3 Unit3
40 pages
Cse330 Agent-Based-Intelligent-Systems TH 1.00 Ac26
0% (1)
Cse330 Agent-Based-Intelligent-Systems TH 1.00 Ac26
2 pages
Informed Search Algorithms: UNIT-2
No ratings yet
Informed Search Algorithms: UNIT-2
35 pages
Midsem Regular QP
No ratings yet
Midsem Regular QP
2 pages
Artificial Intelligence - Alpha-Beta Pruning
No ratings yet
Artificial Intelligence - Alpha-Beta Pruning
6 pages
Unit-2 Adversarial Search
No ratings yet
Unit-2 Adversarial Search
13 pages
Adversarial Search 2020
No ratings yet
Adversarial Search 2020
34 pages
Heuristic Search
No ratings yet
Heuristic Search
49 pages
09 Constraint Satisfaction Problems
No ratings yet
09 Constraint Satisfaction Problems
51 pages
Memory Bounded1
No ratings yet
Memory Bounded1
17 pages
Skill Set For Competitive Programming
No ratings yet
Skill Set For Competitive Programming
7 pages
Lecture 04 Part A - Knowledge Representation and Reasoning
100% (1)
Lecture 04 Part A - Knowledge Representation and Reasoning
23 pages
Adversarial Search Two - Persons Game: Russel Norvig (Text) Book and Patrick Henry Winston (Reference Book)
No ratings yet
Adversarial Search Two - Persons Game: Russel Norvig (Text) Book and Patrick Henry Winston (Reference Book)
71 pages
Lec-4-HEURISTIC SEARCH METHODS-1
No ratings yet
Lec-4-HEURISTIC SEARCH METHODS-1
54 pages
Question Bank 1to11
No ratings yet
Question Bank 1to11
19 pages
Line Generation Algorithm PDF
100% (1)
Line Generation Algorithm PDF
4 pages
Lecture 6 - State Space Search - Uninformed Search
No ratings yet
Lecture 6 - State Space Search - Uninformed Search
43 pages
Tic Tac Toe
No ratings yet
Tic Tac Toe
12 pages
Recursion C++ PDF
No ratings yet
Recursion C++ PDF
24 pages
Puzzles As Programmer Interview Question
No ratings yet
Puzzles As Programmer Interview Question
32 pages
Minimax With Alpha Beta Pruning
No ratings yet
Minimax With Alpha Beta Pruning
21 pages
Random Number Generator
No ratings yet
Random Number Generator
46 pages
Load Scheduling
100% (1)
Load Scheduling
10 pages
Openmp Tutorial: Seung-Jai Min
No ratings yet
Openmp Tutorial: Seung-Jai Min
30 pages
ACM ICPC Programming Contest Orientation
No ratings yet
ACM ICPC Programming Contest Orientation
40 pages