0% found this document useful (0 votes)

8 views45 pages

Week 13

The document discusses the significance of game playing in AI research, highlighting its role as a reasoning problem and a means for comparison with human performance. It details game characteristics, the minimax algorithm, and its properties, including the alpha-beta pruning technique to optimize search efficiency. Additionally, it touches on deterministic and chance-based games, emphasizing the complexity and practical applications of AI in gaming.

Uploaded by

S Lekshmi Nair

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views45 pages

Week 13

Uploaded by

S Lekshmi Nair

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 45

` Introduction to

Artificial
Intelligence
Dr Lekshmi S Nair
Live 13
Game Playing

Why do AI researchers study game playing?

1. It’s a good reasoning problem, formal and nontrivial.

2. Direct comparison with humans and other computer

programs is easy.

2
What Kinds of Games?
Mainly games of strategy with the following
characteristics:

1. Sequence of moves to play

2. Rules that specify possible moves
3. Rules that specify a payment for each
move
4. Objective is to maximize your payment
3
Games vs. Search Problems

• Unpredictable opponent  specifying a

move for every possible opponent reply

• Time limits  unlikely to find goal, must

approximate

4
Two-Player Game
Opponent’s Move

Generate New Position

Game yes
Over?
no
Generate Successors

Evaluate Successors

Move to Highest-Valued Successor

no Game yes
Over?
5
Game Tree (2-player,
Deterministic, Turns)
computer’s
turn

opponent’s
turn

computer’s The computer is Max.

turn
The opponent is Min.
opponent’s
turn

leaf nodes At the leaf nodes, the

are evaluated utility function
is employed. Big value
6
means good, small is bad.
Mini-Max Terminology
• utility function: the function applied to leaf nodes
• backed-up value
– of a max-position: the value of its largest successor
– of a min-position: the value of its smallest successor
• minimax procedure: search down several levels;
at the bottom level apply the utility function,
back-up values all the way up to the root node,
and that node selects the move.

7
Minimax
• Perfect play for deterministic games
• Idea: choose move to position with highest minimax
value
= best achievable payoff against best play
• E.g., 2-ply game:

8
Example 1
Use the Minimax algorithm to compute the minimax value at each node for the game
tree below

9
Example 1
Use the Minimax algorithm to compute the minimax value at each node for the game
tree below

10
Minimax Strategy
• Why do we take the min value every other
level of the tree?

• These nodes represent the opponent’s

choice of move.

• The computer assumes that the human

will choose that move that is of least value
to the computer.
11
Minimax algorithm

12
Tic Tac Toe
• Let p be a position in the game
• Define the utility function f(p) by
– f(p) =
• largest positive number if p is a win for computer
• smallest negative number if p is a win for opponent
• RCDC – RCDO
– where RCDC is number of rows, columns and
diagonals in which computer could still win
– and RCDO is number of rows, columns and diagonals
in which opponent could still win.

13
Sample Evaluations
• X = Computer; O = Opponent

O O O X
X X X

X O X O
rows rows
cols cols
diags diags
14
Minimax is done depth-first
max

min

max

leaf
2 5 1

15
Properties of Minimax
• Complete? Yes (if tree is finite)
• Optimal? Yes (against an optimal opponent)
• Time complexity? O(bm)
• Space complexity? O(bm) (depth-first exploration)

• For chess, b ≈ 35, m ≈100 for "reasonable" games

 exact solution completely infeasible

Need to speed it up.

16
Alpha-Beta Procedure
• The alpha-beta procedure can speed up a
depth-first minimax search.
• Alpha: a lower bound on the value that a
max node may ultimately be assigned
v>

• Beta: an upper bound on the value that a

minimizing node may ultimately be
assigned
v<
17
α-β pruning example

18
α-β pruning example
=3

alpha cutoff

19
α-β pruning example

20
α-β pruning example

21
α-β pruning example

22
Alpha Cutoff
>3 =3

8 10

What happens here? Is there an alpha cutoff?

23
Beta Cutoff

=4 <4

4 >8

8  cutoff

24
Alpha-Beta Pruning
max

min

max

eval 5 2 10 11 1 2 2 8 6 5 12 4 3 25 2

25
Properties of α-β
• Pruning does not affect final result. This means that it
gets the exact same result as does full minimax.

• Good move ordering improves effectiveness of pruning

• With "perfect ordering," time complexity = O(bm/2)

 doubles depth of search

• A simple example of the value of reasoning about which

computations are relevant (a form of metareasoning)

26
The α-β algorithm

cutoff

27
The α-β algorithm

cutoff

28
When do we get alpha cutoffs?

100 ...

< 100 < 100

29
Shallow Search Techniques
1. limited search for a few levels

2. reorder the level-1 sucessors

3. proceed with - minimax search

30
Additional Refinements
• Waiting for Quiescence: continue the search
until no drastic change occurs from one level to
the next.

• Secondary Search: after choosing a move,

search a few more levels beneath it to be sure it
still looks good.

• Book Moves: for some parts of the game

(especially initial and end moves), keep a
catalog of best moves to make.
31
Evaluation functions
• For chess/checkers, typically linear weighted sum of
features
Eval(s) = w1 f1(s) + w2 f2(s) + … + wn fn(s)

e.g., w1 = 9 with
f1(s) = (number of white queens) – (number of black
queens), etc.

32
Example: Samuel’s Checker-
Playing Program
• It uses a linear evaluation function
f(n) = a1x1(n) + a2x2(n) + ... + amxm(n)

For example: f = 6K + 4M + U
– K = King Advantage
– M = Man Advantage
– U = Undenied Mobility Advantage (number of
moves that Max has that Min can’t jump after)
33
Samuel’s Checker Player
• In learning mode

– Computer acts as 2 players: A and B

– A adjusts its coefficients after every move
– B uses the static utility function
– If A wins, its function is given to B

34
Samuel’s Checker Player
• How does A change its function?
1. Coefficent replacement
(node ) = backed-up value(node) – initial value(node)
if > 0 then terms that contributed
positively are given more weight and terms that
contributed negatively get less weight
if < 0 then terms that contributed
negatively are given more weight and terms that
contributed positively get less weight

35
Samuel’s Checker Player
• How does A change its function?
2. Term Replacement
38 terms altogether
16 used in the utility function at any one time

Terms that consistently correlate low with the

function value are removed and added to the end of
the term queue.

They are replaced by terms from the front of the

term queue.

36
Kalah
P’s holes

6 6 6 6 6 6
KP Kp
0 counterclockwise 0

6 6 6 6 6 6

p’s holes
To move, pick up all the stones in one of your holes, and
put one stone in each hole, starting at the next one,
including your Kalah and skipping the opponent’s Kalah. 37
Kalah
• If the last stone lands in your Kalah, you get
another turn.
• If the last stone lands in your empty hole, take
all the stones from your opponent’s hole directly
across from it and put them in your Kalah.
• If all of your holes become empty, the opponent
keeps the rest of the stones.
• The winner is the player who has the most
stones in his Kalah at the end of the game.

38
Cutting off Search
MinimaxCutoff is identical to MinimaxValue except
1. Terminal? is replaced by Cutoff?
2. Utility is replaced by Eval

Does it work in practice?

bm = 106, b=35  m=4
4-ply lookahead is a hopeless chess player!
– 4-ply ≈ human novice
– 8-ply ≈ typical PC, human master
– 12-ply ≈ Deep Blue, Kasparov

39
Deterministic Games in Practice
• Checkers: Chinook ended 40-year-reign of human world champion
Marion Tinsley in 1994. Used a precomputed endgame database
defining perfect play for all positions involving 8 or fewer pieces on
the board, a total of 444 billion positions.

• Chess: Deep Blue defeated human world champion Garry Kasparov

in a six-game match in 1997. Deep Blue searches 200 million
positions per second, uses very sophisticated evaluation, and
undisclosed methods for extending some lines of search up to 40
ply.

• Othello: human champions refuse to compete against computers,

who are too good.

• Go: human champions refuse to compete against computers, who

are too bad. In Go, b > 300, so most programs use pattern
knowledge bases to suggest plausible moves.

40
Games of Chance
• What about games that involve chance,
such as
– rolling dice
– picking a card
• Use three kinds of nodes:
– max nodes    min
– min nodes chance
– chance nodes max

41
Games of Chance

c chance node with

max children

d1 di dk

S(c,di)

expectimax(c) = ∑P(di) max(backed-up-value(s))

i s in S(c,di)

expectimin(c’) = ∑P(di) min(backed-up-value(s))

i s in S(c,di) 42
Example Tree with Chance
max

chance
.4 .6
min   1.2
chance
.4 .6 .4 .6
max

leaf 3 5 1 4 1 2 4 5

43
Complexity
• Instead of O(bm), it is O(bmnm) where n is
the number of chance outcomes.

• Since the complexity is higher (both time

and space), we cannot search as deeply.

• Pruning algorithms may be applied.

44
Summary
• Games are fun to work on!

• They illustrate several important points about AI.

• Perfection is unattainable  must approximate.

• Game playing programs have shown the world

what AI can do.

Adversarial Search
No ratings yet
Adversarial Search
49 pages
Uninformed and Informed Search Algorithms
No ratings yet
Uninformed and Informed Search Algorithms
33 pages
4 Adversel Search Game Tree
No ratings yet
4 Adversel Search Game Tree
51 pages
18CS753 Ai Module 4
No ratings yet
18CS753 Ai Module 4
43 pages
The Shikshak Tyit Sem 5 Artificial Intelligence Question Papers N18 A19 N19 N22 A23
100% (1)
The Shikshak Tyit Sem 5 Artificial Intelligence Question Papers N18 A19 N19 N22 A23
9 pages
Optimal Decision in Games
No ratings yet
Optimal Decision in Games
68 pages
Adversial Search
No ratings yet
Adversial Search
39 pages
AI Unit 2
No ratings yet
AI Unit 2
132 pages
Lecture05 AdversarialSearch
No ratings yet
Lecture05 AdversarialSearch
51 pages
All in One
No ratings yet
All in One
164 pages
Adversarial Search Two - Persons Game: Russel Norvig (Text) Book and Patrick Henry Winston (Reference Book)
No ratings yet
Adversarial Search Two - Persons Game: Russel Norvig (Text) Book and Patrick Henry Winston (Reference Book)
71 pages
Adversarial Search
No ratings yet
Adversarial Search
37 pages
Lecture11 AdversarialSearch
No ratings yet
Lecture11 AdversarialSearch
74 pages
AAI Lecture 7 SP 25
No ratings yet
AAI Lecture 7 SP 25
51 pages
Adversial Search
No ratings yet
Adversial Search
101 pages
l3 Part2
No ratings yet
l3 Part2
65 pages
2025 Lecture03 AdversarialSearch
No ratings yet
2025 Lecture03 AdversarialSearch
51 pages
4 UNIT IV Part 1
No ratings yet
4 UNIT IV Part 1
43 pages
Chapter. 06 - Adversarial Search and Games - No Embedded Videos
No ratings yet
Chapter. 06 - Adversarial Search and Games - No Embedded Videos
51 pages
Lecture 6 - Minmax Alpha Beta
No ratings yet
Lecture 6 - Minmax Alpha Beta
41 pages
Unit 2 Adversial Search
No ratings yet
Unit 2 Adversial Search
36 pages
Part4.Game Playing
No ratings yet
Part4.Game Playing
35 pages
Game Playing. Updated
No ratings yet
Game Playing. Updated
44 pages
Ch5 Game
No ratings yet
Ch5 Game
79 pages
3 GamePlaying - Minimax
No ratings yet
3 GamePlaying - Minimax
75 pages
ITSC6121 Lecture 4 - Game Trees I
No ratings yet
ITSC6121 Lecture 4 - Game Trees I
34 pages
ITSC6121 Lecture 4 - Game Trees I
No ratings yet
ITSC6121 Lecture 4 - Game Trees I
38 pages
Games
No ratings yet
Games
41 pages
Lecture 7
No ratings yet
Lecture 7
62 pages
Lecture 09 10+Game+Playing+++MinMax-AlphaBeta
No ratings yet
Lecture 09 10+Game+Playing+++MinMax-AlphaBeta
54 pages
AI Unit-3
No ratings yet
AI Unit-3
109 pages
Game Playing
No ratings yet
Game Playing
60 pages
8.AI17game Final
No ratings yet
8.AI17game Final
30 pages
Introduction To Game Play
No ratings yet
Introduction To Game Play
90 pages
Chapter3 - Search4
No ratings yet
Chapter3 - Search4
37 pages
Adversarial Search
No ratings yet
Adversarial Search
42 pages
1 GamePlaying
No ratings yet
1 GamePlaying
30 pages
6 Game
No ratings yet
6 Game
42 pages
04 Games PDF
No ratings yet
04 Games PDF
77 pages
Game Playing
No ratings yet
Game Playing
32 pages
Artificial Intelligence For Gam Es: Shashwat Shukla
No ratings yet
Artificial Intelligence For Gam Es: Shashwat Shukla
56 pages
1 1 4GamePlaying
No ratings yet
1 1 4GamePlaying
23 pages
18cs753 Ai Module 4
No ratings yet
18cs753 Ai Module 4
44 pages
Game Playing
No ratings yet
Game Playing
48 pages
AI Lec07 Adversarial Search
No ratings yet
AI Lec07 Adversarial Search
29 pages
Artificial Intelligence: Adversarial Search
No ratings yet
Artificial Intelligence: Adversarial Search
58 pages
Adversarial Search
No ratings yet
Adversarial Search
20 pages
Oradea: Bucharest Arad Craiova
No ratings yet
Oradea: Bucharest Arad Craiova
53 pages
Basic 05 Games
No ratings yet
Basic 05 Games
74 pages
SET394 - AI - Lecture 06 - Adversarial Search
No ratings yet
SET394 - AI - Lecture 06 - Adversarial Search
27 pages
Game Playing
No ratings yet
Game Playing
24 pages
MCS 3201-Intelligent Systems: Gihan Seneviratne Gps@ucsc - LK
No ratings yet
MCS 3201-Intelligent Systems: Gihan Seneviratne Gps@ucsc - LK
88 pages
Why Do AI Researchers Study Game Playing?
No ratings yet
Why Do AI Researchers Study Game Playing?
42 pages
Game Playing: Games. Why? Minimax Search Alpha-Beta Pruning
No ratings yet
Game Playing: Games. Why? Minimax Search Alpha-Beta Pruning
31 pages
Adversarial Search
No ratings yet
Adversarial Search
78 pages
AAI - Intro Lec 9 10
No ratings yet
AAI - Intro Lec 9 10
22 pages
Gameplaying Group 1
No ratings yet
Gameplaying Group 1
13 pages
Game Playing MINMAX Search, Alpha-Beta Pruning, PDF
No ratings yet
Game Playing MINMAX Search, Alpha-Beta Pruning, PDF
4 pages
Ai Unit I and Unit 2 Notes
No ratings yet
Ai Unit I and Unit 2 Notes
10 pages
AI - Unit - 2
No ratings yet
AI - Unit - 2
30 pages
Search in Complex Environments
No ratings yet
Search in Complex Environments
28 pages
Adversarial Search and Game Playing: Games
No ratings yet
Adversarial Search and Game Playing: Games
8 pages
Artificial Inteligence
No ratings yet
Artificial Inteligence
4 pages
Artificial Intelligence Unit 3
No ratings yet
Artificial Intelligence Unit 3
69 pages
Ai Unit 2 Digital Notes
No ratings yet
Ai Unit 2 Digital Notes
82 pages
BCA 4th Sem. AI Paper
No ratings yet
BCA 4th Sem. AI Paper
12 pages
Cs607 Quiz 1 & 2 Merged - Merged
No ratings yet
Cs607 Quiz 1 & 2 Merged - Merged
386 pages
Chap 4 Games
No ratings yet
Chap 4 Games
31 pages
A Mini Project Report Tic Tac Toe
No ratings yet
A Mini Project Report Tic Tac Toe
11 pages
Unit 3 Problem Solving and Search Algorithms
No ratings yet
Unit 3 Problem Solving and Search Algorithms
73 pages
CST401 Artificial Intelligence, May 2024
No ratings yet
CST401 Artificial Intelligence, May 2024
4 pages
Mini Max Algorithm
No ratings yet
Mini Max Algorithm
31 pages
Othello Game Player: D C S J ' C
No ratings yet
Othello Game Player: D C S J ' C
26 pages
Development of Tic-Tac-Toe Game Using Heuristic Se
No ratings yet
Development of Tic-Tac-Toe Game Using Heuristic Se
7 pages
IAI Unit2
No ratings yet
IAI Unit2
81 pages
Ai2022 End Merged
No ratings yet
Ai2022 End Merged
28 pages
Updated PPT
No ratings yet
Updated PPT
14 pages
Unit-3 Problem Solving by Searching
No ratings yet
Unit-3 Problem Solving by Searching
31 pages
Module 3 Ai Viva Questions
No ratings yet
Module 3 Ai Viva Questions
7 pages
AI GTU Question Bank
No ratings yet
AI GTU Question Bank
5 pages
AL3391 AI Answer Key AU
No ratings yet
AL3391 AI Answer Key AU
15 pages
AI ENDSEM Sample Debjyoti
No ratings yet
AI ENDSEM Sample Debjyoti
8 pages
Alpha Beta Pruning in AI
No ratings yet
Alpha Beta Pruning in AI
10 pages
279817393aams Vol 216 April 2022 A25 p3303-3313 Kavita Sheoran, Et Al.
No ratings yet
279817393aams Vol 216 April 2022 A25 p3303-3313 Kavita Sheoran, Et Al.
11 pages
Anwesha Chinese Checker
No ratings yet
Anwesha Chinese Checker
34 pages
AI SMPS 2023 Week 07 Judea Pearl Figure 8.2 Alpha Beta
No ratings yet
AI SMPS 2023 Week 07 Judea Pearl Figure 8.2 Alpha Beta
4 pages

Week 13

Uploaded by

Week 13

Uploaded by

` Introduction to

Why do AI researchers study game playing?

1. It’s a good reasoning problem, formal and nontrivial.

2. Direct comparison with humans and other computer

1. Sequence of moves to play

• Unpredictable opponent  specifying a

• Time limits  unlikely to find goal, must

Generate New Position

Move to Highest-Valued Successor

computer’s The computer is Max.

leaf nodes At the leaf nodes, the

• These nodes represent the opponent’s

• The computer assumes that the human

• For chess, b ≈ 35, m ≈100 for "reasonable" games

Need to speed it up.

• Beta: an upper bound on the value that a

What happens here? Is there an alpha cutoff?

• Good move ordering improves effectiveness of pruning

• With "perfect ordering," time complexity = O(bm/2)

• A simple example of the value of reasoning about which

< 100 < 100

2. reorder the level-1 sucessors

3. proceed with - minimax search

• Secondary Search: after choosing a move,

• Book Moves: for some parts of the game

– Computer acts as 2 players: A and B

Terms that consistently correlate low with the

They are replaced by terms from the front of the

Does it work in practice?

• Chess: Deep Blue defeated human world champion Garry Kasparov

• Othello: human champions refuse to compete against computers,

• Go: human champions refuse to compete against computers, who

c chance node with

expectimax(c) = ∑P(di) max(backed-up-value(s))

expectimin(c’) = ∑P(di) min(backed-up-value(s))

• Since the complexity is higher (both time

• Pruning algorithms may be applied.

• They illustrate several important points about AI.

• Perfection is unattainable  must approximate.

• Game playing programs have shown the world

You might also like