0% found this document useful (0 votes)

55 views39 pages

Introduction To Artificial Intelligence: Amna Iftikhar Spring ' 2021 1

The document provides an introduction and overview of the minimax algorithm and alpha-beta pruning techniques for game playing. It discusses game theory concepts, examples of games that computers now play like checkers, chess and Go. It then explains the minimax algorithm and how it works through examples. It also discusses limitations of minimax and techniques like alpha-beta pruning to improve search efficiency.

Uploaded by

Ali Raza cs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

55 views39 pages

Introduction To Artificial Intelligence: Amna Iftikhar Spring ' 2021 1

Uploaded by

Ali Raza cs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 39

Introduction to Artificial

Intelligence

Lecture 06
Amna Iftikhar Spring ' 2021 1
Today’s Agenda
• Minimax algorithm
• Alpha Beta pruning

Amna Iftikhar Spring ' 2021 2

Game theory

• Competitive environments, in which the agents’ goals are

in conflict, giving rise to adversarial search problems—
often known as games.

Amna Iftikhar Spring ' 2021 3

Game Playing State-of-the-Art
• Checkers: 1950: First computer player. 1994: First computer champion: Chinook
ended 40-year-reign of human champion Marion Tinsley using complete 8-piece
endgame. 2007: Checkers solved!

• Chess: 1997: Deep Blue defeats human champion Gary Kasparov in a six-game
match. Deep Blue examined 200M positions per second, used very sophisticated
evaluation and undisclosed methods for extending some lines of search up to 40
ply. Current programs are even better, if less historic.

• Go: 2017: AlphaGo beats world's number one Go player Ke Jie. In go, b > 300!
Classic programs use pattern knowledge bases, but big recent advances use Monte
Carlo (randomized) expansion methods.
– AlphaGo Zero achieved a 100-0 victory against the champion-defeating AlphaGo, while its
successor, the self-taught AlphaZero, is currently perceived as the world's top player in Go as
well as possibly in chess.

Amna Iftikhar Spring ' 2021 4

The Number of possible legal board positions
for GO!

• That number—which is greater than the number of atoms in the universe—was

only determined in early 2016. Because there are so many directions any given
game can move in, Go is a notoriously difficult game for computers to play. It has
often been called the “Holy Grail” of artificial intelligence.

Amna Iftikhar Spring ' 2021 5

Types of Games
• Many different kinds of games!

• Axes:
– Deterministic or stochastic?
– One, two, or more players?
– Zero sum?
– Perfect information (can you see the state)?

• Want algorithms for calculating a strategy (policy)

which recommends a move from each state
Amna Iftikhar Spring ' 2021 6
Zero-Sum Games

• Zero-Sum Games • General Games

– Agents have opposite utilities (values on – Agents have independent utilities
outcomes) (values on outcomes)
– Lets us think of a single value that one – Cooperation, indifference,
maximizes and the other minimizes competition, and more are all possible
– Adversarial, pure competition

Amna Iftikhar Spring ' 2021 7

Adversarial Search

Amna Iftikhar Spring ' 2021 8

Game Tree
• Many two-player games can be efficiently represented using
trees, called game trees.
• A game tree is an instance of a tree in which the root node
represents the state before any moves have been made, the
nodes in the tree represent possible states of the game (or
positions), and arcs in the tree represent moves.
• It is usual to represent the two players’ moves on alternate
levels of the game tree, so that all edges leading from the root
node to the first level represent possible moves for the first
player, and edges from the first level to the second represent
moves for the second player, and so on.

Amna Iftikhar Spring ' 2021 9

Ply and Move
• In discussing game trees, we use the concept of ply, which
refers to the depth of the tree.
• When a computer evaluates a game tree to ply 5, it is
examining the tree to a depth of 5. The 4th ply in a game tree
is the level at depth 4 below the root node.
• Because the games we are talking about involve two players,
sequential plies in the tree will alternately represent the two
players. Hence, a game tree with a ply of 8 will represent a
total of eight choices in the game, which corresponds to four
moves for each player.
• It is usual to use the word ply to represent a single level of
choice in the game tree, but for the word move to represent
two such choices—one for each player (Source: wikipedia)

Amna Iftikhar Spring ' 2021 10

Tic-Tac-Toe Game Tree

Amna Iftikhar Spring ' 2021 11

Minimax Algorithm for Tic-Tac-Toe

Amna Iftikhar Spring ' 2021 12

Minimax Algorithm
• The minimax algorithm is a useful method for simple two-
player games. It is a method for selecting the best move
given an alternating game where each player opposes the
other working toward a mutually exclusive goal.

• In a two-person game, you must assume that your opponent

has the same knowledge that you do and applies it as well as
you do. So at each stage of the game you must assume your
opponent makes the best available move. This is the basis of
the minimax procedure.

Amna Iftikhar Spring ' 2021 13

Minimax Algorithm (Cont’d)
• It is assumed that a suitable static evaluation function is
available, which is able to give an overall score to a given
position.

• In applying Minimax, the static evaluator will only be used on

leaf nodes, and the values of the leaf nodes will be filtered up
through the tree, to pick out the best path that the computer
can achieve.

Amna Iftikhar Spring ' 2021 14

Minimax Algorithm (Cont’d)
• The principle behind Minimax is that a path through the tree
is chosen by assuming that at its turn (a max node), the
computer will choose the move that will give the highest
eventual static evaluation, and that at the human opponent’s
turn (a min node), he or she will choose the move that will
give the lowest static evaluation.

• In other words, we assume that each player makes the next

move that benefits them the most.

Amna Iftikhar Spring ' 2021 15

Minimax Algorithm for Tic-Tac-Toe

Amna Iftikhar Spring ' 2021 16

Example of Minimax Algorithm

Amna Iftikhar Spring ' 2021 17

Example I

Amna Iftikhar Spring ' 2021 18

Example II

Amna Iftikhar Spring ' 2021 19

Example III

Amna Iftikhar Spring ' 2021 20

Minimax Efficiency
• How efficient is minimax?
– Just like (exhaustive) DFS
– Time: O(bd)
– Space: O(bd)

• Example: For chess, b  35, d  100

– Exact solution is completely infeasible
– But, do we need to explore the whole tree?

Amna Iftikhar Spring ' 2021 21

Minimax Properties
max

min

10 10 9 100

Optimal against a perfect player. Otherwise?

Amna Iftikhar Spring ' 2021 22

Lookahead and Horizon Effect
• Minimax is a very simple algorithm
and is unsuitable for use in many
games, such as chess, where the
game tree is extremely large.
• In such cases, bounded lookahead is
very commonly used and can be
combined with Minimax.
• The idea of bounded lookahead is
that the search tree is only examined
to a particular depth. All nodes at this
depth are considered to be leaf
nodes and are evaluated using a
static evaluation function.

Amna Iftikhar Spring ' 2021 23

Lookahead and Horizon Effect
• When we employ
lookahead strategy, we
suffer from what is called
the horizon effect.

• When we can’t see beyond

the horizon, it becomes
easier to make a move that
looks good now, but leads
to problems later as we
move further into this
subtree.

Amna Iftikhar Spring ' 2021 24

Evaluation Function
• For a computer to use this tree to make decisions
about moves in a game of tic-tac-toe, it needs to use
an evaluation function, which enables it to decide
whether a given position in the game is good or bad.
• If we use exhaustive search, then we only need a
function that can recognize a win, a loss, and a draw.
• Then, the computer can treat “win” states as goal
nodes and carry out search in the normal way.
• But, for limited depth search terminal utilities would
have to be replaced by an evaluation function for
non-terminal positions.
Amna Iftikhar Spring ' 2021 25
Static Evaluation Function for Tic-Tac-Toe
• The static evaluation
function is defined as
the number of
possible win positions
not blocked by the
opponent minus the
number of possible
win positions (row,
column, and diagonal)
for the opponent not
blocked by the current
player:

• f (n) = win_positions-
lose_positions

Amna Iftikhar Spring ' 2021 26

Alpha-Beta Pruning
• Using alpha–beta pruning, it is
possible to remove sections of the
game tree that are not worth
examining, to make searching for a
good move more efficient.
• The principle behind alpha–beta
pruning is that if a move is
determined to be worse than
another move that has already been
examined, then further examining
the possible consequences of that
worse move is pointless.

Amna Iftikhar Spring ' 2021 27

Alpha-Beta Pruning Working
• The algorithm maintains two values, alpha and beta, which represent
the minimum score that the maximizing player is assured of and the
maximum score that the minimizing player is assured of respectively.
• Initially alpha is negative infinity and beta is positive infinity.
• Together alpha and beta provides a window of possible scores. We
will never choose to make moves that score less than alpha and our
opponent will never let us make moves scoring more than beta. The
score we finally achieve must lie between the two.
• As the recursion progresses the "window" becomes smaller. When
beta becomes less than alpha, it means that the current position
cannot be the result of best play by both players and hence need not
be explored further.

Amna Iftikhar Spring ' 2021 28

Alpha-Beta Pruning Working (Cont’d)
• Together alpha and beta provides a window of possible
scores. We will never choose to make moves that score
less than alpha and our opponent will never let us make
moves scoring more than beta. The score we finally
achieve must lie between the two.

• As the recursion progresses the "window" becomes

smaller. When beta becomes less than alpha, it means
that the current position cannot be the result of best play
by both players and hence need not be explored further.

Amna Iftikhar Spring ' 2021 29

Example I (Revisited)

X X

Only 7 nodes (out of 12) explored with alpha-beta pruning.

Amna Iftikhar Spring ' 2021 30

Working of Alpha-Beta for Example I

Amna Iftikhar Spring ' 2021 31

Working of Alpha-Beta for Example I (Cont’d)

Amna Iftikhar Spring ' 2021 32

Minimax Example

3 12 8 2 4 6 14 5 2

Amna Iftikhar Spring ' 2021 33

Pruning

3 12 8 2 14 5 2

Amna Iftikhar Spring ' 2021 34

Alpha-Beta Implementation
α: MAX’s best option on path to root
β: MIN’s best option on path to root

def max-value(state, α, β): def min-value(state , α, β):

initialize v = -∞ initialize v = +∞
for each successor of state: for each successor of state:
v = max(v, value(successor, v = min(v, value(successor,
α, β)) α, β))
if v ≥ β return v if v ≤ α return v
α = max(α, v) β = min(β, v)
return v return v

Amna Iftikhar Spring ' 2021 35

Example II

Amna Iftikhar Spring ' 2021 36

Example III

Amna Iftikhar Spring ' 2021 37

Java Applet

• https
://www.yosenspace.com/posts/computer-scie
nce-game-trees.html

Amna Iftikhar Spring ' 2021 38

Advantages of Alpha-Beta Pruning
• The alpha–beta pruning method provides its best
performance when the game tree is arranged such that the
best choice at each level is the first one (i.e., the left-most
choice) to be examined by the algorithm.
• With such a game tree, a Minimax algorithm using alpha–beta
cut-off will examine a game tree to double the depth that a
Minimax algorithm without alpha–beta pruning would
examine in the same number of steps.

Amna Iftikhar Spring ' 2021 39

cs188 sp23 Lec09
No ratings yet
cs188 sp23 Lec09
47 pages
AI - Unit - 2
No ratings yet
AI - Unit - 2
30 pages
Unit 2c Game Playing (Compatibility Mode)
No ratings yet
Unit 2c Game Playing (Compatibility Mode)
36 pages
21CSC206T Unit3
100% (1)
21CSC206T Unit3
138 pages
AI All Units
No ratings yet
AI All Units
93 pages
Game Playing AI: (Based On Earlier Lecture From Stephen Gould)
No ratings yet
Game Playing AI: (Based On Earlier Lecture From Stephen Gould)
28 pages
AI Unit3
No ratings yet
AI Unit3
145 pages
Lecture 09 10+Game+Playing+++MinMax-AlphaBeta
No ratings yet
Lecture 09 10+Game+Playing+++MinMax-AlphaBeta
54 pages
AI-Lecture 6 (Adversarial Search)
No ratings yet
AI-Lecture 6 (Adversarial Search)
68 pages
Game Playing
No ratings yet
Game Playing
74 pages
5.1 GamePlaying (AIML)
No ratings yet
5.1 GamePlaying (AIML)
48 pages
Chapter05 4e
No ratings yet
Chapter05 4e
40 pages
Minimax Algorithm & Alpha-Beta Pruning
No ratings yet
Minimax Algorithm & Alpha-Beta Pruning
35 pages
Ai Unit 3
No ratings yet
Ai Unit 3
138 pages
Lec7 LU Su20
No ratings yet
Lec7 LU Su20
46 pages
UNIT II Adversarial Search
No ratings yet
UNIT II Adversarial Search
44 pages
AI Notes Unit II
No ratings yet
AI Notes Unit II
31 pages
CS632 Lecture 09
No ratings yet
CS632 Lecture 09
22 pages
Unit 3 Updated
No ratings yet
Unit 3 Updated
112 pages
Unit 3 - Ai - II Aiml Full-1
No ratings yet
Unit 3 - Ai - II Aiml Full-1
108 pages
Lect3 PDF
No ratings yet
Lect3 PDF
67 pages
CSC 325 AI Lecture06 Adversarial Search Fall2024 10102024 041106pm
No ratings yet
CSC 325 AI Lecture06 Adversarial Search Fall2024 10102024 041106pm
65 pages
Game Tree
100% (2)
Game Tree
25 pages
ITSC6121 Lecture 4 - Game Trees I
No ratings yet
ITSC6121 Lecture 4 - Game Trees I
38 pages
4 Adversel Search Game Tree
No ratings yet
4 Adversel Search Game Tree
51 pages
Lecture 5 - Adversal Search
No ratings yet
Lecture 5 - Adversal Search
88 pages
GamePlaying Minimax Unit-2 SPS
No ratings yet
GamePlaying Minimax Unit-2 SPS
72 pages
Lecture 6 - Minmax Alpha Beta
No ratings yet
Lecture 6 - Minmax Alpha Beta
41 pages
Kenwood TRC 80 User Manual PDF
No ratings yet
Kenwood TRC 80 User Manual PDF
33 pages
Biti1113 Games in Ai
No ratings yet
Biti1113 Games in Ai
58 pages
Unit 5 AI
No ratings yet
Unit 5 AI
80 pages
AI Unit 3
No ratings yet
AI Unit 3
54 pages
L06 (Adversarial Search) Ori
No ratings yet
L06 (Adversarial Search) Ori
46 pages
Chapter. 06 - Adversarial Search and Games - No Embedded Videos
No ratings yet
Chapter. 06 - Adversarial Search and Games - No Embedded Videos
51 pages
W6-Adverserial Search
No ratings yet
W6-Adverserial Search
39 pages
Yapay Zeka - 8
No ratings yet
Yapay Zeka - 8
48 pages
Game Playing. Updated
No ratings yet
Game Playing. Updated
44 pages
Game Playing
No ratings yet
Game Playing
33 pages
6 Game
No ratings yet
6 Game
53 pages
Unit-2 Ai
No ratings yet
Unit-2 Ai
45 pages
1 1 4GamePlaying
No ratings yet
1 1 4GamePlaying
23 pages
Games
No ratings yet
Games
41 pages
Adversarial Search
No ratings yet
Adversarial Search
36 pages
04 Games PDF
No ratings yet
04 Games PDF
77 pages
3 GamePlaying - Minimax
No ratings yet
3 GamePlaying - Minimax
75 pages
Adversarial Search
No ratings yet
Adversarial Search
42 pages
6 Game
No ratings yet
6 Game
42 pages
Adversarial Search Two - Persons Game: Russel Norvig (Text) Book and Patrick Henry Winston (Reference Book)
No ratings yet
Adversarial Search Two - Persons Game: Russel Norvig (Text) Book and Patrick Henry Winston (Reference Book)
71 pages
Unit Ii Ai (DS) .
No ratings yet
Unit Ii Ai (DS) .
28 pages
6 Min Max
No ratings yet
6 Min Max
11 pages
Adversial Search
No ratings yet
Adversial Search
21 pages
G51IAI Introduction To AI: Game Playing Alpha-Beta Search and General Issues
No ratings yet
G51IAI Introduction To AI: Game Playing Alpha-Beta Search and General Issues
41 pages
Lecture Adversarial Searches
No ratings yet
Lecture Adversarial Searches
25 pages
CC511 Week 4
No ratings yet
CC511 Week 4
57 pages
TTL Midterm Reviewer
No ratings yet
TTL Midterm Reviewer
10 pages
Lec11&12-Adversarial Search
No ratings yet
Lec11&12-Adversarial Search
30 pages
Game-Playing & Adversarial Search
No ratings yet
Game-Playing & Adversarial Search
68 pages
Game Playing
No ratings yet
Game Playing
24 pages
Institute of Southern Punjab Multan: Syed Zohair Quain Haider Lecturer ISP Multan
No ratings yet
Institute of Southern Punjab Multan: Syed Zohair Quain Haider Lecturer ISP Multan
41 pages
Ix Developer: User's Guide
100% (1)
Ix Developer: User's Guide
48 pages
HVAC - Part-3
No ratings yet
HVAC - Part-3
55 pages
07 Game Playing
No ratings yet
07 Game Playing
30 pages
5 Data Centric Engineering
No ratings yet
5 Data Centric Engineering
23 pages
SkillSet Pro: Advanced Employability Skills Mastery Program
No ratings yet
SkillSet Pro: Advanced Employability Skills Mastery Program
11 pages
Non-Circular Pressure Vessels Square
100% (1)
Non-Circular Pressure Vessels Square
27 pages
Automationinmanufacturingunit 1byvarunpratapsingh 230215010703 90e10c8e
No ratings yet
Automationinmanufacturingunit 1byvarunpratapsingh 230215010703 90e10c8e
57 pages
SAP Enhancement Package 5 For SAP ERP 6.0
No ratings yet
SAP Enhancement Package 5 For SAP ERP 6.0
222 pages
Description For Engineering CHNG CMZ700-001 (050804) 3decoded
No ratings yet
Description For Engineering CHNG CMZ700-001 (050804) 3decoded
8 pages
Riscv Server Soc
No ratings yet
Riscv Server Soc
34 pages
Math Homework Tic Tac Toe
100% (1)
Math Homework Tic Tac Toe
8 pages
Manual Fujikura 70s
No ratings yet
Manual Fujikura 70s
98 pages
Unit 1
No ratings yet
Unit 1
23 pages
Viva Question CSE-376
No ratings yet
Viva Question CSE-376
7 pages
ALL Boolean Algebra
No ratings yet
ALL Boolean Algebra
25 pages
EQUIP9-Operations-Use Case Challenge
No ratings yet
EQUIP9-Operations-Use Case Challenge
6 pages
AFM244-S23 Syllabus
No ratings yet
AFM244-S23 Syllabus
7 pages
Taoufik Hachi Mi
No ratings yet
Taoufik Hachi Mi
11 pages
Monalyn Señaris - 2.2.1.4 Packet Tracer - Simulating IoT Devices
No ratings yet
Monalyn Señaris - 2.2.1.4 Packet Tracer - Simulating IoT Devices
5 pages
Pengaruh Penyajian Laporan Keuangan Dan Aksesibilitas TERHADAP TINGKAT AKUNTABILITAS KEU PADA SKPD KAB BENGKALIS
No ratings yet
Pengaruh Penyajian Laporan Keuangan Dan Aksesibilitas TERHADAP TINGKAT AKUNTABILITAS KEU PADA SKPD KAB BENGKALIS
7 pages
Whitepaper: Decentralized Finance Global Smart AMM DEX Protocol
No ratings yet
Whitepaper: Decentralized Finance Global Smart AMM DEX Protocol
16 pages
(System Message) (System Message) (System Message) : (Dota V6.69C.W3X)
No ratings yet
(System Message) (System Message) (System Message) : (Dota V6.69C.W3X)
38 pages
Job Recommendation System Using NLP
No ratings yet
Job Recommendation System Using NLP
10 pages
GE Welch - Group 5
No ratings yet
GE Welch - Group 5
5 pages
KRA Series: Splitter Attenuator
No ratings yet
KRA Series: Splitter Attenuator
16 pages
Hoja de Especificaciones PTZ-N2404I-DE3F
No ratings yet
Hoja de Especificaciones PTZ-N2404I-DE3F
5 pages
The Effect of Controlled Permeable Formwork Liner On The Mechanical Properties of Concrete
No ratings yet
The Effect of Controlled Permeable Formwork Liner On The Mechanical Properties of Concrete
11 pages
Add Label For XY Scatter Chart
No ratings yet
Add Label For XY Scatter Chart
34 pages
Pipeliner Mps 4000
No ratings yet
Pipeliner Mps 4000
4 pages
Practice Game Theory
From Everand
Practice Game Theory
Albert Rutherford
No ratings yet
Fun Online Games For Teens with Tips and Tricks: Ages 13 And Up: Games for Kids and Teens
From Everand
Fun Online Games For Teens with Tips and Tricks: Ages 13 And Up: Games for Kids and Teens
Baby Professor
No ratings yet