0% found this document useful (0 votes)

16 views7 pages

Game AI

Uploaded by

Kyaw Soe Linn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views7 pages

Game AI

Uploaded by

Kyaw Soe Linn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

15-112: Fundamentals of Programming and Computer Science

Fundamentals of Game AI

1. Why use game AI?

In term projects, Game AI is used to create a computer player for specific types of games. These algorithms
usually need to meet most or all of the criteria below:

• Turn-based (these algorithms all require this)

• 2-player
• Zero-sum (i.e. actions that help player 1 hurt player 2)
• Deterministic (i.e. no randomness, like dice rolls)
• Perfect information (i.e. no information is hidden from either player)

The main game AI algorithm is called minimax and only really works when all of those criteria are met.
Modifications and/or similar algorithms also work well when some of those criteria are not met (i.e. some
games of chance).

Page 1
15-112: Fundamentals of Programming and Computer Science

2. Key Utilities

The game AI algorithms usually require you to write several key utility functions. The ones listed below
are the ones required for minimax, but the other ones are quite similar:

• You must have some consistent way of representing the current game state (i.e. the current “board”,
whose turn it is, etc.)
• A way to obtain all of the moves that the current player can make from the current state
• A way to obtain the new state caused by making a certain move (this should almost always be
nondestructive)
• A way to know if the game is over, and if so, who won (or if there is a tie)
• A heuristic function that assigns a score to a state (more on this later)
You should try to keep app out of these functions, because they need to be able to work on arbitrary
game states, not just the current game state of the animation.

Examples of each of these parts will be shown in the case studies in subsequent pages.

3. Heuristics

The problem is that no algorithm can figure out what move is truly the best without exploring all or
almost all of the future game outcomes, potentially dozens or hundreds of moves into the future. This is
computationally impractical, so most of these algorithms instead only look a few moves into the future
and then use a heuristic function to assign a score to the state.

The heuristic function h(s) should usually have the following properties:

• If s is a state where the player has won, h(s) = +∞ (or is a really big positive number)
• If s is a state where the player has won, h(s) = −∞ (or is a really big negative number)
• If s is a state where the game is tied, h(s) = 0
• If state s1 is better for the player than state s2 , then h(s1 ) should ideally be > h(s2 )

The first 3 can be done by checking if the game is over (and if so, who won). As for the rest, that is often
done by using several scoring metrics and taking a weighted average.

For example, if we are playing a game of Othello, the metrics could be:
f1 (s) = the number of pieces the player has on the board
f2 (s) = the number of moves the player can make
f3 (s) = the number of corner pieces the player controls
And the following is one possible heuristic function:


 +∞ won

0 tied
h(s) =


 −∞ lost
f1 (s) + 3f2 (s) + 20f3 (s) otherwise


Design of the heuristic function is often important to the speed and performance of the game AI. If it
is too simple, the AI may not play well, and if it is too complex, then the AI may be too slow to be of
use. For any given game, there could be hundreds of possible scoring metric functions you could write.
Figuring these out is up to you. Research on the theory of your particular game may provide some insights.

Also, MVP definitions often require you to have at least one scoring metric in the heuristic that is somewhat
complex.

Page 2
15-112: Fundamentals of Programming and Computer Science

4. Algorithms

There are three major game AI algorithms worth considering (though there are other hybrids):

Minimax works best for games that meet the criteria mentioned earlier quite closely. The core idea of
minimax is that it uses recursion to simulate future game states. In the base case (game over, or the
target depth is reched), the heuristic function is called. In the recursive case, if it is your turn it picks the
move/score that resulted in the highest score from the recursive call, and if its your opponent’s turn it
picks the one with the lowest score.

Expectimax is quite similar to minimax, but also allows for some elements of chance. In the recursive
case, it has a third option instead of just your turn vs opponent’s turn: a chance node. This is where the
moves are all the possible outcomes of a random event (i.e. the roll of some dice). Instead of picking the
best/worst recursive call outcome, you take a weighted average of all the recursive calls (weighted by the
probability).

Monte Carlo is often used when there are too many moves and/or too much randomness for minimax/-
expectimax to handle well. The Monte Carlo algorithm is based on making many random simulations
and trying to make an informed decision based on those many simulations. Often it includes algorithmic
tricks to maximize how much of the game tree is “explored”, i.e. increasing the probability that a move
will be randomly chosen if the resulting game state has not been simulated yet (these versions are called
Monte Carlo Tree Search).

5. Case Study: Chess

Chess is the classical example of a game that can have a minimax AI. It meets all of the criteria (2 person,
turn-based, deterministic, perfect information, zero-sum). The key utilities can be written as follows
(there are other ways to write these):
(a) State: the board as a 2D list of strings, and a string saying whose turn it is (maybe also a list of
captured pieces).
(b) Possibles moves: returns a list of (startRow, startCol, endRow, endCol) tuples indicating each move
that each piece belonging to the current player can make.
(c) New state: nondestructively updates the board to reflect the moved piece and switches the turn.
(d) Game over: a function to detect checkmate
(e) Heuristic: could include metrics such as the number of pieces you have (weighted by type), whether
you are in check, the number of “pins” you have, etc.

Page 3
15-112: Fundamentals of Programming and Computer Science

6. Case Study: Backgammon

Backgammon is a game that is more suited to expectimax. It meets all of the criteria for minimax, except
that there is an element of chance; the legal moves for the player at each turn depend on the outcome of
rolling two dice.

This can be handled by having a chance node before each player’s turn, where the chance node’s possible
outcomes are all the possible dice rolls. The outcome of the dice roll is then stored in the state, allowing
the actual player/opponent turn to make decisions based on the dice roll.

7. Case Study: Battleship

Battleship is an example of a game that is more suited to Monte Carlo approach (not necessarily the
Monte carlo Tree Search). The criteria it violates is that it does not have perfect information; you do not
know where your enemy’s ships are and they do not know where yours are.

One approach to this would be to use some kind of recursive backtracker to simulate where the enemy’s
ships could be located based on the red/white pins pins, repeat several times, and then choose a location
to fire on based on which location contained a ship in most of the simulations.

Page 4
15-112: Fundamentals of Programming and Computer Science

8. Case Study: Risk

Risk is an example of a game that is more suited to the Monte Carlo Tree Search. It has a great deal of
chance and many possible moves at each turn. The likely aproach for writing an AI would be having it
do many random simulations of the game and picking whichever of the current moves resulted in the best
outcome on average.

The advantage to this approach is that its possible to write the random simulations in a way that they
explore the potential long term outcomes of a move on the game by playing out the game perhaps dozens
of moves into the future.

9. Minimax Pseudocode

Below is the pseudocode for minimax, which uses the utilities mentioned earlier. This version returns the
best move and the associated score.

1 define minimax ( state , depth , maximizing ) :

2 If depth = 0 or the state is a game - over state :
3 Return the score of the state
4 If maximizing :
5 Best score = −∞
6 Best move = dummy value
7 For each move from the current state :
8 Obtain the new state caused by making the move
9 Get the score of the new state by calling minimax :
10 Use the new state as the state
11 Decrease the depth by 1
12 Maximizing boolean is now false
13 If the score is better than the best score :
14 Update the best score / move
15 Return best score and best move
16 Otherwise :
17 Best score = +∞
18 Best move = dummy value
19 For each move from the current state :
20 Obtain the new state caused by making the move
21 Get the score of the new state by calling minimax :
22 Use the new state as the state
23 Decrease the depth by 1
24 Maximizing boolean is now true
25 If the score is worse than the best score :
26 Update the best score / move
27 Return best score and best move
To modify for expectimax, before checking if its maximizing or not you would check if it is a chance node.
If so, you would loop over all the moves, obtain hte new state and its score from the recursive call, then
take the average of all those scores instead of the min or max.

Page 5
15-112: Fundamentals of Programming and Computer Science

10. Minimax Example

Below is an example of Minimax choosing the best move in a game. The game states are represented as a
“tree” where the “root” (at the top) is the current game state and “leaves” (at the bottom) are the game
states that we call the heuristic function on.

This example has 3 layers of recursive calls before reaching the base case, and thus goes through 2 turns
of maximizing and 1 turns of minimizing the score.

For the purposes of this visual, recursive calls are evaluated simultaneously, and it starts by looking at
the base case and working upwards. Also, each state is labeled with a name and the calculated score.

11. Note: Alpha Beta Pruning

After MVP, it is quite common for students who wrote minimax to add Alpha Beta Pruning (ABP for
short) as an additional feature. ABP is a minor extension to the minimax algorithm that causes it to
speed up its calculation (sometimes significantly) by “pruning” stupid parts of the game tree that it knows
it won’t have to explore.

Page 6
15-112: Fundamentals of Programming and Computer Science

12. Where to start looking

Gry to avoid looking at sources that actually show code. This includes the GeeksForGeeks articles on
Minimax and Alpha Beta Pruning (the ones for Expectimax and Monte Carlo are less useful code, so
looking at those pages is less problematic). If a page does have code, try to avoid looking at it too much.

You must cite all sources, even the ones below!

• The Wikipedia pages for the algorithms ( [Minimax], [Expectimax], [Monte Carlo Tree Search],
[Alpha Beta Pruning]) are useful, but some of them contain a lot of other information you don’t
need to worry about. Some of them also have some decent images / GIFs, which show you how the
process works step-by-step.

• Some articles on Minimax / ABP: [LevelUp Coding] [Baeldung] [Medium] [Educative]

And some videos: [Computerphile] [Berkeley CS188]

• Some articles on Expectimax: [Baeldung] [GeeksForGeeks]

• Some articles on Monte Carlo Tree Search: [GeeksForGeeks] [TowardsDataScience] [Baeldung]

• If possible, avoid the GeeksForGeeks pages for Minimax or ABP, especially the parts with code or
“pseudocode” (their minimax “pseudocode” section isn’t very “pseudo”).

• Plenty of universities (including CMU) have course notes you can find online explaining some or all
of these algorithms. Some of these courses at CMU are 15-150 and 15-281.

Page 7

Lecture13 - Adversial Search Algorithms
No ratings yet
Lecture13 - Adversial Search Algorithms
23 pages
4 Adversel Search Game Tree
No ratings yet
4 Adversel Search Game Tree
51 pages
18CS753 Ai Module 4
No ratings yet
18CS753 Ai Module 4
43 pages
18cs753 Ai Module 4
No ratings yet
18cs753 Ai Module 4
44 pages
Introducing The Min-Max Algorithm: Paulo Pinto 28 July 2002
No ratings yet
Introducing The Min-Max Algorithm: Paulo Pinto 28 July 2002
11 pages
Game Search Algorithms in AI
No ratings yet
Game Search Algorithms in AI
5 pages
18cs753 Ai Module 4
No ratings yet
18cs753 Ai Module 4
44 pages
Artificial Intelligence For Gam Es: Shashwat Shukla
No ratings yet
Artificial Intelligence For Gam Es: Shashwat Shukla
56 pages
Why Do AI Researchers Study Game Playing?
No ratings yet
Why Do AI Researchers Study Game Playing?
42 pages
Ai Unit 2
No ratings yet
Ai Unit 2
88 pages
Adversial Search
No ratings yet
Adversial Search
38 pages
Game Playing
No ratings yet
Game Playing
32 pages
Lecture05 AdversarialSearch
No ratings yet
Lecture05 AdversarialSearch
51 pages
Unit Ii
No ratings yet
Unit Ii
56 pages
Optimal Decision in Games
No ratings yet
Optimal Decision in Games
68 pages
Local Adversarial Search
No ratings yet
Local Adversarial Search
44 pages
Week 13
No ratings yet
Week 13
45 pages
8.AI17game Final
No ratings yet
8.AI17game Final
30 pages
Adversarial Search
No ratings yet
Adversarial Search
37 pages
Lecture 04 - AI For Game Playing
No ratings yet
Lecture 04 - AI For Game Playing
55 pages
Adversarial Search Two - Persons Game: Russel Norvig (Text) Book and Patrick Henry Winston (Reference Book)
No ratings yet
Adversarial Search Two - Persons Game: Russel Norvig (Text) Book and Patrick Henry Winston (Reference Book)
71 pages
Lecture11 AdversarialSearch
No ratings yet
Lecture11 AdversarialSearch
74 pages
2025 Lecture03 AdversarialSearch
No ratings yet
2025 Lecture03 AdversarialSearch
51 pages
Game Playing AI: (Based On Earlier Lecture From Stephen Gould)
No ratings yet
Game Playing AI: (Based On Earlier Lecture From Stephen Gould)
28 pages
AI Unit-3
No ratings yet
AI Unit-3
109 pages
Oradea: Bucharest Arad Craiova
No ratings yet
Oradea: Bucharest Arad Craiova
53 pages
Paiunit 4
No ratings yet
Paiunit 4
36 pages
18CS753 AI Module 4
No ratings yet
18CS753 AI Module 4
44 pages
Adversial Search
No ratings yet
Adversial Search
101 pages
AI Lec07 Adversarial Search
No ratings yet
AI Lec07 Adversarial Search
29 pages
Artificial Intelligence 5. Game Playing: Course V231 Department of Computing Imperial College © Simon Colton
No ratings yet
Artificial Intelligence 5. Game Playing: Course V231 Department of Computing Imperial College © Simon Colton
28 pages
Game Playing MINMAX Search, Alpha-Beta Pruning, PDF
No ratings yet
Game Playing MINMAX Search, Alpha-Beta Pruning, PDF
4 pages
Adversarial Search
No ratings yet
Adversarial Search
78 pages
Adversarial Search
No ratings yet
Adversarial Search
42 pages
06 Adversarialsearch
No ratings yet
06 Adversarialsearch
36 pages
Lecture Notes Adversarial Search
No ratings yet
Lecture Notes Adversarial Search
15 pages
Ai Chap 2
No ratings yet
Ai Chap 2
48 pages
Chapter3 - Search4
No ratings yet
Chapter3 - Search4
37 pages
Lecture Notes Adversarial Search
No ratings yet
Lecture Notes Adversarial Search
13 pages
Games
No ratings yet
Games
41 pages
1 1 4GamePlaying
No ratings yet
1 1 4GamePlaying
23 pages
3 GamePlaying - Minimax
No ratings yet
3 GamePlaying - Minimax
75 pages
Aiml Unit-2
No ratings yet
Aiml Unit-2
61 pages
Game Playing
No ratings yet
Game Playing
48 pages
Game Playing
No ratings yet
Game Playing
63 pages
Adversarial Search - Game Trees and Minimax Evaluation
No ratings yet
Adversarial Search - Game Trees and Minimax Evaluation
50 pages
Adversarial Search and Game Playing: Games
No ratings yet
Adversarial Search and Game Playing: Games
8 pages
Game Playing
No ratings yet
Game Playing
24 pages
Chap-4 Adversarial Search
No ratings yet
Chap-4 Adversarial Search
43 pages
Artificial Inteligence
No ratings yet
Artificial Inteligence
4 pages
Game Playing, Knowledge Representation and Reasoning
No ratings yet
Game Playing, Knowledge Representation and Reasoning
18 pages
Adversarial Search and Game Playing
No ratings yet
Adversarial Search and Game Playing
77 pages
Lecture Adversarial Searches
No ratings yet
Lecture Adversarial Searches
25 pages
Game Playing Algorithm
No ratings yet
Game Playing Algorithm
27 pages
6 Min Max
No ratings yet
6 Min Max
11 pages
Gameplaying Group 1
No ratings yet
Gameplaying Group 1
13 pages
Game Playing
No ratings yet
Game Playing
8 pages
Swapping Pump - Shellogg.P
No ratings yet
Swapping Pump - Shellogg.P
13 pages
Downloads - Mastering Node - JS, Part 1 - Introduction
100% (1)
Downloads - Mastering Node - JS, Part 1 - Introduction
41 pages
Cybersecurity Industry: Research Analysis
No ratings yet
Cybersecurity Industry: Research Analysis
16 pages
BP0273950-Customer Experience Strategy CW2
No ratings yet
BP0273950-Customer Experience Strategy CW2
7 pages
Optimized GAN-Based Pipeline For High-Quality Face Restoration From CCTV Images
No ratings yet
Optimized GAN-Based Pipeline For High-Quality Face Restoration From CCTV Images
29 pages
Test Redmi 10 Selene
No ratings yet
Test Redmi 10 Selene
61 pages
Csab 32
No ratings yet
Csab 32
3 pages
Wi-Fi Module - WRG1 - Tuya Smart - Docs
No ratings yet
Wi-Fi Module - WRG1 - Tuya Smart - Docs
15 pages
BGC Form - Idb Check
No ratings yet
BGC Form - Idb Check
2 pages
3 Day-Fundamental of Mathematics
No ratings yet
3 Day-Fundamental of Mathematics
3 pages
Instagram User Analytics
No ratings yet
Instagram User Analytics
12 pages
An Introduction To Formal Language Theory That Integrates Experimentation and Proof - Allen Stoughton
No ratings yet
An Introduction To Formal Language Theory That Integrates Experimentation and Proof - Allen Stoughton
288 pages
Youtu-Be-F9e - BRLy
No ratings yet
Youtu-Be-F9e - BRLy
4 pages
Smart India Hackathon 2024
No ratings yet
Smart India Hackathon 2024
7 pages
Exploring Water Waves With The Ripple Tank Gizmo - Course Sidekick
No ratings yet
Exploring Water Waves With The Ripple Tank Gizmo - Course Sidekick
1 page
Booster Pump, 5 HP, 80 GPM X 180 FT
No ratings yet
Booster Pump, 5 HP, 80 GPM X 180 FT
4 pages
Com Solved Proble4ms I
No ratings yet
Com Solved Proble4ms I
6 pages
Employees' Perceived Benefits and Drawbacks From "Work From Home" During Covid-19
No ratings yet
Employees' Perceived Benefits and Drawbacks From "Work From Home" During Covid-19
15 pages
Health Monitoring of Aero Engine Components by Automated Eddy Current Inspection
No ratings yet
Health Monitoring of Aero Engine Components by Automated Eddy Current Inspection
8 pages
1.A-DQ BN SR 2Y 1.200N CT-General-2021.04.07-03
No ratings yet
1.A-DQ BN SR 2Y 1.200N CT-General-2021.04.07-03
7 pages
Itilv3 Foundation Study Guide
No ratings yet
Itilv3 Foundation Study Guide
22 pages
BAE - Digital - USL - Landscape - AMPV CDR - Medical Evacuation
No ratings yet
BAE - Digital - USL - Landscape - AMPV CDR - Medical Evacuation
2 pages
Removal of Mercury From Flue Gas Using Activated Carbon
No ratings yet
Removal of Mercury From Flue Gas Using Activated Carbon
1 page
(03-07) - Fuel Injection Nozzle - 4-390 4T-390 Emissions
No ratings yet
(03-07) - Fuel Injection Nozzle - 4-390 4T-390 Emissions
3 pages
Catalogue
No ratings yet
Catalogue
1 page
Pushing The Limits of Reengineering: Development Program For F-Class Turbine Parts
No ratings yet
Pushing The Limits of Reengineering: Development Program For F-Class Turbine Parts
4 pages
Beijing's Latest Answer To Pollution: The Smog Free Tower: Clean Air Creative Thinking
No ratings yet
Beijing's Latest Answer To Pollution: The Smog Free Tower: Clean Air Creative Thinking
2 pages
Iwerkz Keyboard Manual
No ratings yet
Iwerkz Keyboard Manual
2 pages
V. Spring Escape Chute: For Indoor Type
No ratings yet
V. Spring Escape Chute: For Indoor Type
2 pages
Jmu 3001 - e 1
No ratings yet
Jmu 3001 - e 1
1 page