0% found this document useful (0 votes)

67 views49 pages

CS335 Introduction To AI: Francisco Iacobelli June 25, 2015

This document provides an overview of the CS335 Introduction to AI course. It discusses topics that will be covered including games and optimal decisions in games. Games can have perfect or imperfect information. Optimal players in games aim to find the best move against the opponent's best move. The document describes algorithms for finding optimal moves like minimax and alpha-beta pruning which improve upon minimax by pruning parts of the search tree. Heuristics can also be used to guide the search and make evaluations faster by replacing minimax utility values with heuristic evaluations below a certain depth. Feature-based heuristic evaluation functions are discussed along with their limitations regarding linearity and independence assumptions.

Uploaded by

Tariq Iqbal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

67 views49 pages

CS335 Introduction To AI: Francisco Iacobelli June 25, 2015

Uploaded by

Tariq Iqbal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 49

CS335 Introduction to AI

Francisco Iacobelli
June 25, 2015
Games
Competitive and Perfect Information

I Competitive: Commonly Zero Sum (whatever one player

wins the other loses)
I Perfect Information: Players knows the results of all
previous moves
There is one best way to play for each player
I Imperfect Information: Players do not know all of the
previous moves (may play simultaneously)
I Simple states for representation (not Robo Soccer, but To
be fair...)
Games
Language/Functions

S0 //initial state
player(s) // who’s the player in state s
actions(s) // possible moves from state s
result(s,a) // the state after action a is taken on state s
terminal(s) // returns true if s is a terminal state
utility(s,p) // the objective function in state s for player p
Optimal Decisions
Optimal Players

I A game not only finds best way to goal

I The other player has a say

I Two players: MAX and MIN

I actions(s) and result(s,a) define a game tree
I Tic Tac Toe: Fewer than 9!(362, 880) terminal nodes
I Chess: over 1040
I Search tree as theory
Optimal Decisions
Optimal Players

I A game not only finds best way to goal

I The other player has a say

I Two players: MAX and MIN

I actions(s) and result(s,a) define a game tree
I Tic Tac Toe: Fewer than 9!(362, 880) terminal nodes
I Chess: over 1040
I Search tree as theory
Optimal Decisions
Optimal Players

I A game not only finds best way to goal

I The other player has a say

I Two players: MAX and MIN

I actions(s) and result(s,a) define a game tree
I Tic Tac Toe: Fewer than 9!(362, 880) terminal nodes
I Chess: over 1040
I Search tree as theory
Optimal Decisions
Optimal Players

I A game not only finds best way to goal

I The other player has a say

I Two players: MAX and MIN

I actions(s) and result(s,a) define a game tree
I Tic Tac Toe: Fewer than 9!(362, 880) terminal nodes
I Chess: over 1040
I Search tree as theory
Game Tree
Tic Tac Toe
Min plays Max
Small 2-ply game

4 = MAX ; 5 = MIN
Minimax
Picking my best move against your best move

minimax(s) =

 utility (s) if terminal(s)
maxa∈action(s) minimax(result(s, a)) if player (s) = MAX
mina∈action(s) minimax(result(s, a)) if player (s) = MIN

Minimax
Example

minimax(s) =

 utility (s) if terminal(s)
maxa∈action(s) minimax(result(s, a)) if player (s) = MAX
mina∈action(s) minimax(result(s, a)) if player (s) = MIN

Minimax Algorithm
Recursive

function Minimax(state)
v = max-value(state)
return action in successors(state) with value v
//
function max-value(state)
if terminal(s)
return utility(s)
v = -infinity
for a,s in successors(state) do
v = max(v,min-value(s))
return v
//
function min-value(state)
if terminal(state)
return utility(s)
v = +infinity
for a,s in successors(state) do
v = min(v,max-value(s))
return v
Minimax
Discussion

I Complete depth first exploration

I Depth m with b legal moves. O(bm )
I Space complexity (memory) O(bm)
I Chess: m ≈ 35; on average:50 ≤ b ≤ 100
I Impractical for most games, but basis of other algs.
Minimax
Discussion

I Complete depth first exploration

I Depth m with b legal moves. O(bm )
I Space complexity (memory) O(bm)
I Chess: m ≈ 35; on average:50 ≤ b ≤ 100
I Impractical for most games, but basis of other algs.
Minimax
Multiplayer

I Utility vectors instead of values

Alha-Beta prunning
Intuition

Do we need to expand all nodes?

minimax(root) = max(min(3, 12, 8), min(2, x, y ), min(14, 5, 2))

= max(3, min(2, x, y ), 2)
= max(3, z, 2)
=3

Do we need z?
Alpha–Beta prunning

Two values:
I α = value of best choice so far for MAX (highest-value)
I β = value of best choice so far for MIN (lowest-value)
I Each node keeps track of its [α, β] values
Alpha Beta prunning
Example

A α = −∞ β = +∞

B C D

3
Alpha Beta prunning
Example

A α = −∞ β = +∞

B C D

3 ≤ α?.. continue
Alpha Beta prunning
Example

A α = −∞ β = 3

B C D

3 ≤ β?
Alpha Beta prunning
Example

A α = −∞ β = 3

B C D

3 12 ≤ α?
Alpha Beta prunning
Example

A α = −∞ β = 3

B C D

3 12 ≤ β?
Alpha Beta prunning
Example

A α = −∞ β = 3

B C D

3 12 8 ≤ α, β?
Alpha Beta prunning
Example

A α = −∞ β = 3

3 B C D

3 12 8
Alpha Beta prunning
Example

A α = −∞ β = 3

3 ≥ α? B C D

3 12 8
Alpha Beta prunning
Example

A α=3β=3

3 B C D

3 12 8
Alpha Beta prunning
Example

A α=3β=3

3 B C D

3 12 8 2 ≤ α?
Alpha Beta prunning
Example

A α=3β=3

3 B 2 ≥ α? C D

3 12 8 2
Alpha Beta prunning
Example

A α=3β=3

3 B 2 C D

3 12 8 2 14 ≤ α?
Alpha Beta prunning
Example

A α=3β=3

3 B 2 C D

3 12 8 2 14 ≤ β?
Alpha Beta prunning
Example

A α=3β=3

3 B 2 C D

3 12 8 2 14 5 ≤ α?
Alpha Beta prunning
Example

A α=3β=3

3 B 2 C D

3 12 8 2 14 5 ≤ β?
Alpha Beta prunning
Example

A α=3β=3

3 B 2 C 2 D

3 12 8 2 14 5 2 ≤ α?
Alpha Beta prunning
Example

3 A α=3β=3

3 B 2 C 2 D

3 12 8 2 14 5 2
Alpha-Beta Prunning
Algorithm

function alpha-beta(state)
v = max-value(state,−∞,+∞)
return action in successors(state) with value v
//
function max-value(state,α,β)
if terminal(s)
return utility(s)
v = -infinity
for a in action(state) do
v = max(v,min-value(result(s,a),α,β))
if v >= β return v
α = max(α,v)
return v
//
function min-value(state,α,β)
if terminal(state)
return utility(s)
v = +infinity
for a in action(state) do
v = min(v,max-value(result(s,a),α,β))
if v<= α return v
β = min(β,v)
return v
Alpha-Beta Prunning
Properties

I Prunning does not affect final outcome

I Sorting moves by result improves α − β performance
m
I Perfect ordering: O(b 2 )
I An exercise on metareasoning
Real Time Decisions
Heuristics, welcome back.

What if we change minimax as follows:

I we replace utility (s) by eval(s) –a heuristic function
I and replace terminal(s) by cutoff (s) to know when to apply
eval(s)
I therefore: h − minimax(s, d) is now a function of s and the
depth d to explore
Heuristics
Search Faster

h − minimax(s, d) =

 eval(s) if cutoff (s)
maxa∈action(s) h − minimax(result(s, a), d + 1) if player (s) = MAX
mina∈action(s) h − minimax(result(s, a), d + 1) if player (s) = MIN

Heuristics
Good Evaluation Functions

I A bad evaluation function may result in loss

I eval(sw ) ≥ eval(sd ) ≥ eval(sl )1
I eval(s) should be fast
I eval(s) on non-terminal states should be highly correlated
with winning

1
w=win,d=draw,l=lose
Heuristics
Combination of Features

n
X
eval(s) = w1 f1 (s) + w2 f2 (s) + . . . + wn fn (s) = wi fi (s)
i=1

Assumption: Each feature is independent of other features

What are good features and weights, say for chess? for
checkers?
Heuristics
linearity and the independence assumption

Both these boards would have the same heuristic. Whites

move.

But they shouldn’t!

Heuristics
linearity and the independence assumption

Both these boards would have the same heuristic. Whites

move.

But they shouldn’t!

Heuristics
Cutoff for α–β prunning

The idea is to replace the termination condition in α–β with

if cutoff(s,d) then return eval(s)

I fixed d
I iterative deepening on d
I if s is terminal, cutoff (s) returns true
I add quiescence search or vanilla states
I try and prevent horizon effect or inevitable consequences
Stochastic Games
Chance plays a part

Backgammon: States and moves depend on a dice roll which

oponent cannot forsee
Stochastic Games
Incorporate Change in the Tree

Chance is represented as nodes

Stochastic Games
Expected Value

expectiminimax(s) =


 utility (s) if terminal(s)
maxa∈action(s) expectiminimax(result(s, a)) if player (s) = Max


 min
 P a∈action(s) expectiminimax(result(s, a)) if player (s) = Min
r P(r )expectiminimax(result(s, r )) if player (s) = Chance

Playing Games
Complexity

I Chess: beginner plans 4 − 6 ply. Kasparov ≈ 12

I Chess: Say O(355 )
I Backgammon: O(bm nm ) where n is the number of dice
rolls.
I Backgammon: b ≈ 20 and n = 21
I Mario Bros: 22 × 22 area around Mario and 16 possible
actions every 40 millisecons2
I Monte Carlo simulations... stay tuned.

2
Togelius,Shaker,Karakovskiy and Yannakakis (2013)
Playing Games
Current Status

I Checkers: Chinook ended 40-year-reign of human world

champion Marion Tinsley in 1994. Used a precomputed
endgame database defining perfect play for all positions
involving 8 or fewer pieces on the board, a total of 444 billion
positions.
I Chess: Deep Blue defeated human world champion Gary
Kasparov in a six-game match in 1997. Deep Blue searches 200
million positions per second, uses approx 8000 features for
evaluation, a DB of 700,000 grandmaster games, etc. and
undisclosed methods for extending some lines of search up to
40 ply.
I Othello: human champions refuse to compete against
computers, who are too good.
I Go: human champions refuse to compete against computers,
who are too bad. In go, b > 300, so most programs use pattern
knowledge bases to suggest plausible moves
Exercise

Describe and implement

I state descriptions
I move generators
I terminal tests
I utility functions
I evaluation functions (heuristics)
For: Monopoly, Scrabble, Texas Hold’em

Retro Gamer Book of Arcade Classics 2nd Edition
100% (4)
Retro Gamer Book of Arcade Classics 2nd Edition
180 pages
Handheld Game Devices in Out of School Settings
0% (1)
Handheld Game Devices in Out of School Settings
12 pages
Averbakh - Chess Endings: Essential Knowledge - 158 Chess Endgame Positions You Should Know Well
No ratings yet
Averbakh - Chess Endings: Essential Knowledge - 158 Chess Endgame Positions You Should Know Well
27 pages
Adventures of A Chess Master by George Koltanowski
0% (1)
Adventures of A Chess Master by George Koltanowski
208 pages
Da Archive 2021-12-01
75% (4)
Da Archive 2021-12-01
114 pages
William Hartston - The Benoni
100% (6)
William Hartston - The Benoni
123 pages
Knowledge Representation: Unit-6
No ratings yet
Knowledge Representation: Unit-6
101 pages
Informed Search Algorithms: UNIT-2
No ratings yet
Informed Search Algorithms: UNIT-2
35 pages
m2 Agents
No ratings yet
m2 Agents
42 pages
First Order Logic Unit-4
No ratings yet
First Order Logic Unit-4
47 pages
Inference in First-Order Logic I
No ratings yet
Inference in First-Order Logic I
29 pages
AI Notes Unit II (Final)
No ratings yet
AI Notes Unit II (Final)
26 pages
First Order Logic
No ratings yet
First Order Logic
33 pages
New Doc 2018-01-04
No ratings yet
New Doc 2018-01-04
10 pages
Precision Natural STRONG NT PDF
No ratings yet
Precision Natural STRONG NT PDF
36 pages
Constraint Satisfaction Problems: Section 1 - 3
No ratings yet
Constraint Satisfaction Problems: Section 1 - 3
44 pages
Printable 2014 NMJL Card
No ratings yet
Printable 2014 NMJL Card
5 pages
Risk 2210 Odds Chart
No ratings yet
Risk 2210 Odds Chart
1 page
Poker Analyser With Report
No ratings yet
Poker Analyser With Report
40 pages
Dagger Game
No ratings yet
Dagger Game
14 pages
State Space Search: Water Jug Problem
100% (2)
State Space Search: Water Jug Problem
14 pages
Bagh Chal Juego de Mesa
No ratings yet
Bagh Chal Juego de Mesa
2 pages
Artificial Intelligence: Unit-1 Introduction: Chapter 1 Text Book: Stuart Russell, Norvig
No ratings yet
Artificial Intelligence: Unit-1 Introduction: Chapter 1 Text Book: Stuart Russell, Norvig
13 pages
Brains, Heart, and Body - Laugh's Theory On The Three Types of Fighting Gamers - Shoryuken
No ratings yet
Brains, Heart, and Body - Laugh's Theory On The Three Types of Fighting Gamers - Shoryuken
8 pages
EGM2 Issue 10 (April 1995)
No ratings yet
EGM2 Issue 10 (April 1995)
133 pages
cs188 sp23 Lec09
No ratings yet
cs188 sp23 Lec09
47 pages
DND 5E CharacterSheet - Form Fillable
No ratings yet
DND 5E CharacterSheet - Form Fillable
3 pages
Gaming TH Syst MS: A Compon NT Analysis Fram Work or TH Classroom Us O Rpgs
No ratings yet
Gaming TH Syst MS: A Compon NT Analysis Fram Work or TH Classroom Us O Rpgs
11 pages
Bushido Abbreviated Rules.
No ratings yet
Bushido Abbreviated Rules.
28 pages
Tutorial 12 Suggested Solutions
No ratings yet
Tutorial 12 Suggested Solutions
5 pages
Awakened Snowman v1
No ratings yet
Awakened Snowman v1
1 page
04 Games PDF
No ratings yet
04 Games PDF
77 pages
Artificial Intelligence: Adversarial Search
No ratings yet
Artificial Intelligence: Adversarial Search
62 pages
Chapter 3 - Searching-Part 3
No ratings yet
Chapter 3 - Searching-Part 3
64 pages
Oradea: Bucharest Arad Craiova
No ratings yet
Oradea: Bucharest Arad Craiova
53 pages
Week-11 - Adversarial Search
No ratings yet
Week-11 - Adversarial Search
50 pages
AI-Lecture 6 (Adversarial Search)
No ratings yet
AI-Lecture 6 (Adversarial Search)
68 pages
Game-Playing & Adversarial Search
No ratings yet
Game-Playing & Adversarial Search
68 pages
6 Game
No ratings yet
6 Game
42 pages
Artificial Intelligence: Adversarial Search
No ratings yet
Artificial Intelligence: Adversarial Search
58 pages
Warlock - The Fiend - v5.0
No ratings yet
Warlock - The Fiend - v5.0
1 page
CC511 Week 4
No ratings yet
CC511 Week 4
57 pages
Adveserial Search
No ratings yet
Adveserial Search
29 pages
Game Playing
No ratings yet
Game Playing
60 pages
Adversarial Search
No ratings yet
Adversarial Search
91 pages
STR CON DEX INT WIS CHA: Initiative Initiative Defenses AC Movement Speed
No ratings yet
STR CON DEX INT WIS CHA: Initiative Initiative Defenses AC Movement Speed
4 pages
Qwertsdfgsdfavg: Qwerqwer
No ratings yet
Qwertsdfgsdfavg: Qwerqwer
3 pages
Wall Buy
No ratings yet
Wall Buy
3 pages
AI Unit-3
No ratings yet
AI Unit-3
109 pages
Game Playing
No ratings yet
Game Playing
53 pages
Game Playing
No ratings yet
Game Playing
32 pages
06 Minimax
No ratings yet
06 Minimax
53 pages
Adversial Search
No ratings yet
Adversial Search
101 pages
Game Playing
No ratings yet
Game Playing
33 pages
W6-Adverserial Search
No ratings yet
W6-Adverserial Search
39 pages
District 3 Class 2A Team Wrestling Championships Bracket
No ratings yet
District 3 Class 2A Team Wrestling Championships Bracket
1 page
L06 (Adversarial Search) Ori
No ratings yet
L06 (Adversarial Search) Ori
46 pages
Adversarial Search
No ratings yet
Adversarial Search
42 pages
3 GamePlaying - Minimax
No ratings yet
3 GamePlaying - Minimax
75 pages
Adversarial Search
No ratings yet
Adversarial Search
37 pages
5e Etttttgb
No ratings yet
5e Etttttgb
2 pages
Game Playing
No ratings yet
Game Playing
24 pages
Lecture 7
No ratings yet
Lecture 7
62 pages
L06 Adversarial Search
No ratings yet
L06 Adversarial Search
66 pages
SET394 - AI - Lecture 06 - Adversarial Search
No ratings yet
SET394 - AI - Lecture 06 - Adversarial Search
27 pages
Gameshark
No ratings yet
Gameshark
2 pages
Adversarial Search and Game Playing: Games
No ratings yet
Adversarial Search and Game Playing: Games
8 pages
Adversarial Search
No ratings yet
Adversarial Search
36 pages
Lec03 Ai Chapter6 Adversarial Search and Game Playing Aima
No ratings yet
Lec03 Ai Chapter6 Adversarial Search and Game Playing Aima
52 pages
Lecture05 AdversarialSearch
No ratings yet
Lecture05 AdversarialSearch
51 pages
Board Games
No ratings yet
Board Games
7 pages
Artificial Inteligence
No ratings yet
Artificial Inteligence
4 pages
Digital Product (Games) PDF
No ratings yet
Digital Product (Games) PDF
1 page
Chapter3 - Search4
No ratings yet
Chapter3 - Search4
37 pages
Adversarial Search
No ratings yet
Adversarial Search
49 pages
Adversarial Search
No ratings yet
Adversarial Search
20 pages
Lecture 6 - Minmax Alpha Beta
No ratings yet
Lecture 6 - Minmax Alpha Beta
41 pages
Chapter. 06 - Adversarial Search and Games - No Embedded Videos
No ratings yet
Chapter. 06 - Adversarial Search and Games - No Embedded Videos
51 pages
AI Lec03 Adversarial Search
No ratings yet
AI Lec03 Adversarial Search
38 pages
Games
No ratings yet
Games
41 pages
ITSC6121 Lecture 4 - Game Trees I
No ratings yet
ITSC6121 Lecture 4 - Game Trees I
34 pages
IA c06 NoAnim
No ratings yet
IA c06 NoAnim
31 pages
1 1 4GamePlaying
No ratings yet
1 1 4GamePlaying
23 pages
6 Game
No ratings yet
6 Game
53 pages
Game Playing
No ratings yet
Game Playing
74 pages
AAI Lecture 7 SP 25
No ratings yet
AAI Lecture 7 SP 25
51 pages
Lecture11 AdversarialSearch
No ratings yet
Lecture11 AdversarialSearch
74 pages
Game Playing MINMAX Search, Alpha-Beta Pruning, PDF
No ratings yet
Game Playing MINMAX Search, Alpha-Beta Pruning, PDF
4 pages
2025 Lecture03 AdversarialSearch
No ratings yet
2025 Lecture03 AdversarialSearch
51 pages
Lec7 LU Su20
No ratings yet
Lec7 LU Su20
46 pages
GamePlaying Minimax Unit-2 SPS
No ratings yet
GamePlaying Minimax Unit-2 SPS
72 pages
Scrabble Score Sheet 1
No ratings yet
Scrabble Score Sheet 1
1 page
Ai Lecture-4
No ratings yet
Ai Lecture-4
37 pages
ITSC6121 Lecture 4 - Game Trees I
No ratings yet
ITSC6121 Lecture 4 - Game Trees I
38 pages
JJK 220
No ratings yet
JJK 220
3 pages
CS2201 7
No ratings yet
CS2201 7
56 pages
Unit 2 Adversial Search
No ratings yet
Unit 2 Adversial Search
36 pages
Games Playing-2-57
No ratings yet
Games Playing-2-57
56 pages