0% found this document useful (0 votes)

73 views44 pages

Local Adversarial Search

This document provides an overview of various local and adversarial search algorithms used in artificial intelligence, including: 1. Local search algorithms like hill-climbing that optimize problems without a defined goal or path cost. 2. Techniques to address issues with hill-climbing like getting stuck in local optima, such as simulated annealing and beam search. 3. Adversarial search algorithms like minimax and alpha-beta pruning that model competitive two-player games and aim to construct optimal strategies against opponents.

Uploaded by

Tiffany Bryan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

73 views44 pages

Local Adversarial Search

Uploaded by

Tiffany Bryan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 44

Local & Adversarial Search

CSD 15-780: Graduate Artificial Intelligence

Instructors: Zico Kolter and Zack Rubinstein
TA: Vittorio Perera

1
Local search algorithms
 Sometimes the path to the goal is irrelevant:
 8-queens problem, job-shop scheduling
 circuit design, computer configuration
 automatic programming, automatic graph drawing
 Optimization problems may have no obvious
“goal test” or “path cost”.
 Local search algorithms can solve such
problems by keeping in memory just one
current state (or perhaps a few).

2
Advantages of local search
1. Very simple to implement.
2. Very little memory is needed.
3. Can often find reasonable solutions in
very large state spaces for which
systematic algorithms are not suitable.

3
Hill-climbing search

4
Problems with hill-climbing
 Can get stuck at a local maximum.
 Cannot climb along a narrow ridge when each
possible step goes down.
 Unable to find its way off a plateau.
Solutions:
 Stochastic hill-climbing – select using weighted

random choice
 First-Choice hill-climbing – randomly generate

neighbors until one better

 Random restarts – run multiple HC searches with

different initial states.

5
Simulated Annealing Search
 Based on annealing in metallurgy where
metal is hardened by heating to high
state and cool gradually.
 The main idea is to avoid local maxima
(or minima) by having a controlled
randomness in the search that gradually
decreases.

6
Simulated annealing search

7
Beam Search
 Like hill-climbing but instead of tracking just
one best state, it tracks k best states.
 Start with k states and generate successors
 If solution in successors, return it.
 Otherwise, select k best states selected from
all successors.
 Like hill-climbing, there are stochastic forms
of beam search.

8
Genetic Algorithms
 Similar to stochastic beam search,
except that successors are drawn from
two parents instead of one.
 General idea is to find a solution by
iteratively selecting fittest individuals
from a population and breeding them
until either a threshold on iterations or
fitness is hit.
9
Genetic algorithms cont.
 An individual state is represented by a
sequence of “genes”.
 The selection strategy is randomized
with probability of selection
proportional to “fitness”.
 Individuals selected for reproduction
are randomly paired, certain genes are
crossed-over, and some are mutated.
10
Genetic algorithms cont.

11
Genetic Algorithm

12
Genetic algorithms cont.
 Genetic algorithms have been applied to a
wide range of problems.
 Results are sometimes very good and
sometimes very poor.
 The technique is relatively easy to apply and
in many cases it is beneficial to see if it
works before thinking about another
approach.

13
Adversarial Search
 The minimax algorithm
 Alpha-Beta pruning
 Games with chance nodes
 Games versus real-world competitive
situations

14
Adversarial Search
 An AI favorite
 Competitive multi-agent environments
modeled as games

15
From single-agent to two-players
 Actions no longer have predictable
outcomes
 Uncertainty regarding opponent and/or
outcome of actions
 Competitive situation
 Much larger state-space
 Time limits
 Still assume perfect information
16
Formalizing the search problem
 Initial state = initial game/board position
and player
 Successors = operators = all legal moves
 Terminal state test (not “goal”-test) = a
state in which the game ends
 Utility function = payoff function = reward
 Game tree = a graph representing all the
possible game scenarios

17
Partial game tree for Tic-Tac-Toe

18
What are we searching for?
 Construct a “strategy” or “contingent
plan” rather than a “path”
 Must take into account all possible
moves by the opponent
 Representation of a strategy
 Optimal strategy = leads to the highest
possible guaranteed payoff

19
The minimax algorithm
 Generate the whole tree
 Label the terminal states with the payoff
function
 Work backwards from the leaves,
labeling each state with the best
outcome possible for that player
 Construct a strategy by selecting the
the best moves for “Max”

20
Minimax algorithm cont.
 Labeling process leads to the “minimax
decision” that guarantees maximum
payoff, assuming that the opponent is
rational
 Labeling can be implemented using
depth-first search using linear space

21
Illustration of minimax
MAX 3

MIN
3 2 2

3 12 8 2 4 6 14 5 2

22
But seriously...
 Can’t search all the way to leaves
 Use Cutoff-Test function;
generate a partial tree whose leaves
meet the cutoff-test
 Apply heuristic to each leaf
 Assume that the heuristic represents
payoffs, and back up using minimax

23
What’s in an evaluation function?
 Evaluation function assigns each state
to a category, and imposes an ordering
on the categories
 Some claim that the evaluation function
should measure P(winning)...

24
Evaluating states in
chess
 “material” evaluation
 Count the pieces for each side, giving
each a weight (queen=9, rook=5,
knight/bishop=3, pawn=1)
 What properties do we care about in the
evaluation function?
 Only the ordering matters

25
Evaluating states in
backgammon
Possible goals (features):
 Hit your opponent's blots
 Reduce the number of blots that are in danger
 Build points to block your opponent
 Remove men from board
 Get out of opponent's home
 Don't build high points
 Spread the men at home positions

26
Learning evaluation functions
 Learning the weights of chess pieces...
can use anything from linear regression
to hill-climbing.
 The harder question is picking the
primitive features to use.

27
Problems with minimax
 Uniform depth limit
 Horizon problem:
over-rates sequences of moves that
“stall” some bad outcome
 Does not take into account possible
“deviations” from guaranteed value
 Does not factor search cost into the
process

28
Minimax may be inappropriate…
MAX

MIN
99 100

99 1000 1000 1000 100 101 102 100

29
Reducing search cost
 In chess, can only search
full-width tree to about 4 levels
 The trick is to “prune” certain subtrees
 Fortunately, best move is provably
insensitive to certain subtrees

30
Alpha-Beta pruning
 Goal: compute the minimax value of a
game tree with minimal exploration.
 Along current search path, record best
choice for Max (alpha), and best choice
for Min (beta).
 If any new state is known to be worse
than alpha or beta, it can be pruned.
 Simple example of “meta-reasoning”

31
Illustration of Alpha-Beta
MAX
11

11 10 MIN

11 48 10 MAX

11 9 48 10 10 MIN

X X X X X X X X
41 11 9 37 52 48 20 30 10 27 10 37 50 36 25 3

32
Implementation of Alpha-Beta
function Alpha (state, , )
if Cutoff (state) then return Value(state)
for each s in Successors(state) do
  Max(, Beta (s, , ))
if then return 
end
return 

33
Implementation cont.
function Beta (state, , )
if Cutoff (state) then return Value(state)
for each s in Successors(state) do
  Min(, Alpha (s, , ))
if then return 
end
return 

34
Effectiveness of Alpha-Beta
 Depends on ordering of successors.
 With perfect ordering, can search twice
as deep in a given amount of time (i.e.,
effective branching factor is SQRT(b)).
 While perfect ordering cannot be
achieved, simple heuristics are very
effective.

35
What about time limits?
 Iterative deepening
(minimax to depths 1, 2, 3, ...)
 Can even use iterative deepening
results to improve top-level ordering

36
Games with an element of chance
 Add chance nodes to the game tree
 Use the expecti-max or expecti-minimax
algorithm
 One problem: evaluation function is now
scale dependent (not just ordering!)
 There is even an alpha-beta trick for this
case

37
38
Evaluation is scale dependent

39
State-of-the-art programs
Chess: Deep Blue [Campbell, Hsu, and Tan; 1997]
 Defeated Gary Kasparov in a 6-game match.
 Used parallel computer with 32 PowerPCs
and 512 custom VLSI chess processors.
 Could search 100 bilion positions per move,
reaching depth 14.
 Used alpha-beta with improvements,
following “interesting” lines more deeply.
 Extensive use of libraries of openings and
endgames.
40
State-of-the-art programs
 Checkers: [Samuel, 1952]
 Expert-level performance using a 1KHz CPU with
10,000 words of memory.
 One of the early example of machine learning.
 Checkers: Chinook [Schaeffer, 1992]
 Won the 1992 U.S. Open and first to challenge for a
world championship.
 Lost in match against Tinsley (World champion for over
40 years who had lost only in 3 games before match).
 Became world champion in 1994.
 Used alpha-beta search combined with a database of
all 444 bilion positions with 8 pieces or less on board.41
State-of-the-art programs
Backgammon: TD-Gammon [Tesauro, 1992]
 Ranked among the top three players in the
world.
 Combined Samuel’s RL method with neural
network techniques to develop a remarkably
good heuristic evaluator.
 Used expecti-minimax search to depth 2 or 3.

42
State-of-the-art programs
Bridge: GIB [Ginsburg, 1999]
 Won computer bridge championship; finished 12th in
a field of 35 at the 1998 world championship.
 Examine how each choice works for a random
sample of the up to 10 million possible arrangements
of the hidden cards.
 Used explanation-based generalization to compute
and cache general rules for optimal play in various
classes of situations.

43
Lots of theoretical problems...
 Minimax only valid on whole tree
 P(win) is not well defined
 Correlated errors
 Perfect play assumption
 No planning

AE 106 Module 10 Decision Theory
No ratings yet
AE 106 Module 10 Decision Theory
21 pages
Optimal Decision in Games
No ratings yet
Optimal Decision in Games
68 pages
SRM'24 AI Unit 3
No ratings yet
SRM'24 AI Unit 3
52 pages
Decision Making Under Risk and Uncertainty-1
No ratings yet
Decision Making Under Risk and Uncertainty-1
4 pages
Lecture13 - Adversial Search Algorithms
No ratings yet
Lecture13 - Adversial Search Algorithms
23 pages
Decision Theory and Decision Tree Analysis
No ratings yet
Decision Theory and Decision Tree Analysis
43 pages
Decision Theory
0% (2)
Decision Theory
45 pages
Lecture05 AdversarialSearch
No ratings yet
Lecture05 AdversarialSearch
51 pages
Or Assignment 2023
No ratings yet
Or Assignment 2023
5 pages
Assignment - 6 Game Theory
33% (3)
Assignment - 6 Game Theory
2 pages
Lecture 5 - Adversal Search
No ratings yet
Lecture 5 - Adversal Search
88 pages
CS2201 7
No ratings yet
CS2201 7
56 pages
GamePlaying Minimax Unit-2 SPS
No ratings yet
GamePlaying Minimax Unit-2 SPS
72 pages
Artificial Intelligence VOLUME II (AI Course Book 2)
No ratings yet
Artificial Intelligence VOLUME II (AI Course Book 2)
352 pages
Lec - 7 Decision Analysis
100% (1)
Lec - 7 Decision Analysis
63 pages
Unit 2 Adversial Search
No ratings yet
Unit 2 Adversial Search
36 pages
Adversarial Search
No ratings yet
Adversarial Search
37 pages
2025 Lecture03 AdversarialSearch
No ratings yet
2025 Lecture03 AdversarialSearch
51 pages
8.AI17game Final
No ratings yet
8.AI17game Final
30 pages
FALLSEM2024-25 PMDS601L TH VL2024250104945 2024-08-13 Reference-Material-I
No ratings yet
FALLSEM2024-25 PMDS601L TH VL2024250104945 2024-08-13 Reference-Material-I
58 pages
Lecture 4
No ratings yet
Lecture 4
29 pages
Adversial Search
No ratings yet
Adversial Search
101 pages
Mat 540 Quiz 2 With Answers
83% (6)
Mat 540 Quiz 2 With Answers
10 pages
AI Final
No ratings yet
AI Final
60 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
195 pages
AAI Lecture 7 SP 25
No ratings yet
AAI Lecture 7 SP 25
51 pages
Games
No ratings yet
Games
41 pages
Simulation Game Theory
No ratings yet
Simulation Game Theory
15 pages
OR - Unit-6
No ratings yet
OR - Unit-6
71 pages
ITSC6121 Lecture 4 - Game Trees I
No ratings yet
ITSC6121 Lecture 4 - Game Trees I
38 pages
Ai Lecture-4
No ratings yet
Ai Lecture-4
37 pages
Adversarial Search
No ratings yet
Adversarial Search
49 pages
Chapter. 06 - Adversarial Search and Games - No Embedded Videos
No ratings yet
Chapter. 06 - Adversarial Search and Games - No Embedded Videos
51 pages
Evaluative Report of The Department: Dhiyamaan Ollege of Ngineering Dr.M.G.R.Nagar, Hosur-635 109
No ratings yet
Evaluative Report of The Department: Dhiyamaan Ollege of Ngineering Dr.M.G.R.Nagar, Hosur-635 109
12 pages
3 GamePlaying - Minimax
No ratings yet
3 GamePlaying - Minimax
75 pages
Week 13
No ratings yet
Week 13
45 pages
Adversarial Search
No ratings yet
Adversarial Search
91 pages
Lecture 7
No ratings yet
Lecture 7
62 pages
ITSC6121 Lecture 4 - Game Trees I
No ratings yet
ITSC6121 Lecture 4 - Game Trees I
34 pages
Lecture 6 - Minmax Alpha Beta
No ratings yet
Lecture 6 - Minmax Alpha Beta
41 pages
AI Unit-3
No ratings yet
AI Unit-3
109 pages
Lecture Adversarial Searches
No ratings yet
Lecture Adversarial Searches
25 pages
Chapter3 - Search4
No ratings yet
Chapter3 - Search4
37 pages
Foundations of Artificial Intelligence: Local Search
No ratings yet
Foundations of Artificial Intelligence: Local Search
91 pages
1 1 4GamePlaying
No ratings yet
1 1 4GamePlaying
23 pages
The Principle of Dominance
No ratings yet
The Principle of Dominance
21 pages
Bus 206 Sample Questions Chapters 5-8
100% (2)
Bus 206 Sample Questions Chapters 5-8
24 pages
Part4.Game Playing
No ratings yet
Part4.Game Playing
35 pages
Adversarial Search
No ratings yet
Adversarial Search
42 pages
Ai Module-3notes
No ratings yet
Ai Module-3notes
35 pages
Lecture 09 10+Game+Playing+++MinMax-AlphaBeta
No ratings yet
Lecture 09 10+Game+Playing+++MinMax-AlphaBeta
54 pages
04 Games PDF
No ratings yet
04 Games PDF
77 pages
Artificial Intelligence: Adversarial Search
No ratings yet
Artificial Intelligence: Adversarial Search
62 pages
SET394 - AI - Lecture 06 - Adversarial Search
No ratings yet
SET394 - AI - Lecture 06 - Adversarial Search
27 pages
AI Final
No ratings yet
AI Final
16 pages
Adversarial Search
No ratings yet
Adversarial Search
19 pages
PSMOD - Decision Making Techniques
No ratings yet
PSMOD - Decision Making Techniques
71 pages
AI Lec07 Adversarial Search
No ratings yet
AI Lec07 Adversarial Search
29 pages
M M M M M M M M M: Koppar & Associates, Chartered Accountants 6/30/2011
0% (1)
M M M M M M M M M: Koppar & Associates, Chartered Accountants 6/30/2011
34 pages
Oradea: Bucharest Arad Craiova
No ratings yet
Oradea: Bucharest Arad Craiova
53 pages
Artificial Intelligence: Adversarial Search
No ratings yet
Artificial Intelligence: Adversarial Search
58 pages
6 Game
No ratings yet
6 Game
42 pages
2.6-Adversarial Search Algorithms
No ratings yet
2.6-Adversarial Search Algorithms
21 pages
L2-AI Technique, Tic - Tac - Toe Problem.
No ratings yet
L2-AI Technique, Tic - Tac - Toe Problem.
16 pages
Min Max Algorithm
No ratings yet
Min Max Algorithm
4 pages
Operational Research Process of British American Tobacco: Afrin Akter Tumpa ID: M23030204151 Uchyas Roy ID: M23030204149
No ratings yet
Operational Research Process of British American Tobacco: Afrin Akter Tumpa ID: M23030204151 Uchyas Roy ID: M23030204149
23 pages
Adversarial Search
No ratings yet
Adversarial Search
78 pages
CS 188: Artificial Intelligence: Adversarial Search
No ratings yet
CS 188: Artificial Intelligence: Adversarial Search
44 pages
Term Paper Final - Arafat
No ratings yet
Term Paper Final - Arafat
38 pages
CSC-411-AI-lec6-Adversarial Search
No ratings yet
CSC-411-AI-lec6-Adversarial Search
38 pages
Exam Production Execution and Logistics SS2021 Final
No ratings yet
Exam Production Execution and Logistics SS2021 Final
24 pages
Game Playing
No ratings yet
Game Playing
32 pages
Local Search Algorithms: Hill Climbing Simulated Annealing Search Local Beam Search Genetic Algorithm
No ratings yet
Local Search Algorithms: Hill Climbing Simulated Annealing Search Local Beam Search Genetic Algorithm
45 pages
Week15 16 PDF
No ratings yet
Week15 16 PDF
21 pages
Why Do AI Researchers Study Game Playing?
No ratings yet
Why Do AI Researchers Study Game Playing?
42 pages
Proceedings of The AAAI Conference On Artificial Intelligence
No ratings yet
Proceedings of The AAAI Conference On Artificial Intelligence
9 pages
Game Playing MINMAX Search, Alpha-Beta Pruning, PDF
No ratings yet
Game Playing MINMAX Search, Alpha-Beta Pruning, PDF
4 pages
MateriMinggu03 1920 AI ProblemSolving
No ratings yet
MateriMinggu03 1920 AI ProblemSolving
41 pages
06 Adversarialsearch
No ratings yet
06 Adversarialsearch
36 pages
Web Development and Core Java Lab Manual V Semester: Dept. of Computer Science and Engineering
No ratings yet
Web Development and Core Java Lab Manual V Semester: Dept. of Computer Science and Engineering
54 pages
Response Time Optimization in Wireless Sensor Network For The On-Board Rapid Transit Applications
No ratings yet
Response Time Optimization in Wireless Sensor Network For The On-Board Rapid Transit Applications
24 pages
415cie03 - Daa
No ratings yet
415cie03 - Daa
2 pages
Multiplayer Residual Advantage Learningwith General Function Approximation
No ratings yet
Multiplayer Residual Advantage Learningwith General Function Approximation
15 pages
Lecture 8 - Adversarial Search: Dr. Muhammad Adnan Hashmi
No ratings yet
Lecture 8 - Adversarial Search: Dr. Muhammad Adnan Hashmi
15 pages
Artificial Intelligence 5. Game Playing: Course V231 Department of Computing Imperial College © Simon Colton
No ratings yet
Artificial Intelligence 5. Game Playing: Course V231 Department of Computing Imperial College © Simon Colton
28 pages
ACCA P5 Revision Notes - V2
No ratings yet
ACCA P5 Revision Notes - V2
58 pages
Battery Management in Wireless Sensor Networks: Doru E. Tiliute
No ratings yet
Battery Management in Wireless Sensor Networks: Doru E. Tiliute
4 pages
511CIT05 Formal Languages and Automata Theory LTPM C 3 1 0 100 4 Aim
No ratings yet
511CIT05 Formal Languages and Automata Theory LTPM C 3 1 0 100 4 Aim
2 pages
Summary Materi Uts Da - 2020
No ratings yet
Summary Materi Uts Da - 2020
12 pages
IRC - Risk and Uncertainty
No ratings yet
IRC - Risk and Uncertainty
5 pages
Q T Bank1k
No ratings yet
Q T Bank1k
12 pages
Clustering Protocols in Wireless Sensor Networks: A Survey
No ratings yet
Clustering Protocols in Wireless Sensor Networks: A Survey
10 pages
Cse Palcement Namelist 2017
No ratings yet
Cse Palcement Namelist 2017
8 pages
Adversarial Search and Game Playing: Games
No ratings yet
Adversarial Search and Game Playing: Games
8 pages
415CIE03 Design and Analysis of Algorithm
No ratings yet
415CIE03 Design and Analysis of Algorithm
2 pages
Energy Efficient Cluster Head Selection To Enhance Network Connectivity For Wireless Sensor Network
No ratings yet
Energy Efficient Cluster Head Selection To Enhance Network Connectivity For Wireless Sensor Network
5 pages
Artificial Inteligence
No ratings yet
Artificial Inteligence
4 pages
Computer Lab - VII Room No 133: R TM TH
No ratings yet
Computer Lab - VII Room No 133: R TM TH
2 pages
Adhiyamaan College of Engineering, Hosur - 635 109 (Autonomous) Department of Computer Science & Engineering
No ratings yet
Adhiyamaan College of Engineering, Hosur - 635 109 (Autonomous) Department of Computer Science & Engineering
1 page
This Claim Should Be Made Separately by The Supervisor Towards Refreshment Expenses Only Incurred On The Day of DC/ Viva-Voce
No ratings yet
This Claim Should Be Made Separately by The Supervisor Towards Refreshment Expenses Only Incurred On The Day of DC/ Viva-Voce
1 page
Master Roblox Studio Advanced Game Development Techniques: Roblox Studio, #3
From Everand
Master Roblox Studio Advanced Game Development Techniques: Roblox Studio, #3
Steven Mcananey
No ratings yet
Brute Force Search: Fundamentals and Applications
From Everand
Brute Force Search: Fundamentals and Applications
Fouad Sabry
No ratings yet

Local Adversarial Search

Uploaded by

Local Adversarial Search

Uploaded by

Local & Adversarial Search

CSD 15-780: Graduate Artificial Intelligence

neighbors until one better

different initial states.

99 1000 1000 1000 100 101 102 100

You might also like