0% found this document useful (0 votes)

81 views7 pages

Build Order Optimization in Starcraft.: January 2011

This document summarizes a research paper about optimizing build orders in real-time strategy (RTS) games like StarCraft. Build orders determine the sequence of units and structures produced to achieve certain goals in the game. The authors present a planning approach and heuristics to speed up searching for optimal build orders in StarCraft. They evaluate their method by comparing the build orders it generates to those of professional StarCraft players.

Uploaded by

Noe Muñoz Quito

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

81 views7 pages

Build Order Optimization in Starcraft.: January 2011

Uploaded by

Noe Muñoz Quito

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/220978332

Build Order Optimization in StarCraft.

Conference Paper · January 2011

Source: DBLP

CITATIONS READS
69 1,038

2 authors, including:

David Churchill
Memorial University of Newfoundland
13 PUBLICATIONS 668 CITATIONS

SEE PROFILE

All content following this page was uploaded by David Churchill on 21 March 2016.

The user has requested enhancement of the downloaded file.

Build Order Optimization in StarCraft

David Churchill and Michael Buro

University of Alberta
Computing Science Department
Edmonton, Alberta, Canada

Abstract bases. The order in which units and structures are produced
is called a build order. RTS games are usually won by play-
In recent years, real-time strategy (RTS) games have gained
ers who destroy opponents’ structures first. This goal can be
interest in the AI research community for their multitude of
challenging subproblems — such as collaborative pathfind- accomplished in various ways. For instance, one could try
ing, effective resource allocation and unit targeting, to name to surprise (“rush”) the opponent by investing resources into
a few. In this paper we consider the build order problem in attack forces early in the game at the cost of delaying the
RTS games in which we need to find concurrent action se- construction of structures that are important in later game
quences that, constrained by unit dependencies and resource stages. If the opponent, on the other hand, invests in tech-
availability, create a certain number of units and structures in nological development and disregards defense, the rushing
the shortest possible time span. We present abstractions and player will win easily. Thus, at the highest adversarial strat-
heuristics that speed up the search for approximative solu- egy level, the choice of initial build orders often decides the
tions considerably in the game of StarCraft, and show the ef- game outcome. Therefore, like in chess, aspiring players
ficacy of our method by comparing its real-time performance
need to study and practice executing build orders and tai-
with that of professional StarCraft players.
lor them to specific opponents. Avoiding the interesting and
ambitious task of selecting good build order goals, in this
Introduction paper we assume that they are given to us. Because act-
Automated planning, i.e. finding a sequence of actions lead- ing fast is very important in RTS games due to the fact that
ing from a start to a goal state, is a central problem in artifi- players act asynchronously, what remains is finding action
cial intelligence research with many real-world applications. sequences that accomplish the given goal while minimizing
For instance, the satisfiability problem can be considered a the plan duration (makespan). This process is called build
planning problem (we need to assign values to n Boolean order optimization.
variables so that a given formula evaluates to true) with ap- The research on this subject that is reported here was mo-
plications to circuit verification, solving Rubik’s cube from a tivated by the goal of creating strong AI systems for the
given start state is a challenging pastime, and building sub- popular RTS game of StarCraft and the frustration of hard-
marines when done inefficiently can easily squander hun- coding build orders for them. In the remainder of this paper
dreds of millions of dollars. Most interesting planning prob- we first give a brief overview of related work on build order
lems in general are hard to solve algorithmically. Some, like optimization and our application domain StarCraft. Then we
the halting problem for Turing machines, are even undecid- describe our search algorithm, the underlying assumptions,
able. and the abstractions we use. In the following experimental
In this paper we consider a class of planning problems that section we gauge the performance of our planner by com-
arises in a popular video game genre called real-time strat- paring its build orders with those executed by professional
egy (RTS) games. In these games, which can be succinctly players. We finish the paper by conclusions and suggestions
described as war simulations, players instruct units in real- for future work.
time to gather resources, to build other units and structures,
to scout enemy locations, and to eventually destroy oppo- Background
nents’ units to win the game. In the opening phase of RTS The build order optimization problem can be described as a
games players usually don’t interact with each other because constraint resource allocation problem with makespan min-
their starting locations are spread over large maps and player imization, which features concurrent actions. Because of
visibility is limited to small regions around friendly units or their practical relevance, problems of this kind have been
structures. The main sub-goals in this game phase are to es- the subject of study for many years, predominantly in the
tablish a sufficient income flow by producing workers that area of operations research.
gather resources, to quickly build structures that are pre- (Buro and Kovarsky 2007) motivates research on build or-
requisites for other structures or can produce combat units der problems in the context of RTS games and proposes a
to build a minimal force for defense or early attack, and to way of modeling them in PDDL, the language used in the
send out scouts to explore the terrain and search for enemy automated planning competitions. In (Kovarsky and Buro
2006) the issue of concurrent execution is studied in general and gas which are consumed by the player throughout the
and efficient action ordering mechanisms are described for game. Producing additional worker units early in the game
the RTS game build order domain. increases resource income and is typically how most profes-
Existing techniques for build order planning in the RTS sional players start their build orders. Once a suitable level
game domain have focused mainly on the game Wargus (an of income has been reached, players begin the construction
open source clone of Warcraft 2), which is much simpler of additional structures and units which grow their military
than StarCraft due to the limited number of possible ac- strength. Each unit has a set of prerequisite resources and
tions and lower resource gathering complexity. Several of units which the player must obtain before beginning their
these techniques rely heavily on means-end analysis (MEA) construction. The graph obtained by listing all unit prereq-
scheduling. Given an initial state and a goal, MEA produces uisites and eliminating transitive edges is called a tech tree.
a satisficing plan which is minimal in the number of actions Due to the complex balance of resource collection and unit
taken. MEA runs in linear time w.r.t. the number of actions prerequisite construction, finding good build orders is a dif-
in a plan, so it is quite fast, but the makespans it produces ficult task, a skill often taking professional players years to
are often much longer than optimal. develop.
(Chan et al. 2007b) employ MEA to generate build or-
der plans, followed by a heuristic rescheduling phase which A Build Order Planning Model for StarCraft
attempts to shorten the overall makespan. While they pro- Build order planning in RTS games is concerned with find-
duce satisficing plans quite quickly, the plans are not op- ing a sequence of actions which satisfies a goal with the
timal due to the complex nature of the rescheduling prob- shortest makespan. It is our goal to use domain specific
lem. In some cases they are able to beat makespans gen- knowledge to limit both the branching factor as well as depth
erated by human players, but do not mention the relative of search while maintaining optimality, resulting in a search
skill level of these players. This technique is extended in algorithm which can run in real-time in a StarCraft playing
(Chan et al. 2007a) by incorporating best-first search in an agent.
attempt to reduce makespans further by solving intermedi- In StarCraft, a player is limited to a finite number of re-
ate goals. They admit that their search algorithm is lacking sources which they must both collect and produce through-
many optimizations, and their results show that this is not out the game. All consumables (minerals, gas) as well as
only slower than their previous work but still cannot produce units (workers, fighters, buildings) are considered resources
significantly better solutions. (Branquinho and Lopes 2010) for the purpose of search. An action in our search is one
extend further on these ideas by combining two new tech- which requires some type of resource, while producing an-
niques called MeaPop (MEA with partial order planning) other (combat actions are out of our scope). Resources
and Search and Learning A* (SLA*). These new results which are used by actions can be of the forms Require, Bor-
improve on the makespans generated by MEA, but require row, Consume, and Produce (Branquinho and Lopes 2010).
much more time to compute, bringing it outside the range of Required resources, which are called prerequisites, are the
real-time search. They are currently investigating ways of ones which must be present at the time of issuing an action.
improving the run-time of SLA*. A borrowed resource is one which is required, used for the
These techniques however are only being applied to War- duration of an action, and returned once the action is com-
gus, with goals consisting of at most 5 types of resources. pleted. A consumed resource is one which is required, and
Interesting plans in StarCraft may involve multiple instances used up immediately upon issue. A produced resource is one
of up to 15 different units in a single goal and requiring far which is created upon completion of the action.
more workers, increasing complexity dramatically. Each action a has the form a = (δ, r, b, c, p), with du-
ration δ (measured in game simulation frames), three sets
StarCraft of preconditions r (required), b (borrowed), c (consumed),
RTS games are interesting application domains for AI re- and one set of produced items p. For example, in the Star-
searchers, because state spaces are huge, actions are con- Craft domain, the action a = “Produce Protoss Dragoon”
current, and part of the game state is hidden from players has δ = 600, r = {Cybernetics-Core}, b = {Gateway},
— and yet, human players still play much better than ma- c = {125 minerals, 50 gas, 2 supply}, p = {1 Dragoon}.
chines. To spark researchers’ interest in this game domain, States then take the form S = (t, R, P, I), where t is the
a series of RTS game AI competitions have been organized current game time (measured in frames), vector R holds the
in the past 6 years. In 2006-2009 a free software RTS game state of each resource available (ex: 2 barracks available,
engine was used (ORTS 2010), but since the advent of the one currently borrowed until time X), vector P holds actions
BWAPI library (BWAPI 2011), the competition focus has in progress but are not yet completed (ex: supply depot will
switched to StarCraft (by Blizzard Entertainment), the most finish at time X), and vector I holds worker income data
popular RTS game in the world with over 10 million copies (ex: 8 gathering minerals, 3 gathering gas). Unlike some
sold. StarCraft has received over 50 game industry awards, implementations such as (Branquinho and Lopes 2010), I is
including over 20 “game of the year” awards. Some pro- necessary due to abstractions made to facilitate search.
fessional players have reached celebrity status, and prize
money for tournaments total in the millions of dollars an- Abstractions
nually. Without having access to the StarCraft game engine source
As in most RTS games, each player starts with a num- code, it was necessary to write a simulator to compute
ber of worker units which gather resources such as minerals state transitions. Several abstractions were made in order

2
to greatly reduce the complexity of the simulation and the Algorithm 1 Depth-First Branch & Bound
search space, while maintaining close to StarCraft-optimal Require: goal G, state S, time limit t, bound b
results. Note that any future use of the term ’optimal’ or 1: procedure DFBB(S)
’optimality’ refers to optimality within these abstractions: 2: if TimeElapsed ≥ t then
We abstract mineral and gas resource gathering by real 3: return
valued income rates of 0.045 minerals per worker per frame 4: end if
and 0.07 gas per worker per frame. These values have been 5: if S satisfies G then
determined empirically by analyzing professional games. In 6: b ← min(b, St ) . update bound
reality, resource gathering is a process in which workers 7: bestSolution ← solutionPath(S)
spend a set amount of time gathering resources before re- 8: else
turning them to a base. Although we fixed income rates in 9: while S has more children do
our experiments, they could be easily estimated during the 10: S 0 ← S.nextChild
game. This abstraction greatly increases the speed of state 11: S 0 .parent ← S
transition and resource look-ahead calculations. It also elim- 12: h ← eval(S 0 ) . heuristic evaluation
inates the need for “gather resource” type actions which typ- 13: if St0 + h < b then
ically dominate the complexity of build order optimization. 14: DFBB(S 0 )
Due to this abstraction, we now consider minerals and gas 15: end if
to be a special type of resource, whose “income level” data 16: end while
is stored in state component I. 17: end if
Once a refinery location has been built, a set number of 18: end procedure
workers (3 in our experiments) will be sent to gather gas
from it. This abstraction eliminates the need for worker
re-assignment and greatly reduces search space, but in rare
cases is not “truly” optimal for a given goal. Fast Forwarding and State Transition
Whenever a building is constructed, a constant of 4 sec- In general, RTS games allow the user to take no action at
onds (96 simulation frames) is added to the game state’s any given state, resulting in a new state which increases the
time component. This is to simulate the time required for internal game clock, possibly increasing resources and com-
a worker unit to move to a suitable building location within pleting actions. This is problematic for efficient search al-
an arbitrary environment, since individual map data is not gorithms since it means that all actions (including the null
used in our search, but again could be estimated during the action) must be taken into consideration in each state of the
game. game. This results in a search depth which is linear not in
the number of actions taken, but in the makespan of our
Algorithm solution, which is often quite high. In order to solve this
We use a depth-first branch and bound algorithm to perform problem, we have implemented a fast-forwarding simulation
build order search. The algorithm takes a starting state S technique which eliminates the need for null actions.
as input and performs a depth-first recursive search on the In StarCraft, the time-optimal build order for any goal is
descendants of S in order to find a state which satisfies a one in which actions are executed as soon as they are legal,
given goal G. This algorithm has the advantage of using since hoarding resources cannot reduce the total makespan.
a linear amount of memory with respect to the maximum Although resource hoarding can be a vital strategy in late-
search depth. Since this is an any-time algorithm we can game combat, it is outside the scope of our planner. Let us
halt the search at any point and return the best solution so define the following functions:
far, which is an important feature for real-time applications. S 0 ←Sim(S, δ) - Simulate the natural progression of a
StarCraft game from a state S through δ time steps given
Action Legality that no other actions are issued, resulting in a new state S 0 .
In order to generate the children of a state, we must deter- This simulation includes the gathering of resources (given
mine which actions are legal in this state. Intuitively, an ac- our economic abstraction) and the completion of durative
tion is legal in state S if the simulation of the game starting actions which have already been issued.
in time will eventually produce all required resources with- δ ←When(S, R) - Takes a state S and a set of resource
out issuing any further actions. Given our abstractions, an requirements R and returns the earliest time δ for which
action is therefore legal in state S if and only if the following Sim(S, δ) will contain R. This function is typically called
conditions hold: 1) The prerequisites required or resources with action prerequisites to determine when the required re-
borrowed are either currently available, or being created. sources for an action a will be ready.
Example: a Barracks is under construction, so fighter units S 0 ←Do(S, a) - Issue action a in state S assuming all re-
will be trainable without any other actions being issued. 2) quired resources are available. The issuing of the action in-
The consumed resources required by the action are either volves subtracting the consume resources, updating actions
currently available or will be available at some point in the in progress and flagging borrowed resources in use. The re-
future without any other actions being taken. Example: we sulting state S 0 is the state for which action a has just been
do not currently meet the amount of minerals required, how- issued and has its full duration remaining.
ever our workers will eventually gather the required amount
(assuming there is a worker gathering minerals). S 0 = Do(Sim(S, When(S, a)), a)

3
now defines our state transition function which returns the 500
K=1
state S 0 for which action a has been issued. 480 K=2

460
Concurrent Actions and Action Subset Selection

makespan (seconds)
440
A defining feature of RTS games is the ability to perform
concurrent actions. For example, if a player has a suffi- 420
cient amount of resources they may begin the concurrent 400
construction of several buildings as well as the training of
380
several units. In a general setting, this may cause an action-
space explosion because an super-exponential number of 360
possible actions sequences has to be considered. Even in 340
the common video game setting in which a game server se- 1 2 3 4 5 6 7 8 9
quentializes incoming concurrent player actions, it can be log(number of nodes expanded) [base 10]

co-NP hard to decide whether these actions when sequential-

ized in arbitrary order result in the same state (Buro and Ko- Figure 1: Makespan vs. nodes searched for late-game goal
varsky 2007). Fortunately, many RTS games, including Star- of two carriers, comparing optimal search (K = 1) and
Craft, have the property that simultaneously executable ac- approximate search with macro actions (K = 2). Macro
tions are independent of each other, i.e. action effects don’t actions make complex searches tractable while maintaining
invalidate prerequisites of other actions: For any two actions close to optimal makespans.
a, b to be executable concurrently in state S we must have for our search is TrivialPlan(S,G) — Given a state and a
δ = When(S, prerequisites of a and b) = 0, which means goal, we simply take a random legal action from the goal
Sim(S, δ) = S. Because function Do(S, x) returns a state and issue it when it is possible. This guarantees that our
in which precondition resources are decreased and postcon- goal is met, but does not optimize for time. The length of
dition resources are increased, we have this plan is then used as an upper bound in our search.
Do(Do(S, a), b) = Do(S, a + b)
Breadth Limiting
= Do(Do(S, b), a),
To limit the branching factor of our search, we impose up-
where ’+’ indicates the concurrent issuing of two actions, per bounds on certain actions. Ex: if our goal contains two
proving that the ordering of concurrent actions has no effect fighter units which are trained at a barracks, we know that we
on the resulting state. We can also apply this argument it- need to produce at most two barracks. Since it is difficult to
eratively for subsets larger than two actions. Based on this pre-compute the optimal number of worker and supply units
insight and the “earliest execution” property of optimal ac- for a given goal in this fashion, higher bounds are placed on
tion sequences we discussed in the previous subsection, we them to ensure optimal numbers can be produced.
can therefore impose a single ordering on simultaneous ac-
tions to to eliminate the need for iterating over all possible Macro Actions
sequences of concurrent actions from a given state. Macro actions (also called options in reinforcement learn-
ing) have proven useful in speeding up search and plan-
Heuristics ning through incorporating domain specific knowledge (Iba
Our depth-first branch and bound algorithm allows us to 1989). While these actions can be learned (Stolle and Precup
prune nodes based on heuristic evaluations of the path length 2002), we have simply hand-created several macro actions
left to our goal. Line 13 of Algorithm 1 shows that we by inspecting build orders used by professional players. Our
can prune a child node if its length so far plus its heuris- macros all take the form of doubling existing actions which
tic evaluation is less than the upper bound. If our heuristic is are commonly executed in sequence. For example: profes-
admissible, this guarantees that our computed solution will sional players often build worker or fighter units in bunches,
be optimal. We use the following admissible lower-bound rather than one at a time. By creating macro actions such
heuristics to prune our search: as these we cut the depth of search dramatically while main-
- LandmarkLowerBound(S,G) — StarCraft’s tech tree taining close to time-optimal makespans. To implement this,
imposes many prerequisites on actions. These actions are for each action we associate a repetition value K so that only
known in the search literature as landmarks. Given this se- K actions in a row of this type are allowed. The effects of
quence of non-concurrent landmark actions, we sum the in- introducing macro actions can be seen in Figure 1.
dividual durations of actions not yet created to form an ad-
missible lower bound for our search. Experiments
- ResourceGoalBound(S,G) — Summing the total con- Experiments were conducted to compare build orders used
sumed resource cost of units in a goal gives us a lower bound by professional StarCraft players to those produced by our
on the resources required to construct the goal optimally. planner. Although our planner is capable of planning for
Performing a quick search to determine the makespan of each race, we limited our tests to Protoss players in order
producing only these resources is an admissible heuristic. to avoid any discrepancies caused by using build orders of
We can then take the maximum of these three heuristics as different races. 100 replays were chosen from various repos-
our heuristic value h. The heuristic used as an upper bound itories online, 35 of which feature professional players Bisu,

4
A) CPU time statistics for search without macro actions:
Algorithm 2 Compare Build Order 1 1
0.9 0.9
Require: BuildOrder B, TimeLimit t, Increment Time i 0.8 0.8
1: procedure C OMPARE B UILD O RDER(B,t,i) 0.7 0.7
0.6 density 0.6 density
2: S ← Initial StarCraft State 0.5 distribution 0.5 distribution
3: SearchPlan ← DFBB(S,GetGoal(B, 0, ∞),t) 0.4
75th perc.: 0.08%
0.4
75th perc.: 13.06%
0.3 0.3
4: if SearchPlan.timeElapsed ≤ t then 0.2
90th perc.: 1.50%
0.2
90th perc.: 17.86%
5: return MakeSpan(SearchPlan) / MakeSpan(B) 0.1 0.1
0 0
6: else 0 2 4 6 8 10 12 14 16 18 20 22 24 0 2 4 6 8 10 12 14 16 18 20 22 24
7: inc ← i opt(120) CPU time / makespan (%) opt(120) CPU time / makespan (%)
8: SearchPlan ← ∅
9: while inc ≤ MakeSpan(B) do B) CPU time statistics for search with macro actions:
10: IncPlan ← DFBB(S,GetGoal(B,inc-i,inc),t) 1 1
11: if IncPlan.timeElapsed ≥ t then 0.9 0.9
0.8 0.8
12: return failure 0.7 0.7
13: else 0.6 density 0.6 density
0.5 distribution 0.5 distribution
14: SearchPlan.append(IncPlan) 0.4 0.4
75th perc.: 0.01% 75th perc.: 8.18%
15: S ← S.execute(IncPlan) 0.3
90th perc.: 0.02%
0.3
90th perc.: 9.99%
0.2 0.2
16: inc ← inc + i 0.1 0.1
17: end if 0 0
0 2 4 6 8 10 12 14 16 18 20 22 24 0 2 4 6 8 10 12 14 16 18 20 22 24
18: end while
app(120) CPU time / makespan (%) app(120) CPU time / makespan (%)
19: return MakeSpan(SearchPlan) / MakeSpan(B)
20: end if
Figure 2: CPU time statistics for search without (A), and
21: end procedure
with (B) macro actions at 120s increments. Shown are den-
sities and cumulative distributions of CPU time/makespan
ratios in % and percentiles for professional game data points
Stork, Kal, and White-Ra. The remaining replays were taken with player makespans 0..249s (left) and 250..500s (right).
from high level tournaments such as World Cyber Games. E.g. the top-left graph indicates that 90% of the time, the
runtime is only 1.5% of the makespan, i.e. 98.5% of the CPU
The BWAPI StarCraft programming interface was used
time in the early game can be used for other tasks.
to analyze and extract the actions performed by the profes-
sional players. Every 500 frames (21s) the build order im-
plemented by the player (from the start of the game) was have changed their mind or re-planned at various stages of
extracted and written to a file. Build orders were contin- their build order. It is however the best possible comparison
ually extracted until either 10000 frames (7m) had passed, without having access to a professional player to implement
or until one of the player’s units had died. A total of 520 build orders during the experiment.
unique build orders were extract this way. We would like to Figures 2 (time statistics) and 3 (makespan statistics) dis-
have used more data for further confidence, however the pro- play the results of these experiments, from which we can
cess of finding quality replays and manually extracting the conclude our planner produces build orders with comparable
data was quite time consuming. Though our planner is ca- makespans while consuming few CPU resources. Results
pable of planning from any state of the game, the beginning for 60s incremental search were similar to 120s (with less
stages were chosen as it was too difficult to extract meaning- CPU usage) and were omitted for space. Results grouped by
ful build orders from later points in the game due to the on- makespan to show effects of more complex searches.
going combat. To extract goals from professional build or-
ders, we construct a function GetGoal(B,ts ,te ) which given Use in StarCraft Playing Agents
a professional build order sequence B, a start time ts and Our planner (with macro actions) was incorporated into our
an end time te computes a goal which contains all resources StarCraft playing agent (name removed, written in C++ with
produced by actions issued in B between ts and te . BWAPI) which was previously a participant the 2010 AIIDE
Tests were performed on each build order with the method StarCraft AI Competition. When given expert knowledge
described in Algorithm 2 with both optimal (opt) and macro goals, the agent was capable of planning to the goal in real
action (app) search. First with t = 60s and i = 15s, sec- time, executing the build order, and subsequently defeating
ond with t = 120s and i = 30s. This incremental tactic is some amateur level players, as well as the built-in StarCraft
believed to be similar in nature to how professionals re-plan computer AI. The specific results are omitted since for this
at various stages of play, however it is impossible be certain paper we are not concerned with the strength of the overall
without access to professionally labeled data sets (for which agent, but with showing that our build order planning sys-
none exist). We claim that build orders produced by this tem works in a real world competitive setting, something no
system are “real-time” or “online” since they consume far existing method has accomplished.
less CPU time than the durations of the makespans they pro-
duce. Agents can implement the current increment while it Conclusion and Future Work
plans the next. It should be noted that this experiment is in- In this paper we have presented heuristics and abstractions
deed biased against the professional player, since they may that reduce the search effort for solving build order problems

5
A) Planning without macro actions, 120s increments, plan quality relative to professional player makespans [opt(120)]
opt(120) makespan / pro plan makespan

1.4 0.3 0.3

1 1
1.3 0.25 0.9 0.25 0.9
density density
distribution 0.8 distribution 0.8
1.2

distribution(x)

distribution(x)
0.2 0.7 0.2 0.7

density(x)

density(x)
75th perc.: 0.97 0.6 75th perc.: 1.04 0.6
1.1
0.15 90th perc.: 1.00 0.5 0.15 90th perc.: 1.09 0.5
1
0.4 0.4
0.1 0.1
0.9 0.3 0.3
0.05 0.2 0.05 0.2
0.8 0.1 0.1
0.7 0 0 0 0
0 50 100 150 200 250 300 350 400 450 500 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
pro plan makespan (seconds) opt(120) makespan / pro makespan opt(120) makespan / pro makespan

B) Planning with macro actions, 120s increments, plan quality relative to professional player plan makespans [app(120)]
app(120) makespan / pro plan makespan

1.3 0.3 0.3

1 1
1.25
0.25 0.9 0.25 0.9
1.2 density density
distribution 0.8 distribution 0.8
1.15

distribution(x)

distribution(x)
0.2 0.7 0.2 0.7
density(x)

density(x)
1.1 75th perc.: 1.00 0.6 75th perc.: 1.03 0.6
1.05 0.15 90th perc.: 1.00 0.5 0.15 90th perc.: 1.08 0.5
1 0.4 0.4
0.1 0.1
0.95 0.3 0.3
0.9 0.05 0.2 0.05 0.2
0.85 0.1 0.1
0.8 0 0 0 0
0 50 100 150 200 250 300 350 400 450 500 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5
pro plan makespan (seconds) app(120) makespan / pro makespan app(120) makespan / pro makespan

Figure 3: Makespan statistics for search without (A) and with (B) macro actions. Goals extracted by looking ahead 120s
relative to professional player plan makespans. Shown are scatter plots of the makespan ratios (left), ratio densities, cumulative
distributions, and percentiles for early game scenarios (pro makespan 0..249s, center) and early-mid game scenarios (250..500s).
E.g. the top-middle graph indicates that 90% of the time, our planner produces makespans that match those of professionals.

in StarCraft significantly while producing near optimal plans References

in real-time. We have shown macro actions, breadth limiting Branquinho, A., and Lopes, C. 2010. Planning for resource
techniques, income abstractions, and multiple lower bound production in real-time strategy games based on partial order
heuristics which reduce search spaces exponentially. A fast planning, search and learning. In Systems Man and Cybernet-
forwarding approach was introduced which replaced the null ics (SMC), 2010 IEEE International Conference on, 4205–4211.
action, cut down on simulation time, and eliminated the need IEEE.
to solve the subset action selection problem. Buro, M., and Kovarsky, A. 2007. Concurrent action selection
We have shown that with all of these techniques, our with shared fluents. In AAAI Vancouver, Canada.
planner is capable of producing plans in real-time which BWAPI. 2011. BWAPI: An API for interacting with StarCraft:
are comparable to professional StarCraft players, many of Broodwar. http://code.google.com/p/bwapi/.
which have played the game for more than 10 years. We Chan, H.; Fern, A.; Ray, S.; Wilson, N.; and Ventura, C. 2007a.
have also incorporated our solution into an actual game- Extending online planning for resource production in real-time
playing agent which is capable of defeating non-trivial op- strategy games with search.
ponents, eliminating the need for the tedious hard-coding of Chan, H.; Fern, A.; Ray, S.; Wilson, N.; and Ventura, C. 2007b.
build orders often present in similar agents. Online planning for resource production in real-time strategy
In the future we plan to improve our techniques for deal- games. In Proceedings of the International Conference on Au-
tomated Planning and Scheduling, Providence, Rhode Island.
ing with more complex search goals by learning macro ac-
tions, and adding analysis of income data to further restrict Iba, G. 1989. A heuristic approach to the discovery of macro-
search. Our abstractions (such as income) can also be im- operators. Machine Learning 3(4):285–317.
proved by adjusting various parameters to reflect game con- Kovarsky, A., and Buro, M. 2006. A first look at buildorder
text or specific environments. Our ultimate goal for the fu- optimization in real-time strategy games. In Proceedings of the
GameOn Conference, 18–22. Citeseer.
ture of our planning system is the incorporation of strategic
building placement and adversarial search. This would al- ORTS. 2010. ORTS - A Free Software RTS Game Engine.
low for goals such as defending a base or defeating enemy http://skatgame.net/mburo/orts/.
forces, eliminating the need for expert knowledge goals to Stolle, M., and Precup, D. 2002. Learning options in reinforce-
be given to our planner, greatly improving the strength and ment learning. Abstraction, Reformulation, and Approximation
adaptability of a StarCraft playing agent. 212–223.

View publication stats

500-201 Alteon TrainingManual 30.5.x
No ratings yet
500-201 Alteon TrainingManual 30.5.x
126 pages
Deploying and Managing Exchange Server 2013 HA
No ratings yet
Deploying and Managing Exchange Server 2013 HA
265 pages
Image Analysis Classification and Change Detection in Remote Sensing With Algorithms For Python 4th Edition Morton John Canty Instant Download
100% (1)
Image Analysis Classification and Change Detection in Remote Sensing With Algorithms For Python 4th Edition Morton John Canty Instant Download
90 pages
StarCraft 2 Protoss Build Orders and Guides
100% (1)
StarCraft 2 Protoss Build Orders and Guides
34 pages
(Ebook) Game AI Pro 360: Guide To Tactics and Strategy by Steve Rabin (Author) ISBN 9780367150945, 0367150948install Download
No ratings yet
(Ebook) Game AI Pro 360: Guide To Tactics and Strategy by Steve Rabin (Author) ISBN 9780367150945, 0367150948install Download
59 pages
ADI - Reference Manual
No ratings yet
ADI - Reference Manual
690 pages
Layout Barranquilla - DWG v2
No ratings yet
Layout Barranquilla - DWG v2
1 page
Playing Tetris With Reinforcement Learning
No ratings yet
Playing Tetris With Reinforcement Learning
10 pages
Ecgg15 Chapter-Rts Ai
No ratings yet
Ecgg15 Chapter-Rts Ai
16 pages
RTS AI Problems and Techniques
No ratings yet
RTS AI Problems and Techniques
12 pages
GDC22 AgeIV ML Trials and Tribulations
No ratings yet
GDC22 AgeIV ML Trials and Tribulations
69 pages
2777-Article Text-5552-1-10-20180316
No ratings yet
2777-Article Text-5552-1-10-20180316
9 pages
Cig 2009 Mario
No ratings yet
Cig 2009 Mario
6 pages
RTS and AI - Article
No ratings yet
RTS and AI - Article
2 pages
Abbott Plum A+ PDF
No ratings yet
Abbott Plum A+ PDF
85 pages
2021 SharePoint QMS Guidebook
No ratings yet
2021 SharePoint QMS Guidebook
29 pages
Journal Pone 0264550
No ratings yet
Journal Pone 0264550
18 pages
TStarBot-X An Open-Sourced and Comprehensive Study
No ratings yet
TStarBot-X An Open-Sourced and Comprehensive Study
26 pages
Space Empires - Close Encounters Players Handbook
No ratings yet
Space Empires - Close Encounters Players Handbook
30 pages
Starcraft
No ratings yet
Starcraft
21 pages
StarCraft II A New Challenge For Reinforcement Learning
100% (1)
StarCraft II A New Challenge For Reinforcement Learning
20 pages
PH DThesis Lim Yew Jin
No ratings yet
PH DThesis Lim Yew Jin
192 pages
MP MSG 184 07
No ratings yet
MP MSG 184 07
8 pages
How To Document Infrastructure - Part 1 Fibre Networks v1 - 1022016
No ratings yet
How To Document Infrastructure - Part 1 Fibre Networks v1 - 1022016
16 pages
A Survey of Real-Time Strategy Game AI Research and Competition in
No ratings yet
A Survey of Real-Time Strategy Game AI Research and Competition in
19 pages
Brims 04
No ratings yet
Brims 04
8 pages
StarCraft Micromanagement With Reinforcement Learning and Curriculum Transfer Learning
No ratings yet
StarCraft Micromanagement With Reinforcement Learning and Curriculum Transfer Learning
12 pages
Civ II Official Strategy Guide
No ratings yet
Civ II Official Strategy Guide
388 pages
SNR Sons College (Autonomous) Department of Computer Applications (Ug) BCA (Academic Year 2015 - 2016 and Onwards) Scheme of Examination
No ratings yet
SNR Sons College (Autonomous) Department of Computer Applications (Ug) BCA (Academic Year 2015 - 2016 and Onwards) Scheme of Examination
72 pages
Three-Dimensional Attention Transformer For State Evaluation in Real-Time Strategy Games
No ratings yet
Three-Dimensional Attention Transformer For State Evaluation in Real-Time Strategy Games
9 pages
Classical Planning
No ratings yet
Classical Planning
19 pages
Strategy Guide To StarCraft The Boardgame
No ratings yet
Strategy Guide To StarCraft The Boardgame
42 pages
Starcraft Ii: A New Challenge For Reinforcement Learning
No ratings yet
Starcraft Ii: A New Challenge For Reinforcement Learning
19 pages
Aerospace 10 00133
No ratings yet
Aerospace 10 00133
21 pages
Gadgets
No ratings yet
Gadgets
4 pages
Abbott Hospira Plum A 116 Manual PDF
No ratings yet
Abbott Hospira Plum A 116 Manual PDF
146 pages
B125 & B105 - Brochure
No ratings yet
B125 & B105 - Brochure
8 pages
IS VII & VIII 2018 SYLLABUS-compressed
No ratings yet
IS VII & VIII 2018 SYLLABUS-compressed
78 pages
A Neural-Evolutionary Model For Case-Based Plannin
No ratings yet
A Neural-Evolutionary Model For Case-Based Plannin
11 pages
Niel Et Al. - 2018 - Hierarchical Reinforcement Learning For Real-Time
No ratings yet
Niel Et Al. - 2018 - Hierarchical Reinforcement Learning For Real-Time
9 pages
IC Cyber Security Risk Assessment Report 11680 WORD
No ratings yet
IC Cyber Security Risk Assessment Report 11680 WORD
12 pages
AI StefanWeijers
No ratings yet
AI StefanWeijers
8 pages
UNIT Pegasus-Spyware (TV)
No ratings yet
UNIT Pegasus-Spyware (TV)
10 pages
A Data Mining Approach To Strategy Prediction
No ratings yet
A Data Mining Approach To Strategy Prediction
8 pages
AAMI FDA Summit Report PDF
No ratings yet
AAMI FDA Summit Report PDF
48 pages
Avery 2011 Computational
No ratings yet
Avery 2011 Computational
8 pages
Behaviour Oriented Design For Real-Time PDF
No ratings yet
Behaviour Oriented Design For Real-Time PDF
8 pages
Pedido 41 - 3
No ratings yet
Pedido 41 - 3
6 pages
NNFQ and CNNFQ
No ratings yet
NNFQ and CNNFQ
6 pages
Deep RTS
No ratings yet
Deep RTS
8 pages
Szehr 2021 B
No ratings yet
Szehr 2021 B
10 pages
DS Lab 2
No ratings yet
DS Lab 2
8 pages
Oberon 2
No ratings yet
Oberon 2
327 pages
MEMORIZACIÓN Métodos para Piano Recopilación de Chuan C. Chuang 1
No ratings yet
MEMORIZACIÓN Métodos para Piano Recopilación de Chuan C. Chuang 1
22 pages
Starcraft Presentation PDF
No ratings yet
Starcraft Presentation PDF
33 pages
Influence Maps
No ratings yet
Influence Maps
111 pages
Portfolio Greedy Search and Simulation For Large-Scale Combat in Starcraft
No ratings yet
Portfolio Greedy Search and Simulation For Large-Scale Combat in Starcraft
8 pages
Object Oriented SE
No ratings yet
Object Oriented SE
73 pages
Single Phase Smart Meter Using DLMS/COSEM Application Data
No ratings yet
Single Phase Smart Meter Using DLMS/COSEM Application Data
2 pages
M Cts Vs Minimax
No ratings yet
M Cts Vs Minimax
58 pages
Course Outline CSCD 607 Advanced Computer Networks
No ratings yet
Course Outline CSCD 607 Advanced Computer Networks
9 pages
Predicting Player Strategies in Real Time Strategy Games
No ratings yet
Predicting Player Strategies in Real Time Strategy Games
54 pages
MID 1 Assignment
No ratings yet
MID 1 Assignment
2 pages
SIP5 - 7SA SD 82 84 86 7SL 82 86 SJ 86 - V08.40 - Manual - C010 E - en
0% (1)
SIP5 - 7SA SD 82 84 86 7SL 82 86 SJ 86 - V08.40 - Manual - C010 E - en
2,200 pages
Examen Final Ccna1 v5.0
No ratings yet
Examen Final Ccna1 v5.0
23 pages
Turbo VPN For PC
No ratings yet
Turbo VPN For PC
7 pages
Enrollment Guidelines For Session 2024 25-1-8-3
No ratings yet
Enrollment Guidelines For Session 2024 25-1-8-3
1 page
MOBA A New Arena For Game AI
No ratings yet
MOBA A New Arena For Game AI
8 pages
Numerical Methods For Engineers
80% (10)
Numerical Methods For Engineers
179 pages
Juegos de Estrategia
No ratings yet
Juegos de Estrategia
11 pages
Adaptivity Challenges in Games and Simulations A Survey
No ratings yet
Adaptivity Challenges in Games and Simulations A Survey
15 pages
Lara 13 Review
No ratings yet
Lara 13 Review
8 pages
Group 4 Tower Defnse
No ratings yet
Group 4 Tower Defnse
16 pages
ImprovinImproving Adaptive Game AI With Evolutionary Learningg Adaptive Game AI With Evolutionary Learning
No ratings yet
ImprovinImproving Adaptive Game AI With Evolutionary Learningg Adaptive Game AI With Evolutionary Learning
8 pages
A. Introduction
No ratings yet
A. Introduction
14 pages
B125 B105 Clinical Reference Guide - DOC2204111
No ratings yet
B125 B105 Clinical Reference Guide - DOC2204111
52 pages
Finite Difference Method
No ratings yet
Finite Difference Method
7 pages
A Is Tar Craft
No ratings yet
A Is Tar Craft
6 pages
A Bayesian Model For Plan Recognition in Rts Games Applied To Starcraft
No ratings yet
A Bayesian Model For Plan Recognition in Rts Games Applied To Starcraft
6 pages
Research On Operational Model of PUBG
No ratings yet
Research On Operational Model of PUBG
5 pages
Senior Trujillo 2020
No ratings yet
Senior Trujillo 2020
17 pages
Learning Micro-Management Skills in RTS Games by Imitating Experts
No ratings yet
Learning Micro-Management Skills in RTS Games by Imitating Experts
7 pages
1) Building A Player Strategy Model by Analyzing Replays of Real-Time Strategy Games - 2008
No ratings yet
1) Building A Player Strategy Model by Analyzing Replays of Real-Time Strategy Games - 2008
6 pages
ZF Arıza Kodları PDF
No ratings yet
ZF Arıza Kodları PDF
54 pages
Game Playing in Artificial Intelligence
No ratings yet
Game Playing in Artificial Intelligence
17 pages
Cross-Entropy Method For Reinforcement Learning
No ratings yet
Cross-Entropy Method For Reinforcement Learning
24 pages
Coevolving Build-Orders For RTS Games: Christopher A. Ballinger, Member, IEEE, Sushil J. Louis, Member, IEEE
No ratings yet
Coevolving Build-Orders For RTS Games: Christopher A. Ballinger, Member, IEEE, Sushil J. Louis, Member, IEEE
4 pages
Basics of Video Editing
No ratings yet
Basics of Video Editing
7 pages
B105 Patient Monitor: Care With Confidence
No ratings yet
B105 Patient Monitor: Care With Confidence
5 pages
Real
No ratings yet
Real
1 page
Deep Imitation Learning For Playing Real Time Strategy Games
No ratings yet
Deep Imitation Learning For Playing Real Time Strategy Games
6 pages
Technical Specifications of Medical Devices For The Case Management of COVID-19 in Healthcare Settings
No ratings yet
Technical Specifications of Medical Devices For The Case Management of COVID-19 in Healthcare Settings
8 pages
Plum A+ Infusion System Product Overview
No ratings yet
Plum A+ Infusion System Product Overview
1 page
FIRST SUMMATIVE TEST-Q2 (Week 1 and Week 2)
No ratings yet
FIRST SUMMATIVE TEST-Q2 (Week 1 and Week 2)
3 pages
Instituto Tecnol Ogico y de Estudios Superiores de Monterrey
No ratings yet
Instituto Tecnol Ogico y de Estudios Superiores de Monterrey
37 pages
Web Vent PCV A
No ratings yet
Web Vent PCV A
2 pages
ADF Certification - Study Guide - Topic Notes
No ratings yet
ADF Certification - Study Guide - Topic Notes
9 pages
Entity-Component System Design Patterns: Definitive Reference for Developers and Engineers
From Everand
Entity-Component System Design Patterns: Definitive Reference for Developers and Engineers
Richard Johnson
1/5 (1)
Mastering the Art of x86 Assembly Programming: Unlocking the Secrets of Expert-Level Skills
From Everand
Mastering the Art of x86 Assembly Programming: Unlocking the Secrets of Expert-Level Skills
Steve Jones
No ratings yet
Cisco Packet Tracer Implementation: Building and Configuring Networks: 1, #1
From Everand
Cisco Packet Tracer Implementation: Building and Configuring Networks: 1, #1
S. R. Jena
No ratings yet

Build Order Optimization in Starcraft.: January 2011

Uploaded by

Build Order Optimization in Starcraft.: January 2011

Uploaded by

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

Build Order Optimization in StarCraft.

Conference Paper · January 2011

The user has requested enhancement of the downloaded file.

David Churchill and Michael Buro

co-NP hard to decide whether these actions when sequential-

1.4 0.3 0.3

1.3 0.3 0.3

in StarCraft significantly while producing near optimal plans References

View publication stats

You might also like