0% found this document useful (0 votes)

69 views14 pages

Classification of Player Roles in The Team-Based Multi-Player Game Dota 2

This document discusses classifying player roles in Dota 2 using machine learning. It aims to classify players into specific roles like carry, support, etc. based on their in-game behavior, rather than individual performance. It explores using attributes extracted from replay files and different classification algorithms. Logistic regression performed best, classifying roles with over 74% accuracy on a dataset of 708 players. This research could help understand player behavior and team strategies in Dota 2.

Uploaded by

Jair Polanco

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

69 views14 pages

Classification of Player Roles in The Team-Based Multi-Player Game Dota 2

Uploaded by

Jair Polanco

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Classification of Player Roles in the Team-Based

Multi-player Game Dota 2

Christoph Eggert, Marc Herrlich, Jan Smeddinck, and Rainer Malaka

Digital Media Lab, TZI, University of Bremen, Germany

Abstract. Computer games are big business, which is also reﬂected

in the growing interest in competitive gaming, the so-called electronic
sports. Multi-player online battle arena games are among the most suc-
cessful games in this regard. In order to execute complex team-based
strategies, players take on very specific roles within a team. This paper
investigates the applicability of supervised machine learning to classify-
ing player behavior in terms of specific and commonly accepted but not
formally well-defined roles within a team of players of the game Dota 2.
We provide an in-depth discussion and novel approaches for constructing
complex attributes from low-level data extracted from replay files. Using
attribute evaluation techniques, we are able to reduce a larger set of can-
didate attributes down to a manageable number. Based on this resulting
set of attributes, we compare and discuss the performance of a variety of
supervised classification algorithms. Our results with a data set of 708
labeled players see logistic regression as the overall most stable and best
performing classifier.

Keywords: multi-player games, player roles, classiﬁcation.

1 Introduction
Digital games have become an important social, cultural, and economical factor.
Online multi-player games attract especially large player bases and big audi-
ences. Computer games have also matched many traditional media in terms of
total revenue [14]. This is also reﬂected in the growing interest in competitive
gaming, the so-called electronic sports (eSports). Stemming from it’s early roots
in the 1990s, it has only been in recent years that eSports has been showing signs
of becoming a mainstream phenomenon. Parallel developments, e.g. the success
of game-related online videos in the form of so-called Let’s Plays and live broad-
casting, also play an important role as an indicator and multiplier for societal
impact. Game tournaments award signiﬁcant prize money and there are players
and teams that can make a living from playing games. Multi-player online bat-
tle arena (MOBA) games are among the most popular and successful games in
this regard. Due to their popularity, their competitive nature, as well as their
complex team-based strategies and tactics, they share many similarities with
traditional physical team sports, and akin to the recent rise of data analysis in
physical sports, data analysis and machine learning begin to play an important
role for the development and analysis of digital games.

c IFIP International Federation for Information Processing 2015
K. Chorianopoulos et al. (Eds.): ICEC 2015, LNCS 9353, pp. 112–125, 2015.
DOI: 10.1007/978-3-319-24589-8_9
Classiﬁcation of Player Roles in Dota 2 113

In this paper we investigate the applicability and performance of supervised

machine learning (ML) to classify player behavior in terms of specific roles within
a team of players of the game Dota 2, a popular contemporary MOBA game.
Such information could be useful for game designers to better understand how
their game design influences emergent gameplay and player behavior but also
for players, both casual and professional, who want to analyze their own per-
formance or who want to learn from others. It could also support casters and
moderators in commentating and presenting matches. Furthermore, this research
might hold implications for social and other research concerned with (human be-
havior in) games. While ML has been applied to games and traditional sports,
most works are either interested in questions like spatial behavior, trying to pre-
dict the match outcome, or otherwise trying to correlate performance to certain
events or behaviors. In contrast, we aim at building a classifier that is largely
independent of individual player performance and that is also not tied to the
overall match outcome but that is able to identify a player’s role in terms of the
non-formally defined roles established as common grounds within the Dota 2 or
MOBA community.
This paper contributes to the state of the art in several ways: We provide an
in-depth discussion and novel approaches regarding the construction of complex
attributes from low-level data extracted from Dota 2 replay files, together with
an evaluation of these attributes with respect to different classifiers. Based on
the resulting reduced set of attributes, we compare and discuss the performance
of a range of supervised classification algorithms, including logistic regression,
random forest decision trees, support vector machines (in combination with Se-
quential Minimal Optimization), naive Bayes and Bayesian networks, classifying
both with a newly established larger set of player roles, as well as with a reduced
set inspired by related work [5].

2 Related Work

We restrict the discussion to three main areas: traditional physical sports, com-
parable works that focus on different games or genres, and works that also focus
on Dota 2, yet have different classification goals.
Recognizing behaviors in traditional sports typically requires some form of
image processing or other recognition techniques to extract usable data. While
there is no need for image processing in our case since positional information
is directly available, there are similarities that might be applicable to MOBA
games. Tovinkere et al. [16] make use of the trajectory of the ball in soccer
games to detect events. Combined with player positions and a rule-based-system,
which was built with domain-specific knowledge, this leads to a large number
of detectable events. A very similar approach is presented by Li et al. [10] for
ice hockey games. A notable domain difference compared to works for soccer is
that not only the position of the goal is considered, but also the moment when
the blue line is crossed. In basketball games, as presented by Fu et al. [4], the
actual tracking of the ball is less important for certain tactics. In order to detect
114 C. Eggert et al.

offensive strategies they make use of the fact that defenders are closer to their
basket than the offensive team to predict ball possession. The strategy is then
recognized by comparing player positions relative to each other with expected
patterns. While in MOBA games there is no ball or puck that could be tracked,
using relative player positions might be applied to certain events (e.g. team fights;
see section 6). In addition to positional information, Zhu et al. [21] also utilize
information like score boards and game time that are typically on display during
TV broadcasts to improve their predictions for soccer games. Dota 2 replays (see
section 5) contain similar information, e.g. amounts of damage, kills, healing etc.
that could be combined with positional information.
An approach to ML in computer games in general was proposed by Drachen et
al. [2]. They suggest using unsupervised learning algorithms, specifically k-means
and Simplex Volume Maximization, to cluster player behavioral data. They use
two very different games for a proof of concept. Knowledge of the game design is
used with both titles to define the attributes used for the algorithms. This also
means that the chosen attributes strongly differ. In contrast to our work, their
goal is specifically to aid developers in terms of general game design, for example
by finding underused mechanics. Because of that a few game specific attributes
are selected and unsupervised learning is applicable. Our approach targets be-
havior that is not tied as directly to just one or two mechanics. Therefore we
expect that we need a larger set of attributes and employ supervised learning
methods. Other notable works are based on the real-time strategy game series
Starcraft. Liu et al. [11] target the identification of a specific player from Star-
craft 2 replays by his or her personal play style. In contrast, we are looking
for players behaving according to a certain common role, which can be seen
as trying to remove personal play style and performance as noise. Synnaeve et
al. [15] present a method for an adaptive artificial intelligence (AI) in Starcraft:
Brood War, which uses similar features. Instead of units, their method collects
data on the produced buildings to recognize build orders. The prediction is then
made with a Bayesian model. Their work in turn is partly based on the works of
Weber et al. [17], who use produced buildings, units and upgrades as attributes.
In MOBA games we cannot rely on such attributes alone. The most similar at-
tributes to the production in Starcraft would be the items players are buying for
their hero. However, unlike Starcraft, players almost never have a fixed income,
which has a big influence on the items players are buying. Items are also often
more connected to specific heroes than to player roles.
A notable work on Dota 2 is presented by Gao et al. [5]. They target the
identification of both the heroes that players are playing, and the role they are
taking. They define a basic model with three roles a player can fulfill that are
predicted with an accuracy of about 74%. For comparison in addition to our
more complex set of classes, we also applied our attributes to the reduced set of
roles by Gao et al. (see section 7) with signification improvements in terms of
accuracy over their results. It must be noted, though, that we had no access to
their test data. Other works about Dota 2 are mostly based on skill-related ques-
tions or social studies. For example, Pobiedina et al. [13] come to the conclusion
Classification of Player Roles in Dota 2 115

that the national diversity of players as well as the number of friends playing
together has a significant influence on team success. Nuangjumnonga et al. [12]
research correlations between the leadership behavior (such as authoritarian and
democratic) and the roles the players are fulfilling in the game. Notably they use
the roles Carry, Support and Ganker, which we will also cover in section 6. Yang
et al. [20] identify combat patterns to predict game outcomes with an accuracy
of 80%. More recently, another contribution by Drachen et al. [3] investigated
skill-based differences in the spatio-temporal team behavior of Dota 2 matches.
They find higher-skilled players to move more actively and closer to their team-
mates around the map. For collecting positional information they make use of
a spatial division of the Dota 2 map into zones, looking at zone changes, which
shares some similarities with our method to detect early game movement (see
section 6) that focuses on the number of entered zones.

3 Background: Dota 2

Based on the popular modification DotA (Defense of the Ancients) for the game
Warcraft 3, Dota 2 is is a typical example of the MOBA genre. Most popular
MOBAs (including e.g. League of Legends, Heroes of Newerth, Smite, etc.) are
identical in terms of the basic gameplay but differ in specific details, e.g. heroes,
skills, additional mechanics, graphics, maps, etc.. Dota 2 is played in teams of
five. Each player controls a hero character with specific strengths and weak-
nesses, abilities, matching items and so on that is picked from a large pool at
the beginning of each round. The choice of a specific hero is an important aspect
of the game. Teams need a balance of heroes with different abilities that are able
to fulfill certain roles with respect to the team tactics and strategies, resembling
traditional team-based sports. Although the core setup appears simple, it can
lead to a large variety of complex team-based behaviors, roles, and strategies.
Heroes develop their abilities in a heterogeneous manner and become stronger
throughout the game by collecting experience points and gold, which the players
can invest into items that support the heros abilities or provide other advantages

Fig. 1. Left: Dota 2 map layout with lane annotations Mid: Areas used for determining
the player lane Right: Area masks to reduce false positives for early gank detection
116 C. Eggert et al.

to the team. As shown in figure 1 (left), the map is split into three lanes. Each
lane has three defensive towers (green / red squares in figure 1) (e.g. left) that
constantly attack enemies within their range. The goal of the game is to destroy
the enemys main building (the Ancient ) after destroying all towers leading to
it on a lane. Along each of the lanes, a wave of non-player characters (NPC),
called creeps, runs from the base of each team to the base of the enemy. Creeps
are important sources for gold and experience and can be utilized for attacks.
Finishing blows to them are called last hits and are often used as an efficiency
benchmark. Additionally, there are camps of neutral creeps (hostile towards both
factions), in the map areas marked as Dire Jungle and Radiant Jungle in figure 1
(left). Another possibility to earn gold and experience is killing enemy heroes.
Players lose some gold with each death and have to wait a certain amount of
time until their hero is revived at their base. Surprise attacks on enemy players
from behind while they are dealing with creeps are a common tactic. In Dota 2
this behavior is called ganking. Each match is separated into three phases, called
the early game, mid game and late game. There are no exact time thresholds
for these phases and transitions (based on the behavior of the players and the
strength of their heroes) can be subtle. The early game typically lasts for about
10-15 minutes. In this phase players mostly stay on their side of the map and
collect experience points and gold by killing creeps. The mid game is the game
phase that differs the most in every match. It depends heavily on the heroes
chosen by each team. At some point in the game, heroes get so strong that even
towers are no real threat to them anymore and heroes will be fully developed
and equipped to their maximum abilities. This phase is called the late game.

4 Player Roles

There are recurring roles (or play styles) that players choose and try to follow
for a specific game. These roles are not formally defined but have established
themselves informally among players. It is important to note that these roles de-
scribe a different facet of play than classic player type classifications (e.g. after
Bartle [1]) which aim to classify expressed character traits of the players and
were designed to match role-play style games. Our selection and characterization
of roles is based on a comparison of online guides, videos and commentary of
professional players and commentators. Definitions and naming conventions of
player roles will in any case differ slightly among the player base and shift over
time as the game evolves, constituting another challenge for ML applications in
this area. We do not view this mutability as a limiting factor but rather as a
realistic constraint. We provide characterizations and in-depth explanations on
the attribute calculation and selection (section 6) in order to facilitate repro-
ducibility and comparability with other classification schemes.
In the end, we isolated nine player roles for the main ML task, which strike a
balance between covering common play styles in great detail while leaving out
some exotic styles which are rarely observed or are minor variations of other
styles. Additionally, we employ a set containing only three rather general roles
Classification of Player Roles in Dota 2 117

that have been used in other work for comparison. In general, we tried to avoid
performance-based characterizations or attributes as much as possible as we
were not interested in distinguishing bad from good players. The isolated roles
were: Carries - who are usually weak and need protection early on, but are very
strong in later stages, often deciding games. Carries typically end up with a high
amount of last hits, gold per minute and overall kills, but they can get them
in quite different ways. Therefore, we define two kinds of carries called active
carries and farming carries. Active carries engage enemy players and participate
in team fights to gain experience and gold, while farming carries focus on utilizing
enemy or neutral creeps for character development. Gankers try to waylay enemy
heroes with surprise attacks, sometimes very early in the game. Support players
in different ways try to help other players, sometimes even sacrificing themselves.
We define three kinds of support players to cover different strategies. Babysitter
support players protect teammates (usually a carry), staying very close to them.
In contrast, roaming supports are active around the map and even waylay other
players similar to gankers. However, they still let other team members take the
greater share. Farming supports also take their share of experience and gold.
However, they spend their gold on support items and they avoid interfering with
the carry. Pushers continuously try to clear out enemy towers, thereby pushing
their lane. Feeders are players that somehow get taken advantage of or show very
bad performance during the whole game. This is a special role we added to be
able to separate such players that do not show any useful observable behavior.
Inactive players represent another special class of players that - due to technical
difficulties or other reasons - do not actively participate.

5 Data Collection

Dota 2 games are stored as replay files (replays). Replays contain all low-level
game events that occurred during a game and allow the engine to re-simulate
whole games. This approach has different advantages and disadvantages. It is
very flexible because watching a replay is not limited to watching only one
player’s perspective or watching at the same speed as the original game. How-
ever, this comes with the cost that it is not simply playing back a recording, but
the full game logic simulation has to be processed. For our goal this also means
that, while low-level events and some additional data needed for attribute con-
struction can be read directly from the replays, constructing some attributes will
require significant additional processing (see section 6). We built our attribute
construction processing on top of the Java-based replay parser Clarity by Martin
Schrodt.
Adequately labeling replays requires watching the whole game, sometimes
several times for different players. This is a time-consuming task. Therefore we
designed and implemented a tool to crowd-source the labeling (of the play style)
to the Dota 2 player community. The tool allows anyone to quickly upload
labeled match summaries, based on local replay files, to an online database.
The tool was advertised by calls to the player community through established
118 C. Eggert et al.

community websites and available for download from our website. Players were
free to label whatever games they liked, which could be either their own games
or games taken from other sources like online replay archives. For the second
community call we also asked users to label a specific, seemingly problematic
game (in terms of ambiguous player roles) to gain more insight into the issues
as discussed in section 7. The labeled replays contain a large variety of players
from different skill levels. In addition, we manually labeled a set of replays from
the tournament The International 2014, which contains only replays of highly
professional players. Overall our final data set contained 708 labeled players.

6 Attribute Construction and Evaluation

The full set of attributes we considered for evaluation is presented in table 1.

While some attributes correspond directly to low-level events or summary data,
attributes that capture positional information and fighting behavior require more
complex processing of the replay data. We experimented with different attribute
filters implemented in the Java library WEKA [7] to determine the best set
of attributes using our labeled examples. These include algorithm-independent
subset selections, for example the CfsSubsetEval class based on the works of
Mark Hall [6]; algorithm-dependent subset selections, most notably the Wrap-
perSubsetEval class based on works from Kohavi et al. [9]; and several classes,
such as InfoGainAttributeEval, as presented in the works of Witten et al. [19,
p. 487-492]. Our results with the WrapperSubsetEval class with best-first search
are presented in table 1. We have chosen this algorithm for our final attribute
selection because it resulted in the highest accuracy with our labeled data set.
For classification we selected all attributes that were present in at least four
folds, excluding assists, as this resulted in the overall highest accuracy. In the
following sections we describe the algorithms and heuristics we developed to cal-
culate the attributes that cannot be directly obtained from replay files. They can
be grouped into five rough categories: space and movement, early ganks, team
fights, support items, and damage types. Not all of the attributes we describe
in this section were finally chosen to be used for classification within this work
but they might prove valuable for future works.

6.1 Space and Movement

Player Lane. Many roles depend on the lane (see figure 1) a player is most
active in during the early game. This information also provides the foundation
for other attributes, e.g., the lane partners. Players typically have three main
positions at the beginning of the game: top lane, mid lane, or bottom lane. Ad-
ditionally, the jungle areas can be used, or players can have a roaming position,
meaning that they move around the map instead of staying in a certain area.
The typical areas in which players are positioned most of the time for each lane
according to our observations can be seen in figure 1 (middle). During the very
early game (0:30-6:00) the position of each player is checked every two seconds
Classification of Player Roles in Dota 2 119

Table 1. Attributes evaluated for classification. Attributes marked with + are not di-
rectly available from replays. Attributes marked with * were finally selected for classifi-
cation. Number of Folds shows in how many folds an attribute was selected by WEKA’s
WrapperSubsetEval class using 10-fold cross-validation for logistic regression.

Attribute Number of Folds

KDA Ratio* = (Kills + Assists) / (Deaths + 1) 10
Last Hits* 10
Early Ganks*+ 10
Number of Support Items*+ 10
Damage to Neutral Creeps*+ 10
Damage to Regular Creeps*+ 10
Lane Partners*+ 10
Kills* 9
Experience* 5
Deaths* 5
Assists 5
Team Fight Participation*+ 4
Early Movement (Visited Cells)*+ 4
Damage to Heroes*+ 4
Solo Lane+ 2
Damage to Towers+ 1
Chosen Hero 0
Gold 0

and for each player a region counter is increased if the player is present. Players
that are positioned in one of the three lanes at least half of the time are assigned
to the corresponding lane. If this is not the case, they are ﬂagged as roaming.
Unfortunately, trying to detect the jungle position with the same approach does
not work as these areas would necessarily overlap with the areas of the lanes. In-
stead, the damage that players do to neutral creeps is tracked (see damage types
below). If an empirically determined damage threshold of 6000 is surpassed, the
position is set to jungle regardless of any other positional information.

Lane Partners and Solo Lane. The number of lane partners is determined
by comparing the lane attribute calculated as described above between players.
Players in the roaming or jungle position are always assigned zero lane partners,
while players sharing the top, mid or bottom lane are assigned the correspond-
ing number of their teammates in the same lane. We also included a solo lane
attribute as an alternative to the lane partners. This attribute is always true if
a player is assigned to one of the three main lanes without any teammate and
otherwise false.

Early Movement. Some roles are characterized by how active players are on
the map during the early game. Although this attribute might depend on skill as
120 C. Eggert et al.

is indicated by Drachen et al. [3], this could also be caused by active play styles
being more prevalent in higher-skilled matches. Unfortunately, the movement
activity cannot simply be determined by tracking the total movement of players,
as all characters are usually running back and forth even when they do not
change their general position. For this reason, we make use of the structure of
positional information in Dota 2 replays. Positions in replays are specified by a
128x128 grid and additional offsets. As we do not need the accurate coordinates,
which might also introduce a lot of noise into the classification, we divide grid
positions by ten, effectively resulting in a grid consisting of 13x13 cells. During
the early game, we count the total number of cells that each player visits and
assign it to the respective attribute.

6.2 Early Ganks

Early ganks are a key indicator for aggressive player roles, such as roaming
supports and gankers. We collect every fight that is not considered a team fight
(see below) first. Players are considered to be within the fighting area if they are
within 10% of the map size. We use a time threshold of 5 seconds to find the
end of a fight and an empirically determined damage threshold of 100 to avoid
false positives. Within the collection of fights we detect early gankers based on
the lane attribute described in section 6.1. Based on the assigned lane, we define
extended areas in which players are expected to fight, which can be seen in
figure 1 (right). If a player is participating in a fight that is not taking place
in the expected area for their lane, we increase the corresponding early gank
attribute by one. For players in the roaming position, every fight participation
therefore increases the attribute. Due to expected false positives this attribute
does not necessarily reflect the exact number of early ganks but a large value
should reliably provide a strong indication for them.

6.3 Team Fights

Team fights are typically considered to involve most players of both teams and
often decide the outcome of a match. In a standard match there is usually at
least one player of each team assigned to each of the three main lanes. If both
teams decide to assign both of the remaining two players to the same lane, this
leads to a situation where fights involving six players might happen early on.
However, these types of fights should not be considered team fights according
to the description above. Therefore, we define a minimum of seven players to
participate in a fight to label it as a team fight. In addition, we use empirically
determined spatial, damage and time thresholds to extract team fights from the
attack events contained in the replays files. For each such event a fight entry is
instantiated that at first contains only the two players directly involved in the
event. Additional players are added to the fight if they either take or receive
damage within a radius of 20% of the map size (the team fight zone). Fights are
ended if no corresponding attack events occur for 5 seconds. The total damage
dealt or received within the team fight must surpass a threshold of 2000 to
Classification of Player Roles in Dota 2 121

further reduce false positives. After all team ﬁghts have been counted, each
player is assigned the percentage of team ﬁghts they were involved in as the
corresponding attribute.

6.4 Support Items

The right items are key to support roles. However, many items that are useful for
support players also have their uses for other roles. Therefore, based on available
game guides we manually compiled a list of items that are exclusively used by
support players: Courier, Flying Courier, Observer Ward, and Sentry Ward.
Item purchases are not directly reflected in the available data from the replay
files. Therefore, we periodically check the inventory of each player and increment
a counter for support items if a new item from our list is found. The resulting
count is finally assigned to a corresponding attribute for each player.

6.5 Damage Types

We determined the following damage categories based on player roles and re-
quirements of other attributes: damage to heroes, damage to towers, damage to
neutral creeps, and damage to regular creeps. We extract this information from
the replay ﬁles by utilizing a categorized database of all units in the game and
comparing identiﬁers of attacker and victim for each damage event.

7 Classification Results and Discussion

Based on the attribute selection described in section 6, several different classifiers
were trained and evaluated using 10-fold cross-validation on our data set. The
choice of candidate classifiers was based on existing works and complemented by
commonly used classification approaches. It included: Logistic regression (LR),
random forest decision trees (RF), support vector machines with sequential min-
imal optimization (SMO), naive Bayes classifiers (NB), and Bayesian networks.
The WEKA library and tools provided the technical platform. The classifiers
were evaluated according to several established performance metrics (accuracy,
mean absolute error (MAE) [18], and area under ROC (AUC) [8]) and we also
analyzed the confusion matrix. All performance metrics were calculated by us-
ing the WEKA default implementations, which are described in the reference
documents [19], with optimized parameters.
Table 2 lists the accuracies and MAEs of all classification approaches, as well
as the weighted AUC averages. Table 3 presents the confusion matrix of the LR
classifier. With accuracies of around 75%, MAEs around 0.08 and AUC values
around 0.95 the results are not perfect but quite promising with respect to
the complex classification task and the limited data set. Our analysis revealed
neuralgic points that might be good starting points to improve the results in
future work. For further analysis we limit ourselves to the LR case as the overall
most stable and best performing classifier of our selection. Taking a closer look
122 C. Eggert et al.

Table 2. Summary of 10-fold cross-validation accuracies, mean absolute errors and

weighted averages of the AUC for the full set and for a reduced set of classes

Classifier Accuracy Mean Absolute Error Wgt. Avg. AUC

Full set of classes
Random Forest 76.27% 0.0905 0.943
Logistic Regression 75.85% 0.0826 0.947
SMO 75.28% 0.1753 0.926
Bayesian Networks 72.03% 0.0801 0.933
NaiveBayes 70.76% 0.0769 0.933
Reduced set of classes
Bayesian Networks 96.58% 0.0322 0.995
SMO 96.15% 0.2308 0.975
Logistic Regression 96.15% 0.0381 0.993
Naive Bayes 95.58% 0.0383 0.994
Random Forest 91.17% 0.1162 0.985

Table 3. Confusion Matrix for logistic regression using 10-fold cross-validation.

a b c d e f g h i classified as
163 6 13 11 1 0 2 3 0 a = Carry - Active
12 101 0 0 0 1 9 0 0 b = Carry - Farming
27 3 28 0 3 0 0 0 0 c = Ganker
10 0 0 113 5 0 0 5 0 d = Support - Babysitter
1 0 2 8 57 0 0 3 0 e = Support - Roaming
5 0 0 0 2 8 1 0 0 f = Support - Farming
7 14 0 1 0 0 30 0 0 g = Pusher
6 0 3 7 0 0 0 31 0 h = Feeder
0 0 0 0 0 0 0 0 6 i = Inactive

at the confusion matrix (table 3) reveals that there are two frequent cases of
misclassification: Active carry versus ganker and farming carry versus pusher. In
order to gain better insight we manually analyzed a number of problematic games
and players and we could observe that in these games the early gank detection
could be problematic because of the unclear transition between game phases (esp.
early to mid game). As mentioned in section 6 we empirically determined a fixed
time threshold for early game detection that – on average – worked well but not
for exceptional cases that were present in some games; e.g. players leaving their
lanes either extremely early or extremely late. A second factor that we noticed
originated from roles looking very similar even to the human eye, although for
apparently different reasons. For example, the farming carry wants as much gold
and experience as possible and the pusher on the other hand wants to destroy
towers as fast as possible but these goals in many cases can be achieved by
the same actions. The active carry and the ganker are both involved in many
player versus player fights. As we based our initial role definitions on established
information sources for the game, our results highlight that some of the accepted
roles do in fact bear overlaps because even though they define a certain player
Classification of Player Roles in Dota 2 123

behavior, the distinction in some cases seems to be made solely by the players’
intentions, which cannot be directly observed. A third factor that might play
an important role is what we call performance noise in cases where players are
not able to demonstrate a certain role clear enough. Although the special roles
feeder and inactive were added to reduce performance effects, borderline cases
may not be detected. A fourth factor we identified are dynamic role changes
that are demanded by the situation or enforced by the competing team. This
is problematic because our attributes and classification are currently based on
large time spans or even the whole game. A solution might be detecting roles for
smaller time spans within the game, however, this would also necessarily reduce
the amount of game data available due to the shorter time span and dividing
the game into meaningful phases is in itself a very challenging task.
We conducted a small study on the issue of ambiguous roles, asking three Dota
2 experts to manually classify ten players in one of the problematic games. The
responses were highly divergent with up to four different labels being provided
for some players. This illustrates that the classification task is difficult, even for
human experts, which is an interesting insight for the MOBA community and
game designers. We classified the professional tournament data set (203 play-
ers) to look deeper into the possible influence of performance noise and achieved
an accuracy of 81% percent. Although based on a limited data set, this indi-
cates that performance might indeed be an issue. Yet, limiting the original data
set to only the players of the winning teams did not result in any notable dif-
ferences in classification performance (accuracy 75.62%, MAE 0.0842, weighted
AUC 0.936). This suggests that individual performance noise might be an issue
but not the team performance as a whole. We also classified our data set with
the same attributes but with a reduced set of classes (carry, support, and solo
lane) inspired by Gao et al. [5] to compare the power of our attributes and to
assess the influence of the number of classes. Before classification with the same
attributes as before we relabeled our data according to the following rules: all
types of carries, gankers, and pusher were relabeled as carry and all types of sup-
ports were relabeled as just support. Entries labeled as inactive were completely
removed before training and classification (cf. [5]). Players labeled as feeders
were manually looked up and relabeled by a human expert to the best fitting
class. The results of a ten-fold cross-validation are also presented in table 2.
Again, LR proved to be among the best performing and most stable classifiers.
However, for the reduced set of classes, Bayesian networks also performed well.
While the results overall improved compared to the full set of classes, there are
still some residual errors. Compared to the results of Gao et al. [5], classifica-
tion with our attributes achieved a higher accuracy for our data set. A direct
comparison is not possible since the data sets differ. Still, the results indicate
that classification with a reduced set of classes works well and could already be
employed for many applications.
Summarizing, we can state that the full set of classes shows promising results
but also highlights that, although these classes are accepted by game experts,
some are ambiguous even to humans manually labeling the data. Furthermore,
124 C. Eggert et al.

we identiﬁed the reliable distinction of game phases as a major challenge. Ad-

ditionally, looking at global game roles per match has proven to have limits,
due to dynamic roles switching within shorter time spans. Lastly, performance
noise exists, although of the mentioned factors, this one is comparatively well
controlled by our attributes (in accordance with the design goals, as diﬀerences
between winners and losers or even compared to professionals were not signiﬁ-
cantly large).

8 Conclusion and Future Work

We presented and discussed an approach to apply machine learning techniques
for the classification of player roles in the MOBA game Dota 2. Since most
MOBA games share many key game mechanics, our approach should be appli-
cable to other games of the same genre or even to similar team-based games of
other genres with slight modifications. Investigating a larger set of attributes,
we isolated a manageable set and employed that reduced set to estimate the
applicability of a range of classifiers according to established performance met-
rics. While the classification accuracy for the whole set of classes is limited, with
approx. 75%, it is still promising and for our data set logistic regression was
clearly the overall most stable and best performing classifier. Classification for
a reduced set of classes was very successful with an accuracy of 96%, which
is already suitable for many applications. Again, LR – although not the best
performing classifier – proved to be very stable and well suited to this domain.
Looking at the limitations of our results highlights several important chal-
lenges. First, the definition of classes seems to be primarily intention-defined
rather than behavior-defined for some classes, which makes them very difficult
to detect. Still, our approach could be useful as a pre-processing step for a tool
that allows game designers, players, or casters to look at games and player roles,
e.g, by highlighting problematic games. Second, the distinction of specific games
phases is an important issue that is not trivial to solve. It affects the classifica-
tion of certain roles that are characterized by behavior related to certain phases
in the game and further complications arise from the fact that roles may change
during a game. Third, performance noise is a factor, e.g., if players are not able
to act out their intended role. However, we presented indications that our at-
tributes are insensitive to performance up to a certain degree by comparing the
results of limiting classification just to the winning team or just professional
players. In the future we plan to look more closely at the mentioned problem
of identifying the game phase and transitions more reliably. It might also be
beneficial to detect roles not for a whole match but rather for phases or sections
to account for role changes during the game.
Classification of Player Roles in Dota 2 125

References
1. Bartle, R.: Hearts, clubs, diamonds, spades: Players who suit muds. Journal of
MUD Research 1(1), 19 (1996)
2. Drachen, A., Sifa, R., Bauckhage, C., Thurau, C.: Guns, swords and data: Cluster-
ing of player behavior in computer games in the wild. In: CIG, pp. 163–170. IEEE
(2012)
3. Drachen, A., Yancey, M., Maguire, J., et al.: Skill-Based Differences in Spatio-
Temporal Team Behaviour in Defence of the Ancients 2. In: Proc. of IEEE Games,
Entertainment, and Media (GEM), IEEE (2014)
4. Fu, T.S., Chen, H.T., Chou, C.L., Tsai, W.J., Lee, S.Y.: Screen-strategy analysis
in broadcast basketball video using player tracking. In: Yang, J.F., Hang, H.M.,
Tanimoto, M., Chen, T. (eds.) VCIP, pp. 1–4. IEEE (2011)
5. Gao, L., Judd, J., Wong, D., Lowder, J.: Classifying dota 2 hero characters based
on play style and performance. In: Univ. of Utah Course on ML (2013)
6. Hall, M.A.: Correlation-based Feature Subset Selection for Machine Learning.
Ph.D. thesis, University of Waikato, Hamilton, New Zealand (1998)
7. Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The
weka data mining software: An update. SIGKDD Explor. Newsl. 11(1), 10–18 (2009)
8. Huang, J., Ling, C.X.: Using auc and accuracy in evaluating learning algorithms.
IEEE Transactions on Knowledge and Data Engineering 17, 299–310 (2005)
9. Kohavi, R., John, G.H.: Wrappers for feature subset selection. Artificial Intelli-
gence 97(1-2), 273–324 (1997); special issue on relevance
10. Li, F., Woodham, R.J.: Analysis of player actions in selected hockey game situa-
tions. In: CRV, pp. 152–159. IEEE (2005)
11. Liu, S., Ballinger, C., Louis, S.: Player identification from rts game replays. In:
28th Int. Conf. on Computers and their Applications, CATA (2013)
12. Nuangjumnonga, T., Mitomo, H.: Leadership development through online gaming.
In: 19th ITS Biennial Conference, ITS, Bangkok (2012)
13. Pobiedina, N., Neidhardt, J., del Moreno, M.C.C., Werthner, H.: Ranking factors
of team success. In: WWW (Companion Volume), pp. 1185–1194. WWW Conf.
Steering Committee, ACM (2013)
14. Statista: Video games revenue worldwide from 2012 to 2015.
http://statista.com/statistics/278181/ (access: August 05, 2015)
15. Synnaeve, G., Bessre, P.: A Bayesian Model for Plan Recognition in RTS Games
Applied to StarCraft. In: AIIDE. AAAI Press (2011)
16. Tovinkere, V., Qian, R.J.: Detecting Semantic Events in Soccer Games: Towards
A Complete Solution. In: Proc. of IEEE Int. Conf. on Multim. & Expo (2001)
17. Weber, B.G., Mateas, M.: A data mining approach to strategy prediction. In: CIG,
pp. 140–147. IEEE, Piscataway (2009)
18. Willmott, C.J., Matsuura, K.: Advantages of the mean absolute error (MAE) over
the root mean square error (RMSE) in assessing average model performance. Cli-
mate Research 30, 79–82 (2005)
19. Witten, I.H., Frank, E., Hall, M.A.: Data Mining: Practical Machine Learning
Tools and Techniques, 3rd edn. Morgan Kaufmann Publ. Inc., SF (2011)
20. Yang, P., Harrison, B., Roberts, D.L.: Identifying patterns in combat that are
predictive of success in moba games. In: Proc. of Foundations of Digital Games
(2014)
21. Zhu, G., Huang, Q., Xu, C., Rui, Y., Jiang, S., Gao, W., Yao, H.: Trajectory Based
Event Tactics Analysis in Broadcast Sports Video. In: Proc. of the 15th Int. Conf.
on Multimedia, pp. 58–67. ACM, New York (2007)

Internal Structure of A Leaf
50% (2)
Internal Structure of A Leaf
25 pages
Stability of Food Emulsions (2) : David Julian Mcclements
No ratings yet
Stability of Food Emulsions (2) : David Julian Mcclements
37 pages
Complete Notes of Bme
No ratings yet
Complete Notes of Bme
250 pages
Docmine: Spare Parts Catalog
No ratings yet
Docmine: Spare Parts Catalog
83 pages
Testing MCQ
No ratings yet
Testing MCQ
59 pages
TECH - ELEC-Difference Between Capacitor and Supercapacitor
No ratings yet
TECH - ELEC-Difference Between Capacitor and Supercapacitor
24 pages
Microcontroller 8051
No ratings yet
Microcontroller 8051
72 pages
Of Tribes, Hunters and Barbarians - Forest Dwellers in The Mauryan Period
No ratings yet
Of Tribes, Hunters and Barbarians - Forest Dwellers in The Mauryan Period
20 pages
Data Driven Ghosting
No ratings yet
Data Driven Ghosting
15 pages
Hazop Ip
No ratings yet
Hazop Ip
117 pages
OOP Templates Assignment - Zip
No ratings yet
OOP Templates Assignment - Zip
31 pages
MOBA Games A Literature Review
100% (1)
MOBA Games A Literature Review
11 pages
Hots Anonymous Final Paper
No ratings yet
Hots Anonymous Final Paper
28 pages
Game Play Differences by Expertise Level in Dota 2, A Complex Multiplayer Video Game
No ratings yet
Game Play Differences by Expertise Level in Dota 2, A Complex Multiplayer Video Game
33 pages
FIFA Video Game - Players Classification
No ratings yet
FIFA Video Game - Players Classification
26 pages
Spe 201216 Ms Minifrac
No ratings yet
Spe 201216 Ms Minifrac
12 pages
Chou, Yu-Kai - Actionable Gamification - Beyond Points, Badges, and Leaderboards-Octalysis Media (2015) - 123-156
No ratings yet
Chou, Yu-Kai - Actionable Gamification - Beyond Points, Badges, and Leaderboards-Octalysis Media (2015) - 123-156
34 pages
CdS/Graphene Photocatalysts
No ratings yet
CdS/Graphene Photocatalysts
28 pages
MOBA Bridging Cultures Through Competitive Play in A Globalized World
No ratings yet
MOBA Bridging Cultures Through Competitive Play in A Globalized World
9 pages
SSRN Id4749221 Code3223260
No ratings yet
SSRN Id4749221 Code3223260
40 pages
LEaggue
No ratings yet
LEaggue
41 pages
A Qualitative Exploration of Factors Affecting Group Cohesion and Team Play in Multiplayer Online Battle Arenas (Mobas)
No ratings yet
A Qualitative Exploration of Factors Affecting Group Cohesion and Team Play in Multiplayer Online Battle Arenas (Mobas)
25 pages
From Generative To Conventional
No ratings yet
From Generative To Conventional
17 pages
2011.12692v4-Towards Playing Full MOBA Games With Deep Reinforcement Learning
No ratings yet
2011.12692v4-Towards Playing Full MOBA Games With Deep Reinforcement Learning
15 pages
Player Stats Analysis Using Machine Learning
No ratings yet
Player Stats Analysis Using Machine Learning
4 pages
Mechanics and Metagame: Exploring Binary Expertise In: League of Legends
No ratings yet
Mechanics and Metagame: Exploring Binary Expertise In: League of Legends
19 pages
2781 Sc2egset Starcraft II Esport R
No ratings yet
2781 Sc2egset Starcraft II Esport R
32 pages
Mechanics and Metagame - Exploring Binary Expertise in LOL
No ratings yet
Mechanics and Metagame - Exploring Binary Expertise in LOL
19 pages
E-Sports Player Performance Metrics For Predicting
No ratings yet
E-Sports Player Performance Metrics For Predicting
13 pages
Journal Pone 0264550
No ratings yet
Journal Pone 0264550
18 pages
Exploring The Relationship Between Offli
No ratings yet
Exploring The Relationship Between Offli
21 pages
5630 Cree
No ratings yet
5630 Cree
32 pages
Exploring Player Experience in Ranked League of Legends: Behaviour & Information Technology
No ratings yet
Exploring Player Experience in Ranked League of Legends: Behaviour & Information Technology
14 pages
Predicting Player Strategies in Real Time Strategy Games
No ratings yet
Predicting Player Strategies in Real Time Strategy Games
54 pages
Design Life Cycle
No ratings yet
Design Life Cycle
16 pages
Esports An Analysis PDF
No ratings yet
Esports An Analysis PDF
13 pages
League ML1
No ratings yet
League ML1
12 pages
Notes On Anova: Dr. Mcintyre Mcdaniel College Revised: August 2005
No ratings yet
Notes On Anova: Dr. Mcintyre Mcdaniel College Revised: August 2005
10 pages
A Qualitative Exploration of Factors Affecting Gro
No ratings yet
A Qualitative Exploration of Factors Affecting Gro
26 pages
HDI OnQ RandI Set A Closed To Arrival Control On Rate Levels V1.0
No ratings yet
HDI OnQ RandI Set A Closed To Arrival Control On Rate Levels V1.0
11 pages
League ML2
No ratings yet
League ML2
15 pages
The Influence of Cognitive Skills and Team Cohesion On Player Performance in Multiplayer Online Battle Arena
No ratings yet
The Influence of Cognitive Skills and Team Cohesion On Player Performance in Multiplayer Online Battle Arena
37 pages
Players Continuous Willingness To Play in MOBA
No ratings yet
Players Continuous Willingness To Play in MOBA
12 pages
Identifying Player Skill of Dota 2 Using Machine L
No ratings yet
Identifying Player Skill of Dota 2 Using Machine L
18 pages
Comparison of Reaction Time Between Esports Player
No ratings yet
Comparison of Reaction Time Between Esports Player
16 pages
Player Modeling: Towards A Common Taxonomy: Marlos C. Machado, Eduardo P. C. Fantini and Luiz Chaimowicz
No ratings yet
Player Modeling: Towards A Common Taxonomy: Marlos C. Machado, Eduardo P. C. Fantini and Luiz Chaimowicz
8 pages
Result Prediction by Mining Replays in Dota 2: Filip Johansson, Jesper Wikström
No ratings yet
Result Prediction by Mining Replays in Dota 2: Filip Johansson, Jesper Wikström
29 pages
Exam 2018 s1 Op2 New
No ratings yet
Exam 2018 s1 Op2 New
12 pages
Win Prediction in Multiplayer Esports Live Professional Match Prediction
No ratings yet
Win Prediction in Multiplayer Esports Live Professional Match Prediction
12 pages
Machine Learning Methods For Predicting League of Legends Game Outcome
No ratings yet
Machine Learning Methods For Predicting League of Legends Game Outcome
11 pages
12 JST-1649-2019
No ratings yet
12 JST-1649-2019
16 pages
Classification of Player Roles in The Team-Based Multi-Player Game Dota 2
No ratings yet
Classification of Player Roles in The Team-Based Multi-Player Game Dota 2
15 pages
1 - Updated - Acta 20231 Kaanarik 1180583 2
No ratings yet
1 - Updated - Acta 20231 Kaanarik 1180583 2
14 pages
MOBA Slice
No ratings yet
MOBA Slice
17 pages
متغيرات خططية Memmert - 2011 - World-level-analysis-in-top-level-football
No ratings yet
متغيرات خططية Memmert - 2011 - World-level-analysis-in-top-level-football
11 pages
League Legends Study
No ratings yet
League Legends Study
16 pages
Small Hydro Power Plant: Scenario in India - A Comparative Study
No ratings yet
Small Hydro Power Plant: Scenario in India - A Comparative Study
7 pages
8.design and Analysis of A Conformal MIMO Ingestible Bolus Sensor Antenna For Wireless Capsule Endoscopy For Animal Husbandry
No ratings yet
8.design and Analysis of A Conformal MIMO Ingestible Bolus Sensor Antenna For Wireless Capsule Endoscopy For Animal Husbandry
9 pages
Physical, Connected at A Primal Level With Blood, Sweat, Tears. Videogames Are Unlit Rooms With
No ratings yet
Physical, Connected at A Primal Level With Blood, Sweat, Tears. Videogames Are Unlit Rooms With
7 pages
MOBA Games: A Literature Review: Entertainment Computing February 2018
No ratings yet
MOBA Games: A Literature Review: Entertainment Computing February 2018
12 pages
47.creeping Systems
No ratings yet
47.creeping Systems
7 pages
MOBAPaper Camera Ready
No ratings yet
MOBAPaper Camera Ready
8 pages
Macos Mojave Compatibility 02 07
No ratings yet
Macos Mojave Compatibility 02 07
11 pages
Scientific Heroes: Multiplayer Online Battle Arenas Foster Players' Hypothetico-Deductive Reasoning
No ratings yet
Scientific Heroes: Multiplayer Online Battle Arenas Foster Players' Hypothetico-Deductive Reasoning
5 pages
SOCI1003 Assignment Cover Sheet
No ratings yet
SOCI1003 Assignment Cover Sheet
7 pages
Performanceof Machine Learning Algorithmsin Predicting Game Outcomefrom Draftsin Dota 2
No ratings yet
Performanceof Machine Learning Algorithmsin Predicting Game Outcomefrom Draftsin Dota 2
13 pages
A Machine Learning Approach To Predict The Result of League of Legends
No ratings yet
A Machine Learning Approach To Predict The Result of League of Legends
8 pages
Chan 2020 J. Phys. Conf. Ser. 1566 012041
No ratings yet
Chan 2020 J. Phys. Conf. Ser. 1566 012041
8 pages
Mahal Ang Sili Dahil SI Stands For Silicon AtomicNumber 14 LI Stands For Lithium AtomicNumber 3 143 Is ILOVEYOU in Filipino MAHAL KayaMahalAndSili MOMO TabaNgUtak Ref Peysbuk
No ratings yet
Mahal Ang Sili Dahil SI Stands For Silicon AtomicNumber 14 LI Stands For Lithium AtomicNumber 3 143 Is ILOVEYOU in Filipino MAHAL KayaMahalAndSili MOMO TabaNgUtak Ref Peysbuk
11 pages
Investigating The Impact of Game Features and Content On Champion Usage in League of Legends
No ratings yet
Investigating The Impact of Game Features and Content On Champion Usage in League of Legends
9 pages
Guidance Mandatory Competence Attainment Report (v7) Final 04072012
No ratings yet
Guidance Mandatory Competence Attainment Report (v7) Final 04072012
8 pages
List of Government Colleges Affiliated To The University of Jammu (ACADEMIC SESSION 2020-21)
No ratings yet
List of Government Colleges Affiliated To The University of Jammu (ACADEMIC SESSION 2020-21)
9 pages
fml-g12s Ds en
No ratings yet
fml-g12s Ds en
7 pages
Real-Time Esports Match Result Prediction
No ratings yet
Real-Time Esports Match Result Prediction
9 pages
MOBA A New Arena For Game AI
No ratings yet
MOBA A New Arena For Game AI
8 pages
A Business Research On SPACEX
No ratings yet
A Business Research On SPACEX
5 pages
The Influence of Team Dynamics Over A Team's Performance
No ratings yet
The Influence of Team Dynamics Over A Team's Performance
7 pages
Ranking Practices and Distinction in League of Legends: Yubo Kou Xinning Gui Yong Ming Kow
No ratings yet
Ranking Practices and Distinction in League of Legends: Yubo Kou Xinning Gui Yong Ming Kow
6 pages
Kaushik Kalyanaraman
No ratings yet
Kaushik Kalyanaraman
7 pages
Econometrics Problem Set
No ratings yet
Econometrics Problem Set
5 pages
Predicting The Winning Side of Dota2: Kuangyan Song, Tianyi Zhang, Chao Ma
No ratings yet
Predicting The Winning Side of Dota2: Kuangyan Song, Tianyi Zhang, Chao Ma
4 pages
Planning Engineer
No ratings yet
Planning Engineer
2 pages
How Does He Saw Me? A Recommendation Engine For Picking Heroes in Dota 2
No ratings yet
How Does He Saw Me? A Recommendation Engine For Picking Heroes in Dota 2
4 pages
Work Measurement Techniques Methods Types
No ratings yet
Work Measurement Techniques Methods Types
5 pages
Phy340-Tutorial 2
No ratings yet
Phy340-Tutorial 2
2 pages
MAQ TNC AC Test
No ratings yet
MAQ TNC AC Test
1 page
SIS Football Rookie Handbook 2021
From Everand
SIS Football Rookie Handbook 2021
Matt Manocherian
No ratings yet
What Happens When We Play: A Critical Approach to Games User Experience Design & Education
From Everand
What Happens When We Play: A Critical Approach to Games User Experience Design & Education
Rebecca Rouse
No ratings yet
Projecting X 2.0: How to Forecast Baseball Player Performance
From Everand
Projecting X 2.0: How to Forecast Baseball Player Performance
Mike Podhorzer
5/5 (1)
A Short Introduction to Databases
From Everand
A Short Introduction to Databases
Viji Kumar
No ratings yet

Classification of Player Roles in The Team-Based Multi-Player Game Dota 2

Uploaded by

Classification of Player Roles in The Team-Based Multi-Player Game Dota 2

Uploaded by

Classification of Player Roles in the Team-Based

Multi-player Game Dota 2

Christoph Eggert, Marc Herrlich, Jan Smeddinck, and Rainer Malaka

Digital Media Lab, TZI, University of Bremen, Germany

Abstract. Computer games are big business, which is also reﬂected

Keywords: multi-player games, player roles, classiﬁcation.

In this paper we investigate the applicability and performance of supervised

6 Attribute Construction and Evaluation

The full set of attributes we considered for evaluation is presented in table 1.

6.1 Space and Movement

Attribute Number of Folds

6.2 Early Ganks

6.3 Team Fights

6.4 Support Items

6.5 Damage Types

7 Classification Results and Discussion

Table 2. Summary of 10-fold cross-validation accuracies, mean absolute errors and

Classifier Accuracy Mean Absolute Error Wgt. Avg. AUC

Table 3. Confusion Matrix for logistic regression using 10-fold cross-validation.

we identiﬁed the reliable distinction of game phases as a major challenge. Ad-

8 Conclusion and Future Work

You might also like