Assessing Team Strategy Using Spatiotemporal Data
Assessing Team Strategy Using Spatiotemporal Data
ABSTRACT Canada
9(3) Shots (on Goal)
USA
12(4)
In CVPR, 2009.
[16] W. Lu, J. Ting, K. Murphy, and
The Moneyball revolution coincided with a shift in the way 13 Fouls 11 Players in Broadcast Sports Vide
professional sporting organizations handle and utilize data in 3 Corner Kicks 8 Random Fields. In CVPR, 2011.
1 O↵sides 4
terms of decision making processes. Due to the demand for 38% Time of Possession 62%
[17] P. Lucey, A. Bialkowski, P. Carr
I. Matthews. Characterizing Mul
better sports analytics and the improvement in sensor tech- 3 Yellow Cards 1
Behavior from Partial Team Tra
nology, there has been a plethora of ball and player tracking 0 Red Cards 0
the English Premier League. In A
4 Saves 3
information generated within professional sports for analyt- [18] L. Madden. NFL to Follow Army
ical purposes. However, due to the continuous nature of the (a) (b)Sensors in Attempt to Prevent H
data and the lack of associated high-level labels to describe8. REFERENCES
Figure 1: (a) An example of standard soccer
www.forbes.com/sites/lancema
statis-
nfl-to-follow-armys-lead-on-
it - this rich set of information has had very limited use espe-[1] tics
S. Ali andbased
M. Shah.on hand-labeled
Floor event
Fields for Tracking data which
in High describe
-attempt-to-prevent-head-inj
cially in the analysis of a team’s tactics and strategy. In this Density
what Crowd Scenes. In ECCV,
happened. 2008.
(b) Spatiotemporal data[19]has the po- Y. Chang, A. H
R. Masheswaran,
paper, we give an overview of the types of analysis currently[2] N. Allen, J. Templon, P. McNally, L. Birnbaum, and
tential to describe the where and how, but S. Danesis.
as it Destructing
is the Rebo
K. Hammond. StatsMonkey: A Data-Driven Sports Tracking Data. In MIT Sloan Sp
performed mostly with hand-labeled event data and high- a continuous
Narrative Writer. In AAAI signal which is
Fall Symposium not associated
Series, with
Conference, 2012.
a
light the problems associated with the influx of spatiotem- 2010.fixed event, using this data for analysis is V.difficult.
[20] Morariu and L. Davis. Multi-
poral data. By way of example, we present an approach[3] BBC-Sports. Footballers may trial wearing microchips Recognition in Structured Scena
which uses an entire season of ball tracking data from the to monitor health. [21] T. Moskowitz and L. Wertheim.
English Premier League (2010-2011 season) to reinforce the
1. INTRODUCTION
www.bbc.co.uk/sport/0/football/21460038, 14 Feb Hidden Influences Behind How S
2013. Games Are Won. Crown Publish
common held belief that teams should aim to “win home[4] In his 2003 book Moneyball
M. Beetz, N. von Hoyningen-Huene, B. Kirchlechner,
[14], Michael Lewis docu-
[22] NBA Shot Charts. www.nba.com/
games and draw away ones”. We do this by: i) forming a mented
S. Gedikli, how M.
F. Siles, Oakland
Durus, andA’sM.General
Lames. Manager Billy Beane was
[23] D. Oliver. Basketball on Paper: R
representation of team behavior by chunking the incoming able to Automated
ASPOGAMO: effectivelySports
use Game
metrics derived from hand-crafted
Analysis Performance Analysis. Brassey’s
Models. International
statistics Journal of
to exploit theComputer Science inin the value of indi-
inefficiencies
spatiotemporal signal into a series of quantized bins, and ii) [24] D. Oliver. Guide to the Total Qu
Sport, 8(1), 2009.
generate an expectation model of team behavior based on[5] vidual baseball players. Around
P. Carr, Y. Sheikh, and I. Matthews. Monocular
the same time, Basketball
espn.go.com/nfl/story/_/id/6
explaining-statistics-total-
a code-book of past performances. We show that home ad- on Paper [24] was published
Object Detection using 3D Geometric Primitives. which outlined methods
4 August 2011.
for
vantage in soccer is partly due to the conservative strategy 2012.valuing player performance in basketball which is a far more
[25] Opta Sports. www.optasports.co
of the away team. We also show that our approach can flag[6] K. Goldsberry.
challenging CourtVision:
problemNew Visual and
because it isSpatial
Analytics for the NBA. In MIT Sloan Sports Analytics
a continuous[26] team sport.A. Ess, K. Schindle
S. Pellegrini,
anomalous team behavior which has many potential appli- Due to the popularity and effectiveness of the tools
Conference, 2012.
that
You’ll em-
Never Walk Alone: Model
anated from these works, there for Multi-Target Tracking. In CV
cations. [7] A. Gupta, P. Srinivasan, J. Shi, and L. Davis.has been enormous interest
[27] M. Perse, M. Kristan, S. Kovacic
in the field
Understanding of sports
Videos, analytics
Constructing over the
Plots: Learning a last 10 years with
Trajectory-Based Analysis of Co
Visually Grounded Storyline Model from Annotated
many organizations (e.g. professional teams, media groups)
Categories and Subject Descriptors Videos. In CVPR, 2009.
housing their own analytics department. However,
Activity
nearly
Image
in Basketball Game. Co
all
Understanding, 2008.
[8] Hawk-Eye. www.hawkeyeinnovations.co.uk. [28]hand-labeled
Prozone. www.prozonesports.co
H.4 [Information Systems Applications]: Miscellaneous;[9] of the analytical works have
D. Henschen. IBM Serves New Tennis Analytics At dealt solely with
[29] B. Siddiquie, Y. Yacoob, and L.
I.2.6 [Learning]: General event data
Wimbledon. which describes what happened (e.g.
www.informationweek.com/software/ basketball
Plays in American Football Vide
- rebounds, points scored, assists, football - yards
business-intelligence/ per carry,
University of Maryland, 2009.
Keywords ibm-serves-new-tennis-analytics-at-wimbl/
tackles, sacks,
240002528, 23 June 2012.
soccer - passes, shots, tackles (see
[30] Figure 1(a))).
SportsVision. www.sportsvision
Sports Analytics, Spatiotemporal Data, Representation [10] Once and
A. Hervieu the P.data is in Understanding
Bouthemy. this form, most sportsapproaches just relate
[31] STATS SportsVU. www.sportvu.
videotousing
parsing
playersin the relevant
trajectories. data L.
In J. Zhang, from [32] Statsheet.
Shao,a database, then ap- www.statsheet.com.
L. Zhang, andsport-based
G. Jones, editors, Intelligent Video [33] D. Stracuzzi, A. Fern, K. Ali, R.
plying rules and standard statistical methods,
T. Konik, and D. Shapiro. An A
Event Analysis and Understanding. Springer Berlin /
including regression and optimization.
Heidelberg, 2010. to American Football: From Obs
Permission to make digital or hard copies of all or part of this work for [11] Asand
S. Intille most sporting
A. Bobick. environments
A Framework tend to be dynamic
for Recognizing Video to withControl in a Simulated
personal or classroom use is granted without fee provided that copies are multiple
Multi-Agent players
Action fromcontinuously
Visual Evidence.moving
In AAAI, and competing Magazine,
against32(2), 2011.
not made or distributed for profit or commercial advantage and that copies 1999.each other, simple event statistics do not capture [34] X. Wei, P. Lucey, S. Morgan, and
the com-
Sweet-Spot: Using Spatiotempor
bear this notice and the full citation on the first page. To copy otherwise, to [12] K. Kim,
plexM.aspects
Grundmann, of theA. Shamir,
game.I. Matthews,
To gain an advantage and over
Predict the
Shots in Tennis. In M
republish, to post on servers or to redistribute to lists, requires prior specific J. Hodgins, and I. Essa. Motion Fields to Predict Play
rest of the field, sporting
permission and/or a fee. Evolution in Dynamic Sports Scenes.organizations
In CVPR, 2010. have recently Analyticslooked
Conference, 2013.
[13] to employ
M. Lewis. Moneyball:commercial trackingan technologies
The Art of Winning Unfair [35] C. Xu,can
which Y. Zhang,
lo- G. Zhu, Y. Ru
KDD’13, August 11–14, 2013, Chicago, Illinois, USA. Q. Huang. Using Webcast Text f
Copyright 2013 ACM 978-1-4503-2174-7/13/08 ...$15.00. Game.cate the position
Norton, 2003. of the ball and players at each time instant
Detection in Broadcast. T. Multi
[14] R. Li and R. Chellappa. Group Motion Segmentation
Using a Spatio-Temporal Driving Force Model. In [36] Zonalmarking. www.zonalmarkin
CVPR, 2010.
[15] R. Li, R. Chellappa, and S. Zhou. Learning
Multi-Modal Densities on Discriminative Temporal
1366
Interaction Manifold for Group Activity Recognition.
in professional leagues [29, 32, 9, 26]1 - to determine where alizations for the television broadcasters [9]. Partial data
and how events happen. Even though there is potentially sources normally generated by human annotators such as
an enormous amount of hidden team behavioral information shot-charts in basketball and ice-hockey are often used for
to mine from such sources, due to the sheer volume as well analysis [23], as well as passing and shot charts in soccer
as the noisy and variable length of the data, methods which [26]. Recently, ESPN developed a new quarterback rating
can adequately represent team behaviors are yet to be devel- in American Football called “TotalQBR” [25] which attempts
oped. The value of this data is limited as little analysis can to assign credit or blame to the quarterback depending on
be conducted. What compounds the difficulty of this task a host of factors such as pass or catch quality, importance
is the low-scoring and/or continuous nature of many team in the match, pass thrown under pressure or not. As these
sports (e.g. soccer, hockey, basketball) which makes it very factors are quite subjective, annotators who are reliable in
hard to associate segments of play with high-level behaviors labeling such variables are used. In terms of strategic anal-
(e.g. tactics, strategy, style, system, formations). Without ysis, zonalmarking.net [37] attempts to describe a soccer
these labels, which essentially give the game context, infer- match from a tactical and formation point of view. Whilst
ring team strategy or behavior is impossible as there are no interesting, this approach is still qualitative and is based
factors to condition against (see Figure 1(b)). solely on the opinion of the analyst.
Due to these complexities, there has been no effective As the problem of fully automatic multi-agent tracking
method of utilizing spatiotemporal data in continuous sports. from vision-based systems is still an open one, most aca-
Having suitable methods which can first develop suitable demic research has centered on the tracking problem [1, 27,
representations from imperfect (e.g. noisy or impartial) data, 17, 5]. The lack of fully automated tracking approaches has
and then learn team behaviors in an unsupervised or semi- limited team behavioral research to works on limited size
supervised manner, as well as recognize and predict future datasets. The first work which looked at using spatiotem-
behaviors would greatly enhance decision making in all ar- poral data for team behavior analysis was conducted over 10
eas of the sporting landscape (e.g. coaching, broadcasting, years ago by Intille and Bobick [12]. In this seminal work,
fantasy-games, video games, betting etc.). We call this the the authors used a probabilistic model to recognize a single
emerging field of Sports Spatiotemporal Analytics - and we football play from hand annotated player trajectories. Since
show an example of this new area of analytics by compar- then, multiple approaches have centered on recognizing foot-
ing the strategies of home and away teams in the English ball plays [16, 30, 15, 34], but only on a very small number of
Premier League by using ball tracking data. plays (i.e. 50-100). For soccer, Kim et al. [13] used the global
motion of all players in a soccer match to predict where the
play will evolve in the short-term. Beetz et al. [4] proposed
2. RELATED WORK a system which aims to track player and ball positions via
The use of automatic sports analysis systems have re- a vision system for the use of automatic analysis of soccer
cently graduated from the virtual to the real-world. This matches. In basketball, Perse et al. [28] used trajectories of
is due in part to the popularity of live-sport, the amount player movement to recognize three type of team offensive
of live-sport being broadcasted, the proliferation of mobile patterns. Morariu and Davis [21] integrated interval-based
devices, the rise of second-screen viewing, the amount of temporal reasoning with probabilistic logical inference to
data/statistics being generated for sports, and demand for recognize events in one-on-one basketball. Hervieu et al. [11]
in-depth reporting and analysis of sport. Systems which use also used player trajectories to recognize low-level team ac-
match statistics to automatically generate narratives have tivities using a hierarchical parallel semi-Markov model. In
already been deployed [33, 2]. Although impressive, these addition to these works, plenty of work has centered on an-
solutions just give a low-level description of match statistics alyzing broadcast footage of sports for action, activity and
and notable individual performances without giving any tac- highlight detection [36, 8]2 . Even though notable, the lack
tical analysis about factors which contributed to the result. of tracking data to adequately train models has limited the
In tennis, IBM has created Slamtracker [10] which can pro- usefulness of the above research.
vide player analysis by finding patterns that characterize It is clear from the overview given above, that there ex-
the best chance a player has to beat another player from an ists a major disparity in resources between industry and
enormous amount of event labeled data - although no spa- academia to deal with this problem domain. Sporting orga-
tiotemporal data (i.e. player or ball tracking information) nizations that receive large volumes of spatiotemporal data
has been used in their analysis yet. from third-party vendors but often the people within these
Spatiotemporal data has been used extensively in the vi- organizations lack the computational skills or resources to
sualization of sports action. Examples include vision-based make use of it. Contrastingly, due to the proprietary nature
systems which track baseball pitches for Major League Base- of commercial tracking systems, and the cost and method of
ball [31], and ball and players in basketball and soccer [32, generating the tracking data, research groups who have the
29]. Hawk-Eye deploy vision-based systems which track the necessary skills can not access these large data repositories.
ball in tennis and cricket, and is often used to aid in the Recently however, due to the potential payoff, some industry
officiating of these matches in addition to providing visu- groups are investing in analytical people with these skill sets,
1 or have teamed up with research groups to help facilitate a
As nearly all professional leagues currently forbid the use of
wearable sensors on players, unobtrusive data capture meth- solution. The release of STATS Sports VU data [32] to some
ods such as vision-based systems or armies of human annota- research groups has enabled interesting analysis of shots and
tors are used to provide player and ball tracking information. rebounding in the NBA[7, 20]. In tennis, Wei et al. [35]
However, this restriction may change soon as monitoring the
2
health and well-being of players has attracted significant in- These works only capture a portion of the field, making
terest lately, especially for concussions in American Foot- group analysis very difficult as all active players are rarely
ball [19], as well as heart issues in soccer [3]. present in the all frames.
1367
Home Away
N Team
W D L P GF SF GA SA W D L P GF SF GA SA
1 Man Utd 18 1 0 55 49 347 12 191 5 10 4 25 29 272 25 271
2 Chelsea 14 3 2 45 39 379 13 233 7 5 7 26 30 367 20 208
3 Man City 13 4 2 43 34 306 12 216 8 4 7 28 26 240 21 298
4 Arsenal 11 4 4 37 33 350 15 154 8 7 4 31 39 305 28 251
5 Tottenham 9 9 1 36 30 383 19 228 7 5 7 26 25 274 27 339
6 Liverpool 12 4 3 40 37 319 14 220 5 3 11 18 22 266 30 270
7 Everton 9 7 3 34 31 321 23 227 4 8 7 20 20 259 22 279
8 Fulham 8 7 4 31 30 307 23 262 3 9 7 18 19 245 20 271
9 Aston Villa 8 7 4 31 26 273 19 263 4 5 10 17 22 233 40 340
10 Sunderland 7 5 7 26 25 287 27 243 5 6 8 21 20 246 29 311
11 West Brom 8 6 5 30 30 329 30 237 4 5 10 17 26 273 41 297
12 Newcastle 6 8 5 26 41 300 27 250 5 5 9 20 15 209 30 256
13 Stoke City 10 4 5 34 31 298 18 256 3 3 13 12 15 186 30 294
14 Bolton 10 5 4 35 34 311 24 256 2 5 12 11 18 261 32 346
15 Blackburn 7 7 5 28 22 254 16 259 4 3 12 15 24 200 43 360
16 Wigan 5 8 6 23 22 290 34 227 4 7 8 19 18 221 27 284
17 Wolves 8 4 7 28 30 256 30 266 3 3 13 12 16 205 36 306
18 Birmingham 6 8 5 26 19 231 22 324 2 7 10 13 18 174 36 362
19 Blackpool 5 5 9 20 30 296 37 297 5 4 10 19 25 240 41 441
20 West Ham 5 5 9 20 24 325 31 317 2 7 10 13 19 250 39 378
SUM 179 111 90 648 617 6162 446 4926 90 111 179 381 446 4926 617 6162
AVG(per game) 0.47 0.29 0.24 1.71 1.62 16.2 1.17 13.0 0.24 0.29 0.47 1.00 1.17 13.0 1.62 16.2
Table 1: Table showing the statistics for the home and away performances for each team in the 2010 EPL
season: (left columns) home matches (right columns) away columns (Key: W = wins, D = draws, L = losses,
P = points (3 for a win, 1 for a draw, 0 for a loss), GF = goals for, SF = shots for, GA = goals against,
SA = shots against).
used ball and player tracking information to predict shots cant difference between teams at home and away (10.01%
using data from the 2012 Australian Open. For soccer, re- vs 9.05% for shooting and 73.46% vs 72.99% for passing -
searchers have characterized team behaviors in the English see the bottom row in Table 1).
Premier League using ball-motion information across an en- However, there is a large difference between the amount of
tire season using OPTA data [18]. In this paper, we extend shots (16.2 vs 13.0) and goals scored (1.62 vs 1.17) at home
this method to explain that the home advantage in soccer and away. An illuminating example is the league champions
is due to the conservative strategy that away teams use (or for that season, Manchester United (see top row in Table 1).
more aggressive approach of the home team) which rein- At home, they were unbeaten (winning 18 and drawing 1),
forces the commonly held belief that teams aim to win their but away from home they only won 5 games, drew 10 and
home games and draw their away ones. lost 4. The telling statistic is that at home they scored 49
goals from 347 shots, compared to only 29 goals from 272
3. CASE STUDY: shots away from home. Comparatively, the opposition at
home games only scored a paltry 12 goals from 191 shots
HOME ADVANTAGE IN SOCCER while at away games they scored 25 goals from 271 shots.
In soccer, there is the commonly held belief that team should
3.1 “Win at Home and Draw Away” aim to win their home games and draw their away ones. If
In a recent book by Moskowitz and Wertheim [22], they you skim down Table 1, you will find that: i) all teams won
highlight that the home advantage exists in all professional more home games (except for Blackpool who won the same
sports (i.e. teams win more at home than away). The au- amount), ii) all teams score more goals at home (except Ar-
thors hypothesized that referees play a significant role by senal and Blackburn), iii) all teams had more shots at home
giving home teams favorable calls at critical moments. They compared to away, and iv) all teams gained more points at
then quantitatively showed this in baseball through the use home. These event statistics tell us what has occurred, in
of pitch tracking data. For soccer, hand-labeled event statis- the rest of the paper we use spatiotemporal data to help
tics such as the amount of injury time, number of yellow explain where and why this occurred. Before we detail the
cards and number of penalties awarded to reinforce their method, we first describe the data.
hypothesis. As soccer is a very tactical game, we hypothe-
size that the strategy of the home and away teams also plays
a role in explaining the home advantage. 3.2 Ball Tracking and Event Data
A great case study of home advantage is the 2010-2011 Due to the difficulty associated with accurately tracking
English Premier League soccer season. In that season, the players and the ball, data containing this information is still
home team earned an average 1.71 points out of a total 3 scarce. Most of the data collected is via an army of human
points per match. This is in stark contrast with the away annotators who label all actions that occur around the ball
team, which earned only 1.00 points per game: a rather - which they call ball actions. The F24 soccer data feed col-
large discrepancy, especially considering that teams play ev- lected for the English Premier League (EPL) by Opta [26]
ery other team both at home and away so that any talent is a good example of this. The F24 data is a time coded
disparities apply to both home and away averages. In terms feed that lists all player action events within the game with
of shooting and passing proficiency, there was no signifi- a player, team, event type, minute and second for each ac-
1368
ement
acking
as over fixedwindows
information windows (?; ically hardness
solve
proach,
changedand
coded Sycara
MAPR ofthe provided single proposed
dynamic
structures problems andthat team
of another
multi-agent
in
ateam fully
groups the
membership recognizing
model
observed
of plan
agents, using
to team
prune algorithm
recognition,abeing
Sukthankar first-cut
trace
thethe sizeand andthat
of ap-
aplaybook
the li-en- with ing hardness
solve ically partial
aconnection new MAPR changed of team model, single 4problems
tracings. structures
revealingand inWethe
multi-agent of the show groups
model distinction this plan 5using ofbythe agents, framing
recognition, between
a first-cut Sukthan the the and ap
a aspatio-temporal
spatio-temporal oc- hardness of single
new statistics
sue by including
representing spatial location
behavior asof events
aaovercome
spatio-temporal gen- oc- cated the between 3and multi-agent
agents, to trackplan recognition, and
and Sycara proposed aadynam-
ment over fixed 4another recognizing algorithm that en
cation
nts. despite
er
ment of
tracking
By over events
the
analyzing use being
information
fixed of oc-
windows
aamatch gen- (?; plan
and solveproach,
proach, sue
In
often In
verifying
library.
cupancysuethis this
by
MAPR
new •by provided
unreliable paper,
representing
Characterize
paper,
provided map
•humans
statistics such
Banerjee
Characterize
representing problems
we
basedwe
as that
analysis
that propose
propose
there
including team
et aateam
on fully
team
al. in
ball
are
fully behavior
(?; ano
proposed
team the
aspatial method
observed
?).
tactics
method
movement
behavior model asas
This
tactics
quantitative
observed locationas
to to
ato is team
as overcome
spatio-temporal
function
using
over
ainput despite
formalize
of ameanstrace
function
fixed
events
spatio-temporal
team a this
trace this
and
of MAPR
first-cut
windows
of
being and the
use
is- ofis-
aoc-
oc- ali-
gen- of
the ap-
li- playbook
coded solve
hardness the size.
MAPR dynamic of single problems 5team and membership in multi-agent the model to prune using
plan the
recognition, first-cut
sizeand ofand ap-
the I(X
an
ents. By analyzing match and solve
Sycaraproach,
brary
Automatically coded
plan
erated •MAPR Show
of
proposed
cupancy the
provided
library.
by full that
dynamic
problems
team
map Banerjee
analyze teams
another that plans
(?)
based team
a
anin
and have
onfully
were
et the
recognizingmembership
unseen al.
player
ball unique
model
observed
given
proposed
movement match
tracking asplaying
using
algorithm to
teamto
in prune
a styles,
(?).first-cut
trace
formalize
terms
information
over that
fixed theofand
en-
windows despite
size
tac-
(?; ap-
MAPRa of
li- ically the hav- problem
hardness solve proach,
solve
changed and coded MAPR
Sycara
MAPR as
of theprovided a5single
structures team
dynamic problems
proposed
problems identification
and
that 4of
team another
5multi-agent
a groups inin fully membership theobserved
the task.
model
recognizing
model
of agents, plan using
to using teamrecognition,
prune Sukthankar algorithm
a trace first-cut
first-cut
the size that
ap-
of aap th l
fent over fixed windows • 5 6
nt ts.
r,nts. we By
tracking
over
eventsBy analyzing
can fixed
being information
analyzing windows
characterizegen- a match
a match the (?; with
verifying brary
sue suecupancy
new
a by by of
new representing
statistics
full
representing
such map
size. team
model,
analysis based
includingplans
team team
on
revealing
(?; ball
were behavior
spatial
behavior
?). This movement
given
the location
isas as
as
a a spatio-temporal
over
input of
spatio-temporal
distinction
despite the fixed
events
(?). between
use windows
being
of oc- oc-gen-
the plan proach, library. provided Characterize
6Banerjee 6
that et a team fully
al. proposed tacticsobserved as to a team function
formalize trace of
and MAPR the a pla
li-
s, we can characterize the coded proach,
brary
proach, the
brary ?) planof
with
time,
cupancy
dynamic
being of
erated
ofing
provided
of atime,provided
full
library.
full new
which
partial
generated.
by
map size.
team
team
team
which that
model,
we
humansbased
Banerjee that
plans
team
we
call
a
membership
plans a
on
fully
call
play-segments.
(?)
were
were fully
revealing
ball
tracings.
et and al.
observed
play-segments.givenobserved
player
movement
givento theWe
proposedprune
asas By
tracking
input
show
team
input
distinction
By
overanalyzing
team
the to (?).
this
trace
size
(?).
analyzing trace
information
fixed
formalize of by
and a
windows
between the
a
match
and
framing
matcha MAPR
li- (?;a and
the li- the
solve
Sycara solve
proach,
brary
proach, codedplan • MAPR MAPR
Show
of
proposed
• provided
library. the
provided full •7 team dynamic
that
problems Characterize
6problems
another
Banerjee teams that
that plans
5 team 6
a a in have fully
were
recognizing
fully et
in
the team
membership
al.
the uniqueobserved
model
given
observed proposed tactics
model as playing
algorithm using 7 input
team as
team
tousing
to a
prune a function
trace
styles,
(?).
trace
formalize
a
first-cut
that first-cut
the en-
and and size
despiteof
MAPap-
a a th
of
li- lia
od ng s. we to
By overcome
cananalyzing
informationcharacterize this
a (?;a match is-
the newtics. cupancycupancy
of
statistics time,
erated map map
which
by
including based
humans
based we call
oncall
spatialon (?)
ball ball
play-segments.
and movement
movement
location player of we By over
tracking
over
events analyzing
fixed fixed
being information windows
windows
gen- a match (?; • withAutomatically brary adynamic of
new full size. model, team analyze 7
plans revealing an
7awere unseen given the match
distinctionas input in terms
(?). between of tac-
By analyzing match hardness asofas ?) aof
beingplaybooksingle generated. of
and play-segments,
multi-agent plan can characterize
recognition, the
and a the
time, which we play-segments. By analyzing a match 6
ws s,
method we
for canto characterize
tactical
overcome analysis thistheof is- brary with of fullShow team that plans teams were have givenunique as input
playing (?). styles, despite hav- proach,
brary plan of ing a8full
library. provided partial15team size. 7 10plans
Banerjee that
team were
tracings. fully
et observed al. given observed
proposed We asdistinction input
10
show team to(?). (?).
formalize
this trace by and
framin li-ath
MA
ssws for tactical analysis of planbrary library. Inof as
English problem
full
this aa•as
Banerjee new
ateam
paper, playbook model,
as
plans
Premier we et aof team
al.
of
propose were revealing
proposedidentification
play-segments,
Leaguegiven
a tracking
method the
as
to input distinction
formalize
Ballwe task.
toinformationcan (?).
overcome
Action MAPR
characterize abetween
this Data is- theand coded the5 proach, the
brary with provided
of full new team team ×model, that
12 plans 7a 10
membership fully were
revealing given to the prune as team
input the trace
size of and
between the
10
aforspatio-temporal oc- ofof hardness
as ?)time,
as which
playbook
•thisof Show single
we that
call and teams
play-segments.
play-segments, multi-agent have unique
we By plan
can analyzing recognition,
playing
characterize styles,
match the despite × tics. 4 hav-
10 × Reference
ewe can characterize
tactical analysis the
of is- • erated time,by beinghumanswhich generated.
we (?) call and player
play-segments. By analyzing a (?;
match hardness of single 10 and multi-agent plan recognition, and
eethod to overcome this behavior of each team which allows for tactical analysis of
as as a playbook of play-segments, we can characterize the 15
wsior can
for
efficacy
as acharacterize
tactical
and analysis
usability
spatio-temporal the ofoc-
ofwith?) abeing
solve
Show sue new
as English
MAPR
English
hardness bythe
behavior
amodel,
In utilization
ing
representing ofof Premier
problemspaper,
Premier
partial
of single
each
revealing of
team
we
team
team and in
this League
proposethe
League approach
tracings.
which
the
behavior multi-agentmodela
allows
distinction Ball
method
Ball
asBall aWe using
on
for Action
to
an
Action
show
plantactical
between
spatio-temporal
overcome
aiPad, first-cut
this Data
recognition, which
Data
analysisbythe
the
this
oc- ap-
framing is-
1 plan
of and brary brary
library.
thehardness with ofEnglish
ofproblem
full a
Banerjee
• full Show
new
15 team
•ofproblems
team
Show Premier model,
as
single that
15
plans eta10 plans
that team
al. teams were
and revealing
proposed
teams
were
League have
identificationgiven
multi-agent
given
have unique to the
as Ball unique
as
input
formalize distinction
plan
input
playing
task. (?).
Action playing
(?).
recognition, styles,
MAPR between
J.;Data styles, despi and
ohe efficacy and usability solve •as
English playbook
MAPR Premier of
problems play-segments, League in the model we can
aAction
using characterize Data
aanalysis
first-cut the ap- Allen, N.; Templon, McNally,
ent
slish forover
refficacy
as
overcome fixed
atacticaland windows
analysis
usability
spatio-temporal
this is-windows ofoc- as behavior
aas team’s
behaviorIn
agenerated.
sue playbook
Automatically thisby ing
each
paper,
performance.
of each
representing ofpartial team
we
play-segments,
team propose
aaanalyze We team
whichwhich
team show tracings.
allows
aallows
an
behavior method
we
theunseen for
can
efficacy
for
as We
tactical
totactical show
overcome
characterize
match and
spatio-temporal in
usability
analysisthis this
terms aby
of of is-
oc- of framing
5•× tac- solve
Show ×English the
English
MAPR
the utilization20
15
Premier ×20 of
15
in
this League the approach model Ball using
on
20
Actionan aiPad, first-cut Datawhich ap-
for
vement tactical
efficacy
Premierover andanalysis
usability
League
fixed ofsoc- of hardness proach,
shows cupancy
solve English
behavior of ait’s provided
team’s
single MAPRviability
problem
mapof Premier
performance.
eachbased
and that
Problem
problems to
teamas on be
multi-agent fully
used
team
ball
which League
We observed
for
Formulation
identification
show
movement
in the
allows amateur
plan the
model Ball team
efficacy
over
recognition,
for level
Action
task.
fixed
using
tactical trace
and use. and
usability
windows
a
analysis Data
first-cut
and li-
2
of with ap-
5 5× 4a× 545× hardness
new
4 510
solve ×45
English
10
4 × ×
545×
10 8model,
410 510×55
MAPR4
8 ×
10 ×
4×
8 ing
5
15 ×10 4× ×
48 4of
10
15 8× ×84
15
10
ing Premier
partial
Premier
20 10 ×
12 single
8revealing
1015
10 15 8
12
15 ×
10
problems 8×
12
partial
×15 8 ×
15 × 8×
815
12 15team
12 820 and
15×12 15×15 12
teamLeague
15
League
×the
12 × ×
in tracings.
12multi-agent
×12 12
the distinction
12
tracings. Hammond, model Ball Ball We 25We show
plan
Action
usingAction
between
K. 2010.
show this
recognition,
a Data
References byRefere
Referenc
Referenc
the
StatsMonkey
Data
first-cut
this frami
Refer
Refe
by Ref ap R fR
ement
glish
ments.
glish
.efficacy
ish ByPremierPremier
analyzing
patio-temporal
ficacy over
and
Premier
By and League
League
fixed
usability
analyzing
aoc-
usability
League match
windows
aof
soc-
soc-of
soc-
match brary
In English
behavior a
proach,
this
our team’s
sue a team’s
of
paper,
method
tics.
cupancy
full
by ofprovided Premier
performance.
representing
team
we
each problem
on
performance.
map Problem
propose
team
thebased
Problem
plans
that as
which
2010-2011League
We
team
were amethod
aWe
on show
fully
teambehavior
Formulation
allows
show
ball
Formulation
given
the Ball
observed
English tofor
the
movement as
efficacy
identification as
overcome
input Action
a
tactical
Premier
efficacy team
overand
spatio-temporal
(?).and trace
task.
this
analysis
League
fixed Data
usabilityis-windows
usability and ofsoc- of 13a1hardness
oc-
of li-
1 111 proach,
shows 1 1solve
English
11 • 11it’s
English
of
Automatically
1provided
single MAPR viability 25
problem Premier 25 Premier
and
25
that
Problem
problems to as20be
multi-agent
analyze
aa25fully team
League used League in observed
for
Formulation
identification an
the
Allen, amateur
Narrative
plan Allen,
Ball
unseen
Allen,model Ball
Allen,
N.; team
recognition,
Allen, N.;
Allen,
N.;
Action
Writer.
Templon, match
level
Allen,
Allen,
N.; Action
task.
using
Templon,
Templon,
N.;
N.; trace
Allen,
Allen,
N.; In use.
Allen,
Templon,
Allen, in
AAAI
Templon,
Templon,
N.;
N.;
J.;a
Templon,
Data
and
N.; N.; terms
Data
Templon,
J.;
McNally,
Templon,
J.; To
N.;
N.; Fall
first-cut
and aDo
Templo
McNally
Templon
J.;
McNally
J.; li-
Sym
Temp
Temp
J.;McN
J.; Mc MoJ.PTJM
Problem Formulation
our method on the 2010-2011 English Premier League soc- proach, provided that a fully observed 30 team trace and To a D l
solve sueMAPR of our time,
aproach,
team’s
cupancy method which
performance.
provided
problems map on we the
based callin that play-segments.
the
2010-2011 We
on,show show
aanalyzefully
model
ball theaobserved
English
movement using By
efficacy
asPremieranalyzing
aBy
over and
team
first-cut League
fixed a
usability
trace match
ap-
1windows inof242solve
and
soc- a2of 25li- 2 2tics. 30 problem
30Problem Problem
30 as a team Formulation identification task.
we
ents.
hnts, ver can
shPremier Bycharacterize
Premier
fixed League
windows
analyzing
League a the
soc- soc-
match The by
tatrest brary
team’s
,,cer
tas representing
of
our oft2•the
data.
,,data.
cer
of time,
data.
full
performance. paper
, ttAutomatically
method 3,, ton
team
whichon team
tProblem
tAutomatically
4,, tthe
describes
the twe
5,,t2010-2011
behavior
plans
We
2010-2011
6,,ttcall
were the
7, ttplay-segments. t9asthe
Formulation
8, ,tanalyze
method
given an
English spatio-temporal
efficacy unseen
an
of
input
Premier and
unseen
doing
match(?).
usability
analyzing League oc-
match
this,
in aof terms
soc-
match
in353terms 3 The
22
333 3MAPR
brary tac- 2 2 222
of
rest proach,
brary tac-
of
of
full
9
the
of problems team
provided
full paper team
plans
Problem
25
in
describes
13 30
that
plans the were aanalyze model
were
Formulation
given
Formulation
fully the
17Hammond,
given
Hammond, asHammond,
Hammond,
Avrahami-Zilberbrand,
observed
method using input
Hammond,
Hammond,
as2010.
Hammond, K.Hammond,
aof
input
Hammond,
K. (?).
K.
2010.
team
first-cut
doing
Hammond,
Hammond,
2010.
K. Hammond,
Hammond,
2010.
K.
(?).
K.K. StatsMonkey:
2010.
Intrace
2010.
2010.
K.
K.
StatsMonke
2010.
StatsMonk
D.;
ap-
this,
2010.
K. K.
2010.
in
K.
StatsMo
StatsM
K.Stats
Banerje
and Stat
20120120
Fa
2SS
for
we can
tactical
characterize
aanalysis of the
the proach, cupancy
addition t our as
0our
0,provided
cer
braryt t
of
• 1method
method
1to
cer
,
atime,
Show
map
as
t t 2 playbook
of ,
experiments
data.
, as t •full
based
tics.
3 awhich
,ont the
that Problem
team
the
playbook
4 , t a
on
5of
utilization
we
,
play-segments,
fully
ball
2010-2011
t 6 plans
call
which
, of
t 7 movementFormulation
observed were of
play-segments.
play-segments,
, t 8 , t English
English
show 9 this
given we
team
over
the
can
Premier
approachas
Premierfixed
we By
suitability
characterize
trace
input
can windows
analyzing
League League
on
and (?). of
characterize ana li-
a
soc-
these
the
soc-
iPad,
match the 4proach,
which t 3
0
33
,
provided
t 33
1 , 3t2• , t Automatically
•3 , t
Problem
Automatically
4 , t Problem
5 , 30
t 6 , t 7 , t
Formulation
8Table , Formulation
t
analyze 9 Narrative Lyle,
2: an
Narrative
Narrative
Table unseen
Narrative
J.Narrative
Narrative
an
Writer.
Narrative
showingunseen
Writer.
Narrative
Writer.
Narrative match
Writer.
Narrative
In
Multi-AgentNarrative
Narrative
Writer.
Writer.
Narrative
AAAI
Writer.
In
the Writer.
AAAI
Writer.
AAAI
match
In in
In
Fall
In
identificatio terms
Writer.
Writer.
AAAI
InAAAIWriter.
Writer.
AAAI
Fall
Fall
PlanSympo
AAAI
In
InFall
in
AA
Sym
AA
Sym
Fa
ReIn
te
IF
s, analyzing
ows we cantactical
for match
characterize analysis of t0behavior ,We tas
English
0cer 11 , trefer
data. 22 , tof 3 tto
, each 4 , team
Premier
tics. t55team t66of,behaviors,
,play-segments. twhich
77, t88, tallows
League 99 By B, Ball aswe
for short,
tactical
Action observable
analysis Data of 46seg- 444a t4team
4 addition Table t040,brary 4
Table
2:
,t4t•1441Table ,t4Show
over
,to Table of
t22:2experiments
,,,ttaTable t2: 3full
tics. ,,,that
showing
match. tthe
Table
ttto 44,,showing
,team tt55a5utilization , fully
showing
Given
,time tt666plans
the
,which identification
, ttthe 7that
,,observed the 88were
ttidentification
,,ashow of
tidentification
t99team’s this given
and
rate team
the approach
possession
Algorithms.
by
rate as
suitability trace
breaking
rate by input by
breaking and
In on (?).
breaking
string of
AAAI. ana D.; li-
is
these iPad, Btw
ork
7 Avrahami-Zilberbrand,Avrahami-Zilberbrand,
Avrahami-Zilberbrand,Avrahami-Zilberbrand,
Avrahami-Zilberbrand,
Avrahami-Zilberbrand,
Avrahami-Zilberbrand, Avrahami-Zilberbrand,
Avrahami-Zilberbrand, Avrahami-Zilberbr
Avrahami-Zilberb
Avrahami-Zilberbra
Avrahami-Zilberb
D.; D.;
Banerjee,D.;
Banerj
Baner
D.;
D.;D.;
Ba
brary of time,
of cer full
We
tdata. which team
refer
shows
as a weplans
to
3playbook 4
tit’scall
team were
viability behaviors, given ttto
play-segments, , be as B, input
used as analyzing
for (?).
short, foramateur
can a match
observable
characterize level seg-
use. the t 5English
athe , We t
team , t refer over Table
3
Premier team
2: t Table
, t , behaviors,
t showing,
League t the
, t 9plans the B, Ball into
identification as short,
different
Action observable
rate time by
Dataseries
breaking and seg
,ato match.
tGiven ,,that aof team’s possession string is
the plans into different series and the field into differ-
Work fficacy for and usability of t0 ,tas
tasks t0awe1We
,We ,behavior
,English
refer
trefer
2mentioned. 34,to
23,,ttcoordinated t45ofteam tteach
,,PremierWe ,,team
7firstRelated
6behaviors,
, tbehaviors, ttmovement
78,,describewhich
League t B, Workallows asBall
our short,
dataset. tactical
Actionobservable analysis
Data 575of
seg- brary oflength 5full
,We t5t515a2the team
refer plans ttics.
into
plans
,to ateam
different
awere behaviors, time
given series as input and as theshort, (?). field into
observable differ- seg-
0 1 5shows 2 3 4 7 8
an wscharacterize the ments tof to team B, as
and short, action observable
executed seg- 5 555of5
a2Table ,tinto the 2: it’s resulting ,Table t5viability
showing
,number ttto the be used
identification
B, play-segments for J.amateur rate by
for level
breaking
each use.
6by 6a
plans
ork the tactical
efficacy and analysis
usability of of as a, behavior
team’st1playbook • performance.
Show ,of of the5 6 , tteam
play-segments, We
utilization show
Related
89 9we the
of efficacy
this
Work
can approach
characterize and usability
on the an ofiPad, tasks
6which ,6tments t06we ,We ,English
6refer
mentioned.trefer
,team T
tof ,team
athe over
tcoordinated tdifferent
over
plans team tmatch.
Premier We 6into
,areas.
,match. time
6behaviors,
,different
first ttmovement
Given series
Given
describeLeague
ent ,that ttime and
Lyle,
that
quantization Banerjee,
aB, Lyle,
team’s
seriesthe
Lyle, J.
asteam’s
aand
our Lyle,
Ballfield
2010.
Lyle,
J. J.
short, Lyle,
2010.
dataset.
and 2010.
B.;
possession
action areas. into
Lyle,
J.
Lyle,
Multi-Agent
2010.J.
2010.
2010.
Action
possession
the Lyle,
Kraemer, differ-
Lyle,
Multi-Agent
Multi-Agent
2010.
J.Lyle,
J.
The
fieldLyle,
observable Multi-Agent
2010.
2010.
Multi-Agent
J.
executed
J.intoJ.
2010.
Multi-Agent
L.;
results
string 2010.
Multi-Agent
J.
2010.Plan
2010.
Multi-Ag
Multi-Ag
Data
string Plan
andPlan
is Mul
Multi
Recog
are
differ- seg
by
Mu
Mu
Pla
is
LylPl
RRP
th
ork We ,to 45team tbehaviors, as short, observable seg-
6t60ent 3 ,T ,into
1
ments of coordinated each Related
movement which allows
Work and for
action tactical executed analysis by of aiPad, quantization
of 6length66ent •quantization Show3 areas. the the
resulting The results
utilization 7
Thenumber 8areresults the
9
of
of B, trace this
play-segments
are the aof approach the
trace team of for
the on each
team an iPad,
h
tactical Premier
rkefficacy League
analysis of soc- ments
We
ments
team ourWithWe
aThe
referEnglish
method
a refer
of
ofthe
(e.g.
team’s
rest coordinated
to • to
coordinated
pass
on ofShow
team Problem
the Premier
performance.
team
the
from the
paper
behaviors,
2010-2011behaviors,
Related
movement
movement
agent
We
utilization League
Formulation
describes
Ashow
show
to
English Work
B,
as and
agent
the
of
and asthe
short,Ball
efficacy
thisshort,
action
action
B)
Premier method Action
approach
observable
.analysis
A
and
observable
executed
executed
plan
League ofusability
can on
doing
soc- Data
seg-
by an
by
bede-
10 6of
seg-
athis,
de- a
7 777 in
6 possession
ments ments which
1 6
ent
We of quantization
English7of
of
refer
length
the
is length
ent
coordinated
14team
plans
coordinated therefore: to quantization
Show , team the
over
areas.
Premier , the
resulting
the
a7The
different match.
behaviors,movement
resulting
8
areas.
movement
Nutilization
results time
9(T Given
League
recognition.
number The −T
number areseries
and Plan )+1and
results
B, thethat
and
Algorithms.
and
ofand ofshort,
and
and
Algorithms.
Algorithms.
and
trace
Recognition:
as
. of Ball
If
play-segmentsthis
team’s
and
action
short, the
Algorithms.
play-segments
are
action the Algorithms.
Algorithms.
and
of and
the
field
Algorithms.
the
approach
Inand
Action
possession
possession
and
trace and
Algorithms. into
Algorithms.
and
AAAI.
In
In
team
executed
Formalization
observable
executed of
for
differ-
Algorithms.
Algorithms.
Algorithms.
AAAI.
AAAI.
Algorithms.
In In
InAAAI.
In
AAAI.
for
the
each
on
AAAI.
Data
string
AAAI.
In
In
by
each
team by
an
AAA
AAA
InaInI
an
sethiaa
kort and military domains,
he and Leagueusability of behavior shows it’s viability to B, be used for amateur 2
level 7use. recognition. 10 T T 14 N = 1
1369
7
66
(1) Manchester United (2) Chelsea (3) Manchester City (4) Arsenal (5) Tottenham
55
44
(6) Liverpool (7) Everton (8) Fulham (9) Aston Villa (10) Sunderland
33
22
(11) West Bromwich Albion (12) Newcastle (13) Stoke City (14) Bolton (15) Blackburn
11
0
0
(16) Wigan (17) Wolverhamption (18) Birmingham City (19) Blackpool (20) West Ham United
Figure 4: The mean entropy maps for each of the twenty English Premier League teams characterizing their
ball movement patterns. The maps have been normalized for teams attacking from left to right. The bright
red refers to high entropy scores (i.e. high variability) and the blue areas refer to low entropy scores (i.e very
predictable behavior).
counts, means the ball is moving quicker), we used the to- % Teams Correctly Identified
tal count of entires as the entropy measure normalizes this 50
information. 40
30
20
5. STRATEGY ANALYSIS 10
0
5.1 Discriminating Team Behavior Statistics Occ Maps Combined
Evaluating team strategy is a very difficult task. The ma-
jor hurdle to overcome is the absence of strategy labels. But Figure 5: The identification rate for correctly iden-
given we know the team identity, and assuming that teams tifying home performances using event-labeled data
exhibit similar behaviors over time, we can treat the task as (i.e. no location information), occupancy maps (i.e.
an identification problem. We can do this by answering using no event information, just location), and a combina-
only ball movement information, can we accurately identify tion of the two.
the most likely team?
We model team behavior using a codebook of past per-
formances. If a team’s behavior is consistent, then previous knowledge of where teams operate could boost the discrim-
matches will be a good predictor of future performances. inating power.
For the experiments, we used 380 games of the season and To conduct the experiments, we compared our occupancy
used a leave-one-match-out cross validation strategy to max- representation, to twenty-three match statistics currently
imize training and testing data. Before we investigate the used in analysis (e.g. passes, shots, tackles, fouls, aerials,
difference between home and away performances, we first possession, time- in-play etc.). We also combined the two
need to obtain the best possible representation. To evalu- inputs by concatenating the feature vectors. For classifica-
ate this, we wanted to see how effective event-labeled data tion, we used a k-Nearest Neighbor approach (with k = 30)
was in discriminating between different teams, and if having and all experiments were conducted using D = 10 × 8
1370
1 18
Exp Event-Labeled Occupancy Maps
HvH 19.26 38.79
HvA 16.09 30.08
Actual Team Ranking AvA 13.98 36.41
AvH 16.36 30.34
1371
100
(1) Man United (2) Chelsea (3) Man City (4) Arsenal (5) Tottenham
W=3.8%, H=7.6%, T=10.7% W=1.1%, H=2.5%, T=-0.5% W=1.5%, H=3.1%, T=8.7% W=3.9, H=4.2%, T=9.5% W=0.1%, H=3.0%, T=8.9% 50
(6) Liverpool (7) Everton (8) Fulham (9) Aston Villa (10) Sunderland
W=1.8%, H=3.5%, T=9.5% W=-2.9%, H=-0.3%, T=5.6% W=-0.2%, H=1.6%, T=4.5% W=0.1%, H=0.6%, T=0.2% W=2.6%, H=7.1%, T=8.4% 0
(11) West Bromwich Albion (12) Newcastle (13) Stoke City (14) Bolton (15) Blackburn -50
W=0.6%, H=3.1%, T=9.0% W=5.2%, H=6.7%, T=0.5% W=-3.7%, H=2.1%, T=1.9% W=6.2%, H=5.4%, T=8.0% W=8.1%, H=3.6%, T=7.3%
-100
(16) Wigan (17) Wolverhamption (18) Birmingham City (19) Blackpool (20) West Ham United
W=8.1%, H=12.4%, T=8.1% W=-4.3%, H=0.4%, T=4.1% W=5.0%, H=8.3%, T=13.8% W=-1.0, H=3.4%, T=11.9% W=-2.2%, H=-0.7%, T=3.6%
Figure 7: The normalized difference maps between home and away performances for all the 20 EPL teams. In
all maps, teams are attacking from left-to-right and a positive value refers to a team having more occupancy
in home games, while a negative value refers to more occupancy in away games. Percentages underneath
each team give a value on this difference (Key: W is whole field, H refers to forward half and T refers to the
attacking third.)
120
home and away performance example, we can show the vari-
Distortion Variance
1372
Distance from Mean 60
40
20
(a) (b)
0
0 5 10 15 20 Figure 10: Occupancy maps of the: (top) mean away
Opponent Number performance for Fulham, and (bottom) their perfor-
mance against Manchester United - in this match
Figure 9: Example of the distortion for each away they lost 2-0 and conceded early in the match.
performance of Fulham in the 2010-2011 season. In
match 16 they played Manchester United, and the
performance on the day was far different from their 8. REFERENCES
other away performances (they lost 2-0 on this oc- [1] S. Ali and M. Shah. Floor Fields for Tracking in High
casion). Density Crowd Scenes. In ECCV, 2008.
[2] N. Allen, J. Templon, P. McNally, L. Birnbaum, and
K. Hammond. StatsMonkey: A Data-Driven Sports
7. SUMMARY AND FUTURE WORK Narrative Writer. In AAAI Fall Symposium Series,
Most sports analytics approaches still only use event-labeled 2010.
statistics to drive analysis and decision-making despite the [3] BBC-Sports. Footballers may trial wearing microchips
influx of ball and player tracking data becoming available. to monitor health.
The reason why this new and rich source of information is www.bbc.co.uk/sport/0/football/21460038, 14 Feb
being neglected stems for the fact that it is continuous and 2013.
is extremely difficult to segment into categories which would [4] M. Beetz, N. von Hoyningen-Huene, B. Kirchlechner,
enable high-level analysis (e.g. team strategy labels). The S. Gedikli, F. Siles, M. Durus, and M. Lames.
emerging field of sports spatiotemporal analytics attempts ASPOGAMO: Automated Sports Game Analysis
to use spatiotemporal data such as ball and player track- Models. International Journal of Computer Science in
ing data to drive automatic team behavior/strategy analysis Sport, 8(1), 2009.
which would be extremely useful in all facets of the sports [5] P. Carr, Y. Sheikh, and I. Matthews. Monocular
industry (e.g. coaching, broadcasting, fantasy-games, video Object Detection using 3D Geometric Primitives. In
games, betting etc.). In this paper, we gave an overview ECCV, 2012.
of the types of sports analytics work being done both in [6] ESPNFC. http://espnfc.com/us/en/report/
industry as well as academia. Additionally, we gave a case- 292849/report.html?soccernet=true&cc=5901.
study which investigated possible reasons for why the home [7] K. Goldsberry. CourtVision: New Visual and Spatial
advantage exists in continuous sports like soccer. Using spa- Analytics for the NBA. In MIT Sloan Sports Analytics
tiotemporal data, we were able to show that teams at home Conference, 2012.
play have more possession in the attacking third. Coupled
[8] A. Gupta, P. Srinivasan, J. Shi, and L. Davis.
with the fact that the shooting and passing proficiencies are
Understanding Videos, Constructing Plots: Learning a
not significantly different, this observation can partially ex-
Visually Grounded Storyline Model from Annotated
plain why home teams have more shots and score more, and
Videos. In CVPR, 2009.
in-turn win more at home compared to away matches. Using
[9] Hawk-Eye. www.hawkeyeinnovations.co.uk.
our feature representation, we also showed examples where
pre and post game analysis can be performed. Specifically, [10] D. Henschen. IBM Serves New Tennis Analytics At
we were able to show the variation in home and away perfor- Wimbledon. www.informationweek.com/software/
mances for each team, as well as the ability to flag anomalous business-intelligence/
performances. ibm-serves-new-tennis-analytics-at-wimbl/
Our work also highlighted the importance of match con- 240002528, 23 June 2012.
text and the limiting factor it could have on training ex- [11] A. Hervieu and P. Bouthemy. Understanding sports
amples. In our future work, we will look at unsupervised video using players trajectories. In J. Zhang, L. Shao,
methods which cluster playing similar playing styles which L. Zhang, and G. Jones, editors, Intelligent Video
can enrich our training data set, without effecting its dis- Event Analysis and Understanding. Springer Berlin /
criminating power. Additionally, we are looking to extend Heidelberg, 2010.
this approach to focus on using player tracking information [12] S. Intille and A. Bobick. A Framework for Recognizing
to discover team formations and plays. Predicting team in- Multi-Agent Action from Visual Evidence. In AAAI,
teractions and subsequent performances and outcomes, es- 1999.
pecially when they have not played each other is another [13] K. Kim, M. Grundmann, A. Shamir, I. Matthews,
area focus of our research. As reliable high-level labels are J. Hodgins, and I. Essa. Motion Fields to Predict Play
almost impossible to obtain, predicting match outcomes as Evolution in Dynamic Sports Scenes. In CVPR, 2010.
our evaluation tool seems to be the best indicator of im- [14] M. Lewis. Moneyball: The Art of Winning an Unfair
proved team modeling. Game. Norton, 2003.
1373
[15] R. Li and R. Chellappa. Group Motion Segmentation [26] Opta Sports. www.optasports.com.
Using a Spatio-Temporal Driving Force Model. In [27] S. Pellegrini, A. Ess, K. Schindler, and L. van Gool.
CVPR, 2010. You’ll Never Walk Alone: Modeling Social Behavior
[16] R. Li, R. Chellappa, and S. Zhou. Learning for Multi-Target Tracking. In CVPR, 2009.
Multi-Modal Densities on Discriminative Temporal [28] M. Perse, M. Kristan, S. Kovacic, and J. Pers. A
Interaction Manifold for Group Activity Recognition. Trajectory-Based Analysis of Coordinated Team
In CVPR, 2009. Activity in Basketball Game. Computer Vision and
[17] W. Lu, J. Ting, K. Murphy, and J. Little. Identifying Image Understanding, 2008.
Players in Broadcast Sports Videos using Conditional [29] Prozone. www.prozonesports.com.
Random Fields. In CVPR, 2011. [30] B. Siddiquie, Y. Yacoob, and L. Davis. Recognizing
[18] P. Lucey, A. Bialkowski, P. Carr, E. Foote, and Plays in American Football Videos. Technical report,
I. Matthews. Characterizing Multi-Agent Team University of Maryland, 2009.
Behavior from Partial Team Tracings: Evidence from [31] SportsVision. www.sportsvision.com.
the English Premier League. In AAAI, 2012. [32] STATS SportsVU. www.sportvu.com.
[19] L. Madden. NFL to Follow Army’s Lead on Helmet [33] Statsheet. www.statsheet.com.
Sensors in Attempt to Prevent Head Injury. [34] D. Stracuzzi, A. Fern, K. Ali, R. Hess, J. Pinto, N. Li,
www.forbes.com/sites/lancemadden/2012/07/16/ T. Konik, and D. Shapiro. An Application of Transfer
nfl-to-follow-armys-lead-on-helmet-sensors-in/ to American Football: From Observation of Raw
-attempt-to-prevent-head-injury/, 16 July 2012. Video to Control in a Simulated Environment. AI
[20] R. Masheswaran, Y. Chang, A. Henehan, and Magazine, 32(2), 2011.
S. Danesis. Destructing the Rebound with Optical [35] X. Wei, P. Lucey, S. Morgan, and S. Sridharan.
Tracking Data. In MIT Sloan Sports Analytics Sweet-Spot: Using Spatiotemporal Data to Discover
Conference, 2012. and Predict Shots in Tennis. In MIT Sloan Sports
[21] V. Morariu and L. Davis. Multi-Agent Event Analytics Conference, 2013.
Recognition in Structured Scenarios. In CVPR, 2011. [36] C. Xu, Y. Zhang, G. Zhu, Y. Rui, H. Lu, and
[22] T. Moskowitz and L. Wertheim. Scorecasting: The Q. Huang. Using Webcast Text for Semantic Event
Hidden Influences Behind How Sports Are Played and Detection in Broadcast. T. Multimedia, 10(7), 2008.
Games Are Won. Crown Publishing Group, 2011. [37] Zonalmarking. www.zonalmarking.net.
[23] NBA Shot Charts. www.nba.com/hotspots.
[24] D. Oliver. Basketball on Paper: Rules and Tools for
Performance Analysis. Brassey’s, Incorporated, 2004.
[25] D. Oliver. Guide to the Total Quarterback Rating.
espn.go.com/nfl/story/_/id/6833215/
explaining-statistics-total-quarterback-rating,
4 August 2011.
1374