Network Centrality Measures and Their Correlation
Network Centrality Measures and Their Correlation
A P REPRINT
∗
Gareth D. Simons
arXiv:2106.14040v2 [physics.soc-ph] 1 Jul 2021
A BSTRACT
Street network analysis holds appeal as a tool for the assessment of pedestrian connectivity and its
relation to the intensity and mix of land-uses; however, application within urban-design triggers a
range of questions on implementary specifics due to a variety of theories, methods, and considerations
and it is not immediately clear which of these might be the most applicable at the pedestrian scale
in relation to land-uses. It is, furthermore, difficult to directly evaluate differing approaches on
a like-for-like basis without recourse to the underlying algorithms and computational workflows.
To this end, the cityseer-api Python package is here used to develop, compute, and compare a
range of centrality methods which are then applied to the Ordnance Survey Open Roads dataset for
Greater London. The centralities are correlated to high-resolution land-use and mixed-used measures
computed from the Ordnance Survey Points of Interest dataset for the same points of analysis using a
spatially precise methodology based on network distances to premise locations.
The comparisons show that mixed-uses correlate more strongly against closeness than between-
ness centralities; segmented measures tend to offer slightly stronger correlations than node-based
equivalents; weighted variants offer correlations similar to unweighted versions, but with a greater
degree of spatial specificity; simplest-path methods confer an advantage in the context of local
high-street mixed-uses but not necessarily for district-wide mixed-uses or land-use accessibilities;
and the application of centrality measures to the dual network does not offer tangible benefits over
the primal network.
Keywords computation · data-science · land-use analysis · morphometrics · network analysis · spatial analysis · urban
analytics · urban planning · urban morphology · urbanism
Streets interconnect assortments of people and places. Whereas their purpose seems obvious, their emergent properties
are less so. We intuitively sense that particular sidewalks are busier than others, that certain land-uses may have a
proclivity towards distinct locations, or that highly successful streets can become public destinations in their own right.
Although architects, urban designers, and planners may argue an instinctive appreciation for creating and nurturing such
streets, history has often proved otherwise. Planned streets — whether intentionally so or not — have in many cases
come to function antithetically to their historical purposes by prioritising connections to distant places at the expense of
pedestrians and local connectivity. Streets and interstitial spaces can thus cease functioning as spatially connective tissue,
leading to the implosion of public space while contributing to cities’ social and functional fragmentation, a symptom
of the broader paradigm of lower-density car-centric development characterised by a paucity of pedestrian-accessible
land-uses. Nevertheless, even where the intention is to create more dense and walkable forms of urban development, it
is not easy, without recourse to the incremental and evolutionary development typical of historical towns and cities, to
anticipate the emergent characteristics of planned street networks and how these might relate to the potential vibrancy
of land-uses.
Historical forms of urbanism evolved as an accretion of exchanges competing for access to the heart of villages, towns,
and cities. An agglomerative dynamic, these processes feed — and are fed by — flows of people, goods, capital, and
∗
Benchmark Urbanism [email protected]
Centrality methods at the pedestrian scale A P REPRINT
information cascading through the innumerable tendrils of urban networks reaching near or far. In congealed form, this
logic is fine-grained and porous, with characteristically small urban blocks enmeshed by street networks interleaving
numerous assortments of land-uses. Jacobs (1969) is quick to assure that any semblance of inefficiency is, actually, a
hallmark of complex systems and the particular reason why evolutionary forms of urbanism can be persistently vibrant
and resilient (Jane Jacobs 1961; Marshall 2009). Nevertheless, buoyed by the spirit of modernity, early 20th century
architects, planners, and officials set out to tear these locally integrative networks asunder, mandating the rearrangement
of cities around motor vehicles and the separation of land-uses (Hilberseimer 1955; Garnier 1989; Corbusier 1967).
Though rationalised as ordered, efficient, and hygienic, the tree-like network structures deployed by modernists
amounted to reductionism and a “compulsive desire for neatness and order” (Alexander 1967, p.11). By order of
magnitude, these networks precluded the local combinatorial possibilities available on the densely interconnected
semi-lattice structures of historical urban networks (Alexander 1964). A key ingredient had thus gone missing: planned
street networks which prioritised motor vehicles seldom facilitated locally complex and evolving arrangements of
interactions, the requisite lifeblood of healthy neighbourhoods and vibrant street-fronts (Harvey 1989; Jane Jacobs
1961).
The utopian cities envisioned by modernists were motivated by the zeitgeist of the times: belief in the power of rational
thought and the invincibility of technological progress. New forms of mobility and communications technologies
would compress space and time and (were supposed to) unify and connect the disparate communities and functionally
segregated regions of planned cities. In reality, these networks marginalised pedestrians and, as epitomised by Robert
Moses’ heavy-handed slum-clearances in New York City, went so far as to lay waste to entire communities (Flint
2011; Harvey 1989; Lyon 1999; Ellin 1999). The salient issue is that networks optimising space-time compression for
certain citizens frequently create barriers for others. Problematically, an inordinate amount of power is wielded by those
deciding or influencing the formation of such infrastructures and their consequent flows. These patterns represent an
exclusionary model of urban development premised on the dissolution of public space, a state of urban disembodiment
engendered through the network mechanisms of spatial bypass, selective access, and prohibitive pedestrian distances —
patterns we have come to associate with suburbia. (Graham and Marvin 2001; Pope 1996). These forms of planning
not only remain prevalent but have found new life in the more recent Smart Cities hype (Greenfield 2013; Hill 2013;
Townsend 2013; Sterling 2014).
If planned street networks are brittle and biased or exhibit unpredictable emergent properties, then how should planned
development proceed to preserve the cohesiveness of the public realm and enhance access for pedestrians? It is here
where network analytic methods hold appeal. By allowing the opportunity to evaluate both existing and proposed
street configurations, it becomes easier to make observations about their potential emergent properties as systems of
streets. Thus personal intuition can be bridged to demonstrable, rigorous, and scalable forms of analysis while retaining
integrity in light of complex systems interpretations of urbanism.
Early forms of network analysis arose within the context of the mathematical exploration of telecommunication
networks (Shimbel 1953) and have since gained widespread use and maturity in fields such as social network analysis
(Wasserman and Faust 1994). The intuition is that networks (graphs) consisting of nodes (vertices) are connected by
links (edges) to other nodes. For example, a social network may consist of people connected by relationships to other
people. Network analysis can then be used to answer how connected or ‘central’ a particular person is, thus inferring
how important or influential they might be within the network’s structure. Two of the more common measures of
importance are closeness centrality (Sabidussi 1966), how closely a node is located to other nodes, and betweenness
centrality (Freeman 1977), how often a node provides the link between other nodes. The computation of these measures
involves the use of algorithms calculating shortest-paths through the network: in the basic case, distance is topological
— the number of steps linking from node to node — but it is also possible to represent distance by weighting the links
between the nodes. For example, in street networks, these weights can correspond to the lengths of streets (links)
between road intersections (nodes), and measures such as closeness and betweenness can then be used to infer how
vital a particular location might be within the network.
To understand the broader context of street-network analysis in urban-design, it becomes necessary to distinguish
between global and local forms of analysis; primal and dual street network representations; and shortest-path as
opposed to simplest-path heuristics.
Network centrality methods can be applied to either the ‘global’ or the ‘local’ network. Global analysis scrutinises the
entirety of the given network in contrast to localised forms of analysis which instead iteratively visit each node using a
‘windowing’ methodology. This intuition is conveyed in Figure 1: each node is visited, all reachable nodes within a
specified distance are then identified and isolated from the network at large, and the analysis can then proceed on the
2
Centrality methods at the pedestrian scale A P REPRINT
Figure 1: For localised network analysis, metrics are calculated for each node in the network using a moving-window,
which takes into account all other nodes within the specified distance threshold. In the simple case, these are based on
crow-flies euclidean distances; in the more accurate case (shown), these are based on distances calculated over the street
network.
isolated locally accessible network. The algorithm then steps to the next node and repeats the process until the entirety
of the network is visited. Localised forms of analysis can be completed for a range of different distance thresholds, and
these can be selected to concur with considerations such as pedestrian walking tolerances.
There are important implications for global as opposed to local methods: the computational outputs of global measures
are directly coupled to the delineation of the overall network, which leads to the issue of how city boundaries are
consistently defined and whether related metrics can be normalised in a sufficiently representative manner for accurate
comparisons between different neighbourhoods, towns, and cities. A further drawback to global methods is that the
assessment of different local planning configurations can be unnecessarily encumbered by distant regions of the network,
which may have little impact on a pedestrian’s perception of the walkability of local streets and neighbourhoods. From
this point of view, whereas global properties of city networks can provide a wellspring of information about urban
agglomerations and their evolutionary dynamics over time, they can also be abstruse and needlessly complex for
the needs of urbanists and urban designers who are affected by more immediate and locally specific concerns: How
walkable might a new development be? How many commercial or retail locations may be appropriate based on a
15-minute walk to a particular location? For these purposes, windowed network analysis methods hold more appeal
because they remain contextually anchored; are capable of illustrating how properties of the pedestrian network are
tangibly affected by day-to-day planning decisions and allow for the resultant measures to be consistently interpreted
and compared from location to location or from city to city.
The primal representation of street networks is when intersections are represented by nodes and streets by links, thus
corresponding directly to conceptions of streets embedded in euclidean space: intersections adopt specified coordinates
connected by streets to other intersections. However, it is also possible to adopt the dual representation where, inversely,
nodes correspond to streets and links correspond to the connections between them (Rosvall et al. 2005; Marshall et al.
2018). In simpler cases, per Figure 2, the dual network can be a direct inversion of the primal. However, there are
also other more complex procedures for generating dual-network topologies, such as using the angular straightness of
streets or considerations such as street names as a basis for ‘collapsing’ adjacent street segments into a single (dual
representation) node which may then be individually linked to a potentially large number of other adjacent (dual
representation) nodes. These types of procedure can be used to extricate the dual representation from its euclidean
embedding, thus sacrificing spatial proximities in favour of a more abstract informational representation consisting
of topological links (Michael Batty 2004). Such approaches can be used to analyse properties such as the global
connectivity of the network and its small-world and scale-free characteristics, which are emblematic of complex systems
and network analysis more generally (Albert and Barabási 2002; Porta, Crucitti, and Latora 2006a).
A prevalent form of urban network analysis — space syntax (Bill Hillier and Hanson 1984) — emerged around the use
of the dual representation in order to quantify changes in direction from one street to another. Euclidean distances are
3
Centrality methods at the pedestrian scale A P REPRINT
0°
150m
80°
10° 100° 0°
0°
100m 45m 80m 0°
45m 45m
90° 90° 90°
45m 0°
90° 0°
Figure 2: The primal street network representation (left) and a corresponding example of a dual representation (right).
Note that different forms of dual representations exist: For example, traditional space syntax methods make use of axial
lines that collapse adjacent straight segments into a single node (not shown).
thereby waived in favour of the number of topological turns or, per more recent interpretations, the cumulative angular
changes in direction from turn to turn (see Figure 2). Straighter routes with better lines-of-sight are, therefore, considered
‘shorter’ than more convoluted routes even if the euclidean distances are greater, thus transforming the emphasis from
shortest paths to simplest paths. This strategy reflects the space syntax premise that cognitive way-finding in relation to
lines-of-sight take precedence to the distances traversed between locations and that routes requiring minimal geometric
complexity are favoured to routes requiring minimal physical effort (Hillier and Iida 2005; Serra and Bill Hillier 2019);
accordingly, geometric and topological properties of the street network are deemed primary determinants in the general
evolution of land-uses and the wider patterns of activities in cities (Penn et al. 1998; Bill Hillier, Turner, et al. 2007).
Space syntax concepts, history, and terminology is similar but not necessarily the same as that used more generally by
the network-analysis scientific community. This can lead to some ambiguity on points such as terminology, e.g. mean
depth instead of normalised farness, integration instead of normalised closeness, or choice instead of betweenness.
Some of the methods and interpretations used by space syntax may consequently appear esoteric to outsiders: by way of
example, where techniques on the extrapolation of axial lines (uninterrupted lines of sight) from the street network led
to drawn-out deliberations on how exactly this should be done and whether it is possible to do so in an algorithmically
rigorous manner (Turner, Alan Penn, and Bill Hillier 2005; Porta, Crucitti, and Latora 2006b; Ratti 2004). Newer and
more tractable methods have alleviated this particular debate — fractional analysis and the now prevalent angular
segment analysis — compute total angular change along road centrelines instead of counting topological turns from
axial line to axial line (Turner 2000; Dalton 2001; Turner and Dalton 2005; Turner 2007). Nevertheless, this has
introduced a new algorithmic challenge in that simplest-path algorithms will ‘side-step’ sharp angular turns in cases
where smaller combinations of adjacent angles can be combined instead (Figure 3), thus making it necessary to enforce
in and out directions for algorithms as they pass-through nodes during graph traversals.
Simplest vs shortest-path heuristics should not be construed as an either-or situation because the inter-relation is varied
(Barthelemy 2015) and assertions to the contrary should be treated with caution. Pedestrian route choices are complex
and can be based on a combination of shortest distances and cognitively simplest routes, and it can be reasoned that
this may vary from place to place and from person to person. Whereas simplest-path centralities do not directly take
distances into account, and shortest-path centralities do not directly take angular deviation into account, the reason why
either of these may work in isolation is that shortest routes are frequently the same as the simplest (Viana et al. 2013;
Omer and Kaplan 2018). Further, strong correlations are observed between shortest and simplest path heuristics at
certain distances (Bill Hillier, Turner, et al. 2007; Serra and Bill Hillier 2019). Where multiple compelling route options
exist, the mechanisms behind route choices are far from clear-cut. For example, a tourist might rely on lines-of-sight
to explore a city via boulevards and significant landmarks, but may, in other cases, follow shortest-paths through
the use of a map or a mobile device. Similarly, a resident might rely on sight to find a pub or deli in an unfamiliar
neighbourhood but may otherwise have access to an assortment of locally familiar locations, potentially hidden around
corners off-the-beaten-path to be fluidly connected in manifold non-simple route permutations influenced by varied
4
Centrality methods at the pedestrian scale A P REPRINT
80° 100°
0°
Figure 3: On dual networks using angular distances, shortest path algorithms can sidestep sharp angles by combining
adjacent shallower angles (left). This necessitates enforced in and out directions for each node (right) to ensure that the
algorithm continues-on in the same direction as which it arrived.
daily considerations. Modal choice presents an additional nuance, wherein it could be argued that pedestrians — in
contrast to motor vehicles — have a lower resistance to turns but a higher resistance to farther distances and may thus
actively choose routes that are shorter or less simple than the more axial routes frequented by vehicles (Serra and
Bill Hillier 2019). In yet other cases, pedestrians may opt for routes that are neither the shortest nor the simplest but
offer a more pleasant alternative free from traffic, noise, or fumes; or greater access to greenery, sunlight, shade, or
shallower gradients. In short, pedestrian route-choice considerations are varied, and simplest or shortest-path methods
can be thought of as proxies to some of the more prevalent patterns of spatial flows.
Research has shown support for simplest path measures within the context of vehicular travel, particularly at distances
larger than 2000m. However, there are hints that shortest path measures retain relevancy for smaller distances and
non-vehicular forms of transport such as cycling (Serra and Bill Hillier 2019), thus prompting broader questions
pertinent to urbanists and urban designers: specifically, which methods work well at the pedestrian scale regarding the
intensity and mix of walkable land-uses?
Whereas shortest-path routes are ordinarily applied to primal representations and simplest-path routes are most often
applied to dual representations, these concepts should not necessarily be conflated. The primal representation can
be used with simplest-path angular-segment-analysis methods if the algorithms are designed accordingly; likewise,
dual representations can be used with shortest-path methods if the dual network has not been removed from its spatial
embedding. There may, further, be potential synergies to be gained through the simultaneous application of both primal
and dual representations and the respective methods of inquiry suited to either approach (Masucci and Molinero 2016).
Localised measures are similar to the global equivalents from which they derive but only take nodes up to selected
distance thresholds into account. The procedure entails a moving-window approach (Figure 1), wherein the algorithm
iteratively visits each node from which it then proceeds to isolate all other nodes within the maximum distance
threshold. A contradiction is introduced for simplest-path analysis (Cooper 2015): the euclidean distance metric used
for the threshold differs from the angular distance metric used for the analysis, thus introducing an implicit euclidean
component. Further, when using simple radial (as opposed to network) distance cutoffs, this combination can lead
simplest-path algorithms to behave in unexpected ways, such as selecting routes traversing network distances that
substantially exceed the implied distance of a given radial cutoff: in some cases, even overshooting the threshold then
turning-back to end inside the radial boundary. Various strategies can be considered to mitigate such behaviour, and
the position taken in this analysis is that — in keeping with the implied distance of a stated threshold — the distances
5
Centrality methods at the pedestrian scale A P REPRINT
i
ld
ho
es
thr
al
di
ax
ra red route blue route
m 1500m precluded 1600m max exceeded
m
16
00 90° 270° 270°
650m
1600m 1300m
i
360° 270°
1200m 2450m
90° 90°
200m 400m
blue route start node
2750m 1600m network cutoff
Figure 4: Crude radial distance thresholds (left) can permit distances significantly exceeding the threshold, in some
cases also overshooting the threshold boundary. Conversely, the use of strict network distance thresholds (right) will in
some cases occlude nodes that may otherwise have been within the maximum distance threshold if using shortest-paths.
traversed by an algorithm should be strictly enforced based on actual network distances. Therefore, intuition and
behaviour remain consistent regardless of the heuristic and correspond more realistically to available routes for a given
pedestrian walking tolerance. In some cases, nodes that may have been within maximum network distance thresholds
as measured by shortest-paths will be occluded by the sometimes greater distances encountered along simplest-paths,
which is here accepted as an inherent and representative property of the routes preferred by the simplest-path heuristic
(Figure 4).
Network analysis algorithms are, of necessity, sensitive to the topological structure of the network, and poor quality
network datasets will lead to spurious results. For example, broken links will throw shortest-path algorithms off course,
and unnecessarily complex representations of features such as complex road intersections, private roadways in housing
estates, or individually mapped aisles in large parking lots will introduce incorrect centralities to the wider network.
This situation arises because network analysis methods are susceptible to topological distortions arising from variations
in the intensity of nodes for equivalent lengths of road segments. For example, a straight street segment may be
represented as a single link between two nodes, whereas an arced street of equivalent length is often represented with
additional nodes to approximate the curvature of the roadway. Each additional node results in more summations for all
other nodes within windowed distance thresholds, and this effect is further magnified when such nodes are densely
co-located, consequently skewing the outcome of centrality measures. The Ordnance Survey Open Roads dataset used
in the following analysis is of particularly high quality: it is already cleaned and simplified. It keeps the geometric
representation of roadways distinct from the topological structure of the network, thus minimising the stated issues. If
using topologically ‘messier’ networks from a source such as Open Street Map then it is generally necessary to first
clean and simplify the network, which should be done while preserving the geometry of the original street segments
so that accurate distances or angular changes in direction can be recovered. These techniques are facilitated by the
cityseer-api package and are discussed more comprehensively in Simons (2021c). Note that automated cleaning
routines which merge nodes and street geometries can result in some geometrical ‘quirks’ in the form of small twists or
turns. These tend to be sufficiently minor that shortest-path methods are not substantially degraded; however, it may be
necessary to manually check for downstream artefacts affecting simplest-path calculations (when using simplest-path
methods).
Another manifestation of this phenomenon is the topological divergence between primal and dual representations (see
Figure 5). Even though both derive from the same underlying structure, the outputs for the same network measures using
the same weights will yield different results; the dual representation generates higher equivalent metrics due to larger
quantities of nodes and links, which becomes more pronounced as the complexity of the network increases. Comparisons
between shortest-path measures on primal networks and simplest-path measures on dual networks must therefore
6
Centrality methods at the pedestrian scale A P REPRINT
Figure 5: Even if the underlying structure is the same, primal and dual representations are not directly comparable
because they have distinct topological characteristics. Measures computed on the dual network will demonstrate higher
effective centralities due to greater quantities of nodes and links, and this effect is amplified for densely interconnected
networks.
be avoided. The cityseer-api package computes angular weights using a dynamic methodology that accumulates
changes from road segment to road segment, thereby allowing simplest-path methods to be computed on the primal
network and facilitating the direct comparison of shortest and simplest-path (angular-segment-analysis) methods on the
same topological networks.
Note that whereas it may be tempting to resolve poor-quality networks or topological divergences through normalisation
by the number of nodes, this would prevent accurate comparisons from location to location (discussed at length below).
Another strategy is to weight the nodes by factors such as street lengths (Turner 2007) or the number of adjacent
buildings (Sevtsuk and Mekonnen 2012). The following analysis considers two such strategies: use of segmented
(continuous) forms of ‘length weighted’ centralities and the optional use of network decomposition to distribute nodes
more evenly across networks as an implicit form of length normalisation, with the benefit that variations can then be
captured at a higher resolution of analysis within road segments (Simons 2021c).
where farness is the sum of distances d from node i to all reachable nodes j (Sabidussi 1966). Closeness conveys
how proximate node i is to surrounding nodes, with the implication that high closeness centralities afford increased
likelihood of access and interaction. Closeness can be normalised by the number of nodes in the graph N (not including
node i, which gives N − 1) divided by the sum of the distances
N −1
N ormalised C(i) = P , (3)
j6=i d(i,j)
7
Centrality methods at the pedestrian scale A P REPRINT
The normalised form of Closeness tends to be the most commonly applied derivation in urban analysis; however, closer
inspection reveals that a series of challenges arise when applied to localised (windowed) network analysis. Per Figure 6
and Table 1, normalised closeness does not behave as intended for disconnected or ‘windowed’ graphs, in which cases
non-normalised harmonic closeness centrality (Marchiori and Latora 2000; Rochat 2009) of the form
X 1
Harmonic C(i) = (5)
d(i,j)
j6=i
behaves more suitably, with the difference being that the division happens prior to the summation. For context, the
normalised form of harmonic closeness centrality is the inverse of the harmonic mean of farness (as opposed to the
arithmetic mean in 4)
1
P
j6=i d(i,j)
N ormalised HC(i) = . (6)
N −1
Another option is to use improved closeness centrality proposed by Wasserman and Faust (1994)
(N − 1)i /(g − 1)
Improved C(i) = P , (7)
j6=i d(i,j) /(N − 1)i
which is intended for situations where a limited subset of nodes is reachable, and is defined as the ratio of the fraction
of reachable nodes (N − 1)i to the average distance to those nodes. The global number of nodes g is unknown, but
since this is effectively the worldwide street network shared by all subgraphs it can be argued that the reachable nodes
(N − 1)i can forego the normalisation by (g − 1), thus giving:
(N − 1)i (N − 1)2i
C(i) = P =P . (8)
j6=i d(i,j) /(N − 1)i j6=i d(i,j)
Note that if global nodes g were instead defined as the ‘parent’ city, then results would not be comparable between
locations, and this would otherwise become problematic when arbitrary boundaries delineate networks spanning
adjacent towns or cities. The intuition of the resultant form can be contrasted to normalised Closeness (Eq: 3) with the
number of nodes (N − 1)i being divided by the average instead of the total sum of distances to those nodes. Improved
Closeness, therefore, behaves more intuitively: it increases either for a greater number of locally accessible nodes or if
the average distance to those nodes decreases.
i i 100m i 150m
Figure 6 illustrates these formulations more intuitively. Scenario B should be expected to have a greater closeness
within an urban context than A because it is equivalently close to a greater number of nodes. Scenario C contains a more
distant node and should therefore have a lower closeness centrality than Scenario B but should still be greater than A.
As shown in Table 1, Closeness counter-intuitively decreases from Scenario 1 to 2, whereas normalised Closeness
is also problematic because it neutralises variations between differently sized windowed subgraphs, rendering them
8
Centrality methods at the pedestrian scale A P REPRINT
incomparable. Harmonic closeness centrality and the modified form of improved closeness centrality variants behave
as anticipated. Out of these, harmonic Closeness is technically more precise because it considers the inverse of the
distances independently in contrast to improved Closeness, which first averages the distances; yet, for the same reason,
the averaging implicit with improved Closeness may be advantageous when working with poorer quality or un-simplified
representations of street networks. This is because nodes placed particularly close to other nodes would otherwise
have an unexpectedly large impact on the summed closeness (see Figure 7 and Table 2). The issue is exacerbated by
simplest-path centralities because there are situations where angular distances approach zero for small local distance
thresholds, even when averaged, thus causing infinitely large values through division by zero.
F arness(i) 100 + 100 + 100 = 300 100 + 100 + 100 + 100 = 400 100 + 100 + 150 + 100 = 450
100+100+100 100+100+100+100 100+100+150+100
N orm. F(i) 3
= 100 4
= 100 4
= 112.5
1 1 1
Closeness(i) 100+100+100
= 0.003 100+100+100+100
= 0.0025 100+100+150+100
= 0.002
3 4 4
N orm. C(i) 100+100+100
= 0.01 100+100+100+100
= 0.01 100+100+150+100
= 0.008
1 1 1 1 1 1 1 1 1 1 1
Harmonic C(i) 100
+ 100 + 100 = 0.03 100
+ 100 + 100 + 100 = 0.04 100
+ 100 + 150 + 100 = 0.036
1 1 1 1 1 1 1 1 1 1 1
N orm. HC(i) ( 100 + 100 + 100 )/3 = 0.01 ( 100 + 100 + 100 + 100 )/4 = 0.01 ( 100 + 100 + 150 + 100 )/4 = 0.0092
3 4 4
Improved C(i) (100+100+100)/3
= 0.03 (100+100+100+100)/4
= 0.04 (100+100+150+100)/4
= 0.035
100m 100m
100m 100m
5m
i 100m i
100m 100m
Scenario D Scenario E
Scenario D Scenario E
1 1 1 1 1 1 1 1
Harmonic C(i) 100
+ 100
+
100
+ 200 = 0.035 100
+ 100
+ 5
+ 100
= 0.23
4 4
Improved C(i) (100+100+100+200)/4
= 0.032 (100+100+5+100)/4
= 0.052
Table 2: Comparative closeness scenarios showing indirect segments and very short segments. Improved Closeness is
less susceptible to large numeric variations introduced by very short street segments.
The effects of particularly nearby nodes segue into a broader issue: topological distortions due to varying intensities of
nodes can distort the outcome of centrality measures. For this reason, it can be beneficial to consider the measures in
a street-length-weighted form, particularly if dealing with ‘messy’ sources of graph topology. In this form, a node’s
9
Centrality methods at the pedestrian scale A P REPRINT
contribution to centrality is weighted by the length of the adjacent street segments because greater exposure to street
lengths offers a greater potential for interaction. Accordingly, the contribution of unusually high concentrations of
nodes would be tempered by correspondingly shorter street segments and vice-versa. In practical terms, half of the
length of a street segment is assigned to the node on either end, with the aggregated lengths per street node l then used
to weight the measure in the numerator:
X l
length wt. HC(i) = . (9)
d(i,j)
j6=i
Although the approach seems relatively straightforward, a drawback is that continuously decaying spatial impedances
(e.g. 1/d) are first discretised at node locations and are then, in effect, mapped back onto the adjacent street segments
through the process of street-length weighting, resulting in step-wise (rather than continuous) increases in spatial
impedances as distances increase over the network. A further implication is that the half-segments adjacent to the origin
node must be ignored; otherwise, infinitely large Closeness arises through division by zero distance.
An alternative is to use the integral of harmonic closeness. The integral for f (x) = 1/d takes the form
Z b
f (x) dx = ln(b) − ln(a) (10)
a
and can be applied to sum the ‘area under the (spatial impedance) curve’ for the respective lower and upper segment
bounds a and b for all reachable segments S:
XS Z b XS
continuous HC(i) = f (x) dx = ln(b) − ln(a) . (11)
(a,b) a (a,b)
This allows spatial impedances to increase continuously. For example, a 10m street segment adjacent to the origin
is now found as ln(10) = 2.303 and a segment from 10m to 20m distant is found as ln(20) − ln(10) = 0.693. The
continuous form means that aggregations remain consistent regardless of how many times street lengths are split at
intervening nodes, i.e. ln(20) = ln(10) + (ln(20) − ln(10)) = 2.996, and this form is therefore relatively robust if
applied to ‘messy’ graph topologies.
50m 50m
i i
50m 50m
50m 50m 50m
150m
Scenario F Scenario G
The length-weighted (9) and continuous form (11) of harmonic closeness are compared in Table 3 across all above
scenarios, including two additional scenarios detailed in Figure 8. In length-weighted harmonic Closeness, the street
segments are divided and assigned to the respective nodes on either side, meaning that the portions adjacent to the
origin nodes are not considered. The measure generally performs as intended, but the lack of accommodation for
origin segments presents some difficulties; for example, Scenario C yields an equivalent value to Scenario B, and
Scenario D counter-intuitively yields a higher value than B or C. Further, differences in node densities per F and G are
not neutralised despite weighting by segment lengths. The continuous form resolves the issues mentioned above and
10
Centrality methods at the pedestrian scale A P REPRINT
P l
Rx
length wt. HC(i) = j6=i d(i,j) Harmonic C(i) (0, x) = 0
ln(x) dx
50 50 50
Scenario A 100
+ 100 + 100 = 1.5 ln(100) + ln(100) + ln(100) = 13.82
50 50 50 50
Scenario B 100
+ 100
+ 100
+ 100
= 2.0 ln(100) + ln(100) + ln(100) + ln(100) = 18.42
50 50 75 50
Scenario C 100
+ 100 + 150 + 100 = 2.0 ln(100) + ln(100) + ln(150) + ln(100) = 18.83
50
Scenario D 100
+ 100
100
50
+ 100 50
+ 200 = 2.25 ln(100) + ln(100) + ln(100) + (ln(200) − ln(100)) = 14.51
50 50 2.5 50
Scenario E 100
+ 100
+ 50
+ 100
=2 ln(100) + ln(100) + ln(100) + ln(5) = 15.42
50 50 50 25
Scenario F ( 50 + 100 + 150 + 200 ) ∗ 2 = 3.92 (simplif ied) ln(200) ∗ 2 = 10.60
Scenario G ( 100
50
+ 20075
) ∗ 2 = 4.75 (simplif ied) ln(200) ∗ 2 = 10.60
Table 3: Length weighted closeness comparisons comparing street-length weighted harmonic closeness and a continuous
form of harmonic closeness.
behaves intuitively: Scenario D is greater than A but less than B or C, Scenario E is greater than A but less than B,
Scenario C is greater than B, and different node densities are neutralised per F and G.
Additional nuances need consideration in the case of angular centralities. Whereas forms of harmonic or improved
Closeness are suitable, a decision must be made about how angular distances are scaled. Turner (2007) and Hillier
and Iida (2005) have suggested scaling to 0 for no turns and 2 for 180◦ turns; however, in the case of harmonic
Closeness, this would leave the division through zero conundrum unresolved and would thus imply that pedestrians
would walk infinitely far on streets with 0◦ angular impedance. A scaling of 1 through 3 may be preferable to cap a
node’s contribution to the unit length l instead of ∞. In the case of street-length weighted variants, the continuous
form suggested per Equation 11 cannot be used because simplest-path impedances do not increase continuously. An
alternative strategy can be considered wherein segment lengths in metres are divided by angular impedances, e.g. a
100m segment encountered at 100m distance from the origin at 0◦ of angular change would contribute 100 1 whereas the
equivalent segment encountered around a 90◦ corner would be divided by 2; by 3 in the case of 180◦ ; etc.
Note that segment-based methods should not be computed on the dual graph because the resultant proliferation of
overlapping edges would result in duplicitous summations compared to the originating primal street edges.
The use of spatial impedance as an accessibility measure, which is often referred to as the gravity index, offers a more
elegant approach to the weighting of distances. In contrast to street network analysis, which emphasises the physicalism
of the road network structure as an artefactual manifestation of flows, ‘gravity’ based methods based on impedances
shift the emphasis to the potential flow of interactions as they might unfold through space from the nodes across such
structures (Michael. Batty 2013; Hansen 1959; Rutherford 1979). The distinction may seem pedantic, and whereas the
spatial impedance function bears some resemblance to harmonic closeness centrality, it introduces a more sophisticated
approach in that it more explicitly models spatial impedances. It accordingly reflects the potential for spatial interaction
from nodes j to node i and — like gravity — how the potential for this interaction decays with distance. This typically
takes the form of the negative exponential
X
Gravity(i) = exp(−β · d) (12)
j6=i
where the rate of decay β (in the negative exponential) can be set to model specific trip-purposes or transportation
modes to reflect people’s willingness to travel a given distance d to particular types of locations (Sevtsuk and Mekonnen
2012; S. L. Handy and Niemeier 1997; Iacono et al. 2008).
Note that the term gravity index can be somewhat misleading: full-fledged gravity models form the basis of broader
land-use and transportation modelling where the attraction of the origins and destinations are, as per gravity, taken
into account. However, the gravity index generally assumes equal attractions from each node to every other node and,
as such, is simply a means to provide distance-weighted counts of accessible locations j proximate to i at the given
impedance β. This approach works well for quantifying access to specific land-uses or, in the case of the street network
structure, quantifying physical access from nodes j to node i.
Gravity measures are inherently localised and, unlike windowed network measures, do not strictly require distance
cutoffs. It is nevertheless computationally advantageous to retain thresholds commensurate with distances at which
11
Centrality methods at the pedestrian scale A P REPRINT
w = exp( d) = 4/dmax
1.0
dmax = 100 = 0.04 = 35m
dmax = 200 = 0.02 = 70m
dmax = 400 = 0.01 = 140m
0.8
dmax = 800 = 0.005 = 281m
dmax = 1600 = 0.0025 = 562m
effective weight w
0.6
0.4
0.2
wmin
0.0
100 200 dmax = 400 dmax = 800 dmax = 1600
distance in metres d
Figure 9: Spatial impedance curves for different β parameters. Nearer locations can be weighted more heavily than
farther locations through use of the negative exponential decay function.
the decay renders additional computation sufficiently negligible; for the proceeding discussion, a range of impedances
(Figure 9) is applied with the respective β parameters anchored to distance thresholds dmax through β = 4/dmax . For
example, a 100m distance threshold dmax corresponds to β = 0.04 and gives an average trip distance of 35m. The
distances and the corresponding impedances are selected with pedestrians in mind, where, for example, β = 0.005
(800m dmax ) may represent relatively typical walking distances to bus stops (Sevtsuk, Kalvo, and Ekmekci 2016;
S. L. Handy and Niemeier 1997). Realistically, these values vary significantly based on the purpose of the trip because
pedestrians may be willing to walk much farther for purposes such as recreation and fitness than for purposes such as
shopping (Iacono et al. 2008). These parameters also vary based on location: it can be argued that North American
contexts are more walking suppressive (Reyer et al. 2014) than European equivalents. Regardless, pedestrians tend to
be unwilling to walk distances greater than a mile (1600m, β ≈ 0.0025) for non-recreational purposes and, per the
exponential decay function, are more likely to walk to nearer locations than those farther away (Baradaran and Ramjerdi
2001; Scheurer and Curtis 2007; Duncan et al. 2011; S. Handy and Clifton 2001; Harris 2001; Bates 2007).
As with closeness centrality, it may be preferable to use the gravity index in a continuous form when addressing varied
intensities of nodes, poor quality network representations, or non-simplified networks. In this case, gravity is computed
as the area under the curve for all reachable segments S for the respective lower and upper segment bounds a and b at
the specified impedance β:
S Z b S
X X exp(−β · b) − exp(−β · a)
continuous G(i) = f (x) dx = . (13)
a −β
(a,b) (a,b)
The shortest path from any node j to any other node k will pass through an assortment of nodes i, that is, unless j and k
are directly adjacent. Betweenness
X X
Betweenness(i) = n(j, k) i (14)
j6=i k6=j6=i
is the summation of shortest paths between all (j, k) pairs of nodes passing through a given node i (Freeman 1977), and
in the case of street networks conveys how likely a street is to be traversed by people travelling between other locations.
Topological networks, which count the number of topological steps instead of summing distance weights, will frequently
have multiple equally shortest paths for a particular (j, k) pair. In these cases, the proportion of routes passing through
12
Centrality methods at the pedestrian scale A P REPRINT
node i is used instead, such that if there were two shortest paths, then the sum is incremented by 12 for the two respective
routes, and likewise by 13 if there were three. In the case of street networks weighted by distance or angular change,
it is unlikely for two exactly equidistant shortest paths to exist, and this step can often be disregarded. An exception
would be cities with heavily gridded layouts such as Manhattan or Barcelona, in which case routes within a tolerance of
similarity could be considered equidistant, and therefore with proportional weightings used instead.
Localised betweenness, similar to localised Closeness, only considers other (j, k) node pairs within the threshold cutoff
distances. Unlike Closeness, global formulations applied to windowed graph analysis do not introduce unexpected
behaviour.
For global betweenness measures, normalisation is beneficial unless there is a consistent methodological process for
how the network boundaries were derived from the topological structure of the network, such as the network percolation
based methods described in Arcaute et al. (2016). The boundaries would otherwise be arbitrary, and the number of
shortest paths passing through a node would depend significantly on the extents of the chosen boundaries. However, for
windowed betweenness, the distance thresholds are, in effect, already a form of implicit normalisation, and further use
of explicit normalisation based on the parent-graph would prevent comparisons across locations or distance thresholds.
Similar to windowed closeness measures, it would also lead to counter-intuitive results due to differing quantities of
nodes.
Whereas Closeness and the gravity accessibility index directly take distances into account, betweenness only counts the
number of shortest paths passing through a location. Betweenness can, however, be weighted by distances:
X X
Betweenness(i) = n(j, k) i · exp(−β · d) , (15)
j6=i k6=j6=i
in which case the negative exponential (see 12) reflects the notion that trips between closely located (j, k) node-pairs
are more likely to occur than those located farther apart, with d in this case representing the corresponding trip distance
for a given (j, k) pair of nodes passing through node i.
The previously mentioned measures are useful for exploring the role of shortest and simplest (angular) paths in network
structures, but the richness of potential route choice options remains implicit. It is therefore interesting to consider the
use of network cycles, simply a count of reachable cycles
cycles
X
N etwork Cycles(i) = 1 (16)
j6=i
to explicitly explore the potential complexity of available route options in a sense articulated by Jane Jacobs (1961), who
argued that street configurations should permit ‘pools of cross-use’ by providing a variety of overlapping route options
that can be ‘mixed and mingled’ within pedestrian walking thresholds. Cycles are analogous to a route encircling
a block: setting off in one direction permits circling back towards the same location or mixing and matching many
potential route configurations at intersections. Cycles reflect the potential complexity of interleaving route options as
may be typified by gridded layouts in contrast to suburban cul-de-sacs designed to minimise such cycles. One drawback
to the network cycles measure is that it does not work well for smaller distance thresholds at which cycles are precluded.
The following analysis is an exploratory comparison of local network centralities, which are correlated to local mixed-
uses and local land-use accessibilities with the intention of understanding which centralities relate most intuitively to
notions of pedestrian-accessible land-uses. The network dataset consists of the full street network for Greater London
as derived from the UK Ordnance Survey Open Roads dataset in both the primal and dual representations. The primal
network consists of 192,843 nodes, with the street network buffered to the maximum distance threshold of 8km to avoid
perturbations due to edge-effects (Gil 2017). Land-uses are derived from the UK Ordnance Survey Points of Interest
(POI) dataset consisting of 388,323 points with Eastings and Northings for each location. These are classed according
to a schema consisting of 617 categories and are assigned to the network nodes on either side of the closest adjacent
street edge. Land-use accessibility and mixed-use measures are computed with the cityseer-api package, affording
the use of dynamic land-use aggregation workflows based on distances computed directly over the street network while
taking the direction of approach into account. These workflows avoid larger-scaled gridded or zonal aggregations of
land-uses, which otherwise incur a loss of resolution while inducing a disassociation between the observed centralities
13
Centrality methods at the pedestrian scale A P REPRINT
and land-uses for a given street-front location. These methods are described at length in Simons (2021b) and Simons
(2021c)
The correlation plots make use of Spearman Rank ρ correlation coefficients (step-wise monotonicity of the data)
as opposed to Pearson’s r (linearity of the data) because heavily skewed datasets, as is the case with betweenness
measures, would otherwise require aggressive preprocessing steps (e.g. max-log optimised boxcox transformations).
The observations must be framed with the caveat that no network measure should be expected to correlate perfectly to
land-uses because there are broader considerations at work, including population densities, the nature of the building
fabric, affordability, land-use regulations, etc. It should also be noted that correlations tend to be boosted at larger
distance thresholds because increasing aggregation levels tend to smooth the variance of the data. This phenomenon
is a characteristic of the Modifiable Areal Unit Problem (Fotheringham and Wong 1991) with the implication that
correlations are not directly equivalent across different scales of aggregation.
80
60
40
20
0
5000
0
0 1000 2000 3000 4000 5000 6000 103 104 105
segment weighted closeness dmax = 1000m log segment weighted betweenness dmax = 1000m
Figure 10: Example distributions and hexbin plots for gravity and weighted betweenness against mixed-uses. Closeness
related measures (such as gravity and node densities) show heavier distributions at smaller values with higher values
reflecting a relatively linear increase in relation to mixed-uses. Betweenness measures assume a more ambiguous
relationship.
A range of centralities is computed from 50m to 8km with a visual survey of the 400m and 1600m cases plotted for a
selection of measures in the full page plots from Figures 12 through 16.
Figure 10 shows that closeness-based measures assume a heavier distribution at smaller values with higher values
assuming a relatively linear increase in relation to the presence of mixed-uses. Betweenness is more uncertain, with
upper outliers associated with both lower and higher values of mixed-uses, perhaps reflecting the intuition that large
volumes of through-traffic may in some cases be accompanied by a high quantity and variety of land-uses, such
as high-streets, whereas being devoid thereof in other cases, such as motorways. This observation is reinforced in
Figure 11, where the most intensive levels of mixed-uses tend to increase with closeness while being more loosely
associated with betweenness.
The subsequent correlation plots are computed against distance weighted mixed-use richness (Hillq=0 ) and Food
Retail accessibility at the smaller distance threshold of β = 0.013̄ / dmax = 300m as a proxy for high streets (see
Figure 17 and Figure 18), with distance-weighted mixed-use richness (Hillq=0 ) and Commercial accessibilities used at
the larger distance threshold of β = 0.004 / dmax = 1000m as a proxy for wider mixed-use districts (see Figure 19 and
Figure 20). Note that the use of weighting based on spatial impedances means that these measures are more locally
focussed than the dmax values may otherwise imply. Each of these figures is shown as two correlation gridplots: the
first representing correlations of centralities to raw mixed-use and land-use calculations while the second shows the
same measures normalised by street lengths. The latter controls for network effects, which can be illustrated with the
use of randomised land-use locations generated by randomly ‘scrambling’ existing land-uses within the extents of
14
Centrality methods at the pedestrian scale A P REPRINT
101
mixed-use richness
103
Figure 11: Hexbin plot showing gravity and weighted betweenness, with higher degrees of mixed-uses represented by
increasing intensities of colour. Higher mixed-uses associate with a higher degree of closeness, with the relationship
less well defined for betweenness.
Greater London; so doing, land-uses become more evenly distributed through space instead of the typically clustered
land-use hot-spots observed for real-world scenarios. The measurement of randomised mixed-uses at the smaller
distance threshold (Figure 21) would therefore invoke some peculiar results: streets ordinarily expected to have higher
access to mixed-uses would instead have lower access and vice-versa, and the correlations are accordingly mostly
negligible. The situation changes, however, once expanding the land-use aggregation thresholds to larger distances
(Figure 22) which now gain higher access to mixed-uses purely as a function of higher levels of network centrality. This
effect of the network on the measured mixed-uses can be mitigated through street length normalisation. However, it
should be noted that the non-normalised correlations remain a valid form of observation, albeit with the proviso that
access to land-uses can increase both as a function of the intensity of land-uses per unit street-frontage or as a function
of greater network centrality, which increases the amount of accessible street-frontage.
Generally: closeness-like measures (including density and β variants) offer stronger correlations to mixed-uses for both
the smaller high-street and larger neighbourhood-scale mixed-use scenarios; global variants of closeness are wholly
ineffective and behave counter-intuitively; segmented versions offer stronger correlations than node-based versions,
more so at smaller distances; and weighted measures offer similar or slightly better correlations than unweighted
measures, though at equivalently larger distance thresholds. The β weighted measures are distinguished by an explicit
spatial impedance that weights nearer locations substantially more heavily than distant locations (Figure 9). The
aggressive β used at smaller distance thresholds causes a ‘lag’ for correlations when compared to non β measures
at equivalent dmax , but may ultimately achieve similar or higher correlations at larger thresholds while retaining a
greater degree of local specificity than the distance thresholds might otherwise imply. Simplest-path angular methods
confer an advantage for smaller distance centralities in the context of mixed-uses, though this advantage tends to
dissipate when compared to retail and commercial accessibilities, particularly at the larger scale. This observation
may correspond to the intuition that angular methods are suited to the accentuation of high-streets, which are often
linear and located along thoroughfares through historic town or city centres. In contrast, more expansive mixed-use
districts demonstrate a more agglomerative and fractal (space-filling) dynamic which is better suited to shortest paths.
Interestingly, the comparatively crude node-density and segment-density measures perform similarly or better than the
more sophisticated measures at the smallest distance thresholds. This observation changes at larger thresholds where
the more refined measures take the advantage: it could be surmised that brute access to as many locations as possible
remains an overriding consideration, particularly at smaller walking tolerances.
Note that two issues arise when comparing correlations at successively larger distance thresholds. The first is related to
the Modifiable Areal Unit Problem (Fotheringham and Wong 1991): the larger the threshold, the larger the aggregation
and the more likely it is that smoother distributions with reduced variances will boost correlations. The second is that
properties of smaller distance thresholds become subsumed by larger thresholds, with the implication that correlations
15
Centrality methods at the pedestrian scale A P REPRINT
Figure 12: Comparative plots of selected primal centrality measures for inner London (part 1). Distance thresholds as
indicated. Normalised Closeness is included for the purpose of demonstrating the difficulty with using this measure for
localised forms of network analysis.
16
Centrality methods at the pedestrian scale A P REPRINT
Node Harmonic Angular dmax = 400m Node Harmonic Angular dmax = 1600m
Figure 13: Comparative plots of selected primal centrality measures for inner London (part 2). Distance thresholds as
indicated.
17
Centrality methods at the pedestrian scale A P REPRINT
Segment Harmonic Hybrid dmax = 400m Segment Harmonic Hybrid dmax = 1600m
Figure 14: Comparative plots of selected primal centrality measures for inner London (part 3). Distance thresholds as
indicated.
18
Centrality methods at the pedestrian scale A P REPRINT
Node Betweenness Angular dmax = 400m Node Betweenness Angular dmax = 1600m
Figure 15: Comparative plots of selected primal centrality measures for inner London (part 4). Distance thresholds as
indicated.
19
Centrality methods at the pedestrian scale A P REPRINT
Segment Betweenness Hybrid dmax = 400m Segment Betweenness Hybrid dmax = 1600m
Figure 16: Comparative plots of selected primal centrality measures for inner London (part 5). Distance thresholds as
indicated.
20
Centrality methods at the pedestrian scale A P REPRINT
1600
3200
6400
8000
1600
3200
6400
8000
100
200
400
800
100
200
400
800
50
50
Nd. Density 0.28 0.37 0.42 0.48 0.48 0.45 0.41 0.38 0.37 0.14 0.16 0.16 0.2 0.26 0.29 0.29 0.28 0.28
Nd. Cycles 0.09 0.16 0.3 0.43 0.47 0.45 0.41 0.38 0.37 0.05 0.08 0.14 0.18 0.25 0.29 0.29 0.28 0.28
Nd. Harmonic 0.29 0.36 0.4 0.46 0.5 0.51 0.48 0.43 0.42 0.15 0.17 0.17 0.2 0.25 0.29 0.3 0.29 0.29
Nd. Harm. Ang. 0.28 0.38 0.45 0.51 0.54 0.53 0.5 0.46 0.45 0.14 0.17 0.18 0.22 0.29 0.33 0.33 0.32 0.32
Nd. 0.29 0.35 0.39 0.45 0.5 0.5 0.47 0.42 0.41 0.15 0.16 0.17 0.19 0.23 0.27 0.3 0.29 0.29
Nd. Improved 0.29 0.36 0.41 0.47 0.49 0.47 0.43 0.39 0.38 0.15 0.16 0.17 0.19 0.25 0.29 0.29 0.28 0.28
Closeness -0.25 -0.34 -0.4 -0.46 -0.46 -0.43 -0.39 -0.36 -0.36 -0.12 -0.14 -0.15 -0.2 -0.27 -0.29 -0.28 -0.27 -0.27
Closen. Norm. 0.28 0.19 0.09 0.03 0.08 0.15 0.14 0.09 0.09 0.15 0.1 0.07 -0.02 -0.07 -0.0 0.04 0.03 0.04
Seg. Density 0.39 0.45 0.5 0.54 0.53 0.49 0.43 0.39 0.38 0.17 0.18 0.19 0.22 0.29 0.32 0.31 0.29 0.28
Seg. Harmonic 0.39 0.42 0.47 0.52 0.57 0.56 0.51 0.45 0.44 0.16 0.17 0.19 0.22 0.28 0.32 0.32 0.31 0.31
Seg. Harm. Ang. 0.4 0.45 0.49 0.54 0.56 0.55 0.51 0.47 0.46 0.17 0.19 0.2 0.23 0.3 0.34 0.34 0.33 0.33
Seg. 0.38 0.41 0.46 0.52 0.56 0.55 0.5 0.44 0.43 0.16 0.17 0.19 0.21 0.26 0.31 0.32 0.31 0.31
Nd. Betw. 0.13 0.27 0.4 0.47 0.5 0.51 0.5 0.49 0.49 0.07 0.14 0.19 0.22 0.25 0.27 0.28 0.28 0.28
Nd. Betw. Ang. 0.13 0.27 0.4 0.46 0.49 0.49 0.48 0.47 0.47 0.07 0.14 0.19 0.22 0.25 0.27 0.27 0.26 0.26
Nd. Betw. 0.13 0.27 0.39 0.44 0.49 0.51 0.51 0.5 0.49 0.07 0.14 0.19 0.21 0.24 0.26 0.28 0.28 0.28
Seg. Betw. 0.36 0.38 0.42 0.46 0.49 0.5 0.5 0.49 0.49 0.16 0.17 0.19 0.21 0.24 0.27 0.28 0.28 0.28
Seg. Betw. Ang. 0.39 0.42 0.45 0.48 0.49 0.49 0.47 0.47 0.47 0.18 0.19 0.21 0.23 0.25 0.27 0.27 0.26 0.26
Mixed-uses as measured Mixed-uses normalised by street lengths
-1 Spearman correlations 1
Figure 17: Correlation grids comparing network centralities to distance weighted mixed-uses β = 0.013̄ / dmax = 300m
Hillq=0 as a proxy for high streets. Length normalised correlations normalise the number of mixed-uses by street
lengths.
3200
6400
8000
1600
3200
6400
8000
100
200
400
800
100
200
400
800
50
50
Nd. Density 0.23 0.32 0.4 0.45 0.44 0.4 0.36 0.33 0.32 0.22 0.3 0.38 0.43 0.43 0.39 0.35 0.32 0.31
Nd. Cycles 0.07 0.15 0.29 0.41 0.44 0.4 0.36 0.33 0.32 0.07 0.14 0.28 0.39 0.42 0.39 0.35 0.32 0.31
Nd. Harmonic 0.23 0.29 0.35 0.41 0.45 0.45 0.43 0.38 0.37 0.22 0.28 0.33 0.39 0.44 0.44 0.41 0.37 0.36
Nd. Harm. Ang. 0.23 0.32 0.4 0.45 0.47 0.45 0.42 0.39 0.38 0.22 0.3 0.38 0.43 0.45 0.44 0.41 0.38 0.37
Nd. 0.23 0.28 0.34 0.42 0.47 0.46 0.42 0.37 0.36 0.22 0.27 0.32 0.4 0.45 0.44 0.41 0.36 0.35
Nd. Improved 0.23 0.3 0.38 0.45 0.46 0.42 0.38 0.34 0.33 0.22 0.28 0.36 0.43 0.44 0.41 0.37 0.33 0.32
Closeness -0.21 -0.3 -0.39 -0.44 -0.42 -0.38 -0.34 -0.32 -0.31 -0.2 -0.29 -0.37 -0.42 -0.41 -0.37 -0.33 -0.31 -0.3
Closen. Norm. 0.21 0.13 0.03 0.01 0.1 0.15 0.14 0.09 0.09 0.2 0.12 0.03 0.01 0.09 0.14 0.13 0.08 0.09
Seg. Density 0.27 0.34 0.42 0.49 0.49 0.43 0.38 0.34 0.33 0.25 0.31 0.39 0.46 0.47 0.42 0.37 0.33 0.32
Seg. Harmonic 0.26 0.3 0.36 0.43 0.49 0.48 0.44 0.39 0.38 0.24 0.28 0.34 0.41 0.46 0.47 0.43 0.38 0.37
Seg. Harm. Ang. 0.28 0.33 0.38 0.44 0.47 0.46 0.43 0.4 0.39 0.26 0.3 0.36 0.42 0.45 0.45 0.42 0.39 0.38
Seg. 0.26 0.29 0.36 0.44 0.5 0.49 0.45 0.39 0.38 0.24 0.27 0.33 0.41 0.47 0.48 0.43 0.38 0.37
Nd. Betw. 0.12 0.23 0.32 0.37 0.38 0.37 0.36 0.35 0.35 0.11 0.22 0.3 0.34 0.36 0.35 0.34 0.33 0.33
Nd. Betw. Ang. 0.11 0.23 0.32 0.36 0.37 0.36 0.34 0.33 0.33 0.11 0.22 0.3 0.34 0.35 0.34 0.32 0.31 0.31
Nd. Betw. 0.12 0.23 0.3 0.35 0.38 0.38 0.37 0.36 0.36 0.11 0.21 0.29 0.33 0.36 0.36 0.35 0.34 0.34
Seg. Betw. 0.23 0.25 0.3 0.35 0.37 0.37 0.36 0.35 0.35 0.21 0.23 0.28 0.33 0.35 0.35 0.34 0.33 0.33
Seg. Betw. Ang. 0.26 0.29 0.33 0.36 0.36 0.35 0.34 0.33 0.33 0.24 0.27 0.31 0.34 0.34 0.33 0.32 0.31 0.31
Mixed-uses as measured Mixed-uses normalised by street lengths
-1 Spearman correlations 1
Figure 18: Correlation grids comparing network centralities to weighted local food retail accessibility dmax = 300m as
a proxy for high streets. Length normalised correlations normalise the number of land-uses by street lengths.
21
Centrality methods at the pedestrian scale A P REPRINT
1600
3200
6400
8000
1600
3200
6400
8000
100
200
400
800
100
200
400
800
50
50
Nd. Density 0.26 0.37 0.48 0.63 0.71 0.68 0.61 0.55 0.54 0.17 0.23 0.29 0.34 0.31 0.29 0.29 0.31 0.31
Nd. Cycles 0.07 0.14 0.32 0.56 0.69 0.67 0.6 0.55 0.53 0.06 0.1 0.21 0.33 0.32 0.29 0.3 0.32 0.32
Nd. Harmonic 0.27 0.35 0.43 0.54 0.66 0.71 0.69 0.63 0.61 0.18 0.22 0.26 0.31 0.33 0.33 0.33 0.33 0.33
Nd. Harm. Ang. 0.27 0.39 0.5 0.62 0.71 0.72 0.68 0.62 0.61 0.17 0.24 0.3 0.34 0.33 0.33 0.34 0.35 0.35
Nd. 0.27 0.33 0.41 0.53 0.66 0.73 0.7 0.63 0.6 0.18 0.21 0.25 0.31 0.34 0.32 0.31 0.31 0.31
Nd. Improved 0.27 0.36 0.45 0.6 0.71 0.71 0.64 0.57 0.56 0.18 0.22 0.28 0.34 0.32 0.3 0.3 0.31 0.31
Closeness -0.24 -0.36 -0.49 -0.64 -0.7 -0.65 -0.57 -0.53 -0.52 -0.15 -0.22 -0.29 -0.33 -0.28 -0.27 -0.29 -0.31 -0.31
Closen. Norm. 0.25 0.17 0.01 -0.1 -0.0 0.19 0.23 0.14 0.13 0.17 0.11 0.03 0.01 0.13 0.07 0.03 -0.01 0.01
Seg. Density 0.36 0.43 0.53 0.67 0.75 0.72 0.63 0.57 0.55 0.21 0.25 0.3 0.34 0.32 0.31 0.31 0.32 0.32
Seg. Harmonic 0.35 0.4 0.47 0.58 0.7 0.76 0.71 0.65 0.63 0.2 0.23 0.27 0.32 0.34 0.34 0.34 0.34 0.34
Seg. Harm. Ang. 0.38 0.43 0.51 0.61 0.7 0.72 0.68 0.63 0.62 0.22 0.25 0.29 0.32 0.32 0.33 0.35 0.35 0.36
Seg. 0.34 0.38 0.46 0.57 0.7 0.77 0.73 0.65 0.62 0.2 0.23 0.27 0.32 0.34 0.33 0.33 0.33 0.33
Nd. Betw. 0.12 0.26 0.39 0.47 0.52 0.51 0.49 0.47 0.47 0.08 0.17 0.25 0.29 0.29 0.28 0.27 0.27 0.27
Nd. Betw. Ang. 0.12 0.26 0.39 0.46 0.49 0.48 0.45 0.43 0.43 0.08 0.17 0.25 0.28 0.28 0.27 0.26 0.26 0.26
Nd. Betw. 0.12 0.25 0.37 0.44 0.5 0.52 0.5 0.48 0.47 0.08 0.17 0.24 0.28 0.29 0.28 0.28 0.28 0.28
Seg. Betw. 0.32 0.34 0.39 0.45 0.5 0.51 0.49 0.47 0.46 0.19 0.2 0.23 0.27 0.28 0.28 0.27 0.27 0.27
Seg. Betw. Ang. 0.36 0.39 0.43 0.47 0.48 0.47 0.44 0.43 0.43 0.21 0.23 0.26 0.27 0.27 0.26 0.26 0.26 0.26
Mixed-uses as measured Mixed-uses normalised by street lengths
-1 Spearman correlations 1
Figure 19: Correlation grids comparing network centralities to mixed-uses β = 0.004 / dmax = 1000m Hillq=0 as a
proxy for vibrant districts. Length normalised correlations normalise the number of mixed-uses by street lengths.
3200
6400
8000
1600
3200
6400
8000
100
200
400
800
100
200
400
800
50
50
Nd. Density 0.23 0.34 0.45 0.62 0.72 0.7 0.62 0.55 0.54 0.17 0.25 0.33 0.44 0.48 0.47 0.44 0.42 0.41
Nd. Cycles 0.05 0.11 0.29 0.55 0.71 0.69 0.61 0.55 0.54 0.04 0.08 0.22 0.41 0.49 0.47 0.44 0.42 0.41
Nd. Harmonic 0.24 0.32 0.39 0.51 0.65 0.71 0.69 0.63 0.61 0.18 0.23 0.29 0.37 0.45 0.49 0.48 0.46 0.45
Nd. Harm. Ang. 0.24 0.35 0.47 0.6 0.7 0.72 0.67 0.62 0.6 0.17 0.26 0.34 0.43 0.48 0.49 0.48 0.46 0.45
Nd. 0.23 0.3 0.37 0.5 0.65 0.73 0.71 0.64 0.61 0.17 0.22 0.27 0.36 0.46 0.5 0.49 0.45 0.44
Nd. Improved 0.24 0.32 0.42 0.58 0.71 0.72 0.65 0.58 0.56 0.17 0.24 0.31 0.42 0.48 0.48 0.45 0.43 0.42
Closeness -0.21 -0.33 -0.46 -0.63 -0.71 -0.67 -0.58 -0.53 -0.52 -0.16 -0.24 -0.34 -0.45 -0.47 -0.45 -0.42 -0.41 -0.4
Closen. Norm. 0.22 0.14 -0.01 -0.14 -0.04 0.18 0.24 0.15 0.15 0.16 0.11 -0.01 -0.08 0.03 0.1 0.12 0.07 0.08
Seg. Density 0.31 0.39 0.5 0.65 0.76 0.73 0.64 0.57 0.55 0.22 0.27 0.35 0.45 0.5 0.49 0.46 0.43 0.42
Seg. Harmonic 0.31 0.35 0.43 0.54 0.68 0.75 0.72 0.65 0.63 0.21 0.25 0.3 0.38 0.46 0.51 0.5 0.47 0.46
Seg. Harm. Ang. 0.34 0.39 0.47 0.58 0.68 0.72 0.68 0.62 0.61 0.24 0.28 0.33 0.4 0.46 0.48 0.48 0.46 0.46
Seg. 0.3 0.34 0.41 0.53 0.69 0.77 0.74 0.66 0.63 0.21 0.24 0.29 0.38 0.47 0.51 0.5 0.47 0.46
Nd. Betw. 0.11 0.23 0.35 0.43 0.48 0.47 0.44 0.42 0.42 0.08 0.17 0.26 0.32 0.34 0.33 0.31 0.29 0.29
Nd. Betw. Ang. 0.11 0.23 0.35 0.42 0.45 0.43 0.4 0.38 0.38 0.08 0.17 0.26 0.31 0.32 0.3 0.28 0.27 0.27
Nd. Betw. 0.11 0.23 0.33 0.4 0.46 0.48 0.46 0.43 0.43 0.08 0.17 0.25 0.3 0.33 0.33 0.32 0.3 0.3
Seg. Betw. 0.28 0.3 0.35 0.41 0.46 0.47 0.44 0.42 0.41 0.19 0.21 0.25 0.29 0.32 0.32 0.3 0.29 0.29
Seg. Betw. Ang. 0.32 0.35 0.39 0.43 0.44 0.43 0.39 0.38 0.38 0.22 0.25 0.28 0.3 0.31 0.3 0.27 0.26 0.26
Mixed-uses as measured Mixed-uses normalised by street lengths
-1 Spearman correlations 1
Figure 20: Correlation grids comparing network centralities to commercial land-use accessibility dmax = 1000m as a
proxy for vibrant districts. Length normalised correlations normalise the number of land-uses by street lengths.
22
Centrality methods at the pedestrian scale A P REPRINT
1600
3200
6400
8000
1600
3200
6400
8000
100
200
400
800
100
200
400
800
50
50
Nd. Density 0.14 0.15 0.12 0.08 0.03 0.01 0.0 0.0 -0.0 -0.15 -0.25 -0.38 -0.46 -0.41 -0.33 -0.27 -0.23 -0.22
Nd. Cycles 0.06 0.09 0.11 0.11 0.06 0.02 0.01 0.01 0.0 -0.02 -0.08 -0.22 -0.38 -0.38 -0.32 -0.26 -0.22 -0.21
Nd. Harmonic 0.16 0.18 0.16 0.14 0.1 0.06 0.03 0.02 0.01 -0.14 -0.21 -0.29 -0.38 -0.41 -0.39 -0.35 -0.29 -0.28
Nd. Harm. Ang. 0.15 0.19 0.18 0.17 0.13 0.1 0.08 0.06 0.06 -0.15 -0.24 -0.34 -0.4 -0.37 -0.32 -0.28 -0.25 -0.24
Nd. 0.16 0.17 0.16 0.13 0.08 0.04 0.01 0.0 0.0 -0.13 -0.19 -0.27 -0.39 -0.45 -0.42 -0.35 -0.29 -0.27
Nd. Improved 0.16 0.17 0.14 0.09 0.04 0.01 0.0 0.0 0.0 -0.14 -0.22 -0.34 -0.45 -0.44 -0.36 -0.3 -0.24 -0.23
Closeness -0.11 -0.13 -0.09 -0.06 -0.02 0.0 -0.0 -0.0 0.0 0.15 0.26 0.4 0.46 0.38 0.3 0.25 0.22 0.21
Closen. Norm. 0.15 0.12 0.13 0.11 0.1 0.05 0.01 -0.0 0.0 -0.11 -0.05 0.08 0.02 -0.16 -0.22 -0.17 -0.11 -0.1
Seg. Density 0.33 0.31 0.26 0.17 0.08 0.04 0.02 0.01 0.01 -0.12 -0.21 -0.34 -0.44 -0.4 -0.33 -0.27 -0.23 -0.22
Seg. Harmonic 0.33 0.32 0.31 0.28 0.21 0.13 0.08 0.04 0.03 -0.11 -0.17 -0.24 -0.32 -0.37 -0.36 -0.32 -0.28 -0.26
Seg. Harm. Ang. 0.34 0.33 0.31 0.26 0.2 0.14 0.11 0.08 0.07 -0.13 -0.19 -0.27 -0.33 -0.33 -0.3 -0.26 -0.24 -0.23
Seg. 0.33 0.32 0.31 0.26 0.18 0.09 0.05 0.02 0.02 -0.11 -0.15 -0.23 -0.35 -0.42 -0.4 -0.34 -0.28 -0.26
Nd. Betw. 0.07 0.15 0.23 0.27 0.28 0.3 0.31 0.32 0.32 -0.07 -0.14 -0.19 -0.23 -0.21 -0.17 -0.14 -0.13 -0.12
Nd. Betw. Ang. 0.07 0.15 0.24 0.27 0.29 0.31 0.32 0.32 0.32 -0.07 -0.14 -0.19 -0.21 -0.19 -0.15 -0.12 -0.1 -0.1
Nd. Betw. 0.07 0.15 0.23 0.26 0.28 0.29 0.31 0.32 0.32 -0.07 -0.13 -0.17 -0.21 -0.22 -0.2 -0.16 -0.14 -0.13
Seg. Betw. 0.33 0.32 0.32 0.3 0.3 0.3 0.32 0.32 0.32 -0.08 -0.1 -0.15 -0.2 -0.21 -0.18 -0.14 -0.12 -0.12
Seg. Betw. Ang. 0.33 0.32 0.32 0.31 0.31 0.32 0.33 0.33 0.33 -0.1 -0.14 -0.18 -0.19 -0.16 -0.14 -0.11 -0.1 -0.1
Randomised mixed-uses Randomised & length-normalised mixed-uses
-1 Spearman correlations 1
Figure 21: Correlation grids comparing network centralities to randomised local mixed-uses β = 0.013̄ / dmax = 300m
Hillq=0 . This plot is included for the purpose of demonstrating how access to mixed-uses can increase purely as a
function of the network (left) and how this can be controlled for by normalising by street lengths (right).
3200
6400
8000
1600
3200
6400
8000
100
200
400
800
100
200
400
800
50
50
Nd. Density 0.34 0.45 0.51 0.53 0.45 0.32 0.25 0.21 0.2 -0.02 -0.06 -0.16 -0.36 -0.64 -0.73 -0.63 -0.54 -0.53
Nd. Cycles 0.13 0.21 0.35 0.5 0.47 0.34 0.26 0.21 0.2 0.04 0.02 -0.1 -0.28 -0.58 -0.7 -0.61 -0.53 -0.52
Nd. Harmonic 0.36 0.45 0.5 0.55 0.53 0.45 0.36 0.29 0.27 -0.01 -0.04 -0.09 -0.21 -0.44 -0.62 -0.65 -0.6 -0.59
Nd. Harm. Ang. 0.35 0.49 0.57 0.63 0.59 0.5 0.42 0.35 0.33 -0.02 -0.05 -0.13 -0.27 -0.47 -0.59 -0.57 -0.52 -0.51
Nd. 0.36 0.43 0.49 0.54 0.54 0.45 0.34 0.27 0.25 -0.01 -0.03 -0.08 -0.19 -0.42 -0.65 -0.71 -0.64 -0.62
Nd. Improved 0.36 0.45 0.5 0.54 0.49 0.36 0.27 0.22 0.21 -0.01 -0.04 -0.12 -0.3 -0.57 -0.73 -0.67 -0.57 -0.55
Closeness -0.3 -0.42 -0.48 -0.51 -0.4 -0.28 -0.22 -0.2 -0.19 0.03 0.07 0.19 0.4 0.69 0.72 0.59 0.51 0.51
Closen. Norm. 0.34 0.24 0.13 0.11 0.27 0.32 0.19 0.09 0.09 0.0 0.03 0.13 0.31 0.41 -0.02 -0.24 -0.2 -0.17
Seg. Density 0.61 0.66 0.71 0.7 0.56 0.38 0.28 0.23 0.22 0.06 0.01 -0.09 -0.3 -0.62 -0.72 -0.63 -0.54 -0.53
Seg. Harmonic 0.6 0.64 0.7 0.74 0.71 0.57 0.43 0.33 0.3 0.06 0.04 -0.02 -0.14 -0.38 -0.59 -0.63 -0.59 -0.58
Seg. Harm. Ang. 0.63 0.68 0.74 0.76 0.69 0.57 0.46 0.38 0.35 0.04 0.01 -0.05 -0.18 -0.41 -0.54 -0.54 -0.51 -0.5
Seg. 0.6 0.63 0.69 0.73 0.7 0.56 0.41 0.3 0.28 0.07 0.04 -0.01 -0.13 -0.37 -0.63 -0.69 -0.63 -0.61
Nd. Betw. 0.16 0.33 0.52 0.62 0.66 0.67 0.67 0.67 0.67 -0.02 -0.03 -0.03 -0.07 -0.12 -0.12 -0.07 -0.04 -0.03
Nd. Betw. Ang. 0.16 0.33 0.52 0.61 0.65 0.66 0.65 0.65 0.65 -0.02 -0.03 -0.03 -0.05 -0.08 -0.07 -0.02 0.0 0.01
Nd. Betw. 0.16 0.33 0.5 0.59 0.64 0.67 0.67 0.68 0.68 -0.02 -0.03 -0.02 -0.04 -0.09 -0.13 -0.1 -0.06 -0.05
Seg. Betw. 0.57 0.59 0.62 0.66 0.68 0.68 0.68 0.68 0.68 0.07 0.06 0.03 -0.02 -0.08 -0.11 -0.07 -0.03 -0.03
Seg. Betw. Ang. 0.6 0.63 0.66 0.68 0.68 0.67 0.66 0.66 0.66 0.05 0.03 0.01 -0.03 -0.06 -0.06 -0.01 0.01 0.02
Randomised mixed-uses Randomised & length-normalised mixed-uses
-1 Spearman correlations 1
Figure 22: Correlation grids comparing network centralities to randomised neighbourhood scale mixed-uses β = 0.004
/ dmax = 1000m Hillq=0 . This plot is included for the purpose of demonstrating how access to mixed-uses can increase
purely as a function of the network (left) and how this can be controlled for by normalising by street lengths (right).
23
Centrality methods at the pedestrian scale A P REPRINT
already present at smaller scales may mask the weakening of correlations at the peripheries of larger thresholds. Against
this backdrop, the majority of the measures achieve near to maximal correlations in the vicinity of 400m–1200m at
which point the correlations tend to stagnate or begin to decrease as thresholds further increase, indicating that street
segments sufficiently far away may start to muddy the relevancy of the centrality measure relative to the local land-use
context.
Nd. Harmonic 0.22 0.3 0.37 0.42 0.44 0.46 0.2 0.28 0.36 0.4 0.43 0.45
Nd. Harm. Ang. 0.23 0.34 0.43 0.46 0.49 0.5 0.21 0.3 0.4 0.43 0.45 0.46
Nd. 0.22 0.28 0.35 0.39 0.43 0.45 0.2 0.26 0.32 0.38 0.41 0.44
Nd. Betw. 0.1 0.24 0.36 0.4 0.41 0.42 0.09 0.21 0.3 0.33 0.34 0.35
Nd. Betw. Ang. 0.1 0.23 0.35 0.38 0.4 0.4 0.09 0.21 0.29 0.32 0.33 0.33
Nd. Betw. 0.1 0.24 0.34 0.38 0.39 0.41 0.09 0.21 0.29 0.31 0.33 0.34
mixed-uses dmax = 300m food retail dmax = 300m
1600
1600
100
200
400
800
100
200
400
800
50
50
Nd. Density 0.2 0.32 0.47 0.57 0.63 0.67 0.18 0.29 0.45 0.55 0.62 0.67
Nd. Harmonic 0.2 0.3 0.41 0.49 0.55 0.6 0.18 0.27 0.38 0.46 0.53 0.58
Nd. Harm. Ang. 0.21 0.34 0.48 0.56 0.62 0.65 0.18 0.3 0.45 0.54 0.6 0.64
Nd. 0.2 0.27 0.36 0.44 0.51 0.56 0.18 0.24 0.33 0.41 0.48 0.54
Nd. Betw. 0.09 0.21 0.35 0.41 0.44 0.45 0.07 0.19 0.32 0.37 0.4 0.42
Nd. Betw. Ang. 0.09 0.21 0.34 0.39 0.41 0.42 0.07 0.19 0.31 0.35 0.38 0.39
Nd. Betw. 0.09 0.21 0.33 0.37 0.4 0.42 0.07 0.19 0.3 0.34 0.37 0.39
mixed-uses dmax = 1000m commercial dmax = 1000m
-1 Spearman correlations 1
Figure 23: Correlation grids comparing dual network centralities to mixed-uses and land-use accessibilities on the
dual network. Weighted mixed-use richness (Hillq=0 ) and Food Retail at the smaller distance thresholds of β = 0.013̄
/ dmax = 300m serve as a proxy for high streets, whereas weighted mixed-use richness (Hillq=0 ) and Commercial
accessibilities at the larger distance thresholds of β = 0.004 / dmax = 1000m serve as a proxy for wider mixed-use
districts.
Use of the dual network appears to infer no tangible advantage (see Figure 23): correlations for the respective measures
are similar though generally weaker for betweenness measures. A further drawback in using the dual topology is that
segment-based methods cannot be used because this would incur repeated summations over duplicated segment lengths.
The preceding observations are based on the full topological network as derived from Ordnance Survey Open Roads;
however, it is sometimes beneficial or necessary to sample centralities at more finely spaced intervals on the road
network. The following considers the use of ‘decomposition’ as a strategy (Simons 2021c) whereby the road network
is subdivided into smaller links (edges) based on a maximum distance. Decomposition is here applied at the 100m,
50m, and 20m levels on the primal network, with the latter case ultimately yielding 1,097,510 nodes. Decomposition
of the network has a mixed impact on the strength of correlations, varying based on the degree of decomposition, the
distance threshold, shortest or simplest path heuristic, and the chosen measures (here plotted for selection of measures in
Figure 24). Gentler decomposition at 100m or 50m can boost correlations, whereas aggressive decomposition at 20m
is less advantageous. Visual clues to this dynamic can be found in the distributions of the measures at small thresholds:
correlations soften across-the-board for thresholds below 400m because the topology of any given network causes
shortest-path algorithms to terminate increasingly abruptly as the search distance thresholds decrease, thus leading to
artefacts in the distributions of the measures (see Figure 25). Accordingly, algorithms may terminate within one or two
steps on the network with the implication that centrality measures for nodes of degree = 2 would tend to cluster at
approximately twice the value of nodes of degree = 1, etc. Intermediate levels of decomposition (100m and 50m) can
reduce these artefacts, thus explaining the boost to correlations; contrarily, aggressive levels of decomposition (20m)
can have the opposite effect because the increasing proliferation of degree = 2 nodes can exacerbate peaks in the
distributions for the smallest distance thresholds. These issues tend to resolve once distance thresholds exceed 400m.
24
Centrality methods at the pedestrian scale A P REPRINT
0.8
0.6
spearman
0.8
0.6
spearman
Figure 24: Correlations for shortest-path network centrality measures to mixed-uses at varying levels of network
decomposition. The respective network centrality measures are compared to mixed-use richness (Hillq=0 ) at both
smaller (β = 0.013̄ / dmax = 300m) and greater (β = 0.004 / dmax = 1000m) distance thresholds.
5 Summary
Network analysis affords robust methods for assessing pedestrian-scale connectivity, a topic of paramount importance
given that many forms of contemporary urban development continue to enforce street networks hindering pedestrianism
and local access to rich assortments of land-uses. However, a confusing array of theories, methods, and considerations
exist on the analysis of street networks, and it is not immediately clear which of these might be the most applicable at
the pedestrian scale concerning the intensity and mix of land-uses. The preceding analysis compares correlations to
mixed-uses and land-use accessibilities for a range of network centralities: weighted and unweighted, on the primal and
dual network, and using shortest-path and simplest-path (angular) heuristics.
For pedestrian distances, closeness centralities offer stronger correlations to the general intensity and mix of local
land-uses, both at smaller (i.e. mixed-use high-streets) and larger (i.e. wider mixed-use districts) pedestrian distances.
Betweenness measures play a diminished role, though it could be hypothesised that betweenness is likely advantageous
in contexts with sufficiently high degrees of closeness centrality, as in the case of high-streets, but is relatively
meaningless in contexts with low closeness centralities, such as motorways. This observation contradicts the gist of
other studies, for example, Turner (2007) and Porta, Strano, et al. (2009), which have tended to favour betweenness
measures, and which may be attributable to any one of numerous factors that are present to varying degrees: comparisons
25
Centrality methods at the pedestrian scale A P REPRINT
80000
60000
40000
20000
0
no decomp. dmax = 50m no decomp. dmax = 100m no decomp. dmax = 200m no decomp. dmax = 400m
= 0.07 = 0.12 = 0.2 = 0.37
2 = 0.0 2 = 0.01 2 = 0.01 2 = 0.03
150000
100000
50000
0
50m decomp. dmax = 50m 50m decomp. dmax = 100m 50m decomp. dmax = 200m 50m decomp. dmax = 400m
= 0.2 = 0.34 = 0.54 = 0.93
2 = 0.01 2 = 0.01 2 = 0.04 2 = 0.12
200000
150000
100000
50000
0
20m decomp. dmax = 50m 20m decomp. dmax = 100m 20m decomp. dmax = 200m 20m decomp. dmax = 400m
Figure 25: Comparative distributions of node-based harmonic closeness at small distance thresholds thresholds,
compared across different levels of decomposition.
against traffic-counts as opposed to mixed-uses and land-use accessibilities; use of normalised closeness instead of
harmonic closeness; and smaller geographic extents of analysis.
Explicitly weighted β measures are arguably preferable to unweighted variants because they retain robust correlations
against mixed-uses and land-use accessibilities, but with an explicitly configurable spatial impedance offering a
substantial degree of local emphasis. The use of simplest-path measures can infer a slight advantage for correlations
against mixed-uses as typified by high-streets, though these advantages are not universal and tend to dissipate for
neighbourhood-scale mixed-uses and land-use accessibilities. Thus, the simplest-path premise that visual properties of
the street network are core determinants in the evolution of land-uses and the broader patterns of activities in cities
appears more suited to high-streets than expansive mixed-use districts where a more agglomerative and space-filling
(fractal) logic is at work.
Regardless of the measure, near-maximal correlations for ‘high-street’ mixed-uses are encountered by distance thresh-
olds in the vicinity of 800m and for district-wide mixed-uses by ≈ 1200m, thus indicating little benefit to the use of
larger thresholds given their increasing computational demands and the increasing effects of the Modifiable Area Unit
Problem, which tends to boost correlations at larger thresholds. Conversely, centrality measures start breaking down at
distance thresholds smaller than 400m due to increasingly exposed artefacts arising in the distributions of the measures.
Within the context of pedestrian-scale mixed-uses and land-use accessibilities for Greater London, there appears to
be no tangible benefit to the use of dual topologies. The greater resolution afforded by decomposition is beneficial
for fine-scaled urban analysis, and moderate decomposition levels (50m–100m) may also tend to boost correlations.
26
Centrality methods at the pedestrian scale A P REPRINT
However, this requires consideration of the trade-off between the level of local precision required as opposed to greater
computational demands.
It should be recognised that the various approaches: whether primal or dual, weighted or not, shortest or simplest,
node-based or segment-based, closeness-like, betweenness-like, or otherwise more esoteric, all have some degree of
relevancy, and the differences in the correlations are infrequently drastic. Therefore, either-or arguments should be
avoided in recognition that multiple valid approaches exist and these may differ based on the context or the intended
form of subsequent modelling. For the same reasons, models considering the impact of network centralities will benefit
from dimensionality reduction to disentangle the contributions and collinearities of different methods at different scales,
or else the use of non-linear machine learning methods to exploit potential nuances such as the interplay between
smaller and larger thresholds, or synergies arising from the overlap of closeness, betweenness, and other forms of
centrality measures. The gist of this approach is further explored in associated papers on dimensionality reduction
applied to urban morphological data (Simons 2021d) and the detection of urban archetypes using pedestrian scale
morphological information (Simons 2021a).
6 Acknowledgements
6.1 PhD
This paper derives from the author’s PhD research at the Centre for Advanced Spatial Analysis, University College
London. The author wishes to acknowledge their PhD supervisors, Dr. Elsa Arcaute and Prof. Michael Batty, for their
gracious support and feedback throughout the development of this work. The author takes sole responsibility for any
oversights or shortcomings contained within this paper.
6.2 Data
The geographical plots and statistical figures in this document have been prepared with use of the following sources of
data:
7 Citations
Albert, Réka and Albert-László Barabási (2002). “Statistical mechanics of complex networks”. In: REVIEWS OF
MODERN PHYSICS 74.January, pp. 47–97.
Alexander, Christopher (1964). Notes on the Synthesis of Form. London: Harvard University Press. ISBN: 0674627512.
– (1967). “A City is Not a Tree”. In: Ekistics 23.139, pp. 344–348.
Arcaute, Elsa et al. (2016). “Cities and regions in Britain through hierarchical percolation”. In: Royal Society Open
Science 3. DOI: 10.1098/rsos.150691.
Baradaran, Siamak and Farideh Ramjerdi (2001). “Performance of Accessibility Measures in Europe”. In: Journal of
Transportation and Statistics 4.2/3, pp. 31–48.
Barthelemy, Marc (Aug. 2015). “From paths to blocks: New measures for street patterns”. In: Environment and Planning
B: Urban Analytics and City Science 44.2, pp. 256–271. ISSN: 2399-8083. DOI: 10.1177/0265813515599982.
Bates, John (2007). “History of Demand Modelling”. In: Handbook of Transport Modelling. Ed. by David Hensher and
Kenneth Button. 2nd ed. Elsevier Science. Chap. 2, pp. 11–34.
Batty, Michael (2004). “Distance in Space Syntax”. In: Working Paper Series 04.04, p. 32.
– (2013). The New Science of Cities. Cambridge, MA: MIT Press.
27
Centrality methods at the pedestrian scale A P REPRINT
Cooper, Crispin H V (2015). “Spatial localization of closeness and betweenness measures: a self-contradictory but
useful form of network analysis”. In: International Journal of Geographical Information Science 29.8, pp. 1293–1309.
DOI : 10.1080/13658816.2015.1018834.
Corbusier, Le (1967). The radiant city : elements of a doctrine of urbanism... New York: Orion Press.
Dalton, Nick (2001). “Fractional Configurational Analysis And a solution to the Manhattan problem”. In: 3rd Interna-
tional Space Syntax Symposium. Atlanta.
Duncan, Dustin T et al. (2011). “Validation of Walk Score ® for Estimating Neighborhood Walkability: An Analysis
of Four US Metropolitan Areas”. In: Int. J. Environ. Res. Public Health 8, pp. 4160–4179. DOI: 10 . 3390 /
ijerph8114160.
Ellin, Nan (1999). Postmodern urbanism. eng. Rev. ed. New York: New York : Princeton Architectural Press. ISBN:
156898135x.
Flint, Anthony (2011). Wrestling with Moses: How Jane Jacobs Took on New York’s Master Builder and Transformed
the American City. New York: Random House.
Fotheringham, A S and D W S Wong (July 1991). “The Modifiable Areal Unit Problem in Multivariate Statistical
Analysis”. In: Environment and Planning A: Economy and Space 23.7, pp. 1025–1044. ISSN: 0308-518X. DOI:
10.1068/a231025.
Freeman, Linton C (1977). “A Set of Measures of Centrality Based on Betweenness”. In: Sociometry 40.1, pp. 35–41.
Garnier, Tony (1989). Une cité industrielle : études pour la construction des villes<span class="frbrsuppress"> / .
fre. New York: New York : Princeton University Press. ISBN: 0910413479.
Gil, Jorge (2017). “Street network analysis ’edge effects’: Examining the sensitivity of centrality measures to boundary
conditions”. In: Environment and Planning B: Urban Analytics and City Science 44.5. ISSN: 23998091. DOI:
10.1177/0265813516650678.
Graham, Stephen and Simon Marvin (2001). Splintering Urbanism: Networked Infrastructures, Technological Mobilities
and the Urban Condition. New York: Routledge.
Greenfield, Adam (2013). Against the smart city. 1.2. Author.
Handy, S. L. and D. A. Niemeier (1997). “Measuring accessibility: An exploration of issues and alternatives”. In:
Environment and Planning A. ISSN: 0308518X. DOI: 10.1068/a291175.
Handy, Susan and Kelly Clifton (2001). “Evaluating Neighborhood Accessibility: Possibilities and Practicalities”. In:
Journal of Transportation and Statistics 4.2/3, pp. 67–78.
Hansen, Walter G (1959). “How Accessibility Shapes Land Use”. In: JOURNAL OF THE AMERICAN INSTITUTE OF
PLANNERS 25.2, pp. 73–76. DOI: 10.1080/01944365908978307.
Harris, Britton (2001). “Accessibility: Concepts and Applications”. In: Journal of Transportation and Statistics 4.2/3,
pp. 15–30.
Harvey, David (1989). The condition of postmodernity : an enquiry into the origins of cultural change. eng. Oxford:
Oxford : Basil Blackwell. ISBN: 0631162941.
Hilberseimer, Ludwig (1955). The nature of cities : origins, growth and decline, pattern and form, planning problems.
Hill, Dan (2013). On the smart city; Or, a ’manifesto’ for smart citizens instead.
Hillier, B and S Iida (2005). “Network and psychological effects in urban movement”. English. In: Spatial Information
Theory, Proceedings 3693, pp. 475–490. ISSN: 0302-9743.
Hillier, Bill and Julienne Hanson (1984). The Social Logic of Space. Cambridge: Cambridge University Press.
Hillier, Bill, Alasdair Turner, et al. (2007). “METRIC AND TOPO-GEOMETRIC PROPERTIES OF URBAN STREET
NETWORKS”. In: Proceedings, 6th International Space Syntax Symposium. Istanbul.
Iacono, Michael et al. (2008). “Access to Destinations: How Close is Close Enough? Estimating Accurate Distance
Decay Functions for Multiple Modes and Different Purposes”. In.
Jacobs, J (1969). The Economy of Cities. Vintage Bo. New York: Random House.
Jacobs, Jane (1961). The Death and Life of Great American Cities. Vintage Bo. New York: Random House.
Lyon, David (1999). Postmodernity. Second Edi. Minnesota: University of Minnesota Press.
Marchiori, Massimo and Vito Latora (2000). “Harmony in the small-world”. In: Physica A 285.89, pp. 539–546.
Marshall, Stephen (2009). Cities Design and Evolution. New York: Routledge.
Marshall, Stephen et al. (2018). “Street Network Studies: from Networks to Models and their Representations”. In:
Networks and Spatial Economics. ISSN: 1572-9427. DOI: 10.1007/s11067-018-9427-9.
Masucci, A Paolo and Carlos Molinero (2016). “Robustness and closeness centrality for self-organized and planned
cities”. In: Eur. Phys. J. B 89.53. DOI: 10.1140/epjb/e2016-60431-2.
Omer, Itzhak and Nir Kaplan (Feb. 2018). “Structural properties of the angular and metric street network’s centralities
and their implications for movement flows”. In: Environment and Planning B: Urban Analytics and City Science
46.6, pp. 1182–1200. ISSN: 2399-8083. DOI: 10.1177/2399808318760571.
Penn, A et al. (1998). “Configurational modelling of urban movement networks”. In: Environment and Planning B:
Planning and Design 25, pp. 59–84.
Pope, Albert (1996). Ladders. Princeton: Princeton Architectural Press, p. 208.
28
Centrality methods at the pedestrian scale A P REPRINT
Porta, Sergio, Paolo Crucitti, and Vito Latora (2006a). “The network analysis of urban streets: A dual approach”. In:
Physica A 369, pp. 853–866. DOI: 10.1016/j.physa.2005.12.063.
– (2006b). “The Network Analysis of Urban Streets: A Primal Approach”. In: Environment and Planning B: Planning
and Design 33, pp. 705–725. DOI: 10.1068/b32045.
Porta, Sergio, Emanuele Strano, et al. (Jan. 2009). “Street Centrality and Densities of Retail and Services in Bologna,
Italy”. In: Environment and Planning B: Planning and Design 36.3, pp. 450–465. ISSN: 0265-8135. DOI: 10.1068/
b34098.
Ratti, Carlo (2004). “Space syntax: some inconsistencies”. In: Environment and Planning B: Planning and Design 31,
pp. 487–499. DOI: 10.1068/b3019.
Reyer, Maren et al. (2014). “Walkability is only part of the story: Walking for transportation in Stuttgart, Ger-
many”. In: International Journal of Environmental Research and Public Health. ISSN: 16604601. DOI: 10.3390/
ijerph110605849.
Rochat, Yannick (2009). “Closeness Centrality Extended To Unconnected Graphs : The Harmonic Centrality Index”.
Zurich.
Rosvall, M et al. (2005). “Networks and Cities: An Information Perspective”. In: Physical Review Letters 94. DOI:
10.1103/PhysRevLett.94.028701.
Rutherford, Scott (1979). “Use of the Gravity Model for Pedestrian Travel Distribution”. In: Transportation Research
Record 728, pp. 53–59.
Sabidussi, Gert (1966). “The Centrality Index of a Graph”. In: Psychometrika 31.4, pp. 581–603.
Scheurer, Jan and Carey Curtis (2007). “Accessibility Measures: Overview and Practical Applications”.
Serra, Miguel and Bill Hillier (2019). “Angular and Metric Distance in Road Network Analysis: A nationwide correlation
study”. In: Computers, Environment and Urban Systems. ISSN: 01989715. DOI: 10.1016/j.compenvurbsys.
2018.11.003.
Sevtsuk, Andres, Raul Kalvo, and Onur Ekmekci (2016). “Pedestrian accessibility in grid layouts: The role of block,
plot and street dimensions”. In: Urban Morphology. ISSN: 10274278. DOI: 10.1080/10464883.2012.714912.
Sevtsuk, Andres and Michael Mekonnen (2012). “Urban network analysis: A new toolbox for ArcGIS”. In: RIG 22,
pp. 287–305.
Shimbel, Alfonso (1953). “Structural Parameters of Communication Networks”. In: Bulletin of Mathematical Physics
15, pp. 501–507.
Simons, Gareth D. (June 2021a). “Prediction of ’artificial’ urban archetypes at the pedestrian-scale through a synthesis
of domain expertise with machine learning methods”. In: arXiv: 2106.15364.
– (June 2021b). “The application of mixed-use measures at the pedestrian-scale”. In: arXiv: 2106.14048.
– (June 2021c). “The cityseer Python package for pedestrian-scale network-based urban analysis”. In: arXiv: 2106.
15314.
– (June 2021d). “Untangling urban data signatures: unsupervised machine learning methods for the detection of urban
archetypes at the pedestrian scale”. In: arXiv: 2106.15363.
Sterling, Bruce (2014). The Epic Struggle of the Internet of Things. First Edit. Strelka Institute for Media, Architecture
and Design.
Townsend, Anthony M. (2013). Smart Cities: Big Data, Civic Hackers, and the Quest for a New Utopia. New York: W.
W. Norton & Company.
Turner, Alasdair (2000). “Angular Analysis: A Method for the Quantification of Space”. London.
– (2007). “From axial to road-centre lines: a new representation for space syntax and a new model of route choice
for transport network analysis”. In: Environment and Planning B: Planning and Design 34, pp. 539–555. DOI:
10.1068/b32067.
Turner, Alasdair and Nick Dalton (2005). “A Simplified Route Choice Model Using the Shortest Angular Path
Assumption”. London.
Turner, Alasdair, Alan Penn, and Bill Hillier (2005). “An algorithmic definition of the axial map”. In: Environment and
Planning B: Planning and Design 32, pp. 425–444. DOI: 10.1068/b31097.
Viana, Matheus P et al. (2013). “The simplicity of planar networks”. In: Scientific Reports 3.3495. DOI: 10.1038/
srep03495.
Wasserman, Stanley and Katherine Faust (1994). Social Network Analysis. Cambridge: Cambridge University Press.
29