Kernel Density Estimation of Traffic Accidents in A Network Space
Kernel Density Estimation of Traffic Accidents in A Network Space
Kernel Density Estimation of Traffic Accidents in A Network Space
TopSCHOLAR
Geography/Geology Faculty Publications
9-2008
Jun Yan
Western Kentucky University, [email protected]
This Article is brought to you for free and open access by TopSCHOLAR. It has been accepted for inclusion in Geography/Geology Faculty
Publications by an authorized administrator of TopSCHOLAR. For more information, please contact [email protected].
Abstract: A standard planar Kernel Density Estimation (KDE) aims to produce a smooth
density surface of spatial point events over a 2-D geographic space. However the planar
KDE may not be suited for characterizing certain point events, such as traffic accidents,
which usually occur inside a 1-D linear space, the roadway network. This paper presents
a novel network KDE approach to estimating the density of such spatial point events.
One key feature of the new approach is that the network space is represented with basic
linear units of equal network length, termed lixel (linear pixel), and related network
topology. The use of lixel not only facilitates the systematic selection of a set of regularly
spaced locations along a network for density estimation, but also makes the practical
application of the network KDE feasible by significantly improving the computation
efficiency. The approach is implemented in the ESRI ArcGIS environment and tested
with the year 2005 traffic accident data and a road network in the Bowling Green,
Kentucky area. The test results indicate that the new network KDE is more appropriate
than standard planar KDE for density estimation of traffic accidents, since the latter
covers space beyond the event context (network space) and is likely to overestimate the
density values. The study also investigates the impacts on density calculation from two
kernel functions, lixel lengths, and search bandwidths. It is found that the kernel function
is least important in structuring the density pattern over network space, whereas the lixel
length critically impacts the local variation details of the spatial density pattern. The
search bandwidth imposes the highest influence by controlling the smoothness of the
spatial pattern, showing local effects at a narrow bandwidth and revealing hot spots at
larger or global scales with a wider bandwidth. More significantly, the idea of
representing a linear network by a network system of equal-length lixels may potentially
lead the way to developing a suite of other network related spatial analysis and modeling
methods.
Keywords: Network Space, Lixel, Kernel Density Estimation, Traffic Accidents, Hot
Spots
1. Introduction
To reduce traffic accidents and improve road safety, it is crucial to understand
how, where and when traffic accidents occurred. An improved understanding of spatial
patterns of traffic accidents can make accident reduction efforts more effective. For
instance, by knowing where and when traffic accidents usually occur, law enforcement
can conduct more efficient patrols and highway departments can disseminate more
effectively to drivers the critical information about roadway conditions. In reality, the
occurrences of traffic accidents are seldom random in space and time. In most cases,
traffic accidents form clusters (known as hot spots) in geographic space. This is
because the occurrence of traffic accidents along a certain roadway segment is largely
determined by its traffic volume, which is well-known to exhibit distinct spatial and
temporal patterns (Black, 1991). There are some other important factors that may impact
the distribution of traffic accidents, including natural and environmental characteristics
such as physical environment (steep slope, sharp turn), weather (rain, snow, wind, and
fog), configuration of highway networks such as locations of access and egress points,
deficient design and maintenance of highways, etc. All of these factors more or less are
associated with distinct spatial patterns as well.
Spatial analysis of point events, known as point pattern analysis (PPA), has been
widely examined by spatial scientists and a variety of methods have been developed for
detecting hot spots of point events. The PPA methods can be classified into two broad
categories (Bailey & Gatrell, 1995; OSullivan & Unwin, 2002): (1) Methods examining
the first-order effects of a spatial process, (2) Methods examining the second-order
effects of a spatial process. The first group focuses on the underlying properties of point
events and measures the variation in the mean value of the process. It includes methods
such as quadrat count analysis, kernel density estimation and etc. The second group
mainly examines the spatial interaction (dependency) structure of point events for spatial
patterns, and includes methods such as nearest neighbor statistics, G function, F function,
K function and etc. Out of these two categories of methods, Kernel Density Estimation
(KDE) is one of the most popular methods for analyzing the first order properties of a
point event distribution (Silverman, 1986; Bailey & Gatrell, 1995) partially because it is
easy to understand and implement. Some KDE tools are already made available in some
leading commercial GIS software, e.g. the Spatial Analyst Extension of ESRIs ArcGIS,
as well as some popular spatial statistical analysis software, such as CrimeStat (Levine,
2004). The planar KDE has been used widely for traffic accidents hot spots analysis
and detection. The recent examples include study of urban cyclists traffic hazard intensity
(Delmelle & Thill, in press), pedestrian crash zones detection (Pulugurtha, Krishnakumar,
& Nambisan, 2007), wildlife---vehicle accident analysis (Krisp & Durot, 2007), highway
accident hot spot analysis (Erdogan et al., 2008) and etc. The purpose of KDE is to
produce a smooth density surface of point events over space by computing event intensity
as density estimation. In planar KDE, the space is characterized as a 2-D homogeneous
Euclidian space and density is usually estimated at a large number of locations that are
regularly spaced (a grid). However, in analyzing the spatial pattern of traffic accidents,
which usually occur on roadways and inside a network, the assumption of homogeneity
of 2-D space does not hold and the relevant KDE methods are not readily applicable.
Many other types of human-induced point events also exhibit similar property in that
their distributions are constrained to only the network portion (network space) of the 2-D
Euclidean space, such as residential houses, commercial sites, street lights, moving
vehicles, and etc. In short, the uniformity of 2-D space is basically too strong an
assumption for the analysis of point events occurring in 1-D infinite space (Miller, 1999a).
Special considerations are thus needed for measuring such point events occurring in
network spaces1.
In the early 1990s, spatial scientists started to realize the limitations of spatial
methods originated in the 2-D Euclidean space when applying them directly to networkconstrained phenomena. A large number of studies have been conducted since then in a
variety of application domains, attempting to extend the conventional 2-D spatial
methods to network spaces, including network autocorrelation (Black, 1992; Black &
Thomas, 1998), network Huff model of market area analysis (Miller, 1994; Okabe &
Kitamura, 1996; Okabe & Okunuki, 2001), network distance-decay (Kent et al., 2005),
space-time accessibility measures (Kwan, 1998; Miller, 1999b), space-time clustering
(Black, 1991) and etc. In particular, increased attentions are recently paid to the
applications of standard spatial statistical methods in analyzing spatial point events in a
network space (Okabe et al., 1995; Okunuki & Okabe, 1998; Okabe & Yamada, 2001;
Yamada & Thill, 2004; Lu & Chen, 2007; Yamada & Thill, 2007). For instance, the Kfunction (another popular PPA method), has been extended to network spaces. A network
version of K-function and its computational implementation are described in Okabe and
Yamada (2001). To examine the advantages of network K-function, Yamada & Thill
(2004) compared three versions of K-functions in their ability of analyzing traffic
In this paper, a network space is defined as a simplified and abstracted representation of the real world
road network, which occupies a portion of the actual 2-D geographic space. It is represented as 1-D lines
and line-intersections. The roadway width, traffic direction, and multi-lane properties are not considered for
simplicity of concept demonstration.
SANET software tools can be obtained free of charge upon written request.
See Yamada & Thill (2007) for a typology of situations involving planar and network
spaces in spatial analysis. In addition, the spatial pattern of such kind of point events is
better measured with density values per linear unit over a network instead of per area unit
over a 2-D space. Indeed, in real-world applications, the density of traffic accidents is
often reported as the number of accidents over a defined linear unit (e.g. per mile) rather
than per area unit (e.g. per square mile).
This paper presents a novel network KDE approach to estimating the density of
traffic accidents strictly over a network space. As secondary objectives, the study also
investigates the impacts on density calculation from two different kernel functions, lixel
lengths, and search bandwidths. Although developed initially for traffic accidents, this
new approach could be used to examine the spatial pattern of any point events, as far as
their distribution is limited within network spaces. The remainder of the paper is
organized as follows. The basic concepts of network KDE are discussed in Section 2. The
computational algorithm is detailed in Section 3 along with discussions of some
implementation issues. In Section 4, we present a case study with a real road network and
traffic accident data. Discussions are made and some conclusions are drawn in Section 5.
1 d is
k( )
2
r
r
i =1
( s) =
(1)
d
k ( is ) =
r
k(
1
d
exp( is 2 ) ,
2r
2
d is
) = 0 , when dis > r
r
d
d
k ( is ) = K (1 is2 ) ,
r
r
k(
d is
) = 0 , when dis > r
r
Where K is a scaling factor and its purpose is to ensure the total volume under
Quartic curve is 1. The common values used for K include
and
3
, i.e.
4
d
3
d
d is
d is 2
3
k ( ) = (1 2 ) or k ( is ) = (1 is2 )
r
r
r
4
r
(3) Minimum variance function
2
k(
d is
3
d
) = ( 3 5 is2 ) ,
r
8
r
k(
d is
) = 0 , when dis > r
r
A wealth of literature has examined the effects of the two key parameters of
planar KDE, i.e. kernel function k and search bandwidth r, on the resultant density pattern.
There exists a consensus that the choice of the kernel function k is less important than the
choice of search bandwidth r (Silverman, 1986; Bailey & Gatrell, 1995; OSullivan &
Unwin, 2002; Schabenberger & Gotway, 2005; O' Sullivan & Wang, 2007). It is also
agreed that the value of search bandwidth r usually determines the smoothness of the
estimated density -- the larger the r the smoother is the estimation.
10
With the linear nature of network spaces in mind, this paper proposes to use the
following form of kernel density estimator for the density estimation of networkconstrained point events, such as traffic accidents, in a network space:
n
1 d is
)
r
r
i =1
(s) = k (
Instead of calculating the density over an area unit, the equation estimates the
density over a linear unit. Any of the three forms of kernel functions discussed previously
may be used. As shown in previous sub-sections, many earlier studies in planar KDE
have concluded that the choice of kernel function is not as important as the choice of
bandwidth r. We conjecture that the relative significance of the two parameters on
density estimation might not change much in the new network KDE, since there is no
particular reason for them to perform differently. To confirm our conjecture, we do
implement two kernel functions, Gaussian and Quartic functions, and compare their
impacts on the resultant density pattern in our case study. To verify the role of search
bandwidth in network KDE, we also examine how the density pattern will be impacted
when different search bandwidth are chosen.
It is necessary to emphasize that the network KDE differs from the planar KDE in
several aspects: (1) the network space is used as the point event context, (2) both search
bandwidth and kernel function are based on network distance (calculated as the shortestpath distance in a network) instead of straight-line Euclidean distance, and (3) density is
measured per linear unit. These differences are illustrated in a graphic form in Figure 1. It
can be noticed that network KDE is a 1-D measurement while planar KDE is a 2-D one.
As a result, the actual density values estimated by them would be very different for the
11
same point events dataset. This illustrative example also suggests that a Planar KDE
could possibly over-detect clustered pattern , with four traffic accidents falling within the
search bandwidth and hence included in density estimation for the focal point (x) in the
case of Planar KDE, while only two in the case of Network KDE.
Create a segment-based linear reference system out of the original road network,
with each segment being a line segment between two neighboring road
intersections. A dangling line segment between a road intersection and a
neighboring road end point is also a segment. If there are multiple links between
two intersections, each link is treated as a separate segment.
(2)
Divide each segment into basic linear units of a defined network length l. A basic
linear unit is equivalent to a cell in a 2-D raster grid. For description simplicity,
we call it lixel (a contraction of the words linear and pixel) in the paper. The use
of lixel not only facilitates the systematic selection of a set of regularly spaced
locations along a network for density estimation, but also makes the practical
application of the network KDE feasible by significantly improving the
computation efficiency. The residual of the division (if there is one), i.e. the last
12
lixel with length shorter than l, is a partial lixel, the processing of which will be
detailed later. The intersection point of two lixels is called lxnode.
(3)
(4)
Create the center points of all the lixels. They are termed lxcenters.
(5)
Select a point process (traffic accidents in this study), which has to be a type of
point events occurring within the network space.
(6)
For each point event, find its nearest lixel. The total number of events nearest to a
lixel is counted and assigned to the lixel as a property. Those lixels with one or
more accidents assigned to them are used as source lixels. Aggregating events to
lixels is one important step for improving the computational efficiency of the
algorithm. The possible impacts and future research plans are further described in
the conclusions and discussions section.
(7)
(8)
Calculate the shortest-path network distance from the lxcenter of each source lixel
to lxcenters of all its neighboring lixels within the search bandwidth r. It should
be noted not only the first order nearest neighbor lixels, but all the neighbor lixels
with network distance not farther than r are taken into consideration.
(9)
At the lxcenter of each source lixel and all its neighboring lixels, calculate a
density value based on a selected kernel function, the network distance, and the
number of events on the source lixel.
13
(10)
At the lxcenter of each lixel within the search bandwidth of any source lixels, sum
the density values from different source lixels and assign the total density to the
lixel; for all other lixels, the density value is zero by default.
14
One issue to notice in the segmentation process is that the length of a segment (e.g.
51 meters) may not be exactly an integer number times of a defined lixel length (e.g. 10
meters). Hence, a residual lixel with length, here 1 m, shorter than the defined lixel length
often results for this kind of segment. These residual lixels do not actually present any
issues in implementation. The density values at the lxcenters of these residual lixels is
calculated in the same way as for regular lixels, based on the same kernel functions and
the actual network distance from their lxcenters to the centers of the source lixels,
although the network distance will not be integer number times of the defined lixel
length. A minor caveat is that the resolutions (length) are different between the regular
and residual lixels, however, the overall density pattern should not be affected because
residual lixels generally only amounts to a very small proportion of the entire inventory
of lixels. Further, as far as the lixel length is sufficiently short (e.g. 10 m), including or
excluding a single residual lixel in a hot spot may have trivial effects on the hot spots
detection even at local scales.
Another related issue is how to determine whether a lixel lies within the search
bandwidth from a source lixel. There are at least two different options when a lixel could
be labeled within a search bandwidth: (1) if the network distance from the lxcenter of a
source lixel to the farthest end point of the lixel is shorter than or equal to the search
bandwidth; or (2) if the network distance from the lxcenter of a source lixel to the
lxcenter of the lixel is shorter than or equal to the search bandwidth. The two options may
perform a little differently for the marginal neighbor lixels, but the impacts should be
trivial. In this paper, we adopted the second option.
15
16
calculation, including kernel functions, lixel length, and search bandwidth. Two kernel
functions, Gaussian and Quartic, are compared at a fixed search bandwidth of 100 m for
local details and a search bandwidth of 1000 m for the overall pattern, both with a lixel
length of 10 m. Four versions of segmentation scenarios, with the lixel length at 5 m, 10
m, 50 m, and 100 m respectively, are tested at a fixed search bandwidth of 100 m and
with a Gaussian kernel function. We also examine the impacts of search bandwidth at
local and larger spatial extents. Total six search bandwidths are used, including 20 m, 100
m, 250 m, 500 m, 1000 m, and 2000 m, all with the same lixel length (10 m) and a
Gaussian kernel function.
17
much informative pattern of accident likelihood. However the density maps of the planar
KDE and network KDE are rather different, although some common hot (high value)
regions can be identified in both maps. The density surface by the planar KDE
exhaustively fills out the entire study area (although some areas with value of zero) while
that by the network KDE only covers the space occupied by the street network. In
addition, the maps indicate that the planar KDE is likely to overestimate the density
values in comparison to the network KDE. In Figures 4-c and 4-d, the density value for
the planar KDE can reach as high as over 2000 (per km2) (Figure 4-c), in comparison, the
highest value for the network KDE is only about 140 (per km) (Figure 4-d).
18
present the impacts of the search bandwidth (20 m, 100 m, 250m, 500 m, 1000 m, and
2000 m respectively) on the overall density pattern for the whole study area. It appears
that the narrow bandwidths (20 m, 100 m, and 250 m) may produce patterns suitable for
presenting local effects or hot spots at smaller scales. As the search bandwidth
increases from 20 m to 2000 m, the local hot spots are gradually combined with their
neighbors, and larger clusters appear. The maps at wider search bandwidths (500 m, 1000
m, 2000 m) seemingly give better sense of locations of the hot spots at larger spatial
scales.
20
relatively higher estimated density values of traffic accidents. And several major
intersections can be visually identified as hot spots of traffic accidents.
21
pattern. A narrower search bandwidth could reveal local effects, whereas a wider
bandwidth makes hot spots much more obvious at large spatial scales.
It should be pointed out that the proposed algorithm only calculates the density
value at the center point of each lixel and uses that value to represent the whole lixel.
Although we can certainly compute density values for as many points along a linear
segment as possible, we argue that it is appropriate to use the center point density value
to approximate the density value over the lixel of a reasonable length, mainly because
there are infinite points on a lixel of any length and a certain degree of simplification and
abstraction is always needed for a practical application. Similar simplification and
abstraction are not uncommon. For example, in the 2-D space raster representation, the
center value of a grid cell has often been used to approximate the value of the whole cell.
In addition, an equal-length segmentation approach is used in the current algorithm. In
the future study, it may be interesting to examine other segmentation scenarios with a set
of varied lixel lengths, e.g. shorter length for central urban, longer for rural areas, and so
on.
It is very tempting to recommend an optimal lixel length. However, we avoid
doing so since a universally applicable lixel length may not exist, like the cell resolution
in a 2-D raster representation. In remote sensing, imagery of a particular resolution is
chosen to suit specific application context, and it should be so in the network KDE. The
length of lixels must be carefully chosen based on application context. The inherent data
quality of the road network and accident locations is also an important factor to take into
consideration. In traffic accident, a 10-m lixel length should be sufficiently short if not
optimal given the length of a vehicle, the direct impact area of an accident, and the
22
location accuracy in measurement process. This lixel length may not suit other point
events. We also refrain from recommending an optimal search bandwidth for similar
reasons and believe search bandwidth selection should also be application dependent. It
may be wise to use a set of search bandwidths to reveal hot spots from very local
effects to global scale. In essence, this would lead to a multi-scale exploration, which is
actually needed for many systems with hierarchical structure.
In the implementation, we aggregate all the accident points associated with a focal
source lixel into an accident count of the focal lixel, and their locations on the focal lixel
are not further distinguished. It is from the center point of focal source lixel that the
network distance to each neighboring lixel center is calculated. The aggregation has the
advantage of computational efficiency and more importantly it is closer to the reality in
the sense that a real accident seldom occurs precisely at a dimensionless point location
but occupying a certain length along a roadway. The location accuracy is also affected by
measurement process and contributes uncertainties to the event position. Regardless, the
impacts of aggregation on the measured distance and subsequently the density value
should be minor. For comparison purpose, we will implement the network KDE with unaggregated events in the future study.
In the current algorithm, road segments as well as lixels are also not further
distinguished and are treated equally. In reality, road segments are different in their
traffic flow capacity (e.g. the number of lanes and speed limit), directionality (one way or
two ways, left turn or right turn) and etc. The same accidents will have different
implications depending on the type of road segments they are on, and may have different
impacts on the traffics ahead of or behind the accident vehicles. Such information are not
23
accessible for the study area, however in areas where they are available, it may be
beneficial to incorporate them into the density estimation. The network KDE also has one
of the same fundamental drawbacks as the planar KDE for hot spots detection. Because
no statistical significance is employed in the process, it is ad hoc and there is no
indication of a density threshold above which hot spots can be confidently declared.
Experiments with different density thresholds may be needed in real applications.
Nevertheless, we believe the network KDE method presented in this paper could be
useful and are readily applicable in real world setting for traffic accident decision making
by different agencies. More significantly, although designed for KDE estimation, the idea
of segmenting a linear road network into a network system of equal-length lixels may
potentially lead the way to developing a suite of other network related spatial analysis
and modeling methods and have a larger impact than what is shown in this initial
application.
Acknowledgement
The authors greatly appreciate the helpful comments of three anonymous
reviewers and the detailed comments by the Editor, Dr. Jean-Claude Thill.
24
References
Bailey, T. C., & Gatrell, A. C. (1995). Interactive Spatial Data Analysis. Essex: Longman.
Black, W. R. (1991). Highway Accidents: A Spatial and Temporal Analysis.
Transportation Research Record, 1318, 7582.
Black, W. R. (1992). Network Autocorrelation in Transport Network and Flow Systems.
Geographical Analysis, 24, 207222.
Borruso, G. (2005). Network Density Estimation: Analysis of Point Patterns over a
Network. Lecture Notes in Computer Science: Computational Science and Its
Applications, 3482, 126132. Erdogan, S., Yilmaz, I., Baybura, T., & Gullu, M.
(2008). Geographical information systems aided traffic accident analysis system case
study: city of Afyonkarahisar. Accident Analysis and Prevention, 40(1), 174181.
Delmelle, E. C., & Thill, J.-C. (2008). Urban Bicyclists A Spatial Analysis of Adult and
Youth Traffic Hazard Intensity. Transportation Research Record, in press.
Gibin, M., Longley, P., & Atkinson, P. (2007). Kernel Density Estimation and Percent
Volume Contours in General Practice Catchment Area Analysis in Urban Areas. In
the Proceedings of the GIScience Research UK Conference (GISRUK) 2007.
Maynooth - Ireland.
Kent, J., Leitner, M., & Curtis, A. (2006). Evaluating the Usefulness of Functional
Distance Measures When Calibrating Journey-to-Crime Distance Decay Algorithms.
Computers, Environment and Urban Systems, 30(2), 181200.
Krisp, J.M., & Durot, S. (2007). Segmentation of lines based on point densities An
optimisation of wildlife warning sign placement in southern Finland. Accident
Analysis and Prevention, 39(1), 3846.
25
26
Okabe, A., & Okunuki, K. (2001). A Computational Method for Estimating the Demand
of Retail Stores on a Street Network Using GIS. Transactions in GIS, 5(3), 209220.
Okabe, A., & Yamada, I. (2001). The K-Function Method on a Network and its
Computational Implementation. Geographical Analysis, 33, 271290.
Okabe, A., Okunuki, K., & Shiode, S. (2006). SANET: A Toolbox for Spatial Analysis
on a Network. Geographical Analysis, 38, 5766.
OSullivan, D., & Unwin, D. J. (2002). Geographic Information Analysis. John Wiley,
Hoboken, New Jersey.
O' Sullivan, D., & Wong, D. W. S. (2007). A Surface-Based Approach to Measuring
Spatial Segregation. Geographic Analysis, 39(2), 147168.
Pulugurtha, S.S., Krishnakumar, V. K., & Nambisan, S. S. (2007). New methods to
identify and rank high pedestrian crash zones: An illustration. Accident Analysis and
Prevention, 39(4,) 800811.Schabenberger, O., & Gotway, C. A. (2005). Statistical
Methods for Spatial Data Analysis. Chapman & Hall/CRC, Boca Raton, Florida.
Silverman, B. W. (1986). Density Estimation for Statistics and Data Analysis. Chapman
Hall, London.
Yamada, I., & Thill, J.-C. (2004). Comparison of Planar and Network K-Functions in
Traffic Accident Analysis. Journal of Transport Geography, 12, 149158.
Yamada, I., & Thill, J.-C. (2007). Local Indicators of Network-Constrained Clusters in
Spatial Point Patterns. Geographical Analysis, 39(3), 268292.
27
List of Figures
Figure 1. Illustration of the basic differences between the Planar KDE and Network KDE
for the same point event dataset. To estimate the density value at a focal point x, the
planar KDE treats the whole 2-D space as the context and finds 4 accident points (solid
dots) within a search bandwidth r, whereas the Network KDE only finds 2 accidents
within the same bandwidth in the network space based on network
28
distance.
Figure 2. Illustration of the basic terms used in the proposed network KDE algorithm.
The line segment between the two road intersections A and B is called a segment, so is
the dangling line segment AC between a road intersection A and a road end point C. Each
segment is divided into lixels of a defined network length l. Here, five lixels are created
for segment AB, with lixels 1, 2, 3 and 4 being regular lixels with length l, and lixel 5
being the residual lixel with length less than l. The dark dots are the lxnodes, the
intersection points of lixels. Note the original road intersections (A, B) and end points (C)
are always lxnodes.
29
Figure 3. The study area, Bowling Green, Kentucky. Test dataset include a road
transportation network system and traffic accident data for year 2005.
30
31
Figure 5. Illustration of different overall patterns produced by the network KDE and
planar KDE in the study area: (a) the proposed network KDE (10-m lixel), and (b) a
32
standard planar KDE (10-m raster cell). A Gaussian kernel and a 1000-m search
bandwidth are used.
Figure 6. Illustration of the local impacts of kernel functions in the network KDE: (a) a
Gaussian kernel based network density (per km), and (b) a Quartic kernel based network
density (per km). For both (a) and (b), a 100-m search bandwidth and a 10-m lixel length
are used. Both kernel functions result in similar local details.
33
Figure 7. Illustration of the impacts of kernel functions on the overall density pattern in
the network KDE: (a) a Gaussian kernel based network density (per km), and (b) a
34
Quartic kernel based network density (per km). It appears that the kernel functions make
less difference in the overall pattern.
Figure 8. Illustration of the impacts of t lixel length on the calculated density values for
the network KDE. (a)-(d) are the results for the lixel length of 5 m,10 m, 50 m and 100 m
respectively, with a Gaussian kernel and a 100-m search bandwidth. The shorter the lixel
length, the more detailed location variation can be revealed, as shown in the segment
from A to B.
35
36
Figure 10. Illustration of the impacts of different search bandwidths on the overall
density pattern. (a)-(f) are the results for the search bandwidth of 20 m, 100 m, 250 m,
500 m, 1000 m, and 2000 m respectively, with a Gaussian kernel and a 10-m lixel length.
37
As the search bandwidth increases from 20 m to 2000 m, the local hot spots are
gradually combined with their neighbors and larger clusters appear.
38