Crimestat Iii: Susan C. Smith Christopher W. Bruce
Crimestat Iii: Susan C. Smith Christopher W. Bruce
Susan C. Smith
Christopher W. Bruce
About CrimeStat
About CrimeStat
• Spatial Statistics Program
• Analyzes Crime Incident Locations
• Developed by Ned Levine & Associates
– Grant 1997-U-CX-0040
– Grant 1999-U-CX-0044
– Grant 2002-U-CX-0007
– Grant 2005-U-CX-K037
• Provides supplemental statistical tools for
crime mapping
About CrimeStat
• Newest version is CrimeStat III (3.0)
• Program inputs incident locations (e.g.
robbery locations) in .dbf, .shp, ASCII or
ODBC-compliant formats using either
spherical or projected coordinates
• Program calculate various spatial statistics
and writes graphical objects to several GIS
programs (ArcMap for the purpose of this
workbook)
About CrimeStat
• The workbook provides copyright
information
• The workbook provides information on
how to correctly cite the program in
publications/reports
• The workbook provides a link to obtain
more information on CrimeStat, including
the complete manual
• Dr. Ned Levine’s contact information is
provided in the workbook
Chapter One
Introduction and Overview
In Chapter One….
• Purpose of CrimeStat III
• Uses of spatial statistics in crime analysis
• CrimeStat III as a tool for analysts
• Statistical Routines
• Hardware and Software requirements
• Downloading sample data
• Chapter Layout and Design
Introduction
• Nearly all crimes have a location that can
be analyzed
• In crime analysis, we can identify patterns
by looking at the geography of the
incidents
• Analyzing crime location is a major part of
policing – from determining police districts
to response times to determining a tactical
deployment to an active crime series
Geographic Information Systems
• “GIS” is often synonymous with ‘crime
mapping’
• Crime mapping
– Geocoding incidents or other police-related
data and displaying them on a paper or
computerized map
• Geocoding
– The process of assigning geographic
coordinates to data records, usually based on
the street address
Geographic Information Systems
• When incidents are geocoded, a list or
database of crimes is turned into a map of
those crimes
• This map can now tell a story about the
police data
• Thematic maps are created
– Point Symbol maps
– Choropleth maps
– Graduated Symbol maps
Geographic Information Systems
• Why map crime?
– Identify patterns and problems
– Identify hot spots
– Use as a visual aid
– Shows relationship between geography & other
factors
– Look at direction of movement
– Query data
– Track changes in crime
– Make maps for police deployment
…And many other reasons
Geographic Information Systems
• After you create the map, then analyze
• Why?
– To answer questions about data
• Historically, analysts relied heavily on
visual interpretation of the map to answer
the questions
– To identify hot spots
– To draw conclusions
– To recommend responses
Geographic Information Systems
• Why is visual interpretation not always possible?
– Can’t easily pick out hot spots among 1000s of data
points
– Can’t detect subtle shifts in the geography of a crime
pattern over time
– Can’t calculate correlations between two (or more)
geographic variables
– Can’t analyze travel times among complex road
networks
– Can’t apply complicated journey-to-crime calculations
across tens of 1000s of grid cells
• Spatial Statistics…a need filled by CrimeStat
CrimeStat III
• First released in August, 1999
• Current version, 3.1, released March 2007
• Not a GIS & does not create or display
maps
• It reads the files geocoded by a GIS and
then exports the results into formats the
GIS can read
• Effective use of CrimeStat requires a GIS
and knowledge of its use
CrimeStat III
• With geocoded crime data, CrimeStat can
perform calculations and output map layers
including (but not limited to):
– Mean/center of minimum distance of a group of
incidents
– An area representing the standard deviation of
a group of incidents or the entire geographical
extent of a group of incidents
– Statistics measuring the spatial relationship
between points (con’t next slide)
CrimeStat III
• (con’t)
– Statistics that measure the level of clustering
or dispersion within a group of incidents
– Distance measurements between points
– Identification of hot spots based on spatial
proximity
– Estimation of density across a geographic
area through “kernel smoothing”
– Statistics that analyze the relationship
between space and time (con’t next slide)
CrimeStat III
• (con’t)
– Statistics that analyzed the movement of a
serial offender
– Routines that estimate the likelihood that a
serial offender lives at any location in the
region, based on journey-to-crime research
– …And much, much more….
CrimeStat III
• Using CrimeStat statistical routines, an analyst is
able to
– Identify crime patterns & series
– Identify the ‘target area’ in which a serial offender is
most likely to strike next
– Identify and triage hot spots
– Conduct a risk analysis across a jurisdiction based on
known crime locations
– Create a ‘geographic profile’ to assist in investigating
suspected offenders
– Optimize patrol routes and response times
CrimeStat III
• CrimeStat is valuable for
– Tactical Crime Analysis
• Crime Patterns, Crime Series, Forecasting
– Strategic Crime Analysis
• Hot Spots, Problem Solving, Geographic Profiling
– Operations Analysis
• Patrol Routes, Patrol Districts, Response Times
Spatial Statistics in Crime
Analysis
• Some maps are simplistic and require only
a simple scanning and a limited amount of
human perception
– Hot Spot Identification, Spatial Forecasting
Spatial Statistics in Crime
Analysis
• Some map interpretation are impossible
without spatial statistics
– Geographic Profiling, Density Mapping
Spatial Statistics in Crime
Analysis
• It would be difficult to see subtle shifts in
crime incidents (within a series or pattern
or over years of changes in geography
within a jurisdiction)
Residential 1 mile Moderately dispersed: quartic or uniform Some burglars choose particular houses, but most cruise
burglaries neighborhoods looking for likely targets. A housebreak in any
part of a neighborhood transfers risk to the rest of the
neighborhood.
Domestic 0.1 mile Tightly focused: negative exponential Domestic violence occurs among specific individuals and
violence families. Incidents at one location do not have much chance of
being contagious in the surrounding area.
Commercial 2 miles Focused: triangular or negative exponential A commercial robber probably chooses to strike in a
robberies commercial area, and then looks for preferred targets (banks,
convenience stores) within that area. The wide area may thus
be at some risk, but the brunt of the weight should remain with
the particular target that has already been struck.
Thefts from 0.25 mile Dispersed: uniform If a parking lot experiences a lot of thefts from vehicles, your
vehicles GIS will probably geocode them at the center of the parcel.
This method ensures that the risk disperses evenly across the
parcel and part of the surrounding area (which probably makes
sense)—but not too far, since we know that parking lots tend to
be hot spots for specific reasons.
Creating a KDE
• Data setup; add ArcView SHP file
theftfromautos;
• Create reference grid on Reference File tab
• On Spatial modeling tab, Interpolation sub-tab,
chose Single KDE; adjust bandwidth and select
interpolation method
• Save result to; compute
• Open KLFA shapefile in ArcView and create a
choropleth map
• Experiment with different settings
Dual KDE
• KDE based on two files
– Primary & Secondary
– Primary use is to normalize for risk
• In single KDE, hot spots are based on volume
• In dual KDE, hot spots are based on risk
– Four things to keep in mind
• Sometimes you just want volume
• Data for secondary file is hard to come by
• The point data in the secondary file is interpolated just
like the primary file
• You cannot use a different interpolation method for
numerator and denominator (but you can use an
adaptive bandwidth)
Dual KDE
• Set up Secondary File like Primary File except
– Ratio of Densities
• Divides the density in the primary file with the density in
secondary file
– Log ratio of densities
• Helps control extreme highs and lows
– Valuable in strongly skewed distributions
– Absolute difference in densities
• Subtracts the secondary file densities from the primary
file densities
– Valuable in analyzing one time period to the next
Dual KDE
• Set up Secondary File like Primary File
except (con’t)
– Relative difference in densities
• Option divides primary and second files densities by
area of the cells before subtracting them (just like
absolute difference)
– Sum of densities
• Adds two densities together
– Useful to show combined effects of two types of crime
– Relative sum of densities
• Divides primary and second files by the area of the
cells before adding them
Dual KDE
• On Data Setup, remove larcey from autos and
add resburglaries.shp file
• On Secondary File, select censusblocks.dbf,
set variables, including Z (Intensity)
• On Spatial Modeling, Interpolation tabs, select
“Dual” box (check weighting variable option)
• Save Result to (.shp)
• Open ArcView, add layer, create choropleth
map
Dual KDE Uses and
Cautions
• KDE is a hot spot technique, but it is part
theoretical
• KDE maps are interpolations
– Meaning incidents did not occur at all of the
locations within the hottest color
• Creates a uniform risk surface (which is rare)
• You can only have bank robberies where there are
banks
– Hence, interpret a KDE in reference to where
suitable targets may exist within the risk surface
Chapter Seven
Spatial Temporal Moving
Average
In Chapter Seven...
• Understanding the Spatial Temporal
Moving Average
• Using a time variable in CrimeStat
Introduction
• Spatial-Temporal Moving Average (STMA)
• Set of points in robbery series
– But mean, SD, SDE doesn’t represent the
series
– Something is “off”
• Recall two types of crime patterns (Chpt 3)
– Those that cluster
– Those that walk
Introduction