CRIMECAST: A Crime Prediction and Strategy Direction Service
CRIMECAST: A Crime Prediction and Strategy Direction Service
CRIMECAST: A Crime Prediction and Strategy Direction Service
Abstract— Various researches on criminology provides coldspots focus should be more on individual occasional
us with a key piece of information about criminal offender. This calls for a much optimized probabilistic model
psychology that, a criminal doesn't hover around unknown and massive data analysis which is required to develop real
territory rather they commit crimes when opportunity time crime prediction with precise time and location.
provides in a concentrated or familiar area i.e. hotspots.
So, a crime predicting model can be simulated using crime We present here CRIMECAST, a mathematical simulation
pattern theory which can analyze verified past crime data process that analyses crime rate, crime locations, timing,
and predict future criminal activities. The aim of this nature of crime, damages, scope of crime from past years
paper is to introduce CRIMECAST which is a crime (preferably up to 30 years) and predicts future crime. This
prediction and strategy direction service which attempts to process is a method which relates to past crime trends and
predict probable future crimes by simulating probabilistic factors and determines possible influence on future scope of
model implementation and Artificial Neural Network. crime. It can be used for identifying both frequent offender
CRIMECAST is a spatial crime analysis process that and their hotspots and also occasional offender in coldspots.
focuses on authentic crime history and predicts crime, The authors in [5] proposes Levy Flight Model to capture the
develops strategy map, provides security alert. Our dynamics of hotspots. CRIMECAST uses statistical model and
simulation on very big dataset show that CRIEMECAST ANN implementation to predict spatial behaviors of both
outperforms all other methods of making crime frequent and occasional offenders. CRIMECAST can trace
predictions. and predict criminal's mobility for crimes like - series robbery.
It is usually be done using directional movement approach and
Keywords— Crime Prediction, Hotspot Detection, Oracle data analysis [6].
DB, Probabalistic Model, , ANN
The rest of this paper is organized as follows. Section II of
I. INTRODUCTION this paper addresses related work. CRIMECAST model is
When law was introduced in social life there was an presented in Section III. Section IV describes our experimental
automatic occurrence of an act which committed in violating evaluation and results. Section V provides the conclusion and
it. These acts are known as Crime for which penalty is future scope is discussed in Section VI.
imposed. Nowadays because of the inexorable increase in
crime, it's a difficult challenge to predict future crime II. BACKGROUND
accurately and efficiently. So the necessity of crime prediction Several criminological researches have explored different
is on the rise [1], [2], [3]. Crime is naturally unpredictable. It crime prediction methods. In [7], the authors developed a
is neither random nor uniform [4], [3]. Region with higher point pattern based density model for crime prediction. This
occurrence of criminal events which are referred here as model computes the likelihood of criminal occurrence in a
hotspots has a larger share of criminal activity than region fixed location by analyzing previous incidents. In [2],
with lower concentration of criminal events referred here as Rossomo has modeled a recognized method of determining
coldspots. home location of offender by analyzing the person's crime
location. Different Collaborative Filtering methods [8] infers
Hotspots are changed frequently with development of new user's behavior from behaviors of similar users based on the
residential area or market, employment etc. In hotspots idea that user with similar behavior in the past will have
criminals find crime opportunities easily. So crime occurrence similar behavior in the future. In [5], the authors propose a
is higher. Detection of hotspots enables law enforcement lattice model for predicting residential burglary where each
agency to focus more in hotspots and prioritize the use of location is assigned with a dynamic interest/attractiveness
resources for better prevention of crime. value. All above mentioned methods proposes solution model
for different crime prediction problems to which the method
But in coldspots a criminal has to hunt for opportunities to presented here can't be compared.
commit crime and thus faces higher risks. So, crime
occurrence is lower. For a feasible crime prediction in
In every crime prediction model a data mining technique is E. ARTIFICIAL NEURAL NETWORK:
used for predicting future crime. In [9], the authors introduce
the basic data mining techniques that are used to predict crime. A neural network has a massive number of processing
element that works only on local data called neurons. In [16],
A. Multivariate time series clustering: the authors introduce a crime prediction model that focuses on
areas with high crime occurrence. Then, a crime incidence-
Multivariate time series is a flow of data points computed in scanning algorithm is employed to identify clusters i.e.
similar time intervals. This technique is efficient for hotspots. This provides enough data to train ANN capable of
determining similar crime trends. In [10], the authors discuss determining crime trends within them. This approach is
the approach used for this technique. Because some crimes enhanced by applying the Gamma test otherwise the ANN
have more importance i.e. weight than other crimes example- takes long training time. The results of crime prediction using
murder has more weight than burglary this approach considers ANN is satisfactory and prediction accuracy is generally high.
the weightage scheme in clustering algorithm. The distance
function is defined by a weighted version of Minkowski
distance measurement. The effectiveness of this technique has III. PROPOSED METHODOLOGY
been tested on Indian crime dataset provided by Indian In this Section we present CRIMECAST model
national crime records. components, starting with dataset preparation.
Fuzzy time series was first introduced to predict student ii) Because the crime that has been occurring at a larger scale
enrolments at the University of Alabama [13]. It had complex in recent times has a higher probability of happening again
matrix operations which caused heavy overheads. In [14], the than the crime that used to occur at a larger scale in previous
authors proposed a simplified process. In [15], the authors times. So, we take into calculation a Time Impact Factor.
explored and developed the application of Fuzzy time series Expressed as f(i, y, s) = Time Impact Factor of crime i in
technique for crime prediction. Seventeen years of historic location s at a period of time y.
crime data of Delhi city have been used in this research. The
technique works even if some data is not available. iii) Then, we take into consideration the number of occurrence
of a specific crime in a time period y. Because the crime
occurring more has a bigger probability of occurring again
than the crime with less number of occurrence.
Expressed as b(i, y, s) = Number of occurrence of crime i in for calculation. But to obtain real-time feedback and precise
location s at a period of time y. prediction these factors need to be adjusted because of many
variable of crime like weather, season etc. The weights are
iv) Probability Factor: adjusted in every iteration of training algorithm and we obtain
Then we calculate a probability factor. Expressed as- more precise output. We implemented 2 layers of hidden layer
to get output. We propose the following mathematical process
𝑛
for training of Neural Network –
𝑂(𝑖, 𝑠) = ∑ 𝑝(𝑖, 𝑠) ∗ 𝑓(𝑖, 𝑦, 𝑠) ∗ 𝑑(𝑖, 𝑦, 𝑠)
𝑦=1 i) Compute β for all other nodes using –
Where, n = Amount of time period taken into consideration.
𝛽𝑗 = ∑ 𝑤𝑗→𝑘 𝑜𝑘 (1 − 𝑜𝑘 )𝛽𝑘
v) Probability of Occurrence of Crime i in Location s:
𝑘
We can determine this by following equation - ii) Compute weight changes for all weights using -
Probability of Occurrence, ∆𝑤𝑖→𝑗 = 𝑟𝑜𝑖 𝑜𝑗 (1 − 𝑜𝑗 )𝛽𝑗
where,
𝑂(𝑖, 𝑠) β = Node output
𝑂. 𝑃(𝑖, 𝑠) =
∑𝑚
𝑖=1 𝑂(𝑖, 𝑠) ∆w = Weight change
Where, m = Number of domains in the given dataset r = Rate parameter
In percentage, Probability of Occurrence = O.P(i, s) * 100%
Simulation of CRIMECAST ANN Design:
C. Hotspot Detection
which stated that crime rates peak during summer time. Table II
Increase or decrease of light has direct effect on crime in most Calculated Probability Factor in Different Locations
countries. CRIMECAST also takes into account the varying
impact of this variable using ANN. Mirpur Uttara Motijheel Baridhara Gulshan
𝐷𝑜𝑚𝑎𝑖𝑛
IV. EXPERIMENTAL EVALUATION ⁄ 𝐿𝑜𝑐𝑎𝑡𝑖𝑜𝑛
This section describes implementation of simulated dataset, Murder 10401. 8077. 11401.46 5280.17 4045.55
results and comparison. In this dataset Impact Factor (P) refers 99 46
to Factor of Precedence and Impact Factor (T) refers to Time Rape 867.30 768.7 500.56 1096.77 642.66
7
Impact Factor considered as time domain. Table I represents Theft 4056.4 2000. 5042.13 1121.33 2353.66
the simulated data table. This data was inserted into oracle 2 55
database in order to perform calculation. Robbery 1844 1142. 1753.33 1000.42 1545.66
33
Table I Kidnapping 1647 800.6 1500.47 645.17 850.96
7
Simulated Dataset