PREDPATROL (Predictive Patrolling) - IEEEFormat
PREDPATROL (Predictive Patrolling) - IEEEFormat
Abstract—Crimes exist everywhere in different forms, and community in order to live a normal and peaceful life. This
data collected regarding different crimes, would be very large greatly affects the economy of the society by introducing
in volume which can be managed through data mining impalpable costs like psychological disturbances and
techniques. This research will be very useful in crime decreased quality of life for the victims of these crimes and
detection, but this depends on the volume and correctness of
the data because more data gives more accurate prediction
substantial costs to manage the increased policing in the
results. Classification is a famous, supervised learning society, correction facilities, etc. that places a huge financial
methodology in data analytics. Meaningful sets of data that burden on taxpayers and governments. [1] The high crime
may seem relevant to any particular application, is mined from rate that is prevalent in today’s world is causing a lot of
the huge amount of data available. It is a highly efficient mayhem among people from different countries, who are in
technique used to classify or group data into appropriate fact looking for solutions for detecting crime patterns by
classes and also predict unknown classes. This research focuses studying various criminal behaviors and trying to
mainly on classification of given input text to predict the class comprehend the numerous characteristics of crimes that are
of crime that it belongs to. The dataset used in this research committed. [1] Due to the lack of open crime datasets
was collected from the internet and our tool was trained with
available for case studies, and the inconsistency and
it, for each type of crime. This paper compares the Naïve Bayes
Classifier Algorithm with the Artificial Neural Network inadequacy of the very few that are available, data mining in
Algorithm for predicting the crime type. A huge data set is the area of Crime Analysis is a major challenge faced by
generated every year on the basis of reporting crime. This data researchers today, who are looking for new ways to enhance
can prove very useful in analyzing and predicting crime. Crime Data Analysis to curb this growing societal menace.
Crime analysis is an area of indispensable importance in the One of the most commonly used and important technique in
police department. In the notion of addressing these issues and data mining is Classification. This research will focus on
assisting the police in order to meet increasing demands of the applying Naive Bayes classification algorithm on the crime
current community, a crime type prediction tool has been report and view and compare the accuracy of the results in
proposed as a strategy.
classifying in the crime categories of the Bayesian algorithm
with the Artificial Neural Network Classification algorithm.
Keywords— Crime Prediction, Classification, Machine
Learning, Naïve Bayes, Artificial Neural Networks In this research, real time crime reports are used for training
the tool. PredPatrol’s mission is simple: To aid the Police
Department in predicting crimes. The proposed service is
I. INTRODUCTION not to replace skilled crime analysts and experienced
It is often seen that, the crime rate in our city is quite high officers, but to aid them in achieving results instantly. It is a
and the Police Department find it difficult to manage their web application that takes a crime scene investigation report
huge datasets and get buried in paperwork. The service that as an input and outputs the category of crime committed. In
is proposed will make their tasks easier by managing the the recent past the crime rates here have been increasing and
dataset efficiently and help them identify the type of crime. the action taken to arrest the accused has been quite slow.
Crimes are recurring social issues that disrupt our everyday The motivation behind this research came from this plight.
lives and impact the economic growth of any society. It is a It was decided to provide the police, a system that will help
common menace that induces a sense of fear and discomfort them to expedite prediction of crimes. The end users of the
among the people leading to lack of unity and harmony proposed service would be the members of the Police
among us. It hampers our routine by making us avoid Department, as they would be aided in solving cases faster.
certain places at night or break healthy associations with our This system will be used by the Police Department
neighbors, thereby causing chaos and damage to our extensively where they can feed in simple real-time data and
communal activities. It also leads to the relocation of some
families to a less crime-ridden
get a visual and refined output. It can accelerate the process using two feature selection methods used on a real-time
of taking action against crimes. crime dataset. It is found by comparison that Neural
Networks, k-Nearest Neighbor and Naïve Bayes are better
The organization of this paper is as follows. Section 2 classifiers against Support Vector Machine and Decision
discusses the literature survey, Section 3 covers the tree.
methodology and framework such as classification methods, In this research [9], classification is applied to a real-time
crime dataset/reports and measures for performance crime dataset to predict the category of crime for different
evaluation. Section 4 highlights the experimental results of states of the USA. The real-time crime dataset used in this
the classification algorithms for classifying the type of research was acquired from the 1995 FBI UCR. This work
crime. Finally, Section 5 talks about conclusion and future compares the two classification algorithms namely,
work. Decision Tree and Naïve Bayes for predicting the category
of crime for various states in the USA. The experimental
II. LITERATURE SURVEY
results showed that, Decision Tree outperformed Naïve
A. Criminology and Crime Analysis Bayes algorithm.
Criminology is a field about the study of crimes and The following are related work regarding the two
behavior of criminals and is a practice that recognizes classification algorithms that are used in PredPatrol.
characteristics of crime [2]. It is one of the most significant
areas where data mining approaches can produce vital 1) Naïve Bayes Classifiers: This work [10] gives a
outcomes. Crime analysis, a portion of criminology, is a solution for the problem of prediction of criminals (that is
process that deals with identifying crimes and their
the problem of identifying the most likely suspect) using the
associations with criminals. The huge volume of crime
Naïve Bayes theory. Obtaining the crime dataset is a hard
datasets and also the complication of links between these
task owing to confidentiality issues. So the crime dataset is
data have made criminology a suitable area for applying
data mining methods. The knowledge obtained from data produced synthetically. The proposed methodology is tested
mining techniques is very helpful that can aid and support for the criminal prediction problem and the results show that
the police [3]. According to [4], solving of crimes is a the proposed model provides high scores in identifying most
complicated task that needs both human experience and likely suspects.
intelligence and data mining is an approach that can assist In this research [11], crime data is used on Naive Bayes
the police with crime detection. classification algorithm, with Rapid Miner tool and how
efficiently and accurately Naïve Bayes algorithm could
manage this data is demonstrated.
B. Reason for Predictability of Crime In this work [12], the crime dataset is classified into
There is strong evidence to bolster the fact that crime is vulnerable and non-vulnerable for effective crime control
predictable statistically mainly because criminals function in strategies. The classification algorithms are applied
their comfort zones [5]. That is, they commit the same kind individually on real crime data and their performance
of crimes that they have done in the past, commonly close to evaluation is analyzed using standard measures such as
the same location and time with similar patterns. accuracy, time, Receivers Operating Characteristic (ROC).
The result showed that C4.5 performed better with higher
C. Review of Classification Algorithms Used In accuracy on the three datasets against Naïve Bayes. The
Crime Prediction result also revealed that the two classifiers performed better
under percentage split approach compared to 10 fold
In this work [6], three classification algorithms, namely validation approaches.
Naïve Bayes, C4.5 and K- Nearest Neighbor algorithms
(KNN) are compared using various famous missing data
2) Artifical Neural Networks: In this study [13], a
filling algorithms used on a real time crime dataset with
hybrid crime classification model was proposed, by
numerous missing data. The results show that higher
combining Artificial Bee Colony (ABC) algorithm with the
accuracy in classification can be acquired by combining
Artificial Neural Network (ANN) algorithm. The proposal
KNN classification algorithm and GBWKNN missing data
was by using Artificial Bee Colony as a learning method for
filling algorithm.
ANN, thus producing better results. This hybrid algorithm is
applied to a real time crime dataset to predict classes of
The main objective of this research [7] is to classify crimes crime. The dataset was obtained from the repository of UCI
based on the frequency of occurrence. A theoretical model Machine Learning. The experimental results show that this
based on techniques such as clustering and classification is proposed combination of ANN and ABC outperformed
applied to a real crime dataset recorded by police in other classification algorithms and achieved a high accuracy
England. Weights are allocated to the features to improve with improvement over other algorithms.
the model’s quality.
If the given correct output is zero, i.e., ye=0, the weights will
be adjusted according to the incremental gradient descent as
follows: