0% found this document useful (0 votes)
8 views27 pages

Group 6 - SMa - Crime Data Analysis Using Data Mining - Presentation

The document presents a project on crime data analysis using data mining techniques to identify and visualize crime patterns and trends. It outlines the objectives, methodology, and implementation of various data mining algorithms such as Naïve Bayes and Decision Trees to predict crime occurrences. The study emphasizes the importance of historical crime data for developing an effective predictive model and suggests future work in criminal profiling.

Uploaded by

21pw12
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
8 views27 pages

Group 6 - SMa - Crime Data Analysis Using Data Mining - Presentation

The document presents a project on crime data analysis using data mining techniques to identify and visualize crime patterns and trends. It outlines the objectives, methodology, and implementation of various data mining algorithms such as Naïve Bayes and Decision Trees to predict crime occurrences. The study emphasizes the importance of historical crime data for developing an effective predictive model and suggests future work in criminal profiling.

Uploaded by

21pw12
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 27

St.

Thomas’ College of Engineering &


Technology

CRIME DATA ANALYSIS USING

DATA MINING
.
By

{
Rohan Bhowmick 12200117030
Group 6 Ishika Chakrabarti
Anwesha Chakraborty
12200117045
12200118011

Under the esteemed guidance of


Prof. Subhankar Mallick
Department of Computer Science & Engineering
CONTENTS
INTRODUCTION

THHEORY & DISCUSSION

METHODOLOGY

I M P L E M E N TAT I O N

FUTURE WORK

CONCLUSION

REFERENCES

2
INTRODUCTION
C R I M E D ATA A N A LY S I S

Crime analysis and prevention is a systematic approach for


identifying and analysing patterns and trends in crime. Our
system can predict regions which have high probability for
crime occurrence and can visualize crime prone areas. With the
increasing advent of computerized systems, crime data analysts
can help the Law enforcement officers to speed up the process
of solving crimes.

3
OBJECTIVES
T H E O B J E C T I V E O F T H I S P R O J E C T I S :

 To identify the nature of crime and the crime prevention process to


extracting named entities from narrative reports.
 To explore and choose among the various data mining software that
support clustering and association rule mining technique to
experiment with crime records.
 To build and train as well as test the performance of the model.
 To interpret and analyse the results of the model that how strong is
the model to extract crime data patterns.
 To compare the clustering and association rules data mining
techniques and select the one which performs the best results.
 To compare our proposed model with some recent working model.
 Finally, to forward recommendations based on the findings of the
study.

4
THEORY & DISCUSSION

Data Mining, what is it?


 Generally, data mining (sometimes called data or knowledge
discovery) is the process of analyzing data from different
perspectives and summarizing it into useful information, that
can be used to increase revenue, cuts costs, or both.
 Data mining software is one of a number of analytical tools
for analyzing data.
APPLICATION AND
TRENDS
 Financial Data Analysis

 Retail Industry

 Telecommunication Industry

 Biological Data Analysis

 Other Scientific Applications

 Intrusion Detection

6
CRIME DATA
ANALYSIS USING
DATA MINING
LOOKING AHEAD

7
WHY ANALYSE CRIME?

 Analyse crime to inform law enforcers about general and


specific crime trends, patterns, and series in an ongoing,
timely manner.

 Analyse crime to take advantage abundance of information


existing in law enforcement agencies the criminal justice
system, and the public domain.

 Analyse crime to maximize the use of limited law


enforcement resources.

8
DATA MINING AND CRIME PATTERN

We will look at how to convert crime information into a data-mining problem. In this case it can help
the analysts to identify crimes faster and help to make faster decisions. We have seen that in crime
terminology a cluster is a group of crimes in a geographical region or a hot spot of crime.

Challenges we are facing are:

 Increase in crime information that has to be stored annualized.


 Analysis of data is difficult since data is incomplete and inconsistent.
 Accuracy of the program depends on accuracy of the training set.
 Limitation in getting crime data records from Law Enforcement
department

9
STEPS IN DOING CRIME ANALYSIS

Figure 1: Steps in doing Crime Analysis


10
METHODOLOGY
Data Collection
 Collecting data from various sources like news sites, blogs, social media, RSS feeds
etc.

 But the data we got is ‘VERY UNSTRUCTURED’!, and how do we store it?!

 The advantage of NoSQL database over SQL database is that it allows insertion of
data without a predefined schema.

 Object-oriented programming- hence is easy to use and flexible.

 Unlike SQL database it not need to know what we are storing in advance, specify its
size etc.

11
METHODOLOGY
Classification
• Naïve Bayes- a supervised learning method as well as a statistical method

• The algorithm classifies a news article into a crime type to which it fits the
best Eg. "What is the probability that a crime document D belongs to a
given class C?“

• Using Naive Bayes algorithm we create a model by training crime data


related to vandalism, murder, robbery, burglary, sex abuse, gang rape,
arson, armed robbery,

highway robbery, snatching etc.

• Test results shows that Naive Bayes shows more than 90% accuracy!!

12
METHODOLOGY
Pattern Identification
• Apriori algorithm- used to determine association rules which highlight
general trends

• The result of this phase is the crime pattern for a particular place.

• After getting a general crime pattern for a place, when a new case arrives and
if it follows the same crime pattern then we can say that the area has a
chance for crime occurrence.

• Information regarding patterns helps police officials to facilitate resources in


an effective manner.

13
METHODOLOGY
Prediction
• Decision tree- It is simple to understand and interpret!

• Its robust nature and also it works well with large datasets.

Visualization
• A heat map which indicates level of activity, usually darker colors to
indicate low activity and brighter colors to indicate high activity.

14
I M P L E M E N TAT I O N U S I N G DATA V I S U A L I Z I N G
TECHNIQUE

15
M O S T R E P O RT E D C R I M E

16
D I F F E R E N T T Y P E S O F C R I M E W I T H C R I M E RAT E

17
C R I M E S AG A I N S T C H I L D

18
C R I M E AG A I N S T W O M E N

19
D I S T R I B U T I O N O F C R I M E OV E R T H E Y E A R

20
CRIME PREDICTION

21
CRIME PREDICTION RESULTS

22
FUTURE WORK
Criminal Profiling
• Helps the crime investigators to record the characteristics of
criminals.

• The main goal of doing criminal profiling is that:

• To provide crime investigators with a social and psychological


assessment of the offender

• To evaluate belongings found in the possession of the offender

23
CONCLUSION
An acceptable model for data mining which comes up with
excellent results of analysing crime data set; it requires huge
historical data that can be used for creating and testing the model.
More than 150500 crime records that were used in this work can
give estimation and lead to an acceptable model. VS Code and
Excel software were used to pre-process and analyse the collected
crime and criminal data.

24
REFERENCES
• [1] Malathi. A and Dr. S. Santhosh Baboo. Article:an enhanced algorithm to
predict a future crime using data mining. International Journal of Computer
Applications, 21(1):1–6, May 2011. Published by Foundation of Computer
Science.

• [2] Eibe Frank and Remco R. Bouckaert. Naive bayes for text classification with
unbalanced classes. In Proceedings of the 10th European Conference on
Principle and Practice of Knowledge Discovery in Databases, PKDD’06,
pages 503–510, Berlin, Heidelberg, 2006. Springer-Verlag.

• [3] Ralf Hartmut Güting. Graphdb: Modeling and querying graphs in


databases. In Proceedings of the 20th International Conference on Very Large
Data Bases, VLDB ’94, pages 297–308, San Francisco, CA, USA, 1994. Morgan
Kaufmann Publishers Inc

25
REFERENCES
• [4] Lior Rokach and Oded Maimon. Decision trees. In Oded Maimon and Lior
Rokach, editors, The Data Mining and Knowledge Discovery Handbook, pages
165–192. Springer, 2005.

• [5] Tong Wang, Cynthia Rudin, Daniel Wagner, and Rich Sevieri. Detecting patterns
of crime with series finder. In Proceedings of the European Conference on
Machine Learning and Principles and Practice of Knowledge Discovery in
Databases (ECMLPKDD 2013), 2013.

• [6] R. Sanjana, H. S. S, and S. Sruthi, ‘Big Data Approach for Crime Classification
and Visualization Using Crime Dataset’, Int. J. Innov. Res. Sci. Eng. Technol., vol. 7,
no. 2, pp. 25–29, 2018.

• [7] A. Jain and V. Bhatnagar, ‘Crime Data Analysis Using Pig with Hadoop’, Procedia
Comput. Sci., vol. 78, no. December 2015, pp. 571–578, 2016.

26
Icon Icon Icon

THANK YOU

You might also like