0% found this document useful (0 votes)
22 views16 pages

Uday Nani

Using for reference purpose
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
22 views16 pages

Uday Nani

Using for reference purpose
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 16

A

Mini Project Report on

REAL TIME VIDEO BASED VIOLENCE DETECTION SYSTEM IN PUBLIC


AREA USING NEURAL NETWORK
Submitted for partial fulfilment of the requirements for the award of the degree of

BACHELOR OF TECHNOLOGY
in

COMPUTER SCIENCE AND DESIGN

by

G. VIVEK 21K81A7415
V. ARAVIND 22K85A7404
R. UDAY 22K85A7407

Under the Guidance of

Mr. V. SATHISH

ASSISTANT PROFESSOR

DEPARTMENT OF CSD

St. MARTIN'S ENGINEERING COLLEGE


UGC Autonomous
Affiliated to JNTUH, Approved by AICTE
Accredited by NBA & NAAC A+, ISO 9001-2008 Certified
Dhulapally, Secunderabad-500 100
www.smec.ac.in

NOVEMBER - 2024
St. MARTIN'S ENGINEERING COLLEGE
UGC Autonomous
Affiliated to JNTUH, Approved by AICTE
NBA & NAAC A+ Accredited
Dhulapally, Secunderabad - 500
100

Certificate

This is to certify that the project entitled “REAL TIME VIDEO BASED VIOLENCE
DETECTION SYSTEM IN PUBLIC AREA USING NEURAL NETWORK ” is being
submitted by G. VIVEK (21K81A7415), V. ARAVIND (22K85A7404), R. UDAY
(22K85A7407) in fulfilment of the requirement for the award of degree of BACHELOR
OF TECHNOLOGY IN COMPUTER SCIENCE AND DESIGN is recorded of
bonafide work carried out by them. The result embodied in this report have been verified
and found satisfactory.

Signature of Guide Signature of HOD


Mr. V. SATHISH Dr. P. SAI PRASAD
Assistant Professor Professor and Head of Department
Department of CSD Department of CSD

Internal Examiner External Examiner

Place:

Date:
St. MARTIN'S ENGINEERING COLLEGE
UGC Autonomous
Affiliated to JNTUH, Approved by AICTE,
Accredited by NBA & NAAC A+, ISO 9001:2008 Certified
Dhulapally, Secunderabad - 500 100

DEPARTMENT OF COMPUTER SCIENCE AND DESIGN

DECLARATION

We, the students of “Bachelor of Technology in Department of Computer Science and Design”,
session: 2021 - 2025, St. Martin’s Engineering College, Dhulapally, Kompally, Secunderabad,
hereby declare that the work presented in this project work entitled REAL TIME VIDEO BASED
VIOLENCE DETECTION SYSTEM IN PUBLIC AREA USING NEURAL NETWORKis the
outcome of our own bonafide work and is correct to the best of our knowledge and this work has been
undertaken taking care of Engineering Ethics. This result embodied in this project report has not been
submitted in any university for award of any degree.

G. VIVEK 21K81A7415

V.ARAVIND 22K85A7404

R. UDAY 22K85A7407
ACKNOWLEDGEMENT

The satisfaction and euphoria that accompanies the successful completion of any task
would be incomplete without the mention of the people who made it possible and whose
encouragement and guidance have crowded our efforts with success.
First and foremost, we would like to express our deep sense of gratitude and indebtedness
to our College Management for their kind support and permission to use the facilities available in
the Institute.

We especially would like to express our deep sense of gratitude and indebtedness to
Dr. P. SANTOSH KUMAR PATRA, Professor and Group Director, St. Martin’s Engineering
College, Dhulapally, for permitting us to undertake this project.

We wish to record our profound gratitude to Dr. M. SREENIVAS RAO, Principal,


St. Martin’s Engineering College, for his motivation and encouragement.

We are also thankful to Dr. P. SAI PRASAD, Head of the Department, Computer
Science and Design, St. Martin’s Engineering College, Dhulapally, Secunderabad, for his
support and guidance throughout our project as well as Project Coordinator Dr. B. CHANDRA
SHEKAR, Professor, Computer Science and Design department for his valuable support.

We would like to express our sincere gratitude and indebtedness to our project supervisor
Mr.V. SATHISH, Professor, Information Technology, St. Martins Engineering College,
Dhulapally, for his support and guidance throughout our project.

Finally, we express thanks to all those who have helped us successfully completing this
project. Furthermore, we would like to thank our family and friends for their moral support and
encouragement. We express thanks to all those who have helped us in successfully completing
the project.

G. VIVEK 21K81A7415
V.ARAVIND 22K85A7404
R. UDAY 22K85A7407
ABSTRACT

The increasing concerns over public safety have necessitated the development of
advanced surveillance systems capable of real-time violence detection. This paper
presents a real-time video-based violence detection system leveraging the capabilities of
YOLO (You Only Look Once) and Convolutional Neural Networks (CNN). The system
aims to enhance public security by accurately identifying violent actions in surveillance
footage. YOLO, a state-of-the-art object detection framework, is utilized for its high-
speed processing and efficiency in detecting objects within frames. Combined with
CNNs, which excel in feature extraction and classification, the system is designed to
recognize and categorize violent behaviors effectively. The architecture integrates a
YOLO-based detector to identify and track individuals in the video stream, while a CNN
model is employed to analyze the behavior of detected individuals and classify actions as
violent or non-violent.
The system operates in real-time, processing video feeds from public area surveillance
cameras to provide immediate alerts when violent activities are detected. This capability
not only aids in swift response and intervention but also enhances the overall
effectiveness of public safety measures. The proposed solution is validated through
extensive experimentation, demonstrating its robustness and accuracy in various
scenarios. This paper outlines the design, implementation, and performance evaluation of
the system, highlighting its potential impact on public area surveillance and security
LIST OF FIGURES

Figure No. Figure Title Page No.

4.2.1 CNN for Text Classification 13

4.2.2 Encoding Words 14

4.2.3 One-Hot Vector 15

4.2.6 Max Pooling over Time 16

4.2.7 Sentiment CNN 17

4.3.2 Data Flow Diagram 18


LIST OF TABLES

Table Table Name Page No.


4.5 Dataset View 25
LIST OF ACRONYMS AND DEFINITIONS

S.NO ACRONYM DEFINITION

01. LSTM Long Short-Term Memory

02. CNN Convolutional Neural


Network

03. SVM Support Vector Machine

04. UML Unified Modelling Language

05. RF Random Forest

06. RNN Recurrent Neural Network

07. NB Naïve Bayes

08. MDP Markov Decision Process

09. PTSD Post-traumatic Stress Disorder


CONTENTS

ACKNOWLEDGEMENT i
ABSTRACT ii
LIST OF FIGURES iii
LIST OF TABLES v

LIST OF ACRONYMS AND DEFINITIONS vi


CHAPTER 1 INTRODUCTION 01
1.1 Objective 02

1.2 Overview 03

CHAPTER 2 LITERATURE SURVEY 07


CHAPTER 3 SYSTEM ANALYSIS AND DESIGN 13
3.1 Existing System 13

3.2 Proposed System 13


3.3 System Configuration
CHAPTER 4 SYSTEM REQUIREMENTS & SPECIFICATIONS 15

4.1 Database 15
4.2 CNN Algorithm 15
4.3 Design 20

4.3.1 System Architecture 20


4.3.2 Data Flow Diagram 21
4.3.3 UML Diagram 22

4.3.4 Use Case Diagram 23


4.3.5 Class Diagram 24
4.3.6 Sequence Diagram 25

4.3.7 Activity Diagram 26


4.4 Modules 27
4.4.1 Modules Description 27

4.5 System Requirements 28


4.5.1 Hardware Requirements 28
4.5.2 Software Requirements 28
4.6 Testing 29
4.6.1 Unit Testing 29
4.6.2 Integration Testing 29
4.6.3 Functional Testing 29
4.6.4 System Testing 30
4.6.5 White Box Testing 30
4.6.6 Black Box Testing 30
4.6.7 Unit Testing 31
4.6.8 Integration Testing 31
4.6.9 Acceptance Testing 31
CHAPTER 5 SOURCE CODE 32
CHAPTER 6 EXPERIMENTAL RESULTS 41
CHAPTER 7 CONCLUSION &FUTURE ENHANCEMENT 48
7.1 CONCLUSION 48
7.2 FUTURE ENHANCEMENT 49
REFERENCES 50
CHAPTER 1

INTRODUCTION

In today's digital era, the prevalence of video content across various online platforms has
grown significantly, bringing attention to the need for efficient and automated violence
detection systems. To secure user security, safeguard online communities, and help law
enforcement authorities quickly respond to bad content, violence identification in real-
world recordings is an essential duty. Researchers have looked to pre-trained models as a
possible approach to tackle this problem, relying on their capacity to generalize from in-
depth training on substantial datasets. Public video observation systems are widely
utilized worldwide and can give accurate and detailed information in various security
applications . Although the need to review hours of video material compromises the
capacity to make decisions fast, which is essential in video surveillance for the
prevention of crime and violence.

In sequence to relieve authorities of the burden of having to view hours of movies in


sequence to identify events lasting only a few seconds, several research about the
automatic detection of violent sequences in videos have been presented. Recent research
has demonstrated the accuracy of deep learning-based algorithms in the identification of
violence, in contrast to earlier studies that used hand-crafted features and flow
descriptors typical of classical action recognition methods. Due to their outstanding
performance in image identification tasks, Convolutional Neural Networks (CNNs) are
more typically utilized in computer vision.

In several research areas, CNNs are developing quickly, and that is anticipated that next
solutions will accelerate the implementation of CNNs. These learning algorithms still
have lots of room for progress given the accessibility of massive data and the rapid
increase in computing power. The spatial CNNs, they are used for images recognition
tasks, have recently been extended to the temporal domain for HAR in movies using a
ways of effective methods.
OBJECTIVE

The technological advancement in video and image processing has been unprecedented
due to their importance in finding out intricate contents for various applications and
purposes which includes search, summarization and recognizing action. The emphasis on
recognizing actions from video stream has been growing in recent years due to the rise of
violent acts involving terror groups to various single or multiple person attacks. This has
resulted into the usage of surveillance cameras throughout the world on a ubiquitous
level. The footage that these cameras streams are manually inspected for such violent
acts by humans all the time which is really tenacious and not feasible in the long run-in
order to scale this operation. Also, the process of detecting such scenarios can be error
prone due to the fact that humans can make mistakes, or they might not catch a
significant event due to inspecting other feeds.

There have been millions of cameras around the world for surveillance purposes but
even if the error percentages are low still the dangers are there for thousands of people.
This is just an estimation which allowed us to think about the current situation of
violence detection systems and methodologies. This calls for certain measures and fast
detection of violence without the help of humans. Hence, we turn towards deep learning
methods which are able to learn by itself and it is required as it can be used effectively to
detect potential violent activity before anyone. This is where our system comes in which
proposes to use deep learning algorithms in order to detect violent activity automatically.
This involves various stages of process such as object detection, action detection and
video classification. In this system we propose methodologies that will be able to
recognize violent threats and activities using deep learning methodologies.
OVERVIEW

In sequence to ensure public safety and content filtering, it is becoming increasingly


difficult due to the prevalence of violent content in real-world videos on different digital
platforms. We want to solve this problem by creating a deep learning system that
spontaneously detects violent acts.

putting in spot a violence detection system that is accurate and efficient, able to scan
real-world films and distinguish between violent and nonviolent content. Including a
mail notification system that, when it identifies violence in a video, delivers quick alerts
to a list of recipients. The aim of this work is to examine and develop a real-world film-
based violence detection system.

The study's objectives are as follows Create and improve already-trained models: The
project will concentrate on choosing appropriate pre-trained models for violence
detection, such as two-stream CNNs or 3D CNNs. Using annotated datasets of violent
and nonviolent videos, these models will be refined and tailored to the unique properties
of real-life settings. Improve accuracy and robustness.

Nowadays, tremendous growth is observed in research of surveillance system. The


surveillance cameras installed at various public places like offices, hospitals, schools,
highways, etc. can be used to capture useful actions and movements for event prediction,
online monitoring, goal driven analysis and also in intrusion detection. The goal of this
article is to develop an independent system for automatic analysis to detect presence of
violence and non-violence using Deep Learning techniques. We have put forward a deep
neural network for significant detection accuracy.

A Convolutional Neural Network (CNN) is used to extract frame level features from a
video which are then aggregated using a variant of Long Short -Term Memory (LSTM)
that uses convolutional gates. CNN and LSTM are together used for the analysis of local
motion in a video.
CHAPTER 2

LITERATURE SURVEY

S. No
Authors Title Year
1.

X. Liu, J. Yang, J. Zhang, and Real-Time Video-Based Violence Detection


2020
L. Liu in Public Areas Using Deep Learning

2.
Deep Learning Approaches for Real-Time
A. Patel, M. Sharma, and R.
Detection of Violent Behavior in Video 2021
Kumar
Surveillance Systems

3.
R. Singh, P. Gupta, and K. Efficient Real-Time Violence Detection
2023
Mehta Using YOLO and LSTM Networks

You might also like