Machine Learning Software For The Detect
Machine Learning Software For The Detect
www.matjournals.com https://doi.org/10.46610/JOIPAI.2023.v09i03.002
Machine Learning Software for the Detection of Violence from CCTV Live
Footage
Adupa Nithin Sai1, Kowdodi Siva Prasad2*
1
Under Graduate Student, Department of Computer Science and Engineering (AIML), Hyderabad
Institute of Technology and Management, Hyderabad, Telangana, India
2
Professor, Department of Mechanical Engineering, Hyderabad Institute of Technology and
Management, Hyderabad, Telangana, India
*
Corresponding Author: [email protected]
Received Date: September 20, 2023 Published Date: September 30, 2023
www.matjournals.com https://doi.org/10.46610/JOIPAI.2023.v09i03.002
volumes of CCTV video data to spot patterns, detection is equally vital in applications
abnormalities, and possible problems using such as crowd management and
complex algorithms and neural networks. The surveillance, where distinguishing between
software program continuously learns from normal and deviant behaviour is required. A
historical data, increasing its precision and lot of studies have proposed deep learning-
efficacy in recognizing problems and responding based nonviolence detection systems.
to shifting settings and scenarios [2]. This According to one recent study, a CNN is
cutting-edge AI and ML-powered analysis used to extract spatiotemporal data from
enables the software program to recognize issues video frames, which are then fed into a
like abandoned objects, suspicious activities, support vector machine (SVM) for
overcrowding, and odd behaviour, among others classification. On a benchmark dataset, the
and provides vital insights for law enforcement system attained an accuracy rate of 92%,
agencies to take prompt and informed action. Its demonstrating the usefulness of the
capabilities are also significantly influenced by suggested approach [4].
big data analytics to detect trends, patterns, and Frame-Based Violence Detection: Using
correlations by analyzing massive amounts of Inflated 3D Convolutional Neural Network
video data from several CCTV cameras that may (I3D CNN) frame-based violence detection
not be obvious to human operators. systems identify violent events by
The software program represents a analyzing individual frames in a video clip.
ground-breaking approach to enhancing public One recent study offered a frame-based
safety through automated CCTV analytics, as strategy that extracts spatio-temporal data
well as a powerful fusion of advanced from video frames using the I3D algorithm.
technologies, such as AI, ML, computer vision, The collected features are subsequently
and big data analytics, to reform public safety classified using a Long Short-Term
through automated CCTV analytics. Memory (LSTM) network. On a benchmark
dataset, the suggested system attained an
LITERATURE REVIEW accuracy rate of 94.6%, exceeding existing
state-of-the-art algorithms [5].
Video-based Violence Detection: To Deep Learning and Transfer Learning
identify violent events, video-based for Violence Detection: Another recent
violence detection systems use computer work presented a violence detection system
vision techniques to analyze the visual that uses a CNN-based architecture with
aspects of video frames. Motion, colour, transfer learning and achieves an accuracy
texture, and shape are examples of these rate of 95.26% [6].
characteristics. Convolutional Neural Multi-Modal Deep Learning for Violence
Networks (CNNs) and Recurrent Neural Detection: A recent study proposes a multi-
Networks (RNNs) are two deep learning- modal deep learning strategy for violence
based techniques that have been proposed detection that incorporates visual and audio
for video-based violence detection. One information from video material. The study
recent work presented a two-stage extracted spatiotemporal and auditory
technique for violence detection in which a features using a combination of CNN and
CNN first extracts spatiotemporal LSTM networks, with an accuracy rate of
properties from video frames, which are 92.3%. The suggested approach
subsequently input into an RNN for outperformed utilizing solely visual or
classification. On a benchmark dataset, the audio characteristics, indicating the efficacy
system attained a high accuracy rate of of multi-modal deep learning for violence
95%, confirming the usefulness of the detection [7].
proposed approach [3].
Non-Violence Detection: Non-violence
www.matjournals.com https://doi.org/10.46610/JOIPAI.2023.v09i03.002
METHODOLOGY
www.matjournals.com https://doi.org/10.46610/JOIPAI.2023.v09i03.002
www.matjournals.com https://doi.org/10.46610/JOIPAI.2023.v09i03.002
www.matjournals.com https://doi.org/10.46610/JOIPAI.2023.v09i03.002
www.matjournals.com https://doi.org/10.46610/JOIPAI.2023.v09i03.002
invention. https://doi.org/10.1109/ICOMET.2019.867
REFERENCES 3496
5. N Honarjoo, A Abdari and A Mansouri
1. Google-deepmind/Kinetics-i3d, “I3D (2021). Violence detection using pre-
models trained on Kinetics”, [Online] trained models. 2021 5th International
Available at: https://github.com/google- Conference on Pattern Recognition and
deepmind/kinetics-i3d Image Analysis (IPRIA). IEEE, Available
2. I Kennedy Ihianle, A O. Nwajana, S Henry at:
Ebenuwa, et al (2020). A deep learning https://doi.org/10.1109/IPRIA53572.2021.9
approach for human activities recognition 483558
from multimodal sensing devices, IEEE 6. P Sernani, N Falcionelli, S Tomassini, et al
Access, 8, 179028-179038, Available at: (2021). Deep learning for automatic
https://doi.org/10.1109/ACCESS.2020.3027 violence detection: Tests on the AIRTLab
979 dataset, IEEE Access, 9, 160580-160595,
3. İ Üstek, J Desai, I López Torrecillas, et al Available at:
(2023). Two-stage violence detection using https://ieeexplore.ieee.org/stamp/stamp.jsp?
ViTPose and classification models at smart arnumber=9627980
airports, arXiv, Available at: 7. B Peixoto, B Lavi, P Bestagini, et al (2020).
https://doi.org/10.48550/arXiv.2308.16325 Multimodal violence detection in videos.
4. G Mehdi, N Ali, S Hussain, et al (2019). ICASSP 2020 - 2020 IEEE International
Design and fabrication of automatic single Conference on Acoustics, Speech and
axis solar tracker for solar panel. 2019 2nd Signal Processing (ICASSP). IEEE,
International Conference on Computing, Available at:
Mathematics and Engineering Technologies https://doi.org/10.1109/ICASSP40776.2020
(iCoMET). IEEE, Available at: .9054018
Adupa Nithin Sai and Kowdodi Siva Prasad, Machine Learning Software for the Detection of
Violence from CCTV Live Footage, Journal of Image Processing and Artificial Intelligence,
9(3), 12-18.