0% found this document useful (0 votes)

12 views12 pages

Video Anomaly Detection For Smart Surveillance: Related Concepts

The document discusses video anomaly detection in smart surveillance, focusing on the identification of unusual events in video sequences through temporal and spatial localization. It reviews various approaches including unsupervised, weakly-supervised, and supervised methods, highlighting the challenges and advancements in the field, particularly with the use of deep learning and large datasets like UCF-Crime. Additionally, it provides an overview of popular datasets used for training and evaluating anomaly detection models.

Uploaded by

onkarstudy123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views12 pages

Video Anomaly Detection For Smart Surveillance: Related Concepts

Uploaded by

onkarstudy123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Video Anomaly Detection for Smart Surveillance

Sijie Zhu1 , Chen Chen1 , Waqas Sultani2

1
University of North Carolina at Charlotte, USA
2
Information Technology University, Pakistan
{szhu3, chen.chen}@uncc.edu
arXiv:2004.00222v3 [cs.CV] 11 Apr 2020

Related Concepts

– Anomalous event/activity detection

– Novelty detection

Definition
Anomalies in videos are broadly defined as events or activities that are un-
usual and signify irregular behavior. The goal of anomaly detection is to tem-
porally or spatially localize the anomaly events in video sequences. Temporal
localization (i.e. indicating the start and end frames of the anomaly event in a
video) is referred to as frame-level detection. Spatial localization, which is more
challenging, means to identify the pixels within each anomaly frame that cor-
respond to the anomaly event. This setting is usually referred to as pixel-level
detection.
Background
In modern intelligent video surveillance systems, automatic anomaly detec-
tion through computer vision analytics plays a pivotal role which not only sig-
nificantly increases monitoring efficiency but also reduces the burden on live
monitoring. Video anomaly detection has been studied for a long time, while
this problem is far from being solved (as witnessed by the low accuracy on UCF-
Crime [22] dataset) due to the difficulty of modeling anomaly events and the
scarcity of anomaly data. Identifying anomaly events requires understanding of
complex visual patterns, and some patterns can only be detected when long-term
temporal relationship and causal reasoning are learned in the model, e.g. arson,
burglary, shoplifting, etc.
Early works mainly follow the setting of general anomaly detection which may
be better referred to as novelty detection, where all novel events are considered
as anomaly [13]. This problem is typically formulated as unsupervised learning,
where the models are trained with only normal video frames and validated with
both normal and anomaly frames. A popular idea is to find a set of basis to
represent normal frames and identify frames with high reconstruction loss or er-
ror as anomaly for inference, e.g. sparse coding [5,15], autoencoder [9]. However,
due to the limitation of data and computation, these approaches [13,12,5,15,23]
are conducted on small-scale datasets with relatively simple scenarios, which are
not satisfactory for real-world surveillance applications.
While it is theoretically pleasing to consider all the novel events as anomaly,
this setting has drawbacks for practical surveillance applications. Taking the
campus scenario [13,14] as an example, riding a bike is novel (i.e. considered as
an anomaly) since the model only sees people walking [13]. However, it should not
be considered as an anomaly in general for security purpose. As some anomaly
activities in real world applications may have clear definitions, e.g. different crim-
inal events which follow specific patterns, recent works [22,28] start to leverage
supervision for real-world anomaly detection. UCF-Crime [22] is currently the
largest anomaly detection dataset with realistic anomalies, which contains thou-
sands of anomaly and normal videos. The training set contains both anomaly
and normal videos with video-level annotation as a weak supervision, and the
frame-level annotation is provided for validation set. The detection performance
has been significantly improved with weakly supervised methods [22,28].
There is also a line of research focuses on specific anomaly detection tasks
where only one type of anomaly is considered, e.g. traffic accident on highway.
Since the camera poses, foreground patterns, and backgrounds are highly similar
and stable, the geometric prior knowledge and physics principles can be employed
for manually designed detection pipelines. Several representative works [21,3] rely
on object detection to identify anomaly events.
Representative Approaches
Based on the experimental setting on the training data, video anomaly detec-
tion methods can be generally classified into three categories, i.e. unsupervised,
weakly-supervised, and supervised. We provide a brief overview of recent ap-
proaches for each category.

Unsupervised Methods
Since real-world anomaly events happen with low probability, it is hard to cap-
ture all types of anomaly. However, normal videos are easy to access from social
media and public surveillance, unsupervised methods are thus motivated to de-
tect anomaly events with only normal videos in the training set. Although the
unsupervised methods are not able to achieve satisfactory performance on com-
plex real-world scenarios, they are believed to have better generalization ability
on unseen anomaly patterns.
Classic Machine Learning: Early unsupervised methods mainly adopt classic
machine learning techniques with hand-crafted features as well probability mod-
els. Kim et al . [12] propose to first extract optical flow features and find typical
patterns with a mixture of probabilistic PCA (Principal Component Analysis).
A space-time MRF (Markov Random Field) is then constructed to model the
relationship between spatio-temporal local regions of a video for Bayesian in-
ference. Inspired by studies of crowd behavior like social force model, Mehran
et al . [18] estimate the interaction forces in crowd to better model the normal
crowd behavior. Then normal and anomaly frames are classified with BoW (Bag
of Words) and LDA (Latent Dirichlet Allocation). Li et al . [13] introduce a
mixture of DT-based (Dynamic Textures) model for temporal normalcy. And a
discriminant saliency detection is utilized for measurement of spatial normalcy.
Ullah et al . [23] first extract the corner features and refine them with interaction
flow. A random forest is then trained to classify normal and anomaly frames.
Cong et al . introduce sparse coding for anomaly detection, and Lu et al . [15]
further propose an efficient sparse combination learning framework to achieve a
speed of 150 frames per second (fps).

Deep Learning: Thanks to deep learning techniques, recent works are able to
take advantage of large-scale dataset and powerful computation resource. Follow-
ing the setting of unsupervised anomaly detection, a number of works [9,16,17,7]
are proposed based on deep AE (autoencoder). Hasan et al . [9] propose to learn
both motion feature and discriminative regular patterns with a FCN (Fully Con-
volutional Network) based AE. The regularity score is computed based on the
reconstruction error of AE model. To better model the temporal relationship
within a video, [16] combines FCN and LSTM (long short term memory) as a
ConvLSTM-AE, which further improve the performance of AE framework. [17]
explores the combination of sparse coding and RNN (Recurrent Neural Net-
work). A temporally-coherent sparse coding framework is proposed to introduce
temporal information of video in the background of sparse coding. [7] proposes a
memory-augmented AE to memorize prototypical normal patterns for anomaly
detection. An attention-based sparse addressing is then designed to access the
memory and reconstruct future frames. For all mentioned AE based methods,
the anomaly events are determined based on the reconstruction error. On the
other hand, [11] proposes to formulate the problem as a multi-class classifica-
tion by applying k-means clustering and one-versus-all SVM (Support Vector
Machine).

Instead of directly computing the reconstruction error of future frames with a

set of basis or AE, another popular direction is to predict the future frames based
on the past frames, and assign high anomaly score when the real future frame is
highly different from the predicted one. To achieve future prediction, the idea of
GANs (Generative Adversarial Networks) is introduced, where a generator and a
discriminator are trained alternatively to achieve opposite goals. The generator is
trying to produce frames that are similar to real frames, while the discriminator
is trained to distinguish the generated fake frames from the real frames. With
abundant training data and proper training techniques, the generator would be
able to produce highly realistic fake frames which are indistinguishable from the
real ones for the discriminator. Recent works [14,27] usually employ a FCN based
framework as the generator to predict future frames. Liu et al . [14] propose to
add constraint on intensity, gradient and motion for future frame generation.
The intensity constraint provides the consistency between generated frames and
real ones on RGB space, and the gradient constraint can sharpen the generated
images. The motion constraint aims to generate predicted frames with similar
motions to the real ones by minimizing their optical flow difference. Ye et al . [27]
further introduce a predictive coding module and an error refinement module
based on the GAN-based framework.
Weakly Supervised Methods

With the increasing video data on social media platforms such as YouTube∗ ,
it is possible to access and annotate a large amount of anomaly videos [22].
For certain application scenarios where the anomaly activities are well defined,
the performance can be significantly improved by introducing supervision in-
formation. Recent works [22,28] follow the weakly supervised setting where only
video-level annotation is available for training. That is the training videos are la-
beled with normal or anomalous; however, the temporal location of the anomaly
event in each anomaly video is unknown (i.e. weak supervision).
Sultani et al . [22] and He et al . [10] formulate the weakly supervised problem
as MIL (multiple instance learning). Every frame of a normal video should be
normal, and there is at least one anomaly frame in an anomalous video. [10]
proposes a graph-based MIL framework with anchor dictionary learning, and
all experiments are conducted on UCSD [13] dataset with a weakly supervised
setting. [22] proposes a deep learning based method along with a large-scale
dataset with realistic crime-related anomalies and surveillance videos, namely
UCF-Crime [22]. A C3D framework is used to extract spatio-temporal features
and generate anomaly score. To distinguish normal and anomaly frames with this
weak supervision, the loss function forces the highest score of a negative video
to be higher than the highest score of a normal video. With the parameters of
C3D model frozen, [22] outperforms previous works by a large margin on the
UCF-Crime dataset.
Instead of improving the MIL technique, Zhong et al . [28] consider the weakly
supervised learning as a noisy label learning problem, where the annotation of
some frames in anomaly videos are wrong. They train a GCN (Graph Convolu-
tional Network) based cleaner to refine the noisy labels so that the classification
network can be trained end-to-end with frame-level labels.

Supervised Methods

For certain scenarios where the backgrounds and objects are well defined, e.g.
the roads and cars for highway traffic accidents detection, recent works [3,24]
are usually based on the frame-level annotated training videos (i.e. the temporal
annotations of the anomalies in the training videos are available – supervised
setting). A popular solution is to leverage the geometric prior knowledge and
object detection with additional supervision from other public datasets.
[21] first applies Faster-RCNN to detect vehicles, then an attention-based
LSTM module is applied to learn the accident score. For recent works [3,24] on
AI city challenge† , the frame-level annotation of accident is given on training set.
Apart from applying object detection, [3] models the background and space using
semantic segmentation, and the geometric prior is leveraged by perspective de-
tection. The vehicle dynamics are then represented by a spatial-temporal matrix.
∗
https://www.youtube.com/
†
https://www.aicitychallenge.org/
The anomaly events are identified based on the IOU (Intersection Over Union)
of different objects while applying the NMS (Non-Maximum Suppression) pro-
cedure. [24] utilizes YOLOv3 (You Only Look Once) as the object detector and
specifically improves the framework for small object scenarios. Then a multi-
object tracking is introduced to generate the trajectories of anomaly vehicles.
The accident starting time is estimated based on a curve fitting algorithm.
Datasets
In this section, we briefly review the popular datasets for video anomaly
detection. An overview of all listed datasets is provided in Table 1.

Dataset # of Videos Average Frames Example Anomalies

UCSD Ped1 [13] 70 201 Bikers, small carts
UCSD Ped2 [13] 28 163 Bikers, small carts
Subway Entrance [2] 1 121,749 Wrong direction, no payment
Subway Exit [2] 1 64,901 Wrong direction, no payment
Avenue [15] 37 839 Run, throw, new object
UMN [1] 5 1,290 Run
DAD [4] 1,730 100 Traffic accidents
CADP [21] 1,416 366 Traffic accidents
A3D [26] 1500 85 Traffic accidents
DADA [6] 2000 324 Traffic accidents
DoTA [25] 4677 156 Traffic anomalies, e.g. collision
Iowa DOT [19] 200 27,000 Traffic accidents
ShanghaiTech [14] 437 726 Bikers, cars
UCF Crime [22] 1,900 7,247 Arson, accident, burglary, fighting
Street Scene [20] 81 2509 Jaywalking, car illegally parked

Table 1. An overview of datasets for video anomaly detection.

UCSD: The UCSD dataset contains two subsets, denoted as Ped1 and Ped2.
They are captured with different camera poses at two spots in UCSD campus
where most pedestrians walk. The training set (34 clips for Ped1 and 16 clips for
Ped2) only contains normal frames, and the test set (36 clips for Ped1 and 12 clips
for Ped2) consists of both normal and anomaly frames. Frame-level annotation
is provided for all test clips and 10 of them have pixel-level ground-truth. UCSD
dataset considers pedestrians walking as the normal pattern, so non pedestrian
entities like bikers and skaters are defined as anomaly instances. Dataset link:
http://www.svcl.ucsd.edu/projects/anomaly/dataset.html
Subway: Subway [2] dataset contains two subsets, i.e. Subway Entrance
and Subway Exit. They contain only one long surveillance video each in subway
station. They are first proposed specifically for real-time detection of unusual
events detection in crowded subway scenes, e.g. moving in the wrong direc-
tion, or no payment. Dataset link: http://vision.eecs.yorku.ca/research/
anomalous-behaviour-data/
Avenue: The Avenue [15] dataset contains 15 videos, and each video is about
2 minutes long. The total frame number is 35,240. 8,478 frames from 4 videos
are used as training set. Typical unusual events include running and throw-
ing objects. Dataset link: http://www.cse.cuhk.edu.hk/leojia/projects/
detectabnormal/dataset.html
UMN: The UMN [1] (University of Minnesota) dataset consists of five videos
captured from different angles. The normal pattern is defined as walking and the
main anomaly activity is running. Dataset link: http://mha.cs.umn.edu/
DAD: DAD [4] (Dashcam Accident Dataset) is proposed specifically for
accident detection. The normal pattern is vehicles moving around and anomaly
events include different traffic accidents, e.g. car hits car, or motorbike hits
motorbike. DAD dataset consists of 678 videos from six cities. 58 videos are
used for training. For the rest 620 videos, 620 clips with accidents are sampled
as positive clips and 1130 normal clips are sampled as negative clips. They are
then randomly split into two subsets, i.e. 455 positive and 829 negative clips
for training, and 165 positive and 301 negative clips for testing. Dataset link:
https://aliensunmin.github.io/project/dashcam/
CADP: CADP [21] (Car Accident Detection and Prediction) focuses on
car accident on CCTV (Closed-Circuit Television) cameras. All the 1416 videos
of CADP contain traffic accidents, and 205 of them have temporal as well
as spatial annotations. CADP contains videos captured under various cam-
era types, qualities, weather conditions, and the anomaly events are realistic
for real-world applications. Dataset link: https://ankitshah009.github.io/
accident_forecasting_traffic_camera
A3D: A3D [26] consists of 1500 on-road abnormal event video clips from
dashboard cameras. Each video contains an abnormal traffic event, and the
anomaly start and end times are annotated by human annotators. A total of
128,175 frames (ranging from 23 to 208 frames) at 10 frames per second are
clustered into 18 types of traffic accidents. Dataset link: https://github.com/
MoonBlvd/tad-IROS2019
DADA: DADA [6] is a traffic accident dataset collected for driver attention
prediction in accidental scenarios. It has 658,476 available frames contained in
2000 videos with the resolution of 1584×660. The videos are divided into 54 kinds
of categories, such as “hitting” and “out of control”, based on the participants
of accidents (e.g. pedestrian, vehicle, cyclist, etc.). The spatial crash-objects,
temporal window of the occurrence of accidents are annotated. Dataset link:
https://github.com/JWFangit/LOTVS-DADA
DoTA: DoTA [25] (Detection of Traffic Anomaly) is a recent traffic anomaly
detection dataset containing 4,677 videos with temporal, spatial, and categorical
annotations. The objective is to introduce a when-where-what pipeline to detect,
localize, and recognize anomalous events from egocentric videos. The video clips
are collected from YouTube channels with diverse dash camera accident videos
from different countries under different weather and lighting conditions. Dataset
link: https://github.com/MoonBlvd/Detection-of-Traffic-Anomaly
Iowa DOT Traffic: Iowa DOT (Department of Transportation) Traffic
dataset [19] consists of 200 videos, each approximately 15 minutes in length,
recorded at 30 fps and 800 × 410 resolution. Training and testing set each con-
tains 100 videos. As the official dataset for the 2018 AI City challenge [19] Track
3, it does not provide annotation for the testing set. Main anomaly patterns are
car crashes and stalled vehicles. Dataset link: https://www.aicitychallenge.
org/2018-ai-city-challenge/
ShanghaiTech: ShanghaiTech [14] dataset is collected in ShanghaiTech Uni-
versity under 13 scenes with complex light conditions and camera viewpoints.
It consists of 437 videos with 726 average frames each. The training set con-
sists of 330 normal videos and testing set contains 107 videos with 130 anoma-
lies. Anomaly events include unusual patterns in campus such as bikers or cars.
Dataset link: https://svip-lab.github.io/dataset/campus_dataset.html
UCF Crime: UCF Crime [22] consists of 1900 untrimmed videos covering
13 real-world anomaly events, including Abuse, Arrest, Arson, Assault, Road
Accident, Burglary, Explosion, Fighting, Robbery, Shooting, Stealing, Shoplift-
ing, and Vandalism. 950 of them are normal videos and the rest videos contain
at least one anomaly event for each. The training set contains 800 normal and
810 anomalous videos. The remaining 150 normal and 140 anomalous videos are
temporally annotated for validation. Both training and testing sets cover all the
13 anomaly events. Some of the videos may contain multiple anomaly categories,
e.g. robbery along with fighting, burglary with vandalism, arrest with shooting.
All the videos are realistic for real-world surveillance applications. Furthermore,
UCF Crime covers different light conditions, image resolutions, and camera poses
in complex scenarios, thus is very challenging. Dataset link: https://www.crcv.
ucf.edu/projects/real-world/
Street Scene: Street Scene [20] dataset is focused on single scene anomaly
detection. It consists of 46 training videos and 35 testing videos taken from
a static USB camera looking down on a scene of a two-lane street with bike
lanes and pedestrian sidewalks. There are a total of 203,257 color video frames
(56,847 for training and 146,410 for testing) with 1280 × 720 resolution. The
frames were extracted from the original videos at 15 frames per second. 17 types
of anomaly events/activities are presented in the dataset such as jaywalking,
loitering, car illegally parked, etc. Dataset link: https://www.merl.com/demos/
video-anomaly-detection
Benchmarks
In this section, we introduce popular evaluation metrics and show existing
results on five popular benchmark datasets, i.e. UCSD Ped2 [13], Avenue [15],
UMN [1], Shanghai Tech [14], UCF [22], and Iowa DOT Traffic [19].
The frame-level evaluation criterion uses the frame-level ground truth anno-
tations to determine which detected frames are true positives (i.e. true anomaly
frames) and which are false positives, yielding frame-level true positive and false
positive rates. In pixel-level evaluation, it requires the algorithm to take into
account the spatial locations of anomaly objects in frames. A detection is con-
sidered to be correct if it covers at least 40% of anomaly pixels in the ground-
truth [13]. The pixel-level evaluation can be conducted only if the pixel-level
annotations are available for the testing videos.
As shown in Tables 2 and 3, the frame-level AUC (Area Under the Curve) of
ROC (Receiver Operating Characteristic) curve is widely used as the evaluation
metric for temporal localization of anomaly events. Since the anomaly detection
can be considered as a binary classification for each frame, the ROC curve is
generated by applying different thresholds for the anomaly score of each frame
and calculating the TPR (True Positive Rate) and FPR (False Positive Rate).

Method UMN UCSD Ped2 Avenue Shanghai Tech

Mehran et al . [18] 96.0 - - -
Cong et al . [5] 97.8 - - -
Li [13,17] 99.5 69.3 - -
Hasan et al . [9] - 90.0 70.2 -
Luo et al . [17] - 92.21 81.71 68.0
Gong et al . [7] - 94.1 83.3 71.2
Liu et al . [14] - 95.4 85.1 72.8
Ye et al . [27] - 96.8 86.2 73.6
Ionescu et al . [11] 99.6 97.8 90.4 84.9

Table 2. Frame-level anomaly detection evaluation. AUC (%) of existing works

on UCSD, UMN, Avenue, and Shanghai Tech with unsupervised setting.

Method UCSD Shanghai Tech UCF Crime

He et al . [10] 90.1 - -
Sultani et al . [22] - - 75.41
Zhong et al . [28] 93.2 84.44 82.12

Table 3. AUC (%) of existing works on UCSD, Shanghai Tech, UCF Crime
with weakly supervised setting.

For traffic accident detection with supervised setting, the F1 score, RMSE
(Root Mean Square Error) of anomaly start time, and S3 score are used as
evaluation metrics. The F1 score is defined as:
2T P
F1 = , (1)
2T P + F P + F N

where TP, FP, and FN denote true positive, false positive, and false negative
numbers. The S3 score is computed as:

S3 = F 1(1 − N RM SE), (2)

where the NRMSE denotes the normalized root mean square error [19]. We show
the performance of two state-of-the-art methods on Iowa DOT Traffic dataset
in Table 4.

Method F1 score RMSE S3 score

UWIPL [24] 0.9577 6.7461 0.9362
Traffic Brain [3] 0.9706 5.3058 0.9534

Table 4. F1 score, RMSE, and S3 score of two top-performing methods on Iowa

DOT Traffic dataset with supervised setting.

Video Anomaly Detection Conference Workshop

The NVIDIA AI City Challenge‡ was launched in 2017 and has been held as
a full-day workshop of IEEE/CVF Conference on Computer Vision and Pattern
Recognition (CVPR) since 2018. Traffic anomaly detection in surveillance videos
is one track of the challenge.
Open Problems
Although there has been significant progress towards building efficient video
anomaly detection algorithms in recent years, in particular the deep learning-
based approaches, we highlight a few possible open problems that are worth
exploring in the future.

– Although different learning frameworks have been adopted, the learned rep-
resentations are still not satisfactory for distinguishing complex anomaly
activities. Possible better representations include better 3D feature extrac-
tor, attention mechanism, and causal reasoning (identifying the cause of an
anomaly event, e.g. too fast −→ accidents).
– Early works mainly focus on the unsupervised setting, and recent works have
shown potential on improving performance by leveraging some supervision
information for certain scenarios. It would be promising to explore better
setting for practical applications, e.g. better trade-off between the general-
ization ability (unsupervised setting) and performance (weakly supervised
setting).
– It may be acceptable for anomaly detection systems operating in public
spaces where there is no expectation of privacy. However, what if the tech-
nology needs to be applied to non-public spaces where there is a stronger
expectation of privacy? It is worth exploring effective ways to de-identify the
training videos and train anomaly models with de-identified data.
– The current anomaly detection approaches or systems act as an alerting
mechanism. How do we explain the AI decisions and convey these effectively
to stakeholders, e.g. law enforcement, attorneys, media, local residents, and
‡
https://www.aicitychallenge.org/
broader community. We expect techniques to close the gap between perfor-
mance and interpretable AI models.

References
[1] Unusual crowd activity dataset of University of Minnesota:
http://mha.cs.umn.edu/Movies/Crowd-Activity-All.avi.
[2] Amit Adam, Ehud Rivlin, Ilan Shimshoni, and Daviv Reinitz. Robust real-
time unusual event detection using multiple fixed-location monitors. IEEE
transactions on pattern analysis and machine intelligence, 30(3):555–560,
2008.
[3] Shuai Bai, Zhiqun He, Yu Lei, Wei Wu, Chengkai Zhu, Ming Sun, and
Junjie Yan. Traffic anomaly detection via perspective map based on spatial-
temporal information matrix. In Proc. CVPR Workshops, 2019.
[4] Fu-Hsiang Chan, Yu-Ting Chen, Yu Xiang, and Min Sun. Anticipating
accidents in dashcam videos. In Asian Conference on Computer Vision,
pages 136–153. Springer, 2016.
[5] Yang Cong, Junsong Yuan, and Ji Liu. Sparse reconstruction cost for ab-
normal event detection. In CVPR 2011, pages 3449–3456. IEEE, 2011.
[6] Jianwu Fang, Dingxin Yan, Jiahuan Qiao, and Jianru Xue. Dada: A large-
scale benchmark and model for driver attention prediction in accidental
scenarios. arXiv preprint arXiv:1912.12148, 2019.
[7] Dong Gong, Lingqiao Liu, Vuong Le, Budhaditya Saha, Moussa Reda Man-
sour, Svetha Venkatesh, and Anton van den Hengel. Memorizing normality
to detect anomaly: Memory-augmented deep autoencoder for unsupervised
anomaly detection. In Proceedings of the IEEE International Conference
on Computer Vision, pages 1705–1714, 2019.
[8] Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-
Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. Generative ad-
versarial nets. In Advances in neural information processing systems, pages
2672–2680, 2014.
[9] Mahmudul Hasan, Jonghyun Choi, Jan Neumann, Amit K Roy-Chowdhury,
and Larry S Davis. Learning temporal regularity in video sequences. In Pro-
ceedings of the IEEE conference on computer vision and pattern recognition,
pages 733–742, 2016.
[10] Chengkun He, Jie Shao, and Jiayu Sun. An anomaly-introduced learning
method for abnormal event detection. Multimedia Tools and Applications,
77(22):29573–29588, 2018.
[11] Radu Tudor Ionescu, Fahad Shahbaz Khan, Mariana-Iuliana Georgescu,
and Ling Shao. Object-centric auto-encoders and dummy anomalies for
abnormal event detection in video. In Proceedings of the IEEE Conference
on Computer Vision and Pattern Recognition, pages 7842–7851, 2019.
[12] Jaechul Kim and Kristen Grauman. Observe locally, infer globally: a space-
time mrf for detecting abnormal activities with incremental updates. In
2009 IEEE Conference on Computer Vision and Pattern Recognition, pages
2921–2928. IEEE, 2009.
[13] Weixin Li, Vijay Mahadevan, and Nuno Vasconcelos. Anomaly detection
and localization in crowded scenes. IEEE transactions on pattern analysis
and machine intelligence, 36(1):18–32, 2013.
[14] W. Liu, D. Lian W. Luo, and S. Gao. Future frame prediction for anomaly
detection – a new baseline. In 2018 IEEE Conference on Computer Vision
and Pattern Recognition (CVPR), 2018.
[15] Cewu Lu, Jianping Shi, and Jiaya Jia. Abnormal event detection at 150 fps
in matlab. In Proceedings of the IEEE international conference on computer
vision, pages 2720–2727, 2013.
[16] Weixin Luo, Wen Liu, and Shenghua Gao. Remembering history with convo-
lutional lstm for anomaly detection. In 2017 IEEE International Conference
on Multimedia and Expo (ICME), pages 439–444. IEEE, 2017.
[17] Weixin Luo, Wen Liu, and Shenghua Gao. A revisit of sparse coding based
anomaly detection in stacked rnn framework. In Proceedings of the IEEE
International Conference on Computer Vision, pages 341–349, 2017.
[18] Ramin Mehran, Alexis Oyama, and Mubarak Shah. Abnormal crowd be-
havior detection using social force model. In 2009 IEEE Conference on
Computer Vision and Pattern Recognition, pages 935–942. IEEE, 2009.
[19] Milind Naphade, Zheng Tang, Ming-Ching Chang, David C Anastasiu, Anuj
Sharma, Rama Chellappa, Shuo Wang, Pranamesh Chakraborty, Tingting
Huang, Jenq-Neng Hwang, et al. The 2019 ai city challenge. In CVPR
Workshops, 2019.
[20] Bharathkumar Ramachandra and Michael Jones. Street scene: A new
dataset and evaluation protocol for video anomaly detection. In The IEEE
Winter Conference on Applications of Computer Vision, pages 2569–2578,
2020.
[21] Ankit Shah, Jean Baptiste Lamare, Tuan Nguyen Anh, and Alexander
Hauptmann. Cadp: A novel dataset for cctv traffic camera based acci-
dent analysis. arXiv preprint arXiv:1809.05782, 2018. First three authors
share the first authorship.
[22] Waqas Sultani, Chen Chen, and Mubarak Shah. Real-world anomaly de-
tection in surveillance videos. In Proceedings of the IEEE Conference on
Computer Vision and Pattern Recognition, pages 6479–6488, 2018.
[23] Habib Ullah, Mohib Ullah, and Nicola Conci. Dominant motion analysis in
regular and irregular crowd scenes. In International Workshop on Human
Behavior Understanding, pages 62–72. Springer, 2014.
[24] Gaoang Wang, Xinyu Yuan, Aotian Zhang, Hung-Min Hsu, and Jenq-
Neng Hwang. Anomaly candidate identification and starting time esti-
mation of vehicles from traffic videos. In AI City Challenge Workshop,
IEEE/CVF Computer Vision and Pattern Recognition (CVPR) Confer-
ence, Long Beach, California, 2019.
[25] Yu Yao, Xizi Wang, Mingze Xu, Zelin Pu, Ella Atkins, and David Crandall.
When, where, and what? a new dataset for anomaly detection in driving
videos. arXiv preprint arXiv:2004.03044, 2020.
[26] Yu Yao, Mingze Xu, Yuchen Wang, David J Crandall, and Ella M Atkins.
Unsupervised traffic accident detection in first-person videos. arXiv preprint
arXiv:1903.00618, 2019.
[27] Muchao Ye, Xiaojiang Peng, Weihao Gan, Wei Wu, and Yu Qiao. Anopcn:
Video anomaly detection via deep predictive coding network. In Proceedings
of the 27th ACM International Conference on Multimedia, pages 1805–1813,
2019.
[28] Jia-Xing Zhong, Nannan Li, Weijie Kong, Shan Liu, Thomas H Li, and
Ge Li. Graph convolutional label noise cleaner: Train a plug-and-play action
classifier for anomaly detection. In Proceedings of the IEEE Conference on
Computer Vision and Pattern Recognition, pages 1237–1246, 2019.

Anomaly Detection in Surveillance
No ratings yet
Anomaly Detection in Surveillance
9 pages
Abnormal Event Detection in Videos Using Spatiotemporal Autoencoder
100% (1)
Abnormal Event Detection in Videos Using Spatiotemporal Autoencoder
20 pages
Irregular Events Detection in Videos Using Machine Learning Techniques
No ratings yet
Irregular Events Detection in Videos Using Machine Learning Techniques
7 pages
Study of Methods Detect Anomalous Activities in Videos
No ratings yet
Study of Methods Detect Anomalous Activities in Videos
7 pages
Enhancing Video Anomaly Detection For Human Suspicious Behavior Through Deep Hybrid Temporal Spatial Network
No ratings yet
Enhancing Video Anomaly Detection For Human Suspicious Behavior Through Deep Hybrid Temporal Spatial Network
8 pages
Sensors 23 06256 v2
No ratings yet
Sensors 23 06256 v2
27 pages
Chapters
No ratings yet
Chapters
27 pages
Learning Temporal Regularity in Video Sequences
No ratings yet
Learning Temporal Regularity in Video Sequences
40 pages
VideoAnamolyDetection Survey
No ratings yet
VideoAnamolyDetection Survey
36 pages
Adaptive Sparse Representations For Video Anomaly Detection
No ratings yet
Adaptive Sparse Representations For Video Anomaly Detection
15 pages
NLP Final
No ratings yet
NLP Final
28 pages
View of Real-World Anomaly Detection in Video Using Spatio-Temporal Features Analysis For Weakly Labelled Data With Auto Label Generation
No ratings yet
View of Real-World Anomaly Detection in Video Using Spatio-Temporal Features Analysis For Weakly Labelled Data With Auto Label Generation
9 pages
Wu - Learning Causal Temporal Relation and Feature Discrimination For Anomaly Detection - 21
No ratings yet
Wu - Learning Causal Temporal Relation and Feature Discrimination For Anomaly Detection - 21
15 pages
Anomaly Detection Using Edge Computing in Video Surveillance System: Review
No ratings yet
Anomaly Detection Using Edge Computing in Video Surveillance System: Review
26 pages
Video Anomaly Detection in 10 Years: A Survey and Outlook
No ratings yet
Video Anomaly Detection in 10 Years: A Survey and Outlook
20 pages
Anomaly Detection Through Video Surveill
No ratings yet
Anomaly Detection Through Video Surveill
9 pages
1 s2.0 S1047320321000201 Main
No ratings yet
1 s2.0 S1047320321000201 Main
14 pages
Vision Transformer Attention With Multi-Reservoir Echo State
No ratings yet
Vision Transformer Attention With Multi-Reservoir Echo State
17 pages
Sensors 22 10016 v2
No ratings yet
Sensors 22 10016 v2
16 pages
Abnormal Events Detection Using Spatio-Temporal Saliency Descriptor and Fuzzy Representation Analysis
No ratings yet
Abnormal Events Detection Using Spatio-Temporal Saliency Descriptor and Fuzzy Representation Analysis
12 pages
Real-World Anomaly Detection in Surveillance Videos
No ratings yet
Real-World Anomaly Detection in Surveillance Videos
10 pages
ASurveyof Deep Learning Solutionsfor Anomaly Detectionin Surveillance Videos
No ratings yet
ASurveyof Deep Learning Solutionsfor Anomaly Detectionin Surveillance Videos
9 pages
Doshi Continual Learning For Anomaly Detection in Surveillance Videos CVPRW 2020 Paper
No ratings yet
Doshi Continual Learning For Anomaly Detection in Surveillance Videos CVPRW 2020 Paper
10 pages
TAM-Net Temporal Enhanced Appearance-to-Motion Generative Network For Video Anomaly Detection
No ratings yet
TAM-Net Temporal Enhanced Appearance-to-Motion Generative Network For Video Anomaly Detection
8 pages
Real
No ratings yet
Real
8 pages
Biradar Robust Anomaly Detection Through Transformer-Encoded Feature Diversity Learning ACCVW 2024 Paper
No ratings yet
Biradar Robust Anomaly Detection Through Transformer-Encoded Feature Diversity Learning ACCVW 2024 Paper
14 pages
SMC2018 Review
No ratings yet
SMC2018 Review
7 pages
Annomally Detection Reserach Paper
No ratings yet
Annomally Detection Reserach Paper
21 pages
Dube RW-SVD A Surround View Rough Weather Video Anomaly Dataset and ACCVW 2024 Paper
No ratings yet
Dube RW-SVD A Surround View Rough Weather Video Anomaly Dataset and ACCVW 2024 Paper
17 pages
Detection of Video Anomalies Using Convolutional Autoencoders and One-Class Support Vector Machines
No ratings yet
Detection of Video Anomalies Using Convolutional Autoencoders and One-Class Support Vector Machines
12 pages
Any-Shot Sequential Anomaly Detection in Surveillance Videos CVPRW 2020 Paper
No ratings yet
Any-Shot Sequential Anomaly Detection in Surveillance Videos CVPRW 2020 Paper
6 pages
Spatiotemporal Anomaly Detection
No ratings yet
Spatiotemporal Anomaly Detection
10 pages
Topic Studies
No ratings yet
Topic Studies
6 pages
Video Anomaly Detection Via Motion Completion Diffusion For Intelligent Surveillance System
No ratings yet
Video Anomaly Detection Via Motion Completion Diffusion For Intelligent Surveillance System
11 pages
Imp 2
No ratings yet
Imp 2
27 pages
Georgescu Anomaly Detection in Video Via Self-Supervised and Multi-Task Learning CVPR 2021 Paper
No ratings yet
Georgescu Anomaly Detection in Video Via Self-Supervised and Multi-Task Learning CVPR 2021 Paper
11 pages
Trajectorybased
No ratings yet
Trajectorybased
11 pages
Neurocomputing: Dan Xu, Rui Song, Xinyu Wu, Nannan Li, Wei Feng, Huihuan Qian
No ratings yet
Neurocomputing: Dan Xu, Rui Song, Xinyu Wu, Nannan Li, Wei Feng, Huihuan Qian
3 pages
Fast Anomaly Detection in Traffic Surveillance Video Based On Robust Sparse Optical Flow
No ratings yet
Fast Anomaly Detection in Traffic Surveillance Video Based On Robust Sparse Optical Flow
5 pages
Real-Time Anomaly Detection and Classification From Surveillance Cameras Using Deep Neural Network
No ratings yet
Real-Time Anomaly Detection and Classification From Surveillance Cameras Using Deep Neural Network
6 pages
Anomalous Motion Detection On Highway Using Deep Learning: Harpreet Singh Emily M. Hand Kostas Alexis
No ratings yet
Anomalous Motion Detection On Highway Using Deep Learning: Harpreet Singh Emily M. Hand Kostas Alexis
5 pages
Anomaly Detection in Crowded Scenes
No ratings yet
Anomaly Detection in Crowded Scenes
8 pages
Learning Memory-Guided Normality For Anomaly Detection
No ratings yet
Learning Memory-Guided Normality For Anomaly Detection
10 pages
Electronics 12 00029
No ratings yet
Electronics 12 00029
22 pages
Anomaly Detection in Surveillance Videos Using Deep Learning
No ratings yet
Anomaly Detection in Surveillance Videos Using Deep Learning
6 pages
Paper 02
No ratings yet
Paper 02
5 pages
AI-Based Modeling Architecture To Detect Traffic Anomalies From Dashcam Videos
No ratings yet
AI-Based Modeling Architecture To Detect Traffic Anomalies From Dashcam Videos
3 pages
Semi-Supervised Deep Learning Based Method For Abnormality Detection in Videos
No ratings yet
Semi-Supervised Deep Learning Based Method For Abnormality Detection in Videos
5 pages
Exploring The Use of Different Feature Levels of CNN For Anomaly Detection
No ratings yet
Exploring The Use of Different Feature Levels of CNN For Anomaly Detection
5 pages
Improved Anomaly Detection in Surveillance Videos Based On A Deep Learning Method (2018)
No ratings yet
Improved Anomaly Detection in Surveillance Videos Based On A Deep Learning Method (2018)
9 pages
Log Eucledian Covariance Matrix
No ratings yet
Log Eucledian Covariance Matrix
8 pages
IJECE
No ratings yet
IJECE
12 pages
Final 2
No ratings yet
Final 2
14 pages
Icses 24 T3 1047
No ratings yet
Icses 24 T3 1047
8 pages
Anomaly Detection Using Prediction Error Cc5b2ed6
No ratings yet
Anomaly Detection Using Prediction Error Cc5b2ed6
6 pages
ML - Chapter 5 - Neural Network
No ratings yet
ML - Chapter 5 - Neural Network
64 pages
Suspicious Activity Detection Using Convolution Neural Network
No ratings yet
Suspicious Activity Detection Using Convolution Neural Network
11 pages
Real-Time Anomaly Detection in Surveillance Footage (Deliverable1)
No ratings yet
Real-Time Anomaly Detection in Surveillance Footage (Deliverable1)
4 pages
Proposal
No ratings yet
Proposal
14 pages
Anomaly Detection and Localization
No ratings yet
Anomaly Detection and Localization
15 pages
R22M.tech - CSE Syllabus
No ratings yet
R22M.tech - CSE Syllabus
50 pages
Role of AI
100% (1)
Role of AI
27 pages
PHD Persentation
No ratings yet
PHD Persentation
22 pages
Realistic Speech-Driven Facial Animation With Gans
No ratings yet
Realistic Speech-Driven Facial Animation With Gans
16 pages
ICAISE22 Program 5
No ratings yet
ICAISE22 Program 5
17 pages
Machine Learning and Deep Learning Approaches For CyberSecurity A Review
No ratings yet
Machine Learning and Deep Learning Approaches For CyberSecurity A Review
14 pages
Paper-189 - Machine Learning Unveiled
No ratings yet
Paper-189 - Machine Learning Unveiled
19 pages
Artificial Intelligence A-Z™ 2023 Build An AI With
No ratings yet
Artificial Intelligence A-Z™ 2023 Build An AI With
19 pages
ML Lab Manual (5cs4-23)
No ratings yet
ML Lab Manual (5cs4-23)
53 pages
The Machine Learning Journey PDF
No ratings yet
The Machine Learning Journey PDF
21 pages
Prediction of COVID-19 Using Machine Learning Techniques: Project Title
No ratings yet
Prediction of COVID-19 Using Machine Learning Techniques: Project Title
4 pages
A Hybrid Approach Based On R-CNN Resnet-50 Pre-Trained and Image Segmentation Algorithm To Detect Glaucoma Using Fundus Image
No ratings yet
A Hybrid Approach Based On R-CNN Resnet-50 Pre-Trained and Image Segmentation Algorithm To Detect Glaucoma Using Fundus Image
14 pages
Systematic Review of Machine Learning Techniques For Cattle
No ratings yet
Systematic Review of Machine Learning Techniques For Cattle
18 pages
Ipc MFW7442K1P Z 0832 T40 0360B
No ratings yet
Ipc MFW7442K1P Z 0832 T40 0360B
4 pages
c11 Capstone
No ratings yet
c11 Capstone
19 pages
Introduction To Deep Learning-1
No ratings yet
Introduction To Deep Learning-1
16 pages
Adversarial Learning Targeting Deep Neural Network Classification A Comprehensive
No ratings yet
Adversarial Learning Targeting Deep Neural Network Classification A Comprehensive
32 pages
What Is Ai v2
No ratings yet
What Is Ai v2
10 pages
Final Edit 2
No ratings yet
Final Edit 2
19 pages
Jatin Resume
No ratings yet
Jatin Resume
1 page
Module 2 DL
No ratings yet
Module 2 DL
9 pages
December 2024
No ratings yet
December 2024
32 pages
2448 Self Supervised Visual Re
No ratings yet
2448 Self Supervised Visual Re
109 pages
CSP367 - 1st Day
No ratings yet
CSP367 - 1st Day
61 pages
Fingerprinting Attack On Tor Anonymity U
No ratings yet
Fingerprinting Attack On Tor Anonymity U
6 pages
Grid HTM
No ratings yet
Grid HTM
8 pages
Yadnesh Resume
No ratings yet
Yadnesh Resume
1 page
DL Unit Iv
No ratings yet
DL Unit Iv
18 pages
Devata Chandini Resume-1
No ratings yet
Devata Chandini Resume-1
2 pages
Mintu Pandya Bca NTCC
No ratings yet
Mintu Pandya Bca NTCC
40 pages
Adilet Uvaliyev: Education
No ratings yet
Adilet Uvaliyev: Education
1 page
Ethical Discussions For Autonomous Robotic Surgeries
No ratings yet
Ethical Discussions For Autonomous Robotic Surgeries
13 pages
Research Article: Research and Design of Distributed Fire Alarm System of Indoor Internet of Things Based On Lora
No ratings yet
Research Article: Research and Design of Distributed Fire Alarm System of Indoor Internet of Things Based On Lora
12 pages
Object Detection: Advances, Applications, and Algorithms
From Everand
Object Detection: Advances, Applications, and Algorithms
Fouad Sabry
No ratings yet

Video Anomaly Detection For Smart Surveillance: Related Concepts

Uploaded by

Video Anomaly Detection For Smart Surveillance: Related Concepts

Uploaded by

Video Anomaly Detection for Smart Surveillance

Sijie Zhu1 , Chen Chen1 , Waqas Sultani2

– Anomalous event/activity detection

Instead of directly computing the reconstruction error of future frames with a

Dataset # of Videos Average Frames Example Anomalies

Table 1. An overview of datasets for video anomaly detection.

Method UMN UCSD Ped2 Avenue Shanghai Tech

Table 2. Frame-level anomaly detection evaluation. AUC (%) of existing works

Method UCSD Shanghai Tech UCF Crime

S3 = F 1(1 − N RM SE), (2)

Method F1 score RMSE S3 score

Table 4. F1 score, RMSE, and S3 score of two top-performing methods on Iowa

Video Anomaly Detection Conference Workshop

You might also like