2017 Synthetic Data Generation For Deep Learning in Counting Pedestrians

This paper discusses the use of synthetic data generation to address the challenge of limited data in deep learning applications, specifically for counting pedestrians. The authors propose an algorithm to create realistic synthetic images that can be used to train a Deep Convolutional Neural Network (DCNN) capable of accurately counting pedestrians in real scenes. The results demonstrate that the model trained on synthetic data performs well, achieving competitive accuracy on real-world pedestrian counting tasks.

Uploaded by

Stella

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views6 pages

2017 Synthetic Data Generation For Deep Learning in Counting Pedestrians

Uploaded by

Stella

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Synthetic Data Generation for Deep Learning in Counting Pedestrians

Hadi Keivan Ekbatani, Oriol Pujol and Santi Segui

Faculty of Mathematics and Computer Science, University of Barcelona,
Gran Via de les Corts Catalanes, 585 08007 Barcelona, Spain
[email protected], oriol [email protected], [email protected]

Keywords: Synthetic Data Generation, Deep Convolutional Neural Network, Deep Learning, Computer Vision.

Abstract: One of the main limitations of the application of Deep Learning (DL) algorithms is when dealing with prob-
lems with small data. One workaround to this issue is the use of synthetic data generators. In this framework,
we explore the benefits of synthetic data generation as a surrogate for the lack of large data when applying DL
algorithms. In this paper, we propose a problem of learning to count the number of pedestrians using synthetic
images as a substitute for real images. To this end, we introduce an algorithm to create synthetic images for
being fed to a designed Deep Convolutional Neural Network (DCNN) to learn from. The model is capable of
accurately counting the number of individuals in a real scene.

1 INTRODUCTION ods in various areas including computer vision where

they have shown promising performances. As one so-
Counting the number of objects in still images or lution to tackle this issue, we introduce a synthetic
video frames is a new approach towards dealing with data generator algorithm to create images highly-
detecting or learning objects which has been recently representative of the real images.
proffered in the literature (Rabaud and Belongie, In this article, we tackle a crowd counting prob-
2006), (Kong et al., 2005). Previously, in order to lem by the means of synthetic images and deep con-
count the objects of interest in an image or video, var- volutional neural network. We generate a set of
ious object features needed to be designed, extracted highly realistic, synthetically generated images to be
or detected during the learning phase which restrict fed to a proposed convolution-based deep architec-
their usage in large-scale computer vision applica- ture. DCNN are well-suited for learning object fea-
tions thus demanding more efficient solutions to al- tures from the scratch and in a hierarchical approach.
leviate, expedite and improve this process. The proposed architecture consist of convolutional
One of the recent and commonly used methods to network to capture discriminative information about
facilitate feature detection process is the application the object we are willing to count, following by fully
of Deep Convolutional Neural Networks (DCNN) connect layers where we count the multiplicity of ob-
(Krizhevsky et al., 2012), (LeCun and Bengio, 2005), ject of interest. Figure 1 illustrates the proposal at a
(Szegedy et al., 2015). One of the promises of DCNN glance. The input instances contain a random set of
is replacing handcrafted features with efficient algo- pedestrians in a walkway. As it’s shown in below,
rithms for feature learning and hierarchical feature ex- our goal is to learn to count the number of people in
traction (Song and Lee, 2013). DCNNs have been synthetic images and thereby, accurately predict the
claimed and practically proven to achieve the most number of pedestrians in similar but real images.
assuring performance in different vision benchmark Our contributions are as follows: We introduce a
problems concerning feature detection and classifica- synthetic image generation algorithm in order to sub-
tion (Ciregan et al., 2012), (Szegedy et al., 2015). stitute the lacking training data in a fully supervised
Although access to fast computers and vast learning problem casted as learning to count the num-
amounts of data has enabled the advances of deep ber of pedestrians in a walkway. Moreover, we pro-
learning algorithms such as DCNN in solving many pose a DCNN capable of learning pedestrians’ fea-
problems that were not solvable using classic AI, they tures. Then, we validate our approach in a similar but
have limitations. For instance, they do not perform real scenario. We test our proposed model which has
well when there is limited data (Griffin et al., 2007). been trained on synthetic images, on real images to
This constrain restricts the application of DL meth- see if synthetic data generation can be incorporated

318
Ekbatani, H., Pujol, O. and Segui, S.
Synthetic Data Generation for Deep Learning in Counting Pedestrians.
DOI: 10.5220/0006119203180323
In Proceedings of the 6th International Conference on Pattern Recognition Applications and Methods (ICPRAM 2017), pages 318-323
ISBN: 978-989-758-222-6
Copyright c 2017 by SCITEPRESS – Science and Technology Publications, Lda. All rights reserved
Synthetic Data Generation for Deep Learning in Counting Pedestrians

Figure 1: A schematic of our proposal. In this paper, we show that by creating realistic synthetic images, we are able to train
a DCNN that is able to count the number of pedestrians in similar but real images.

as a surrogate for replacing small training sets when Segui et al. in (Seguı́ et al., 2015) proposed synthetic
applying deep architectures. data generation to counter lack of data issue for learn-
ing to count the number of objects in images using
deep convolutional neural networks. In their work,
2 BACKGROUND AND RELATED they took advantage of existent unlabeled and labeled
datasets to generate synthetic images representative of
WORKS the actual images. The authors introduce two count-
ing problems, counting number of even-digits in im-
2.1 Synthetic Data Generation ages, and counting the amount of pedestrians in a
walkway.
The main purpose of generating synthetic datasets has
been to protect the privacy and confidentiality of the 2.2 Crowd Counting
actual data (Phua et al., 2010), (Yao et al., 2013),
since it does not hold any personal information and Learning to count the objects of interest in an im-
cannot be traced back by any individual. Problems age can be approached from two different perspec-
such as fraud detection (Phua et al., 2010), or health tives: either training an object detector, or training an
care (Yao et al., 2013), are normally tackled by the object counter. In the field of object detection, nu-
use of synthetic data. However, most of the previ- merous works have been previously proposed (Kong
ously mentioned approaches towards synthetic data et al., 2005), (Marana et al., 1998). Furthermore,
generation would not be applicable when it comes Wu and Nevatia in (Wu and Nevatia, 2005) proposed
to synthetic image generation. This is due to the edgelet features (an edgelet is a short segment of line
fact that standard methods such as Probability Den- or curve) as a new type of silhouette-oriented features
sity Function (PDF) or Interpolation operate element- to deal with the problem of detecting individuals in
wise. The need for generating and synthesizing im- crowded still images.
ages using object-wise operations led researchers to As a similar line of work in the course of object
the use image processing tools for creating synthetic counting and more specifically crowd counting, in
images to tackle vision problems. (Leibe et al., 2007) and (Rabaud and Belongie, 2006),
In computer vision, usage of synthetic images has different object tracking approaches were taken to de-
a longstanding history, as in 2000, Cappelli et al. tect and count moving objects in the scene. However,
in (Cappelli et al., 2000) presented an approach to most of object tracking approaches met with skepti-
synthetic fingerprint generation on the basis of some cism by society, given the perception of infringing in-
mathematical models that describe the main features dividuals’ privacy rights.
of real finger prints. More recently, after the success More recently, in (Chan et al., 2008), Chan et al.
of deep convolutional neural networks in various vi- presented a novel approach with no explicit object
sion tasks concerning object detection or classifica- segmentation or tracking to estimate the number of
tion, generation and use of synthetic datasets has been people moving in each direction (towards and away
frequently considered. For example, in (Eggert et al., from camera) in a privacy-preserving manner.
2015), synthetic images are generated to be fed to a On the other hand, in case of feature learning,
DCNN in order to learn how to detect company logo Segui et al. in (Seguı́ et al., 2015) proposed a novel
in the absence of a large training set. approach for counting objects representations using
Moreover, as one of the most recent approaches, deep object features. In their work, objects’ features

319
ICPRAM 2017 - 6th International Conference on Pattern Recognition Applications and Methods

Figure 2: An illustration of image generation process at different steps.

are learned by a counting DCNN and are used to un- make the backgrounds of images as realistic as
derstand the underlying representation. Contrary to possible by:
the previous approaches, their proposal is the first one • making a sparse combination of median back-
where counting problem is handled by learning deep grounds.
features. Additionally, no hints on the object of inter-
• changing the global illumination of the images
est was given besides its’ occurrence multiplicity.
randomly.
• adding some random Gaussian noise to the
backgrounds.
3 SYNTHETIC IMAGE 4. Region Of Interest (ROI). Then, for training and
GENERATION comparison purposes, images are masked with a
filter of Region Of Interest (ROI).
The main hypothesis of this work is that synthetic data
5. Creating Synthetic Images. Afterwards, pedes-
generation algorithms can be used as a workaround
trians are added to the masked background in a
for problems with no or little training sets. On this
way that the center of each person is placed in-
course, we propose an algorithm for creating highly
side white area of the mask. Finally images are
realistic synthetic images of pedestrians in a walkway.
normalized (between 0 and 255) and resized to
We used UCSD unlabeled Anomaly detection dataset
158 × 158 in order to be fed to convolution lay-
of pedestrians collected by Chan et al. and used in
ers.
(Mahadevan et al., 2010) and (Chan et al., 2009).
UCSD Anomaly detection dataset contains clips of
groups of people walking towards and away from the
3.2 Image Improvement
camera, and consists of 34 training video samples and
Although we managed to successfully create syn-
36 testing video samples. Each video has 200 frames
thetic images of people in the street, the generated
of each 238 × 158 pixels.
images were still quite distinguishable from the real
dataset. Thus, in order to make images as highly real-
3.1 Image Generation istic as possible, we improved the dataset as explained
underneath. Figure 3 depicts this procedure.
In our dataset, we employed all 70 training and test-
ing video samples to generate the synthetic pedestrian 1. Remove Non-pedestrians. Amongst the ex-
dataset. We constrained each image by having up to tracted pedestrians, there were some non-
29 pedestrians in the walkway. The process of gener- pedestrians with objects instead of pedestrians,
ating the data includes the following steps while fig- and yet others with more than one person. There-
ure 2 illustrates this process. fore, we manually removed these outliers. After
this edition, we ended with 426 samples of peo-
1. Background Extraction. Firstly, we simply sub- ple.
tract the background from each video frame and
2. Lack of Pedestrians. For the sake of general-
from there, we extract the median backgrounds of
ization, we needed a decent variety of pedestrians
each video (in total, 70 different backgrounds).
in the images to train with. For this purpose, we
2. Pedestrian Extraction. Subtracting each image created 2 versions of current pedestrians list, each
from the mean background, we are able to label darkened by the factor of 20% from each other.
the connected regions (each individual in case of 3. Halos Around the Pedestrians. Due to lack of
our images) using morphological labeling meth- accuracy of the region measuring method, a fine
ods. layer of the background that pedestrians were ex-
3. Background Generation. In this step, we try to tracted from, still remained around the pedestri-

320
Synthetic Data Generation for Deep Learning in Counting Pedestrians

Figure 3: An illustration of each step of image improvement process.

ans. In the created images, depending on where has been set to 400,000 iterations. The output layer is
the person was placed, these thin layers appeared configured as a classification problem.
like a halo around the person. We used morpho- On the validation set, the performance of the
logical erosion on pedestrians’ masks and also model is 0.70 mean absolute error and 0.94 mean
Poisson image editing to remove the halos. squared error. This results improve the achieved re-
4. Image Perspective. Finally, Since pedestrians sults in a similar experiment done by (Seguı́ et al.,
of different sizes were put randomly in the im- 2015) (the comparison is shown in table 2). On the
ages, we considered peoples tallness perspective other hand, on the real test set, we obtained 1.38 mean
in the images. Humans height almost follows a absolute error and 3.61 mean squared error which
Gaussian distribution (Subramanian et al., 2011). closely follow the results in (Chan et al., 2008) which
Therefore, with respect to (Subramanian et al., was obtained by hand-crafting highly specialized im-
2011), we mapped individuals heights with the age features that are dependent on the object class.
length of the walkway in the image, considering This comparison is depicted in table 3 The confusion
a Gaussian noise with mean µ = 0 and σ = 3.5. matrix regarding the model performance is illustrated
in figure 4. As you may notice, due to the inevitable
differences between real and synthetic samples, the
model mostly over-predicts. Moreover, as the number
4 EXPERIMENTS AND RESULTS of pedestrians increases in the images, the prediction
accuracy of the model decreases.
For learning to count the number of pedestrians in a
walkway, we synthetically generated a set of 1 million
Table 2: Performance comparison on the synthetic data be-
images of size 158 × 158 with up to 29 pedestrians in tween our proposal and related work in (Seguı́ et al., 2015).
each image. Maximum overlapping was considered
in the creation of the images. We divided this dataset Experiments MSE MAE
into a training set of 800k images and 200k images Our approach (29 peds) 0.942 0.707
for validation set. To test our model, we used UCSD (Seguı́ et al., 2015) (25 peds) 1.12 0.74
crowd counting dataset with 3375 manually labeled
images of pedestrians. The selected UCSD images Table 3: Performance comparison on the real data between
our proposal and related work in (Chan et al., 2008).
contain from 11 to 29 pedestrians in each image.
We designed a seven layers architecture DCNN Experiments MSE MAE
with four convolutional layers and three fully con- Proposed method 3.61 1.38
nected layers. The architecture is shown in Table 1. (Chan et al., 2008) approach 2.73 1.24

Table 1: Proposed DCNN for counting pedestrians. As you may observe in table 2, in case of synthetic
images, although our images contain more pedestri-
Convolutions Fully connects
ans, our results beat the previous approach in (Seguı́
10 × 15 × 15 & x2 pooling 128
et al., 2015). This proves the improvement we made
10 × 11 × 11 & x2 pooling 64
in synthetic data generation process and the designed
20 × 9 × 9 1
deep architecture.
20 × 5 × 5
Respectively, in case of real images, although we
The algorithm is trained using the Caffe pack- could not improve the work done in (Chan et al.,
age[11] on a GPU NVIDIA Tesla K40. The network 2008), our results follows their results closely. We

321
ICPRAM 2017 - 6th International Conference on Pattern Recognition Applications and Methods

REFERENCES
Cappelli, R., Erol, A., Maio, D., and Maltoni, D. (2000).
Synthetic fingerprint-image generation. In Pattern
Recognition, 2000. Proceedings. 15th International
Conference on. IEEE.
Chan, A. B., Liang, Z.-S. J., and Vasconcelos, N. (2008).
Privacy preserving crowd monitoring: Counting peo-
ple without people models or tracking. In Computer
Vision and Pattern Recognition, 2008. CVPR 2008.
IEEE Conference on. IEEE.
Chan, A. B., Morrow, M., and Vasconcelos, N. (2009).
Analysis of crowded scenes using holistic properties.
In Performance Evaluation of Tracking and Surveil-
Figure 4: Confusion matrix regarding the model perfor- lance workshop at CVPR.
mance on the real test set. The starting point of the graph is Ciregan, D., Meier, U., and Schmidhuber, J. (2012). Multi-
11 since the minimum amount of pedestrians in the real test column deep neural networks for image classification.
set is 11. In Computer Vision and Pattern Recognition (CVPR),
2012 IEEE Conference on. IEEE.
should mention that Chan et.al experiment in (Chan
Eggert, C., Winschel, A., and Lienhart, R. (2015). On the
et al., 2008) was done by hand-crafting highly spe- benefit of synthetic data for company logo detection.
cialized features and exhaustive labeling. This results In Proceedings of the 23rd ACM international confer-
approve the suitability of synthetic data as a surrogate ence on Multimedia. ACM.
for the small real data when using DCNN. Griffin, G., Holub, A., and Perona, P. (2007). Caltech-256
object category dataset. California Institute of Tech-
nology.
5 CONCLUSIONS Kong, D., Gray, D., and Tao, H. (2005). Counting pedes-
trians in crowds using viewpoint invariant training. In
BMVC. Citeseer.
In this paper we explore the benefits of synthetic data
Krizhevsky, A., Sutskever, I., and Hinton, G. E. (2012). Im-
generation for the application of deep convolutional agenet classification with deep convolutional neural
neural networks for a crowd counting problem with networks. In Advances in neural information process-
small training set. We propose an algorithm for cre- ing systems.
ating a highly realistic synthetic dataset of pedestri- LeCun, Y. and Bengio, Y. (2005). Convolutional networks
ans in a walkway to train the proposed DCNN with. for images, speech, and time series. In BMVC. Cite-
Moreover, we provide a system trained with synthetic seer.
images capable of predicting the number of pedestri- Leibe, B., Schindler, K., and Van Gool, L. (2007). Coupled
ans in an image to a satisfactory extent. The obtained detection and trajectory estimation for multi-object
results suggest the incorporation of synthetic data as tracking. In 2007 IEEE 11th International Conference
a well-suited surrogate for the missing real along with on Computer Vision. IEEE.
alleviating required exhaustive labeling. Mahadevan, V., Li, W., Bhalodia, V., and Vasconcelos, N.
There are still many open questions to be ad- (2010). Anomaly detection in crowded scenes. In
dressed such as, when and to what extent synthetic CVPR.
images are applicable as a substitute to solve real Marana, A., Costa, L. d. F., Lotufo, R., and Velastin, S.
world problems. which is the best network architec- (1998). On the efficacy of texture analysis for crowd
monitoring. In Computer Graphics, Image Process-
ture for counting the crowd? ing, and Vision, 1998. Proceedings. SIBGRAPI’98. In-
ternational Symposium on. IEEE.
Phua, C., Lee, V., Smith, K., and Gayler, R. (2010). A
ACKNOWLEDGEMENTS comprehensive survey of data mining-based fraud de-
tection research. In arXiv preprint arXiv:1009.6119.
This work has been partially funded by the Spanish Rabaud, V. and Belongie (2006). Counting crowded moving
MINECO Grants TIN2013-43478-P and TIN2012- objects. In 2006 IEEE Computer Society Conference
38187- C03. We gratefully acknowledge the support on Computer Vision and Pattern Recognition. IEEE.
of NVIDIA Corporation with the donation of a Tesla Seguı́, S., Pujol, O., and Vitria, J. (2015). Learning to count
K40 GPU used for this research. with deep object features. In Proceedings of the IEEE
Conference on Computer Vision and Pattern Recogni-
tion Workshops.

322
Synthetic Data Generation for Deep Learning in Counting Pedestrians

Song, H. A. and Lee, S.-Y. (2013). Hierarchical representa-

tion using nmf. In International Conference on Neural
Information Processing. Springer.
Subramanian, S., Özaltin, E., and Finlay, J. E. (2011).
Height of nations: a socioeconomic analysis of cohort
differences and patterns among women in 54 low-to
middle-income countries. In PLoS One. Public Li-
brary of Science.
Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S.,
Anguelov, D., Erhan, D., Vanhoucke, V., and Rabi-
novich, A. (2015). Going deeper with convolutions.
In Proceedings of the IEEE Conference on Computer
Vision and Pattern Recognition.
Wu, B. and Nevatia, R. (2005). Detection of multi-
ple, partially occluded humans in a single image by
bayesian combination of edgelet part detectors. In
Tenth IEEE International Conference on Computer
Vision (ICCV’05) Volume 1. IEEE.
Yao, W., Basu, S., Wei-Nchih, L., and Singhal, S. (2013).
Synthetic healthcare data generation. Google Patents.

323

Zhou Et Al. (2022)
No ratings yet
Zhou Et Al. (2022)
13 pages
DALL-E For Detection: Language-Driven Compositional Image Synthesis For Object Detection
No ratings yet
DALL-E For Detection: Language-Driven Compositional Image Synthesis For Object Detection
22 pages
A Survey of Synthetic Data Augmentation Methods in Computer Vision
No ratings yet
A Survey of Synthetic Data Augmentation Methods in Computer Vision
33 pages
CNN-based Density Estimation and Crowd Counting: A Survey
No ratings yet
CNN-based Density Estimation and Crowd Counting: A Survey
25 pages
Application of Data Augmentation On Deep Learning
No ratings yet
Application of Data Augmentation On Deep Learning
13 pages
Recent Advances in Deep Learning For Object Detection
No ratings yet
Recent Advances in Deep Learning For Object Detection
26 pages
Sayali
No ratings yet
Sayali
7 pages
Object Detection Using Deep CNNs Trained On Synthetic Images
No ratings yet
Object Detection Using Deep CNNs Trained On Synthetic Images
8 pages
Berrahal 2020
No ratings yet
Berrahal 2020
8 pages
CVPR2019 Residual Regression With Semantic Prior For Crowd Counting
No ratings yet
CVPR2019 Residual Regression With Semantic Prior For Crowd Counting
10 pages
Single-Image Crowd Counting Via Multi-Column Convolutional Neural Network 2016
No ratings yet
Single-Image Crowd Counting Via Multi-Column Convolutional Neural Network 2016
9 pages
Mca 104 Unit 3 Information Technology Notes
No ratings yet
Mca 104 Unit 3 Information Technology Notes
60 pages
Towards The Interpretability of Machine Learning Predictions For Medical Applications Targeting Personalised Therapies: A Cancer Case Survey
No ratings yet
Towards The Interpretability of Machine Learning Predictions For Medical Applications Targeting Personalised Therapies: A Cancer Case Survey
31 pages
2019 UNESCO AI SustDev
No ratings yet
2019 UNESCO AI SustDev
59 pages
Face Recognition Based Attendance System
No ratings yet
Face Recognition Based Attendance System
59 pages
Crowd Counting Using Deep Learning Based Head Dete
No ratings yet
Crowd Counting Using Deep Learning Based Head Dete
6 pages
Artificial Intelligence Research Center
No ratings yet
Artificial Intelligence Research Center
20 pages
cs42 PROJECT REPORT
No ratings yet
cs42 PROJECT REPORT
17 pages
Genetic Learn
No ratings yet
Genetic Learn
21 pages
Denoisng of Images
No ratings yet
Denoisng of Images
59 pages
Instagen: Enhancing Object Detection by Training On Synthetic Dataset
No ratings yet
Instagen: Enhancing Object Detection by Training On Synthetic Dataset
13 pages
A Method For Improving CNN-Based Image Recognition Using Dcgan
No ratings yet
A Method For Improving CNN-Based Image Recognition Using Dcgan
12 pages
New Smart Face Generation
No ratings yet
New Smart Face Generation
9 pages
Project Title - Crowd Counting System Using Switch Convolution Neural Network-1
No ratings yet
Project Title - Crowd Counting System Using Switch Convolution Neural Network-1
6 pages
Crowd Counting
No ratings yet
Crowd Counting
11 pages
Abstract Booklet BESE2022
No ratings yet
Abstract Booklet BESE2022
13 pages
Chandra CVPR 2019
No ratings yet
Chandra CVPR 2019
10 pages
Beery Synthetic Examples Improve Generalization For Rare Classes WACV 2020 Paper
No ratings yet
Beery Synthetic Examples Improve Generalization For Rare Classes WACV 2020 Paper
11 pages
G M C N - N: Enerative Odeling of Onvolutional EU RAL Etworks
No ratings yet
G M C N - N: Enerative Odeling of Onvolutional EU RAL Etworks
12 pages
Artificial Intelligence A-Z™ 2023 Build An AI With
No ratings yet
Artificial Intelligence A-Z™ 2023 Build An AI With
19 pages
Lecun 20181015 Ihes Gomax PDF
No ratings yet
Lecun 20181015 Ihes Gomax PDF
109 pages
Project Paper PDF
No ratings yet
Project Paper PDF
9 pages
Towards Learning 3d Object Detection and 6d Pose Estimation From Synthetic Data
No ratings yet
Towards Learning 3d Object Detection and 6d Pose Estimation From Synthetic Data
4 pages
2019 Apolinario Et Al. Open Set Recognition of Timber Species Using Deep Learning For Embedded Systems PDF
No ratings yet
2019 Apolinario Et Al. Open Set Recognition of Timber Species Using Deep Learning For Embedded Systems PDF
8 pages
Ethhadmur1lolfe6bymk0if90.tobias Berninger Dissertation
No ratings yet
Ethhadmur1lolfe6bymk0if90.tobias Berninger Dissertation
221 pages
Project Report Vision Alerting 466
No ratings yet
Project Report Vision Alerting 466
20 pages
Thesis - Jiang Xiaoyue
No ratings yet
Thesis - Jiang Xiaoyue
193 pages
Newzen - Python List - 2021
No ratings yet
Newzen - Python List - 2021
3 pages
1 s2.0 S0167404821003230 Main
No ratings yet
1 s2.0 S0167404821003230 Main
21 pages
Minsky1969 - An Introduction To Computational Geometry
No ratings yet
Minsky1969 - An Introduction To Computational Geometry
9 pages
Crowdnet: A Deep Convolutional Network For Dense Crowd Counting
No ratings yet
Crowdnet: A Deep Convolutional Network For Dense Crowd Counting
5 pages
Kim2019 Article LatentTransformationsNeuralNet
No ratings yet
Kim2019 Article LatentTransformationsNeuralNet
15 pages
1.thesis Book Omar
No ratings yet
1.thesis Book Omar
55 pages
Inception - GoogLeNet
No ratings yet
Inception - GoogLeNet
10 pages
EI 2017 Art00005 Daniel-Mas-Montserrat
No ratings yet
EI 2017 Art00005 Daniel-Mas-Montserrat
10 pages
People Counting in Crowd Faster R-CNN
No ratings yet
People Counting in Crowd Faster R-CNN
9 pages
Design of Intelligent Classroom Facial Recognition
No ratings yet
Design of Intelligent Classroom Facial Recognition
9 pages
CrowdGAN - Identity-Free Interactive Crowd Video Generation and Beyond
No ratings yet
CrowdGAN - Identity-Free Interactive Crowd Video Generation and Beyond
16 pages
1 s2.0 S0031320317304120 Main
No ratings yet
1 s2.0 S0031320317304120 Main
24 pages
2448 Self Supervised Visual Re
No ratings yet
2448 Self Supervised Visual Re
109 pages
Advanced Techniques For Fault Detection and Classification in Electrical Power Transmission Systems: An Overview
No ratings yet
Advanced Techniques For Fault Detection and Classification in Electrical Power Transmission Systems: An Overview
10 pages
JETIR2209375
No ratings yet
JETIR2209375
6 pages
Approximate Softmax Functions For Energy-Efficient Deep Neural Networks
No ratings yet
Approximate Softmax Functions For Energy-Efficient Deep Neural Networks
13 pages
Scirobotics Abm1421
No ratings yet
Scirobotics Abm1421
13 pages
When Face Recognition Meets Occlusion: A New Benchmark
No ratings yet
When Face Recognition Meets Occlusion: A New Benchmark
5 pages
Major Project PPT - Phase 1
No ratings yet
Major Project PPT - Phase 1
13 pages
Assignment 1
No ratings yet
Assignment 1
6 pages
Object Detection With Deep Learning: A Review
No ratings yet
Object Detection With Deep Learning: A Review
21 pages
1 PB
No ratings yet
1 PB
8 pages
Final Paper Imgprocessing
No ratings yet
Final Paper Imgprocessing
11 pages
Predicting Rapid Impact Compaction - Case Study
No ratings yet
Predicting Rapid Impact Compaction - Case Study
36 pages
Crop Disease Detection Using ResNet
No ratings yet
Crop Disease Detection Using ResNet
20 pages
Irjet V10i1067
No ratings yet
Irjet V10i1067
5 pages
AI&Green - Engg - Content - Outline1 1
No ratings yet
AI&Green - Engg - Content - Outline1 1
8 pages
Design Variable Structure Fuzzy Control Based On Deep Neural Network Model For Servomechanism Drive System
No ratings yet
Design Variable Structure Fuzzy Control Based On Deep Neural Network Model For Servomechanism Drive System
12 pages
AAI Extra
No ratings yet
AAI Extra
7 pages
Transfer Learning For Object Detection Using State-of-the-Art Deep Neural Networks
No ratings yet
Transfer Learning For Object Detection Using State-of-the-Art Deep Neural Networks
7 pages
Aazain Resume 2024
No ratings yet
Aazain Resume 2024
2 pages
2017 Supervised Machine Learning Based Surface Inspection by Synthetizing Artificial Defects
No ratings yet
2017 Supervised Machine Learning Based Surface Inspection by Synthetizing Artificial Defects
6 pages
Harsha Thesis
No ratings yet
Harsha Thesis
62 pages
AIML (3rd - Year) Syllabus Igdtuw
No ratings yet
AIML (3rd - Year) Syllabus Igdtuw
34 pages
Mastering Data Science
No ratings yet
Mastering Data Science
10 pages
Counting in Dense Crowds Using Deep Learning
No ratings yet
Counting in Dense Crowds Using Deep Learning
6 pages
Generative Adversarial Networks For Image and Video Synthesis: Algorithms and Applications
No ratings yet
Generative Adversarial Networks For Image and Video Synthesis: Algorithms and Applications
24 pages
Empowering Edge Intelligence: A Comprehensive Survey On On-Device AI Models
No ratings yet
Empowering Edge Intelligence: A Comprehensive Survey On On-Device AI Models
42 pages
Image Sorting Using Object Detection and Face Recognition
No ratings yet
Image Sorting Using Object Detection and Face Recognition
6 pages
Synopsis
No ratings yet
Synopsis
9 pages
An Investigation of Deep Neural Network Based Techniques For Object Detection An
No ratings yet
An Investigation of Deep Neural Network Based Techniques For Object Detection An
6 pages
Paper4 (GAN)
No ratings yet
Paper4 (GAN)
24 pages
A Review On Deep Learning Approaches To Image Classification and Object Segmentation 1
No ratings yet
A Review On Deep Learning Approaches To Image Classification and Object Segmentation 1
23 pages
H13-311 - V3.5 - Unlocked
100% (1)
H13-311 - V3.5 - Unlocked
132 pages
Hedge Fund Use of AI
No ratings yet
Hedge Fund Use of AI
46 pages
Real Time Object Recognition and Classification
No ratings yet
Real Time Object Recognition and Classification
6 pages
A Review of Generative Adversarial Networks For Computer Vision TasksElectronics Switzerland
No ratings yet
A Review of Generative Adversarial Networks For Computer Vision TasksElectronics Switzerland
17 pages
Synthetic Generation of High Dimensional Dataset
No ratings yet
Synthetic Generation of High Dimensional Dataset
8 pages
Anime Face Generation Using DC-GANs
No ratings yet
Anime Face Generation Using DC-GANs
6 pages
DL Unit - 5
No ratings yet
DL Unit - 5
14 pages
Unlocking Business Potential With ENCS Networks' Data Science Services
No ratings yet
Unlocking Business Potential With ENCS Networks' Data Science Services
10 pages
Activity Recognition: Fundamentals and Applications
From Everand
Activity Recognition: Fundamentals and Applications
Fouad Sabry
No ratings yet
Object Detection: Advances, Applications, and Algorithms
From Everand
Object Detection: Advances, Applications, and Algorithms
Fouad Sabry
No ratings yet
Optical Braille Recognition: Empowering Accessibility Through Visual Intelligence
From Everand
Optical Braille Recognition: Empowering Accessibility Through Visual Intelligence
Fouad Sabry
No ratings yet
Computer Vision: Exploring the Depths of Computer Vision
From Everand
Computer Vision: Exploring the Depths of Computer Vision
Fouad Sabry
No ratings yet
Computer Vision: Fundamentals and Applications
From Everand
Computer Vision: Fundamentals and Applications
Fouad Sabry
No ratings yet
Visual Sensor Network: Exploring the Power of Visual Sensor Networks in Computer Vision
From Everand
Visual Sensor Network: Exploring the Power of Visual Sensor Networks in Computer Vision
Fouad Sabry
No ratings yet

2017 Synthetic Data Generation For Deep Learning in Counting Pedestrians

Uploaded by

2017 Synthetic Data Generation For Deep Learning in Counting Pedestrians

Uploaded by

Synthetic Data Generation for Deep Learning in Counting Pedestrians

Hadi Keivan Ekbatani, Oriol Pujol and Santi Segui

1 INTRODUCTION ods in various areas including computer vision where

Figure 2: An illustration of image generation process at different steps.

Figure 3: An illustration of each step of image improvement process.

Song, H. A. and Lee, S.-Y. (2013). Hierarchical representa-

You might also like