Study of Image in Painting Using Generative Adversarial Network Architecture
Study of Image in Painting Using Generative Adversarial Network Architecture
Abstract- Image In painting is the process of reconstructing lost or deteriorated parts of images and videos. It is an important
problem in computer vision and holds several importance in many imaging and graphics applications, e.g. restoring old photos
and videos, automatic scene editing, denoising, compression and image based rendering. The traditional method of Image In
painting which are mostly based on machine learning models work well for background in painting, they cannot hallucinate
novel image contents for challenging tasks such as in painting of faces and complex objects as well as failing to capture high
level objects semantics. It has been discovered that by simply introducing a small bit of noise to the original data, most
mainstream neural nets may be readily misled into misclassifying items. This is because most machine learning models only
learn from a little quantity of data and the input-to-output mapping is nearly linear, which is a major disadvantage and leads
to overfitting. The present method where we use GANs, or Generative Adversarial Networks, are a type of generative modelling
that employs deep learning techniques such as convolutional neural networks. GANs has a capability of learning from data that
is unstructured or unlabeled, the algorithms try to learn using method of feature extraction which is very different, more reliable
and fully automatic. Celeb Faces Attributes Dataset (Celeb A) is large scale face attributes dataset with more than 200K celebrity
images, each with 40 attributes annotations.
Keyword: Image In painting, Generative Adversarial Networks(GANs), Celeb Faces Attributes Dataset (Celeb A),
Generator, Discriminator.
1. INTRODUCTION
The task of filling empty pixels in a picture, also known as image in painting or completion, is crucial in computer
vision. It can be used in photo editing, image-based rendering, and computational photography, among other things
[1]. The most difficult aspect of image in painting is creating visually realistic and semantically believable pixels for
missing regions that are consistent with existing pixels. Many non-DL approaches exist, such as Diffusion-based and
patch-based methods, Exempler-based picture in painting, Patch offset statistics for image completion, Content Aware
fill in Adobe Photoshop, and so on, that function well in background in painting jobs and are frequently used in
practical applications. The primary idea of approaches based on the patch is to fill in the empty area of the map, draw
the boundary of the missing region. Barnes proposed a method that looks for a match, fill in the missing parts of the
image with a patch made from the rest of the image. As a result, the texture information is more appropriate. However,
it fails miserably when confronted with complex situations, in painting of images (faces, natural images) and the result
of in painting will be hazy. Approaches based on exemplar-based methods are poor at filling in the gaps with intricate
in painting structures [2]. The reason behind this is that the texture synthesis method is inefficient. They cannot
hallucinate unique image contents for tough scenarios when in painting regions involve complicated, nonrepetitive
structure since they presume missing patches can be discovered someplace in background regions (e.g. faces, objects)
[3].
1 | Page
Publisher: Noida Institute of Engineering & Technology,
19, Knowledge Park-II, Institutional Area, Greater Noida (UP), India.
NIET Journal of Engineering & Technology (NIETJET)
Volume 5, Issue Winter 2016 ISSN: 2229-5828 (Print)
The proposed method uses GANs which are clever way of training a generative model by framing the problem as a
supervised learning problem with two sub-models: the generator model, which we train to generate new examples,
and the discriminator model, which tries to classify examples as real (from the domain) or fake (not from the domain)
(generated) [4]. Both models are trained in an adversarial zero-sum game until the discriminator model is tricked
around half of the time, indicating that the generator model is providing credible examples. Celeb Faces Attributes
Dataset (CelebA), the images in this dataset cover large pose variations and background clutter. Celeb A has large
diversities, large quantities, and rich annotations, including 10,177 number of identities, 202,599 number of face
images and 5 landmark locations, 40 binary attributes annotations per image.
2. METHODOLOGY
There are three stages involved in our proposed method:
• Image Data Preprocessing
• Deep Convolutional GAN Architecture used for implementation
• Performance Measure
2.2.1 Loss Functional Model-The loss function used for our model is given by,
2 | Page
Publisher: Noida Institute of Engineering & Technology,
19, Knowledge Park-II, Institutional Area, Greater Noida (UP), India.
NIET Journal of Engineering & Technology (NIETJET)
Volume 5, Issue Winter 2016 ISSN: 2229-5828 (Print)
Prior Loss- focus on more high level image feautre representations instead of pixel wise differences. The prior loss
encourages the recovered image to be similar to samples drawn from training set.
Lp(z) = λlog(1-D(G(z)) (2)
Where λ is a parameter to balance b/w two losses, z is updated to fool D and make corresponding generated image
more realistic.
The Generative model,G,takes 100 dimensional vector drawn from a uniform distribution between [-1,1] and
generate 64*64*3 image.
For Discriminator model D, the input layer is a 64*64*3 picture, followed by a sequence of convolution layers with
half the image dimension and twice the number of channels as the preceding layer, and the output layer is a two-
class softmax [6].
During the training step, we utilise Adam for optimization with a value of 0.003.
z is discovered in the inpainting step by using Adam and constraining z to [-1,1] in each iteration.
DC GAN Architecture-
3 | Page
Publisher: Noida Institute of Engineering & Technology,
19, Knowledge Park-II, Institutional Area, Greater Noida (UP), India.
NIET Journal of Engineering & Technology (NIETJET)
Volume 5, Issue Winter 2016 ISSN: 2229-5828 (Print)
Generator Discriminator
4 | Page
Publisher: Noida Institute of Engineering & Technology,
19, Knowledge Park-II, Institutional Area, Greater Noida (UP), India.
NIET Journal of Engineering & Technology (NIETJET)
Volume 5, Issue Winter 2016 ISSN: 2229-5828 (Print)
FIGURE 4. In painting results on the Celeb A test dataset compared to a regular missing region.
Comparisons of results obtained utilising our suggested technique, FMM-based method, GLCIC model, and DIP
model are shown in FIGURES.4 and 5. The ground truth images are in column (a), while the masked images are in
column (b). Columns (c), (d), and (e) show the FMM, GLCIC, and DIP results, respectively. In the figure, the last
column reveals our results. In comparison to previous methods, the images generated by our model are more similar
to the ground truth photos [9].
FIGURE 5. In painting performances on the Celeb A test dataset with irregular missing patches are compared.
5 | Page
Publisher: Noida Institute of Engineering & Technology,
19, Knowledge Park-II, Institutional Area, Greater Noida (UP), India.
NIET Journal of Engineering & Technology (NIETJET)
Volume 5, Issue Winter 2016 ISSN: 2229-5828 (Print)
FIGURE 6. Quantitative evaluations in terms of PSNR and SSIM at different mask size.
PSNR SSIM
FMM 23.52 0.79
GLCIC 27.81 0.82
DIP 25.46 0.80
GAN 28.90 0.84
References
[1.] Anju Gauniya Pandey , Sanjita Das , S. P.Basu, Palak Srivastava; 'Design and Evaluation Of Nanoemulsion
For Delivery of Diclofenac Sodium',Volume No.2,Issue No.1,2013,PP.079-082,ISSN :2229-5828
[2.] Raj Kumar Goel , Rinku Sharma Dixit, Dr. Manu Pratap Singh; 'Implementaion of Pattern Storage Neural
network As Associative Memory For Storage and Recalling of Finger Prints',Volume No.2,Issue
No.1,2013,PP.083-090,ISSN :2229-5828
[3.] Amit Kumar Yadav, Satyendra Sharma; 'Design and Simulation of Multiplier for High -speed
Application',Volume No.2,Issue No.2,2014,PP.001-007,ISSN :2229-5828
[4.] Deepak Kumar ,Anjana Rani Gupta, Somesh Kumar; 'Dynamic Simulation of Multiple Effect Evaporators
in Paper Industry Using MATLAB',Volume No.2,Issue No.2,2014,PP.008-014,ISSN :2229-5828
[5.] Devendra Pratap, Satyendra Sharma; 'Planning and Modelling of Indoor WLAN Through Field Measurement
at 2.437 GHz Frequency',Volume No.2,Issue No.2,2014,PP.015-019,ISSN :2229-5828
6 | Page
Publisher: Noida Institute of Engineering & Technology,
19, Knowledge Park-II, Institutional Area, Greater Noida (UP), India.