Generative Adversarial Text To Image Synthesis

1) The document describes a method for generating images from text descriptions using a deep recurrent neural network and generative adversarial network (GAN) formulation. 2) The model was able to capture shape and color from flower descriptions but lacked other details to produce realistic images. 3) The model had limitations in generalizing to images with multiple objects or variable backgrounds.

Uploaded by

AMIT MANCHANDA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views

Generative Adversarial Text To Image Synthesis

Uploaded by

AMIT MANCHANDA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

Generative Adversarial Text to Image Synthesis

Amit Manchanda | Anshul Jain | Dr. Vinod Pankajakshan

Department of Electronics and Communication Engineering, IIT Roorkee

Abstract Methodology Conclusion

We implemented a deep recurrent neural net- • We developed a simple and effective model for
work architecture and Generative Adversar- generating images based on detailed visual de-
ial Network(GAN) formulation to effectively scriptions.
bridge the advances in text and image mod- • The images are able to capture shape and color
eling, translating visual concepts from char- of the flower but lacks other significant details
acters to pixels. We show the capability of to pass off as a realistic sample.
the model to generate images of flowers from • The model could not generalize to images with
detailed text descriptions. multiple objects.

Introduction Figure 1: Text-conditional convolutional GAN architecture. Future Works

• Artificial synthesis of images using text descrip- DCGAN • Improve Generator learning with manifold in-
tions could have profound applications in visual GAN training procedure is similar to a two-player min-max game with the following objective function: terpolation.
editing, animation, and digital design. min max V (D, G) = Ex∼ pdata [logD(x)] + Ez∼ pz [log(1 − D(G(z)))] • Implementation of Stacked GANs to produce
G D
• The distribution of images conditioned on a text
high quality images.
where x is a real image from the true distribution, and z is a noise vector sampled from pz , which might
description is highly multimodal. • Explore the possibility of using Wasserstein
be a Gaussian or uniform distribution.
• In GANs, the discriminator D tries to distin-
GANs and Cyclic GANs.
guish real images from syntheticized images. Skip Thought Vectors • Generalizing the model to generate images with

The generator G tries to fool D. An unsupervised approach to train a generic, distributed sentence encoder. We train an encoder-decoder multiple objects and variable backgrounds using
• The discriminator views (text, image) pairs as model where encoder maps the input sentence to a vector and the decoder generates the surrounding MS-COCO dataset.
joint observations and is trained to judge a pair sentences.
t
logP (wi+1
X <t
|wi+1 t
, hi) + logP (wi−1 <t
|wi−1
X
, hi) References
as real or fake. t t

Subproblems Objective is to reduce the sum of the log-probabilities for the forward and backward sentences [1] S. Reed, Z. Akata, X. Yan, L. Logeswaran, B.
• Learn a text feature representation that cap- conditioned on the encoder output. Schiele, and H. Lee. Generative adversarial
tures the important visual details. text-to-image synthesis. In ICML, 2016.
• Use these features to synthesize a compelling Results [2] A. Radford, L. Metz, and S. Chintala. Unsu-
image. pervised representation learning with deep con-
this flower has petals that are the flower has an abundance of flower is purple and pink in petal volutional generative adversarial networks. In
Literature Survey red and are bunched together yellow petals and brown anthers and feature a dark, dense core. ICLR, 2016.
[3] Ryan Kiros, Yukun Zhu, Ruslan R Salakhutdi-
• [1] estimated generative models via adversarial nov, Richard Zemel, Raquel Urtasun, Antonio
process to generate image conditioned on text Torralba, and Sanja Fidler. Skip-thought vec-
and input noise. tors. In NIPS, 2015.
• In [2,4] authors describe architectural guidelines [4] I. J. Goodfellow, J. Pouget-Abadie, M. Mirza,
for stable GANs. B. Xu, D. Warde-Farley, S. Ozair, A. C.
• In [3] authors gave unsupervised approach to Courville, and Y. Bengio. Generative adver-
Figures generated from corresponding caption using the trained model.
train a generic sentence encoder. sarial nets. In NIPS, 2014.

Problem 18
No ratings yet
Problem 18
3 pages
BTP Presentation On Text To Image Synthesis
100% (1)
BTP Presentation On Text To Image Synthesis
38 pages
Engproc 20 00016 With Cover
No ratings yet
Engproc 20 00016 With Cover
7 pages
Deep Learning Based Text To Image Genera
No ratings yet
Deep Learning Based Text To Image Genera
6 pages
Text-to-Image Generation Using Deep Learning
No ratings yet
Text-to-Image Generation Using Deep Learning
6 pages
Image Generation From Caption
No ratings yet
Image Generation From Caption
10 pages
Introduction To Recurrent Neural Network
No ratings yet
Introduction To Recurrent Neural Network
10 pages
Frank Gabel Eml2018 Report
No ratings yet
Frank Gabel Eml2018 Report
15 pages
Rishab Paper Final
No ratings yet
Rishab Paper Final
7 pages
BTP Report On Text To Image Synthesis
No ratings yet
BTP Report On Text To Image Synthesis
62 pages
Tao DF-GAN A Simple and Effective Baseline For Text-to-Image Synthesis CVPR 2022 Paper
No ratings yet
Tao DF-GAN A Simple and Effective Baseline For Text-to-Image Synthesis CVPR 2022 Paper
11 pages
ai-image-generator
No ratings yet
ai-image-generator
37 pages
Photographic Text-to-Image Synthesis With A Hierarchically-Nested Adversarial Network
No ratings yet
Photographic Text-to-Image Synthesis With A Hierarchically-Nested Adversarial Network
10 pages
Indian Institute OF Information Technology Allahabad: Text To Image Synthesis
No ratings yet
Indian Institute OF Information Technology Allahabad: Text To Image Synthesis
8 pages
3rd unit Notes
No ratings yet
3rd unit Notes
16 pages
ImageGenerationwithGans basedTechniquesASurvey
No ratings yet
ImageGenerationwithGans basedTechniquesASurvey
19 pages
Base Paper Batch 9 Final Updated 3
No ratings yet
Base Paper Batch 9 Final Updated 3
10 pages
G I C A: Enerating Mages From Aptions With Ttention
No ratings yet
G I C A: Enerating Mages From Aptions With Ttention
12 pages
Satgan Paper
No ratings yet
Satgan Paper
17 pages
1 RV
No ratings yet
1 RV
11 pages
GAN Technical Final Report
No ratings yet
GAN Technical Final Report
21 pages
Final All Correct
No ratings yet
Final All Correct
49 pages
AI Image Generation
No ratings yet
AI Image Generation
12 pages
Generative Adversarial Networks For Image and Video Synthesis: Algorithms and Applications
No ratings yet
Generative Adversarial Networks For Image and Video Synthesis: Algorithms and Applications
24 pages
Text To Image Synthesis Using Self
No ratings yet
Text To Image Synthesis Using Self
20 pages
SAW-GAN
No ratings yet
SAW-GAN
11 pages
Cycle-Consistent Inverse GAN For Text-to-Image Synthesis - 3474085.3475226
No ratings yet
Cycle-Consistent Inverse GAN For Text-to-Image Synthesis - 3474085.3475226
2 pages
MPAI05_FINAL DOCUMENT
No ratings yet
MPAI05_FINAL DOCUMENT
40 pages
ijariie26613
No ratings yet
ijariie26613
5 pages
(BESTFITTERS) Inverse Image Captioning Using Generative Adversarial Networks
No ratings yet
(BESTFITTERS) Inverse Image Captioning Using Generative Adversarial Networks
12 pages
An Adaptive Approach To Text To Image
No ratings yet
An Adaptive Approach To Text To Image
5 pages
TAM GAN - Tamil Text To Naturalistic Image Synthesis Using Conventional Deep Adversarial Networks - 3584019
No ratings yet
TAM GAN - Tamil Text To Naturalistic Image Synthesis Using Conventional Deep Adversarial Networks - 3584019
18 pages
Cross-Caption Cycle-Consistent Text-to-Image Synthesis
No ratings yet
Cross-Caption Cycle-Consistent Text-to-Image Synthesis
9 pages
Report 16
No ratings yet
Report 16
9 pages
lata2019
No ratings yet
lata2019
4 pages
Dual Adversarial Inference For Text-to-Image Synthesis
No ratings yet
Dual Adversarial Inference For Text-to-Image Synthesis
20 pages
CS236 Default Project
No ratings yet
CS236 Default Project
3 pages
Text To Image Synthesis Using Generative Adversarial Networks
No ratings yet
Text To Image Synthesis Using Generative Adversarial Networks
10 pages
18237wPg#s.
No ratings yet
18237wPg#s.
17 pages
Yayi Final Seminar
No ratings yet
Yayi Final Seminar
19 pages
A Survey of Image Synthesis and Editing With Generative Adversarial Networks PDF
No ratings yet
A Survey of Image Synthesis and Editing With Generative Adversarial Networks PDF
15 pages
Text To Image Translation Using Generative Adversarial Networks
No ratings yet
Text To Image Translation Using Generative Adversarial Networks
7 pages
A Research On Generative Adversarial Networks Applied To Text Generation
No ratings yet
A Research On Generative Adversarial Networks Applied To Text Generation
5 pages
Gans + Final Practice Questions: Instructor: Preethi Jyothi
No ratings yet
Gans + Final Practice Questions: Instructor: Preethi Jyothi
28 pages
Development and deployment of a generative model-based framework for text to photorealistic image generation
No ratings yet
Development and deployment of a generative model-based framework for text to photorealistic image generation
16 pages
DL M6 Tech
No ratings yet
DL M6 Tech
29 pages
project 4 report(Rohit&Gayatri)
No ratings yet
project 4 report(Rohit&Gayatri)
36 pages
Generative Adversarial Text To Image Synthesis: Scott Reed, Zeynep Akata, Xinchen Yan, Lajanugen Logeswaran
No ratings yet
Generative Adversarial Text To Image Synthesis: Scott Reed, Zeynep Akata, Xinchen Yan, Lajanugen Logeswaran
31 pages
A_Realistic_Image_Generation_of_Face_From_Text_Description_Using_the_Fully_Trained_Generative_Adversarial_Networks
No ratings yet
A_Realistic_Image_Generation_of_Face_From_Text_Description_Using_the_Fully_Trained_Generative_Adversarial_Networks
11 pages
Text-to-Image Synthesis With Generative Models Met
No ratings yet
Text-to-Image Synthesis With Generative Models Met
16 pages
Xu AttnGAN Fine-Grained Text CVPR 2018 Paper
No ratings yet
Xu AttnGAN Fine-Grained Text CVPR 2018 Paper
9 pages
Meta
No ratings yet
Meta
17 pages
Sketch2face: Conditional Generative Adversarial Networks For Transforming Face Sketches Into Photorealistic Images
No ratings yet
Sketch2face: Conditional Generative Adversarial Networks For Transforming Face Sketches Into Photorealistic Images
9 pages
ppt1
No ratings yet
ppt1
20 pages
From Words To Pictures Artificial Intelligence Based Art Generator
No ratings yet
From Words To Pictures Artificial Intelligence Based Art Generator
9 pages
Image Generator
No ratings yet
Image Generator
11 pages
Text To Image Generation Using XLNet-Paper Draft
No ratings yet
Text To Image Generation Using XLNet-Paper Draft
4 pages
Gen AI Unit 3
No ratings yet
Gen AI Unit 3
52 pages
2008.02793
No ratings yet
2008.02793
22 pages
Deep Generative Image Models Using A Laplacian Pyramid of Adversarial Networks
No ratings yet
Deep Generative Image Models Using A Laplacian Pyramid of Adversarial Networks
10 pages
Graph Neural Networks in Action with Python: A Complete Practitioner's Guide to Building, Scaling, and Deploying GNN Applications
From Everand
Graph Neural Networks in Action with Python: A Complete Practitioner's Guide to Building, Scaling, and Deploying GNN Applications
Aarav Joshi
No ratings yet
Multimae: Multi-Modal Multi-Task Masked Autoencoders
No ratings yet
Multimae: Multi-Modal Multi-Task Masked Autoencoders
21 pages
Firefly Algorithm Approach For The Optimization of Feature Selection To Perform Classification
No ratings yet
Firefly Algorithm Approach For The Optimization of Feature Selection To Perform Classification
4 pages
Skillsheet-15E - Cambridge VCE Further Mathematics
No ratings yet
Skillsheet-15E - Cambridge VCE Further Mathematics
7 pages
Unit I - Module 2 - ENS181
No ratings yet
Unit I - Module 2 - ENS181
28 pages
DAA Unit 1 Notes
No ratings yet
DAA Unit 1 Notes
34 pages
BECS-184 PYQ - Removed
No ratings yet
BECS-184 PYQ - Removed
18 pages
Bisection Method
100% (2)
Bisection Method
7 pages
Module 5_Mahout
No ratings yet
Module 5_Mahout
20 pages
AZ4030-Subject - 3 KS 04 - 3 KE 04 - Data Structures - Year - B.E. Third Semester (Computer Science & Engineering) (CBCS) Winter 2020
No ratings yet
AZ4030-Subject - 3 KS 04 - 3 KE 04 - Data Structures - Year - B.E. Third Semester (Computer Science & Engineering) (CBCS) Winter 2020
7 pages
Data Analytics Mid
No ratings yet
Data Analytics Mid
15 pages
Digital Image Processing Jan 2024
No ratings yet
Digital Image Processing Jan 2024
8 pages
VP03 1 PatMaxBasic
No ratings yet
VP03 1 PatMaxBasic
19 pages
Fin500J ConstrainedOptimization 2011
No ratings yet
Fin500J ConstrainedOptimization 2011
31 pages
Ant Colony Optimization
100% (1)
Ant Colony Optimization
34 pages
Tutorial 1
No ratings yet
Tutorial 1
3 pages
Graphical Integration. Direct and Standard Step Method Numerical Methods
No ratings yet
Graphical Integration. Direct and Standard Step Method Numerical Methods
10 pages
Lecture 4: Lists: - Theory
No ratings yet
Lecture 4: Lists: - Theory
52 pages
CLT Cheat Sheet
No ratings yet
CLT Cheat Sheet
1 page
Customer Churn Internship Report PDF
No ratings yet
Customer Churn Internship Report PDF
34 pages
Assignment 1 Questions
No ratings yet
Assignment 1 Questions
1 page
Bmats101 - MQP 2
No ratings yet
Bmats101 - MQP 2
4 pages
1994 Ideal Spatial Adaptation by Wavelet Shrinkage
No ratings yet
1994 Ideal Spatial Adaptation by Wavelet Shrinkage
31 pages
Derivation of Lagrange Equations From D'Alembert
No ratings yet
Derivation of Lagrange Equations From D'Alembert
29 pages
Probability and Statistics For Computer Science
No ratings yet
Probability and Statistics For Computer Science
374 pages
Jasa537 12323
No ratings yet
Jasa537 12323
7 pages
Econometric S
No ratings yet
Econometric S
7 pages
(Ebook) Understanding Deep Learning by Simon J. D. Prince ISBN 9780262048644, 0262048647 instant download
100% (2)
(Ebook) Understanding Deep Learning by Simon J. D. Prince ISBN 9780262048644, 0262048647 instant download
46 pages
Pattern Recognition in Neural Networks: T. Muthya Mounika, V.V. Vishnu Prabhakar
No ratings yet
Pattern Recognition in Neural Networks: T. Muthya Mounika, V.V. Vishnu Prabhakar
3 pages
COMMUNITY MEDICINE 4th year syllabus for class test
No ratings yet
COMMUNITY MEDICINE 4th year syllabus for class test
3 pages

Generative Adversarial Text To Image Synthesis

Uploaded by

Generative Adversarial Text To Image Synthesis

Uploaded by

Generative Adversarial Text to Image Synthesis

Amit Manchanda | Anshul Jain | Dr. Vinod Pankajakshan

Abstract Methodology Conclusion

Introduction Figure 1: Text-conditional convolutional GAN architecture. Future Works

You might also like