0% found this document useful (0 votes)

17 views23 pages

Module 5

The document provides an overview of various types of autoencoders, including Stacked, Convolutional, Denoising, Sparse, and Variational Autoencoders, detailing their architectures and functionalities. It also discusses Generative Adversarial Networks (GANs), their training challenges, and proposed solutions, as well as the advancements in diffusion models for image generation. Key concepts include the probabilistic nature of variational autoencoders and the competitive dynamics of GANs, highlighting their respective advantages and limitations.

Uploaded by

3BR18CS151Srinath V Devale

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views23 pages

Module 5

Uploaded by

3BR18CS151Srinath V Devale

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

Stacked Autoencoders

stacked_encoder = tf.keras.Sequential([ stacked_ae =

tf.keras.Sequential([stacked_encoder,stacked_
tf.keras.layers.Flatten(),
decoder])
tf.keras.layers.Dense(100, activation="relu"),
stacked_ae.compile(loss="mse",
tf.keras.layers.Dense(30, activation="relu"), optimizer="nadam")

]) history = stacked_ae.fit(X_train, X_train,

epochs=20,
stacked_decoder = tf.keras.Sequential([
validation_data=(X_valid, X_valid))
tf.keras.layers.Dense(100, activation="relu"),

tf.keras.layers.Dense(28 * 28),

tf.keras.layers.Reshape([28, 28])

])
Convolutional Autoencoders
Encoder:

conv_encoder = tf.keras.Sequential([

tf.keras.layers.Reshape([28, 28, 1]),

tf.keras.layers.Conv2D(16, 3, padding="same", activation="relu"),

tf.keras.layers.MaxPool2D(pool_size=2), # output: 14 × 14 x 16

tf.keras.layers.Conv2D(32, 3, padding="same", activation="relu"),

tf.keras.layers.MaxPool2D(pool_size=2), # output: 7 × 7 x 32

tf.keras.layers.Conv2D(64, 3, padding="same", activation="relu"),

tf.keras.layers.MaxPool2D(pool_size=2), # output: 3 × 3 x 64

tf.keras.layers.Conv2D(30, 3, padding="same", activation="relu"),

tf.keras.layers.GlobalAvgPool2D() # output: 30

])
Decoder:
conv_decoder = tf.keras.Sequential([
tf.keras.layers.Dense(3 * 3 * 16),
tf.keras.layers.Reshape((3, 3, 16)),
tf.keras.layers.Conv2DTranspose(32, 3, strides=2, activation="relu"),
tf.keras.layers.Conv2DTranspose(16, 3, strides=2, padding="same", activation="relu"),
tf.keras.layers.Conv2DTranspose(1, 3, strides=2, padding="same"),
tf.keras.layers.Reshape([28, 28])
])

conv_ae = tf.keras.Sequential([conv_encoder, conv_decoder])

Denoising Autoencoders
Encoder: Decoder:
dropout_encoder = tf.keras.Sequential([ dropout_decoder = tf.keras.Sequential([
tf.keras.layers.Flatten(), tf.keras.layers.Dense(100,
tf.keras.layers.Dropout(0.5), activation="relu"),

tf.keras.layers.Dense(100, tf.keras.layers.Dense(28 * 28),

activation="relu"),
tf.keras.layers.Reshape([28, 28])
tf.keras.layers.Dense(30,
activation="relu") ])

]) dropout_ae =
tf.keras.Sequential([dropout_encoder,

dropout_decoder])
Sparse Autoencoders
Encoder: Decoder:

sparse_l1_encoder = sparse_l1_decoder = tf.keras.Sequential([

tf.keras.Sequential([ tf.keras.layers.Dense(100,
activation="relu"),
tf.keras.layers.Flatten(),
tf.keras.layers.Dense(28 * 28),
tf.keras.layers.Dense(100,
activation="relu"), tf.keras.layers.Reshape([28, 28])

])
tf.keras.layers.Dense(300,
activation="sigmoid"), sparse_l1_ae =
tf.keras.Sequential([sparse_l1_encoder,
tf.keras.layers.ActivityRegularization(l1
=1e-4) sparse_l1_decoder])

])
Variational Autoencoders
Introduction and Historical Context

● Introduced in 2013 by Diederik Kingma and Max Welling

● Quickly became one of the most popular autoencoder variants

Probabilistic Nature

● Outputs are partially determined by chance, even after training

● This differs from denoising autoencoders, which only use randomness during
training

Generative Capabilities
● Can generate new instances that appear to come from the training set
● Similar to Restricted Boltzmann Machines (RBMs) but with advantages:
○ Easier to train
○ Faster sampling process (no need to wait for thermal equilibrium)
Technical Foundation
● Based on variational Bayesian inference
● Performs approximate Bayesian inference efficiently
● Updates probability distributions using Bayes' theorem
● Works with prior and posterior distributions
Architecture and Operation
● Has the standard encoder-decoder structure
● Encoder produces two outputs:
○ Mean coding (μ)
○ Standard deviation (σ)
● Actual coding is sampled from a Gaussian distribution using μ and σ
● Decoder then processes this sampled coding
● Final output resembles the training instance
Variational Autoencoder latent loss:
L = -1/2 * Σᵢ[1 + log(σᵢ²) - σᵢ² - μᵢ²]
Generative Adversarial Neural Networks (GANs)
Historical Context
● Proposed in 2014 by Ian Goodfellow and colleagues
● Generated immediate excitement in the research community
● Initial training difficulties took years to overcome
Core Concept
● Based on competition between neural networks
● Competition drives improvement in both networks
● Composed of two distinct neural networks working against each
other
Generator Network
● Takes random noise (typically Gaussian) as input
● Produces data (typically images) as output
● Random inputs serve as latent representations
● Functions similarly to a decoder in a VAE
● Can generate new images from random noise
Discriminator Network
● Takes images as input from two sources:
○ Fake images from the generator
○ Real images from the training set
● Must classify whether each input image is real or fake
● Acts as a binary classifier
The Difficulties of Training GANs
1. Training Dynamics
● Functions as a zero-sum game between generator and discriminator
● Aims to reach a Nash equilibrium state
● Only one theoretical optimal equilibrium exists:
○ Generator produces perfectly realistic images
○ Discriminator is forced to random guessing (50/50)

2. Nash Equilibrium Concept

● State where no player benefits from changing strategy alone
● Can have single optimal strategy (like driving side)
● Can involve multiple competing strategies (predator-prey example)
3. Major Challenges

● Reaching equilibrium isn't guaranteed

● Mode collapse is a significant issue:
○ Generator focuses on one type of output
○ Gradually loses diversity
○ May cycle between different classes
● Training instability:
○ Parameters can oscillate
○ Training may suddenly diverge
○ Very sensitive to hyperparameters
4. Proposed Solutions
● Experience replay:
○ Stores generated images in buffer
○ Trains discriminator on mix of current and stored fake images
○ Reduces discriminator overfitting
● Mini-batch discrimination:
○ Measures similarity across image batches
○ Helps discriminator identify lack of diversity
○ Encourages generator variety
5. Current State
● Remains active research field
● GAN dynamics not fully understood
● Significant progress made
● Results can be impressive
● Moving toward more complex architectures
Diffusion Model
The modern formalization of diffusion models came from a 2015 paper by Sohl-Dickstein
et al. from Stanford and UC Berkeley. They used thermodynamics principles to model a
diffusion process similar to milk mixing in tea, but aimed to reverse the process.

In 2020, Jonathan Ho et al. from UC Berkeley created the denoising diffusion

probabilistic model (DDPM) that could generate highly realistic images. Their work
marked a significant advancement in the field.

A 2021 paper by OpenAI researchers (Nichol and Dhariwal) improved DDPMs to surpass
GANs in performance. The advantages were:

● Easier to train than GANs

● Generated more diverse images
● Produced higher quality images
● Main drawback: Much slower image generation compared to GANs or VAEs
The DDPM process works as follows:
● Start with an initial image (x0)
● Add Gaussian noise at each time step t (with mean 0 and variance
βt)
● Noise is added independently for each pixel (isotropic)
● Process continues until the original image is completely hidden
Technical implementation details:
● Original DDPM paper used 1,000 time steps
● Improved version increased to 4,000 time steps
● Pixel values are rescaled at each step by √1 − βt
● Mean of pixel values approaches 0
● Variance converges to 1
The forward process probability distribution is defined by:
q(xt|xt-1) = N(√1 - βt xt-1, βtI)
Where:
● N represents a Gaussian distribution
● βt is the variance at time step t
● I is the identity matrix
The ultimate goal is image generation:
● Train a model to perform the reverse process (xt to xt-1)
● Start with pure Gaussian noise
● Gradually remove noise until a new image emerges
● Model trained on specific image types (like cats) will generate
similar images

Sic Ip Service Handbook 2.3 en
No ratings yet
Sic Ip Service Handbook 2.3 en
91 pages
Autoencoders - Presentation
No ratings yet
Autoencoders - Presentation
18 pages
Module 5
No ratings yet
Module 5
23 pages
Unit 5 Autoencoders
No ratings yet
Unit 5 Autoencoders
6 pages
Generative AI
No ratings yet
Generative AI
69 pages
Lec16 DiffusionModels
No ratings yet
Lec16 DiffusionModels
57 pages
Lec15 Generative Models
No ratings yet
Lec15 Generative Models
51 pages
AutoEncoders and GANs
No ratings yet
AutoEncoders and GANs
44 pages
Generative Models
No ratings yet
Generative Models
65 pages
Lec 19
No ratings yet
Lec 19
111 pages
Jntuk r20 Unit-V Deep Learning Techniques (WWW - Jntumaterials.co - In)
No ratings yet
Jntuk r20 Unit-V Deep Learning Techniques (WWW - Jntumaterials.co - In)
61 pages
Advanced Design For AI Algorithms: Lec.: 1 GAN
No ratings yet
Advanced Design For AI Algorithms: Lec.: 1 GAN
223 pages
UNIT - I LONG
No ratings yet
UNIT - I LONG
10 pages
Unit IV V Deep Learning Material
No ratings yet
Unit IV V Deep Learning Material
32 pages
DeepLearning-Aula6
No ratings yet
DeepLearning-Aula6
63 pages
Deep Generative Models
No ratings yet
Deep Generative Models
55 pages
Unsupervised Deep Learning
No ratings yet
Unsupervised Deep Learning
11 pages
NNDL 7&8 Programs
No ratings yet
NNDL 7&8 Programs
7 pages
Autoencoder GAN Edited
No ratings yet
Autoencoder GAN Edited
138 pages
AAI - Module 2 - Variational Autoencoders
No ratings yet
AAI - Module 2 - Variational Autoencoders
9 pages
Unit 5
No ratings yet
Unit 5
46 pages
Denoising Diffusion Implicit Models
No ratings yet
Denoising Diffusion Implicit Models
22 pages
DL Unit 5 QP Solution
No ratings yet
DL Unit 5 QP Solution
19 pages
Diffusion Models
No ratings yet
Diffusion Models
27 pages
Unit-V Deep Learning Techniques
100% (1)
Unit-V Deep Learning Techniques
31 pages
AAI - Module 1 - Generative Adversarial Network and Probabilistic Models
No ratings yet
AAI - Module 1 - Generative Adversarial Network and Probabilistic Models
12 pages
Chapter17 Autoencoders
No ratings yet
Chapter17 Autoencoders
23 pages
Deep Learning
No ratings yet
Deep Learning
78 pages
Week 9 Generative Adversarial Networks
No ratings yet
Week 9 Generative Adversarial Networks
50 pages
Machine Learning Final Presentation
No ratings yet
Machine Learning Final Presentation
32 pages
Stable Diffusion A Tutorial
100% (1)
Stable Diffusion A Tutorial
66 pages
The Six Fronts of The Generative Adversarial Networks
No ratings yet
The Six Fronts of The Generative Adversarial Networks
11 pages
Introduction To Autoencoders: A Brief Overview
No ratings yet
Introduction To Autoencoders: A Brief Overview
27 pages
Deep Learning: Presented By:-Anuj Trehan (003) Deepak Dhingra (008) Divyanshu Sharma
No ratings yet
Deep Learning: Presented By:-Anuj Trehan (003) Deepak Dhingra (008) Divyanshu Sharma
15 pages
Lecture 18 20
No ratings yet
Lecture 18 20
65 pages
VAEs Talk
No ratings yet
VAEs Talk
44 pages
DL Lecture8 Autoencoder
No ratings yet
DL Lecture8 Autoencoder
28 pages
Deep Learning Basics Lecture 8 Autoencoder & DBM
No ratings yet
Deep Learning Basics Lecture 8 Autoencoder & DBM
28 pages
Generative Models
No ratings yet
Generative Models
39 pages
7& 9 Autoencoder and Variational Autoencoder
No ratings yet
7& 9 Autoencoder and Variational Autoencoder
13 pages
L2L Apr24
No ratings yet
L2L Apr24
13 pages
C3W1 Data Augmentation Assignment
No ratings yet
C3W1 Data Augmentation Assignment
16 pages
Chapter 5
No ratings yet
Chapter 5
140 pages
Introtodeeplearning MIT 6.S191
No ratings yet
Introtodeeplearning MIT 6.S191
36 pages
Dis10 Sol
No ratings yet
Dis10 Sol
11 pages
DL 5
No ratings yet
DL 5
9 pages
DL Asmt-2
No ratings yet
DL Asmt-2
17 pages
unit_3_notes[1]
No ratings yet
unit_3_notes[1]
15 pages
Part 15 MD
No ratings yet
Part 15 MD
36 pages
Auto Encoder S
No ratings yet
Auto Encoder S
22 pages
L12 Generative Models en
No ratings yet
L12 Generative Models en
65 pages
Deep Learning Models
No ratings yet
Deep Learning Models
18 pages
Assignment-10.1 NLP 2103a51375
No ratings yet
Assignment-10.1 NLP 2103a51375
8 pages
Generative Adversarial Networks (GAN) : A Gentle Introduction (UPDATED)
No ratings yet
Generative Adversarial Networks (GAN) : A Gentle Introduction (UPDATED)
11 pages
DAAI - Lecture - 15 - 23nov22
No ratings yet
DAAI - Lecture - 15 - 23nov22
113 pages
CVPR2022 Tutorial Diffusion Model
No ratings yet
CVPR2022 Tutorial Diffusion Model
188 pages
DL Co4 PPT-3
No ratings yet
DL Co4 PPT-3
14 pages
DL Unit5
No ratings yet
DL Unit5
15 pages
Tutorialon Diffusion Modelsfor Imaging and Vision
No ratings yet
Tutorialon Diffusion Modelsfor Imaging and Vision
90 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
81 pages
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
Mastering CCNARouting Fundamentals 654 FCFC 9 Da 692 A 4 D
No ratings yet
Mastering CCNARouting Fundamentals 654 FCFC 9 Da 692 A 4 D
12 pages
Jeevan Jyoti
No ratings yet
Jeevan Jyoti
11 pages
Python Internals
100% (1)
Python Internals
3 pages
200088-Job Interview Presentation Samples
No ratings yet
200088-Job Interview Presentation Samples
37 pages
Techblume, Inc Company Profile
No ratings yet
Techblume, Inc Company Profile
14 pages
8.10 - Gis
No ratings yet
8.10 - Gis
13 pages
Nagaraju Juluru@fusion p2p
No ratings yet
Nagaraju Juluru@fusion p2p
5 pages
Pub 57441
No ratings yet
Pub 57441
40 pages
Central Finance Overview
No ratings yet
Central Finance Overview
12 pages
Living in The It Era Final Exam
40% (5)
Living in The It Era Final Exam
6 pages
Sustainment-Assessment 12 09 2024-Sales
No ratings yet
Sustainment-Assessment 12 09 2024-Sales
18 pages
Apache - Kafka Notes
No ratings yet
Apache - Kafka Notes
9 pages
CV Nassim - Fahd - en PDF
No ratings yet
CV Nassim - Fahd - en PDF
2 pages
Crew Acceptance Checklist 2022 Ed Af DL 27052022
No ratings yet
Crew Acceptance Checklist 2022 Ed Af DL 27052022
49 pages
Toslink
No ratings yet
Toslink
20 pages
MicroCont LabManual Updated
No ratings yet
MicroCont LabManual Updated
53 pages
BACHELOR OF COMMERCE SYLLABUS.c4f03360
100% (1)
BACHELOR OF COMMERCE SYLLABUS.c4f03360
33 pages
WebSphere Application Server L3
No ratings yet
WebSphere Application Server L3
100 pages
Calculation Method of Spectrum Requirement For IMT-2020 eMBB and URLLC With Puncturing Based On M G 1 Priority Queuing Model
No ratings yet
Calculation Method of Spectrum Requirement For IMT-2020 eMBB and URLLC With Puncturing Based On M G 1 Priority Queuing Model
14 pages
F1 Maths 201
No ratings yet
F1 Maths 201
6 pages
Big Data Multiple Choice Questions
No ratings yet
Big Data Multiple Choice Questions
9 pages
AMINA Group Case Study V1
No ratings yet
AMINA Group Case Study V1
2 pages
Aircraft IT Ops V10.4 - SEPTEMBER-OCTOBER 2021 - V10.4
No ratings yet
Aircraft IT Ops V10.4 - SEPTEMBER-OCTOBER 2021 - V10.4
77 pages
Windows Hardware Design
No ratings yet
Windows Hardware Design
1,324 pages
Tamil Typing Practice Book Free 426 PDF
0% (1)
Tamil Typing Practice Book Free 426 PDF
5 pages
ABX Pentra: Hematology Analyzer
No ratings yet
ABX Pentra: Hematology Analyzer
2 pages
T4 Client-Server Networks 2
No ratings yet
T4 Client-Server Networks 2
37 pages
What Is Normalization in DBMS (SQL) - 1NF, 2NF, 3NF, BCNF Database With Example
No ratings yet
What Is Normalization in DBMS (SQL) - 1NF, 2NF, 3NF, BCNF Database With Example
8 pages
DAX Interview Questions
No ratings yet
DAX Interview Questions
8 pages

Module 5

Uploaded by

Module 5

Uploaded by

Stacked Autoencoders

stacked_encoder = tf.keras.Sequential([ stacked_ae =

]) history = stacked_ae.fit(X_train, X_train,

tf.keras.layers.Reshape([28, 28, 1]),

tf.keras.layers.Conv2D(16, 3, padding="same", activation="relu"),

tf.keras.layers.Conv2D(32, 3, padding="same", activation="relu"),

tf.keras.layers.Conv2D(64, 3, padding="same", activation="relu"),

tf.keras.layers.Conv2D(30, 3, padding="same", activation="relu"),

conv_ae = tf.keras.Sequential([conv_encoder, conv_decoder])

tf.keras.layers.Dense(100, tf.keras.layers.Dense(28 * 28),

sparse_l1_encoder = sparse_l1_decoder = tf.keras.Sequential([

● Introduced in 2013 by Diederik Kingma and Max Welling

● Outputs are partially determined by chance, even after training

2. Nash Equilibrium Concept

● Reaching equilibrium isn't guaranteed

In 2020, Jonathan Ho et al. from UC Berkeley created the denoising diffusion

● Easier to train than GANs

You might also like