0% found this document useful (0 votes)

19 views7 pages

Sessional-II Exam Solution Spring 2024

Uploaded by

eysha raazia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views7 pages

Sessional-II Exam Solution Spring 2024

Uploaded by

eysha raazia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

National University of Computer and Emerging Sciences

FAST School of Computing Spring-2024 Islamabad Campus

AI-4009: Generative AI Sessional-II Exam

Total Time: 1 Hour
Date: 4th April, 2024
Total Marks: 50
Course Instructor
Dr. Akhtar Jamil

______________ _ __________

Student Name Roll No. Course Section Student Signature

Do not write anything on the question paper except the information required above.
Instructions:
1. Read the question carefully, understand the question, and then attempt your answers in the
provided answer booklet.
2. Verify that you have two (2) printed pages including this page. There are Four (4) questions.
3. Calculator sharing is strictly prohibited.
4. Write concise answers where necessary

Q1: Write short answers to the following questions [10 x 2 = 20]

1. Why do latent variable models approximate the expected log-likelihood rather than
computing the actual probability directly?
In latent variable models, directly calculating the actual probability of the observed data involves
integrating over all possible values of the latent variables, which can be mathematically intractable or
computationally prohibitive, especially in high-dimensional spaces. This integration is necessary
because the latent variables are not directly observed, yet they influence the generation of the
observed data. The true likelihood function of the observed data thus involves summing or integrating
over these hidden variables to account for all their possible configurations.

To manage this complexity, we approximate the expected log likelihood instead of calculating the
actual probability. This approximation makes the problem more tractable by allowing us to work with
simpler forms that can be efficiently computed.

2. What will be the impact if the KL Divergence between 𝐪∅ (𝐳|𝐱) 𝐚𝐧𝐝 𝐏(𝐳) is high?
If the distance between two terms is too high, then the model will generate garbage images if a
random Z is taken as input to generate an image.
3. Explain the concept of uniform dequantization in the context of applying flow-based models.
Uniform dequantization is a technique used to adapt flow-based models for discrete data. Flow-based
models, which are designed to model distributions of continuous data, rely on the ability to perform
exact density estimation and to invertibly map between data spaces. However, many types of data

Page 1 of 7
National University of Computer and Emerging Sciences
FAST School of Computing Spring-2024 Islamabad Campus
encountered in practice, like images, are inherently discrete, with pixel values typically represented as
integers within a certain range (e.g., 0 to 255 for 8-bit images).
The process of uniform dequantization involves adding a small amount of uniform noise to each
discrete data point. Specifically, for a discrete data point y, noise u sampled from a uniform
distribution U over an interval [0,1] or [−0.5, 0.5] is added to y to produce a continuous variable
x=y+u. This noise addition effectively spreads the discrete data points across the continuous interval
between their original integer values, smoothing the data distribution and making it continuous.

4. Why VAEs generally generate blurry images as output?

Variational Autoencoders (VAEs) tend to generate blurry image outputs primarily due to their
underlying objective function, which balances reconstruction accuracy with a regularization term that
encourages the learned latent space to follow a specific distribution, typically a Gaussian. This
regularization term, which promotes the smoothing of the latent space to ensure a continuous and
complete representation, often leads to the averaging of similar data points. When generating new
samples, the decoder part of the VAE thus tends to produce outputs that are averages of similar
training examples, resulting in images that lack the sharpness and detail of the original data.
Additionally, the use of a pixel-wise loss function, such as mean squared error, in the reconstruction
objective can further exacerbate the blurriness by emphasizing the overall structure at the expense of
high-frequency details.

5. Why GANs are considered to be robust against the overfitting problem?

Since we do not feed the real data to the generator, it reduces the risk of memorizing the training
dataset, thereby enhancing the model's generalization capabilities.

6. Can we use all available labels in the dataset to train a discriminator in the GAN model or it
is always designed to be binary (to distinguish between fake or real)? Explain.

Yes, it's possible to extend beyond binary classification in more complex GAN variants, incorporating
multiple labels or attributes into the training process. Using all available labels in the dataset to train a
discriminator in a GAN model can enrich the learning process, enabling the generation of more
diverse and high-quality data. It can also help the discriminator become more robust by giving it a
deeper understanding of the data's underlying structure and characteristics.

7. How image de-duplication process can help decrease the likelihood that GAN memorizes and
directly replicates its training images?
Image de-duplication is a process that removes duplicate or highly similar images from a dataset. In
the context of training Generative Adversarial Networks (GANs), de-duplication plays a crucial role
in promoting the generation of novel images and reducing the likelihood that the GAN simply
memorizes and replicates its training images. Here’s how image de-duplication helps in this context:
- Enhances generalization by forcing the GAN to learn broader dataset features instead of memorizing
specific images.
- Reduces overfitting by removing bias towards repeated patterns, helping the model to better
generalize to unseen data.
- Improves model robustness by presenting a more challenging and varied set of training examples,
enhancing discriminator accuracy.

Page 2 of 7
National University of Computer and Emerging Sciences
FAST School of Computing Spring-2024 Islamabad Campus
- Encourages creative generation by pushing the generator to explore the dataset's underlying space
and produce varied outputs.
- Prevents mode collapse by ensuring a broad representation of the dataset's variance, encouraging
diversity in generated images.
8. How semantic hashing is performed in the image de-duplication process?
Autoencoder compresses and then reconstructs the images, helping to remove noise and unnecessary
details.
Binarization and semantic hashing:
After training, the latent spaces are used to represent each image.
Z are made binary (0 or 1) by thresholding: values above the threshold are set to 1, and those below
are set to 0.
Result of binarization is like semantic hashing where similar images are likely to have similar binary
codes, allowing for efficient comparison and deduplication.

9. Consider a Maxout layer that has 12 units with 4 pieces. Calculate the output of Maxout layer
(y) when the following input is fed to it.
𝒙 = [3, −1,2,6,4,5, −2,0,1,7,9,8]
𝑦 = [3,6,1,9]

10. How Cycle Consistency Losses can be calculated in CycleGANs? Write its formulation.
CycleGANs consist of two mapping functions, (𝐺: 𝑋 → 𝑌) and (𝐹: 𝑌 → 𝑋), where (𝐺) attempts to
translate images from domain (𝑋) to domain (𝑌), and (𝐹) translates images from domain (𝑌) to
domain (𝑋). The cycle consistency loss consists of two parts:

Q2: [5+5]

a) Given a 4x4 image of 3 bits as shown below. Calculate the entropy of this image.
Hint: Calculate histogram. 0 1 2 3
4 5 6 7
𝒙𝒊 𝒓𝒊 Prob(𝒙𝒊 )
7 6 5 4
0 2 2/16 = 0.125
3 2 1 0
1 2 2/16 = 0.125

2 2 2/16 = 0.125

3 2 2/16 = 0.125

Page 3 of 7
National University of Computer and Emerging Sciences
FAST School of Computing Spring-2024 Islamabad Campus

4 2 2/16 = 0.125

5 2 2/16 = 0.125

6 2 2/16 = 0.125

7 2 2/16 = 0.125

𝐻(𝑋) = − ∑ 𝑝𝑖 log 2 (𝑝𝑖 )]

𝑖=1

𝐻(𝑋) = − ∑ 0.125 log 2 (0.125)

𝑖=1
𝐻 (𝑋) = −8 × 0.125 × (−3) = 3
b) What are Variational Autoencoders. How can we train a VAE and then use it for
classification task?

Variational Autoencoders (VAEs) are a class of generative models. VAEs learn the parameters of
probability distributions representing the data in a latent space. This allows VAEs to generate new
data points similar to the ones in the training set.

A VAE consists of two main components: an encoder and a decoder.

Encoder: This part of the model takes an input x and encodes it into a latent space representation z.
The encoder outputs parameters (mean μ and variance σ ) of a Gaussian distribution representing
possible values in the latent space.

Decoder: The decoder part takes a sampled point from the latent space and attempts to reconstruct the
original input x. The goal of the reconstruction process is to be as accurate as possible, which trains
the model to learn a meaningful representation of the data.

Training a VAE involves optimizing both the encoder and the decoder. The loss function is a
combination of both Reconstruction Loss and KL Divergence

Classification
Once the VAE is trained, you can use the encoder part of the VAE as feature extractor that can serve
as input for classification tasks. For classification, you can train a separate classifier on the latent
representations produced by the encoder. Depending on the performance, you might need to fine-tune
the classifier or the entire model by adjusting hyperparameters to improve the classification accuracy.

Q-3: [5+5]

a) Consider a corpus containing the following sentences (documents):

1. The quick brown fox jumps over the lazy dog.

Page 4 of 7
National University of Computer and Emerging Sciences
FAST School of Computing Spring-2024 Islamabad Campus
2. Lazy foxes lie low.
3. The quick yellow bird flies high.
4. High and low, the bird flies.
5. A quick bird jumps over lazy dogs.
Calculate the following:
TF("quick", Document1)
IDF("quick", Corpus)
TF-IDF("quick", Document1, Corpus)

TF("quick", Document 1) = 1/9 = 0.111

IDF("quick", Corpus) = log(5/3) ≈ 0.511
TF-IDF("quick", Document1, Corpus) = TF x IDF ≈ 0.057

b) Explain the working of Mini Batch GANs. What problem this type of GAN actually tackles
that is generally available in standard GANs?
Working of Mini Batch GANs
Mini Batch Generative Adversarial Networks (GANs) adjust the standard GAN framework to
improve the learning process, specifically addressing common issues like mode collapse and
training instability. The core innovation in Mini Batch GANs is in how the discriminator processes
information.

Mini-Batch Discrimination Technique

Mini Batch GANs incorporate a technique known as mini-batch discrimination. This technique
allows the discriminator to look at multiple examples (a mini-batch) at once, rather than making
decisions based on single samples. The idea is to give the discriminator context about the
diversity (or lack thereof) of samples it's evaluating, helping it to distinguish between real and
fake batches more effectively.

Calculating a Diversity Score

The discriminator calculates a score that reflects the diversity of the samples in a mini-batch. If
the generator is producing varied and realistic samples, the diversity score will be higher,
indicating a batch of samples that resembles the variation seen in real data. Conversely, a low
diversity score suggests that the generator's outputs are too similar to each other, signaling a
problem like mode collapse.

Based on the diversity score and the discriminator's ability to distinguish real from fake samples
considering batch context, the feedback to the generator is adjusted. The generator then uses this
feedback to update its parameters, aiming to produce more diverse and realistic samples in the
next iteration.

Mini Batch GANs Solution:

The primary issue with standard GANs is the lack of diversity in the generated samples, often
referred to as mode collapse. In standard GANs, the generator may learn to produce only a small
set of highly realistic outputs that consistently fool the discriminator, neglecting the variety
present in the real data distribution. This leads to poor generalization and limits the usefulness
of the generated data.

Page 5 of 7
National University of Computer and Emerging Sciences
FAST School of Computing Spring-2024 Islamabad Campus
Mini Batch GANs considering multiple instances within a mini-batch, the discriminator becomes
more adept at recognizing small differences and patterns across a wider range of data. This forces
the generator to create more varied outputs to successfully fool the discriminator.

Q-4: [5+5]

a) Write down at least three limitations of CycleGAN titled “Unpaired Image-to-Image Translation
using Cycle-Consistent Adversarial Networks”
a) The model was trained on specific synsets (wild horse and zebra) from ImageNet, which does not include
images of a person riding a horse or zebra. This limitation in the diversity of the training data can restrict
the model's generalization ability to unseen or varied scenarios.
b) The method may incorrectly swap labels, such as tree and building labels, in the output of tasks like
photos→labels. This indicates a challenge in maintaining semantic consistency without explicit
paired guidance.
c) Although unpaired data is abundantly available, solely relying on it can limit achieving the high precision
and reliability of model.

b) With the help of a diagram explain the working of conditional GANs. Write their objective
function.
• Conditional Generative Adversarial Nets (Conditional GANs) are an extension of the
original Generative Adversarial Networks (GANs) framework
• It incorporates conditional information into the data generation process.
• Both the generator and discriminator are provided with additional conditional data
– class labels or part of data features
• This allows the generated data to be more specific to the given condition
– More controlled and diverse data generation.

Page 6 of 7
National University of Computer and Emerging Sciences
FAST School of Computing Spring-2024 Islamabad Campus
• Generator:
– The generator G takes a noise vector z and conditional information y to produce data
G(z|y)
– Not only produces realistic output but also matches the given condition.
• Discriminator:
– The discriminator ( D ) also receives the conditional information y alongside the real
data or the generated data from the generator.
– Its task is to determine whether the given data is real or fake and whether it
corresponds to the given condition.
– The discriminator assesses D(x, y), where ( x ) is either real or generated data.
• Objective Function:
– The loss function encourages the generator to create data that can fool the
discriminator into believing it is real and correctly conditioned.
– Distinguish between real and fake data and also ensure that the generated data adheres
to the conditional context.

Page 7 of 7

Computer Vision Exam Questions English
No ratings yet
Computer Vision Exam Questions English
9 pages
ProfEd221 - Unit 5 - Feedbacking and Communicating Assessment Results PDF
100% (4)
ProfEd221 - Unit 5 - Feedbacking and Communicating Assessment Results PDF
12 pages
06 Training
No ratings yet
06 Training
108 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
105 pages
Tutorial On Diffusion Models For Imaging and Vision: Stanley Chan September 10, 2024
No ratings yet
Tutorial On Diffusion Models For Imaging and Vision: Stanley Chan September 10, 2024
89 pages
6S191 MIT DeepLearning L4
No ratings yet
6S191 MIT DeepLearning L4
88 pages
CS236 Homework 3 Answer
No ratings yet
CS236 Homework 3 Answer
8 pages
Chapter 5
No ratings yet
Chapter 5
140 pages
60 Gen Ai Questions
No ratings yet
60 Gen Ai Questions
35 pages
CS236 Homework 3
No ratings yet
CS236 Homework 3
4 pages
481 Generative Latent Flow
No ratings yet
481 Generative Latent Flow
20 pages
Vae - Gan 1
No ratings yet
Vae - Gan 1
136 pages
Lecun 20181015 Ihes Gomax PDF
No ratings yet
Lecun 20181015 Ihes Gomax PDF
109 pages
Genai See
No ratings yet
Genai See
51 pages
Iva Unit-5 Edited
No ratings yet
Iva Unit-5 Edited
42 pages
Domande ANN
No ratings yet
Domande ANN
28 pages
Lec15 Generative Models
No ratings yet
Lec15 Generative Models
51 pages
Tutorialon Diffusion Modelsfor Imaging and Vision
No ratings yet
Tutorialon Diffusion Modelsfor Imaging and Vision
90 pages
DL Asmt-2
No ratings yet
DL Asmt-2
17 pages
Solution: Introduction To Deep Learning
No ratings yet
Solution: Introduction To Deep Learning
20 pages
Gen AI - Sessional-II Exam Solution
No ratings yet
Gen AI - Sessional-II Exam Solution
10 pages
Unit Iii
No ratings yet
Unit Iii
15 pages
ACV
No ratings yet
ACV
7 pages
SS 2021
No ratings yet
SS 2021
16 pages
Deep Learning 15
No ratings yet
Deep Learning 15
13 pages
SS 2021 Solutions
No ratings yet
SS 2021 Solutions
16 pages
UNIT-5 Part1
No ratings yet
UNIT-5 Part1
15 pages
Advance Computer Vision
No ratings yet
Advance Computer Vision
5 pages
Introtodeeplearning MIT 6.S191
No ratings yet
Introtodeeplearning MIT 6.S191
36 pages
Advance Computer Vision 2
No ratings yet
Advance Computer Vision 2
5 pages
Advance Computer Vision 3
No ratings yet
Advance Computer Vision 3
5 pages
Unsupervised Deep Learning
No ratings yet
Unsupervised Deep Learning
11 pages
Deep Learning
No ratings yet
Deep Learning
78 pages
Advance Computer Vision 4
No ratings yet
Advance Computer Vision 4
4 pages
Week 6
No ratings yet
Week 6
4 pages
AAI Extra
No ratings yet
AAI Extra
7 pages
Ass12 Soln
No ratings yet
Ass12 Soln
7 pages
MuskanSharma - III IT
No ratings yet
MuskanSharma - III IT
10 pages
Deep Learning Viva Questions
No ratings yet
Deep Learning Viva Questions
4 pages
Answers For End-Sem Exam Part - 2 (Deep Learning)
No ratings yet
Answers For End-Sem Exam Part - 2 (Deep Learning)
20 pages
Exam Long Questions
No ratings yet
Exam Long Questions
8 pages
Introduction To Neural Networks 67103 - 2019 Exam B
No ratings yet
Introduction To Neural Networks 67103 - 2019 Exam B
2 pages
2024 Exam2 Solution
No ratings yet
2024 Exam2 Solution
11 pages
WS 2021 Solutions
No ratings yet
WS 2021 Solutions
16 pages
Quiz 3
No ratings yet
Quiz 3
5 pages
ACV - Notes - Final
No ratings yet
ACV - Notes - Final
7 pages
AI60201 2024 Endsem Solutions
No ratings yet
AI60201 2024 Endsem Solutions
5 pages
Mock Endterm ADL 2021
No ratings yet
Mock Endterm ADL 2021
8 pages
Cs230exam Win19 Soln
No ratings yet
Cs230exam Win19 Soln
29 pages
WS 2021
No ratings yet
WS 2021
16 pages
DL Exam 2023-2
No ratings yet
DL Exam 2023-2
5 pages
DL Unit - 5
No ratings yet
DL Unit - 5
14 pages
2019final IUP SampleAnswer
No ratings yet
2019final IUP SampleAnswer
11 pages
F16midterm Sols v2
No ratings yet
F16midterm Sols v2
14 pages
DL - Assignment 12 Solution
No ratings yet
DL - Assignment 12 Solution
7 pages
Exercises INF 5860 Solution Hints
No ratings yet
Exercises INF 5860 Solution Hints
11 pages
Paver Block Specification
No ratings yet
Paver Block Specification
8 pages
SP18 CS182 Midterm Solutions - Edited
No ratings yet
SP18 CS182 Midterm Solutions - Edited
14 pages
MT1SP19
No ratings yet
MT1SP19
13 pages
SP18 Practice Midterm
No ratings yet
SP18 Practice Midterm
5 pages
Deep Learning
No ratings yet
Deep Learning
5 pages
SIL Selection SIL Verification With ExSIlentia Syllabus
0% (1)
SIL Selection SIL Verification With ExSIlentia Syllabus
3 pages
Communication Aids and Strategies Using Tools of Technology
No ratings yet
Communication Aids and Strategies Using Tools of Technology
32 pages
Sistema de Frenos Freight m12
No ratings yet
Sistema de Frenos Freight m12
457 pages
Fundamentals of Aerodynamits: MC Graw Hill
No ratings yet
Fundamentals of Aerodynamits: MC Graw Hill
9 pages
A Detailed Lesson Plan in Mathematics 7: I. Objectives
No ratings yet
A Detailed Lesson Plan in Mathematics 7: I. Objectives
8 pages
ONLINE PRACTICE 26.7.2021 - EC5-14 (Code: N.2)
No ratings yet
ONLINE PRACTICE 26.7.2021 - EC5-14 (Code: N.2)
13 pages
TECH - ELEC-Difference Between Capacitor and Supercapacitor
No ratings yet
TECH - ELEC-Difference Between Capacitor and Supercapacitor
24 pages
Class 10 - Maths - Arithmetic Progressions
No ratings yet
Class 10 - Maths - Arithmetic Progressions
51 pages
Air Pollution: Classification of Air Pollutants
No ratings yet
Air Pollution: Classification of Air Pollutants
33 pages
An Economic Analysis of Selected Road PR
No ratings yet
An Economic Analysis of Selected Road PR
22 pages
Pronoun-Antecedent Rules
No ratings yet
Pronoun-Antecedent Rules
22 pages
Shahzad 2014
No ratings yet
Shahzad 2014
21 pages
Astm C40 C40M 16
No ratings yet
Astm C40 C40M 16
1 page
(Buehler & Griffin & Peetz-2012) The Planning Fallacy - Cognitive, Motivational, and Social Origins
No ratings yet
(Buehler & Griffin & Peetz-2012) The Planning Fallacy - Cognitive, Motivational, and Social Origins
62 pages
Mutations
No ratings yet
Mutations
48 pages
Inner Ring
No ratings yet
Inner Ring
16 pages
DxDiag Requisitos
No ratings yet
DxDiag Requisitos
30 pages
Lesson Plans Feb. 2019
No ratings yet
Lesson Plans Feb. 2019
13 pages
Gyan Sagar College of Engineering, SAGAR, (M.P.)
No ratings yet
Gyan Sagar College of Engineering, SAGAR, (M.P.)
5 pages
Kel 13 Jurnal Ips
No ratings yet
Kel 13 Jurnal Ips
10 pages
SLEX 4 Monster Mash
No ratings yet
SLEX 4 Monster Mash
7 pages
Electrophysiology Devices Market Report
No ratings yet
Electrophysiology Devices Market Report
7 pages
Filtration PDF
No ratings yet
Filtration PDF
13 pages
Planning Engineer
No ratings yet
Planning Engineer
2 pages
Phy340-Tutorial 2
No ratings yet
Phy340-Tutorial 2
2 pages
S20G Low Headroom Hoist/geared Trolley Combination
No ratings yet
S20G Low Headroom Hoist/geared Trolley Combination
5 pages
Troubleshooting Neato Botvac Connected Series
No ratings yet
Troubleshooting Neato Botvac Connected Series
4 pages
Marketnext Foundation
No ratings yet
Marketnext Foundation
4 pages

Sessional-II Exam Solution Spring 2024

Uploaded by

Sessional-II Exam Solution Spring 2024

Uploaded by

National University of Computer and Emerging Sciences

FAST School of Computing Spring-2024 Islamabad Campus

AI-4009: Generative AI Sessional-II Exam

______________________ ______________ _______________ __________________

Q1: Write short answers to the following questions [10 x 2 = 20]

4. Why VAEs generally generate blurry images as output?

5. Why GANs are considered to be robust against the overfitting problem?

𝐻(𝑋) = − ∑ 𝑝𝑖 log 2 (𝑝𝑖 )]

𝐻(𝑋) = − ∑ 0.125 log 2 (0.125)

A VAE consists of two main components: an encoder and a decoder.

a) Consider a corpus containing the following sentences (documents):

TF("quick", Document 1) = 1/9 = 0.111

Mini-Batch Discrimination Technique

Calculating a Diversity Score

Mini Batch GANs Solution:

You might also like

______________ _ __________