0% found this document useful (0 votes)

14 views21 pages

Sem5 PPT

Uploaded by

tejaswi nadendla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views21 pages

Sem5 PPT

Uploaded by

tejaswi nadendla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 21

MINI-PROJECT ‘2023

CLASSIFCATION
USING
GLOW_MODEL
Professor : Prof. Shekhar Verma
Mentor : Jagriti
Singh
Team Members

IIT2021001 KSHITIJ

IIT2021235 TEJASWI.N

IIT2021033 SATHWIKA.CH

IIB2021039 JISHWITHA REDDY

IIB2021022 GAYATHRI NARAYANAM

PROBLEM STATEMENT

THE OBJECTIVE IS TO ADAPT GLOW, A GENERATIVE MODEL TYPICALLY

USED FOR GENERATING NEW DATA SAMPLES, FOR THE TASK OF
CLASSIFYING DIGITS FROM THE MNIST DATASET. SPECIFICALLY, THE
AIM IS TO CLASSIFY TWO DISTINCT DIGITS, '0' AND '1', BY TRAINING
SEPARATE GLOW MODELS FOR EACH DIGIT AND THEN USING THESE
MODELS TO CLASSIFY NEW IMAGES.
GENERATIVE MODELS

• Unlike traditional models that differentiate data, Generative

Models focus on learning data distributions and creating new
instances from these learned patterns.
• This method gives a unique way to really understand and use the
patterns in complex data sets.
• These models find extensive application in tasks like image
generation, data augmentation, and creating realistic simulations.
GENERATIVE ADVERSIAL NETWORKS

• The core of GANs contain two parts: the Generator and the
Discriminator.
• In GANs, the Generator learns to create images that look real starting
from simple noise patterns. The Discriminator's task is to identify if
these images are real or created by the Generator.
• The Generator aims to create images that are so convincing that the
Discriminator thinks they are real. Meanwhile, the Discriminator gets
better at telling real images from fake ones.
• This process is like a continuous game, where each part's success
depends on outsmarting the other.

Img-Credits
VAE

• Variational Autoencoders (VAEs) are a popular unsupervised approach

for learning complex data distributions like images using neural
networks .
• VAEs aim to learn and sample from the probability distribution of
training data, utilizing a low-dimensional latent representation.
• Latent variables in VAEs, inferred but not directly observed, are crucial
for generating data that closely resembles the training set.

Img-Credits
NORMALIZING FLOWS

• Normalizing Flows turn simple distributions into complex ones with

invertible neural networks, making it easier to accurately estimate
densities.
• They use a series of transformations that keep density calculations
manageable, thanks to the change of variables formula.
• Different from GANs and VAEs, Normalizing Flows can precisely
calculate likelihoods and reverse their transformations, leading to more
straightforward and efficient model creation.
• Their ability to both invert transformations and accurately estimate
densities is what makes them unique in deep generative modeling."

Img-Credits
CHANGE OF VARIABLES IN NORMALIZING FLOWS

• The change of variables formula is crucial in normalizing flows, allowing

the transformation of simple probability distributions into more complex
ones, representing the data more accurately.
• This transformation is achieved using an invertible function, which
modifies the variables of the original distribution while preserving the
overall probability structure.

Img-Credits
GLOW-MODEL

• The Glow model is a neural network architecture characterized by its

multi-scale structure. It is built upon a series of repeating layers, each
referred to as a scale, which are fundamental to its functioning.
• At the heart of each scale is a squeeze function that alters the shape of
the input tensor from [c, h, w] to [4*c, h/2, w/2], effectively increasing
the channel depth while halving the spatial dimensions. This
transformation is crucial for the model's ability to process images at
different scales.

Img-Credits
CONT..

• Following the squeeze function is a flow step, consisting of ActNorm for normalization, a 1x1
Convolution for channel-wise information mixing, and an Affine Coupling Layer for specialized
transformations.
• The Glow model is designed for reversible operations which means each layer's function can be
applied in both forward and reverse, allowing for greater flexibility during training and testing
phases. For instance, during testing, the reshaping function can revert a tensor from [4*c, h/2,
w/2] back to its original size [c, h, w], demonstrating the model's adaptability.

Img-Credits
ARCHITECTURE OF GLOW-MODEL
GENERATED DATA USING GLOW-MODEL
USING MODEL IN PERSPECTIVE OF CLASSIFICATION

• We trained two separate Glow models: one on the '0' digit images and another on the '1' digit
images from the MNIST dataset, enabling us to learn the distinct distributions specific to each
digit.
• After training, each model can map its respective images of '0' or '1' to points in its latent space.
This space is a lower-dimensional area where the model encodes information from each input
image.
• In this latent space, each point represents an encoding of an image, capturing essential features
of '0' or '1' as learned by the respective models.
CONT..

• Passing images of '0' and '1' through their trained models gives us their
corresponding latent representations.
• We calculate the mean of all latent vectors for '0' and '1' separately,
obtaining two distinct vectors. These vectors represent the "average"
points in the latent space for the digits '0' and '1', respectively.
• These mean latent vectors symbolize the typical characteristics of the
digits '0' and '1' as learned from our training datasets for each digit.
• The mean vectors serve as reference points in the latent space,
allowing us to see how other images or different digits differ from the
mean latent vectors of '0' and '1'.
CONT..

• Test images from the MNIST dataset are then passed through both Glow
models. Each model, having been trained on a specific digit, encodes
the test images into latent vectors, capturing the key features as
learned for '0' and '1', respectively.
• We calculate the Euclidean distance between each test image's latent
vector (from both models) and the mean latent vectors of '0' and '1'.
• A test image is then classified based on which mean latent vector—'0'
or '1'—it is closer to, as determined by the calculated distances from
both models.
• This proximity-based approach, using the dual encoding by the separate
models, enables us to accurately predict the identity of each digit. We
do this by comparing the test image's position in the latent space
relative to the mean positions of '0' and '1'.
GET-LATENT-MEAN

• We have implemented the get_latent_mean_vector function to calculate mean latent vectors

for each class, by averaging individual latent representations. This process effectively captures
the key features of each class in the latent space.
• These mean vectors act as benchmarks, aiding in classifying new data. They enable the model
to identify the class that most closely matches an input by comparing it against these class-
specific representations.
• We used the get_latent_mean_vector function to calculate mean latent vectors for '0' and '1',
giving us insights into how the model uniquely represents these two digits.
CLASSIFICATION OF IMAGE USING PRE-CALCULATED MEAN VECTOR

• We have implemented the ‘classify’ function to assign classes to test data instances by
comparing their latent vectors' distances to the mean latent vectors of '0' and '1', classifying
each digit based on closest proximity.
• Classification is determined by the shortest Euclidean distance to the mean vectors of '0' and
'1', simplifying the decision-making process.
• The function's accuracy is measured by tallying the percentage of correct predictions against
the actual labels in the test dataset.
• Our results showcase the accuracy percentages for both '0' and '1', indicating the model's
capability to effectively distinguish between these two digits.
EXTENDED PERFORMANCE METRICS: PRECISION, RECALL, AND F1 SCORE

• We also created a single confusion matrix for both '0' and '1', and
calculated precision, recall, and the F1 score to further evaluate the
model's performance.
CONCLUSION AND FURTHER SCOPE

In conclusion, our exploration with the Glow generative model in classifying

digits '0' and '1' has been both enlightening and rewarding.
Further we can improve our model to classify more than two classes.
The positive outcomes of our project invite further research and
development.
This venture into generative models for classification tasks hints at a
promising future, particularly in addressing more complex challenges in
image recognition.

Final-Project-
Link
Reference-Links

• https://github.com/VincentStimper/normalizing-flows
• https://lyusungwon.github.io/studies/2018/07/23/glow/
• https://towardsdatascience.com/introduction-to-normalizing-flows-
d002af262a4b
• https://medium.com/@sairajreddy/gan-generative-adversarial-nets-
e8520157ec62
• https://learnopencv.com/wp-content/uploads/2020/11/vae-diagram-
scaled.jpg
• https://www.google.com/url?sa=i&url=https%3A%2F%2Ffanyv88.com%3A443%2Fhttps%2Fankurdhuriya.medium.com%2Fwhat-are-
normalizing-flows-
ce7ccd222ee7&psig=AOvVaw2ID8I3TSD2mrIKVsak2jIp&ust=1701449036087000&source=images&cd=vfe
&opi=89978449&ved=0CBIQjRxqFwoTCJid286V7IIDFQAAAAAdAAAAABAD
THANK YOU!

Mnist Handwritten Digit Classification
No ratings yet
Mnist Handwritten Digit Classification
26 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
155 pages
Nips00 GM
No ratings yet
Nips00 GM
7 pages
Lecture4 GAN B
No ratings yet
Lecture4 GAN B
38 pages
Yu Xiaozhuo
No ratings yet
Yu Xiaozhuo
85 pages
BigData Assessment2 26230605
No ratings yet
BigData Assessment2 26230605
14 pages
Diffusion Csail Lecture Notes
No ratings yet
Diffusion Csail Lecture Notes
56 pages
Traffic Sign Classification: Mezzi Houssem
No ratings yet
Traffic Sign Classification: Mezzi Houssem
36 pages
L10 Image Classification
No ratings yet
L10 Image Classification
10 pages
Diffusion
100% (5)
Diffusion
62 pages
PyTorch Neural Network Classifcation
No ratings yet
PyTorch Neural Network Classifcation
1 page
End To End Project
No ratings yet
End To End Project
21 pages
TP3 Mi204 Santos Scardellato
No ratings yet
TP3 Mi204 Santos Scardellato
20 pages
Open Set Intraclass Splitting
No ratings yet
Open Set Intraclass Splitting
5 pages
Experiment 2 Lab Manual
No ratings yet
Experiment 2 Lab Manual
7 pages
A. Image Pre-Processing:: Grayscale Conversion Fig.5.f
No ratings yet
A. Image Pre-Processing:: Grayscale Conversion Fig.5.f
6 pages
PixelTransformer - Sample Conditioned Signal Generation
No ratings yet
PixelTransformer - Sample Conditioned Signal Generation
10 pages
2312.14977diffusion Models For Generative Artificial
No ratings yet
2312.14977diffusion Models For Generative Artificial
23 pages
Control System Term Paper
No ratings yet
Control System Term Paper
12 pages
Data Augmentation For Supervised Learning With Generative Adversa
No ratings yet
Data Augmentation For Supervised Learning With Generative Adversa
60 pages
Final Visit PPT
No ratings yet
Final Visit PPT
14 pages
Kak Anncho
No ratings yet
Kak Anncho
7 pages
"I C U N N ": Mage Lassification Sing Eural Etworks
No ratings yet
"I C U N N ": Mage Lassification Sing Eural Etworks
15 pages
Wei 2023 Diffusion Model As Mae
No ratings yet
Wei 2023 Diffusion Model As Mae
18 pages
Matconvnet Manual
No ratings yet
Matconvnet Manual
59 pages
Generative AI in Vision: A Survey On Models, Metrics and Applications
No ratings yet
Generative AI in Vision: A Survey On Models, Metrics and Applications
12 pages
2023 Bocconi 20600 Lez 1 Intro and Digital Images
No ratings yet
2023 Bocconi 20600 Lez 1 Intro and Digital Images
86 pages
Liu Hu Report
No ratings yet
Liu Hu Report
6 pages
Main
No ratings yet
Main
183 pages
Generative Models
No ratings yet
Generative Models
39 pages
Diffusion Models in Vision: A Survey: Florinel-Alin Croitoru, Vlad Hondru, Radu Tudor Ionescu, and Mubarak Shah
No ratings yet
Diffusion Models in Vision: A Survey: Florinel-Alin Croitoru, Vlad Hondru, Radu Tudor Ionescu, and Mubarak Shah
25 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Deep Learning Models (Basic)
No ratings yet
Deep Learning Models (Basic)
35 pages
Lecture 2 PDF
No ratings yet
Lecture 2 PDF
62 pages
Harsha Thesis
No ratings yet
Harsha Thesis
62 pages
Lecture 2: Basics and Definitions: Networks As Data Models
No ratings yet
Lecture 2: Basics and Definitions: Networks As Data Models
28 pages
Diffusion Models
No ratings yet
Diffusion Models
27 pages
Towards Open Set Deep Networks: Abhijit Bendale, Terrance E. Boult University of Colorado at Colorado Springs
No ratings yet
Towards Open Set Deep Networks: Abhijit Bendale, Terrance E. Boult University of Colorado at Colorado Springs
14 pages
Pattern Recognition
No ratings yet
Pattern Recognition
52 pages
Exer6 ThreeMarias
No ratings yet
Exer6 ThreeMarias
3 pages
Week 9 Generative Adversarial Networks
No ratings yet
Week 9 Generative Adversarial Networks
50 pages
What Is Computer Vision?
No ratings yet
What Is Computer Vision?
125 pages
Alice Book Volume 1
No ratings yet
Alice Book Volume 1
281 pages
Exercises INF 5860 Solution Hints
No ratings yet
Exercises INF 5860 Solution Hints
11 pages
Machine Learning-Lecture 16 (Student)
No ratings yet
Machine Learning-Lecture 16 (Student)
10 pages
What Is Computer Vision?
No ratings yet
What Is Computer Vision?
120 pages
Spectral Normalization For GANs
No ratings yet
Spectral Normalization For GANs
26 pages
Answers For End-Sem Exam Part - 2 (Deep Learning)
No ratings yet
Answers For End-Sem Exam Part - 2 (Deep Learning)
20 pages
Gan Tutorial
No ratings yet
Gan Tutorial
57 pages
Noets 2016 Tutorial:Generative Adversarial Networks PDF
No ratings yet
Noets 2016 Tutorial:Generative Adversarial Networks PDF
57 pages
Ml2 Script v2
No ratings yet
Ml2 Script v2
123 pages
Deep Learning
No ratings yet
Deep Learning
78 pages
Lecture-01 Introductory
No ratings yet
Lecture-01 Introductory
29 pages
Deep 2
No ratings yet
Deep 2
57 pages
Sessional-II Exam Solution Spring 2024
No ratings yet
Sessional-II Exam Solution Spring 2024
7 pages
Efficient Training For Optical Computing
No ratings yet
Efficient Training For Optical Computing
26 pages
2024 - Denoising Autoregressive Representation Learning - Li Et Al
No ratings yet
2024 - Denoising Autoregressive Representation Learning - Li Et Al
22 pages
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
From Everand
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
Fouad Sabry
No ratings yet
Image Segmentation: Unlocking Insights through Pixel Precision
From Everand
Image Segmentation: Unlocking Insights through Pixel Precision
Fouad Sabry
No ratings yet
Top 20 MS Excel VBA Simulations, VBA to Model Risk, Investments, Growth, Gambling, and Monte Carlo Analysis
From Everand
Top 20 MS Excel VBA Simulations, VBA to Model Risk, Investments, Growth, Gambling, and Monte Carlo Analysis
Andrei Besedin
2.5/5 (2)
Saitejaswi Nadendla Resume
No ratings yet
Saitejaswi Nadendla Resume
1 page
A Suresh Babu Pe
No ratings yet
A Suresh Babu Pe
16 pages
A Suresh Babu Pe
No ratings yet
A Suresh Babu Pe
1 page
Guided Diffusion From Self-Supervised Diffusion Features
No ratings yet
Guided Diffusion From Self-Supervised Diffusion Features
22 pages
Updated Mess Menu
No ratings yet
Updated Mess Menu
1 page
Installation Log
No ratings yet
Installation Log
1 page
Spin 3
No ratings yet
Spin 3
13 pages
Hyperledger and Hashgraph
No ratings yet
Hyperledger and Hashgraph
41 pages
EGSDE: Unpaired Image-to-Image Translation Via Energy-Guided Stochastic Differential Equations
No ratings yet
EGSDE: Unpaired Image-to-Image Translation Via Energy-Guided Stochastic Differential Equations
25 pages
Janvi Resume
No ratings yet
Janvi Resume
1 page
2021 Placements Data
No ratings yet
2021 Placements Data
10 pages
System Design
No ratings yet
System Design
30 pages
Abhijanitha Ravirala
No ratings yet
Abhijanitha Ravirala
1 page
Teja Resume
No ratings yet
Teja Resume
1 page
Associate Companies To QMF Dec 2024
No ratings yet
Associate Companies To QMF Dec 2024
1 page
Academic Calendar Jan - Jun & July Dec 2025 - 250101 - 211436
No ratings yet
Academic Calendar Jan - Jun & July Dec 2025 - 250101 - 211436
2 pages
Transcript
No ratings yet
Transcript
7 pages
4.hemalatha Resume-1
No ratings yet
4.hemalatha Resume-1
2 pages
Turtle - Turtle Graphics - Python 3.9.7 Documentation
No ratings yet
Turtle - Turtle Graphics - Python 3.9.7 Documentation
40 pages
Aw E-book - คู่มือการใช้งาน SOLIDWORKS
No ratings yet
Aw E-book - คู่มือการใช้งาน SOLIDWORKS
4 pages
Offensive Security Certified Professional OSCP
No ratings yet
Offensive Security Certified Professional OSCP
1 page
Cisco Webex Rooms Brochure
No ratings yet
Cisco Webex Rooms Brochure
22 pages
SAMA APP Overview Specifications (For Tender)
No ratings yet
SAMA APP Overview Specifications (For Tender)
11 pages
Anonymous Class (Extending Class)
No ratings yet
Anonymous Class (Extending Class)
6 pages
W8,9
No ratings yet
W8,9
12 pages
Chapter 1
No ratings yet
Chapter 1
21 pages
Implementing Cisco Service Provider Next-Generation Core Network Services
No ratings yet
Implementing Cisco Service Provider Next-Generation Core Network Services
9 pages
Project Report On DVR (17001005025,2056,2046)
No ratings yet
Project Report On DVR (17001005025,2056,2046)
51 pages
200-301 Cisco CCNA Exam Updated Practice Questions
No ratings yet
200-301 Cisco CCNA Exam Updated Practice Questions
67 pages
Python Full Stack
No ratings yet
Python Full Stack
37 pages
MX BNG 17.x-18.x New Features v02 (ENG)
No ratings yet
MX BNG 17.x-18.x New Features v02 (ENG)
48 pages
HEC-HMS Training-V2-20231219 - 223927
No ratings yet
HEC-HMS Training-V2-20231219 - 223927
61 pages
Fifth Generation: List Processing: LISP
No ratings yet
Fifth Generation: List Processing: LISP
7 pages
How To Find The Product Model of Your Dell Computer - Dell India
No ratings yet
How To Find The Product Model of Your Dell Computer - Dell India
3 pages
CV Porto Vickyab - Compressed
No ratings yet
CV Porto Vickyab - Compressed
8 pages
Quick Start Guide - SAP BTP SDK For iOS
No ratings yet
Quick Start Guide - SAP BTP SDK For iOS
26 pages
05-IP Addressing
No ratings yet
05-IP Addressing
13 pages
On The Robustness of Binomial Model and Finite Difference Method For Pricing European Options
No ratings yet
On The Robustness of Binomial Model and Finite Difference Method For Pricing European Options
7 pages
Building A Capacitive Liquid Sensor
No ratings yet
Building A Capacitive Liquid Sensor
10 pages
Professional Certificates Catalog
No ratings yet
Professional Certificates Catalog
60 pages
K Means Clustering in Image Segmentation
No ratings yet
K Means Clustering in Image Segmentation
5 pages
ESB Services API Reference Guide
No ratings yet
ESB Services API Reference Guide
12 pages
Wifi Pasword
No ratings yet
Wifi Pasword
1 page
Manufacturing Big Data Ecosystem A Systematic Literature Review - 2ndresubmit - v2
No ratings yet
Manufacturing Big Data Ecosystem A Systematic Literature Review - 2ndresubmit - v2
40 pages
A Conceptual Overview of Data Mining: B.N. Lakshmi., G.H. Raghunandhan
No ratings yet
A Conceptual Overview of Data Mining: B.N. Lakshmi., G.H. Raghunandhan
6 pages
DBMS CH-3
No ratings yet
DBMS CH-3
45 pages

Sem5 PPT

Uploaded by

Sem5 PPT

Uploaded by

MINI-PROJECT ‘2023

IIB2021039 JISHWITHA REDDY

IIB2021022 GAYATHRI NARAYANAM

THE OBJECTIVE IS TO ADAPT GLOW, A GENERATIVE MODEL TYPICALLY

• Unlike traditional models that differentiate data, Generative

• Variational Autoencoders (VAEs) are a popular unsupervised approach

• Normalizing Flows turn simple distributions into complex ones with

• The change of variables formula is crucial in normalizing flows, allowing

• The Glow model is a neural network architecture characterized by its

• We have implemented the get_latent_mean_vector function to calculate mean latent vectors

In conclusion, our exploration with the Glow generative model in classifying

You might also like