Unit VI

The document provides an overview of Auto-Encoders, including their components (encoder, bottleneck, and decoder) and their purpose in learning low-dimensional representations of data. It discusses regularization techniques to prevent overfitting, such as L1 and L2 regularization, and introduces specialized types like Denoising Autoencoders and Sparse Autoencoders, which enhance feature extraction and noise reduction. Additionally, it outlines the applications and advantages of these autoencoders in various fields, including image and audio processing.

Uploaded by

iron92469

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views46 pages

Unit VI

Uploaded by

iron92469

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 46

UNIT VI

Auto-Encoders

P Jyothi,Asst. Prof., CSE Dept.

P Jyothi
Asst. prof.,
CSE Dept.

P Jyothi,Asst. Prof., CSE Dept.

Auto-Encoders Introduction

 Encoder:
• The encoder is a neural network with one or more hidden layers.
• It receives noisy input data instead of the original input and generates an
encoding in a low-dimensional space.
• There are several ways to generate a corrupted input. The most common being
adding a Gaussian noise or randomly masking some of the inputs.
 Decoder:
• Similar to encoders, decoders are implemented as neural networks with one or
more hidden layers.
• It takes the encoding generated by the encoder as input and reconstructs the
original data.
• When calculating the Loss function it compares the output values with the
original input, not with the corrupted input.
P Jyothi,Asst. Prof., CSE Dept.
Auto-Encoders Introduction conti..

 An Autoencoder has the following parts:

1. Encoder: The encoder is the part of the network which
takes in the input and produces a lower Dimensional
encoding
2. Bottleneck: It is the lower dimensional hidden layer where
the encoding is produced. The bottleneck layer has a lower
number of nodes and the number of nodes in the bottleneck
layer also gives the dimension of the encoding of the input.
3. Decoder: The decoder takes in the encoding and recreates
back the input.
P Jyothi,Asst. Prof., CSE Dept.
Auto-Encoders Introduction conti..

P Jyothi,Asst. Prof., CSE Dept.

Auto-Encoders Introduction conti..

 The bottleneck layer is the lower dimension layer. In the diagram,

we have the neural networks encoder and decoder. Phi and
Theta are the representing parameters of the encoder and
decoder respectively.
 The target of this model is such that the Input is equivalent to the
Reconstructed Output. To achieve this we minimize a loss
function named Reconstruction Loss. Basically, Reconstruction
Loss is given by the error between the input and the
reconstructed output. It is usually given by the Mean Square
error or Binary Crossentropy between the input and
reconstructed output. Binary Crossentropy is used if the data is
binary.
P Jyothi,Asst. Prof., CSE Dept.
Auto-Encoders Introduction conti..

Autoencoders:
 Autoencoders present an efficient way to learn a
representation of your data, which helps with tasks such as
dimensionality reduction or feature extraction. You can even
train an autoencoder to identify and remove noise from
your data.

P Jyothi,Asst. Prof., CSE Dept.

Regularization in auto-encoders

 Regularization helps with the effects of out-of-control parameters by using

different methods to minimize parameter size over time.
 In mathematical notation, we see regularization represented by the
coefficient lambda, controlling the trade-off between finding a good fit and
keeping the value of certain feature weights low as the exponents on
features increase.
 Regularization coefficients L1 and L2 help fight overfitting by making
certain weights smaller. Smaller-valued weights lead to simpler
hypotheses, which are the most generalizable. Unregularized weights with
several higher-order polynomials in the feature sets tend to overfit the
training set.
 As the input training set size grows, the effect of regularization decreases,
and the parameters tend to increase in magnitude. This is appropriate
because an excess of features relative to training set examples leads to
P Jyothi,Asst. Prof., CSE Dept.
overfitting in the first place. Bigger data is the ultimate regularizer.
Regularization in auto-encoders conti..

 There are other ways we can constraint the reconstruction

of an autoencoder than to impose a hidden layer of smaller
dimension than the input. Rather than limiting the model
capacity by keeping the encoder and decoder shallow and
the code size small, regularized autoencoders use a loss
function that encourages the model to have other properties
besides the ability to copy its input to its output. In practice,
we usually find two types of regularized autoencoder:
the sparse autoencoder and the denoising autoencoder.