0% found this document useful (0 votes)

10 views19 pages

Unit 4

Uploaded by

MALATHI JANAPATI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views19 pages

Unit 4

Uploaded by

MALATHI JANAPATI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

4.

CONVOLUTIONAL NEURAL NETWORKS

Convolutional Neural Networks (CNNs) are the standard neural network architecture
used for prediction when the input observations are images, which is the case in a wide range
of neural network applications. A convolutional neural network (CNN) typically consists of
three layers: a convolutional layer, a pooling layer, and a fully connected layer. The main
advantage of using CNNs is that they do not require human supervision for image
classification and identifying important features in images.

NERUAL NETWORK AND REPRESENTATION LEARING

 Neural networks initially receive data on observations, with each observation
represented by some number n features.
 A simple neural network model with one hidden layer performed better than a model
without that hidden layer.
 One reason is that the neural network could learn nonlinear relationships between
input and output.
 However, a more general reason is that in machine learning, we often need linear
combinations of our original features in order to effectively predict our target.
 Let's say that the pixel values for an MNIST digit are x1 through x784.
 There may be many other such combinations, all of which contribute positively or
negatively to the probability that an image is of a particular digit.
 Neural networks can automatically discover combinations of the original features that
are important through their training process.
 This process of learning which combinations of features are important is known as
representation learning, and it's the main reason why neural networks are successful
across different domains.

1
The above represents the neural networks that start with n features and then learn
somewhere between √n and n “combinations” of these features to make predictions.

A Different Architecture for Image Data:

Here, we create combinations of features, as before, but an order of magnitude more
of them, and have each one be only a combination of the pixels from a small rectangular
patch in the input image. With image data, we can define each learned feature to be a
function of a small patch of data, and thus define somewhere between n and n2 output
neurons which is as shown below

Having our neural network learn combinations of all of the input features i.e..,
combinations of all of the pixels in the input image—turns out to be very inefficient, since it
ignores the insight described in the prior section: that most of the interesting combinations of
features in images occur in these small patches.
Nevertheless, previously it was at least extremely easy to compute new features that
were combinations of all the input features: if we had f input features and wanted to compute
n new features, we could simply multiply the ndarray containing our input features by an f ×
n matrix. Convolution operation can be used to compute many combinations of the pixels
from local patches of the input image.

CONVOLUTIONAL LAYERS
Convolutional Neural Network (CNN) is the extended version of artificial neural
networks (ANN) which is predominantly used to extract the feature from the grid-like matrix
dataset. For example visual datasets like images or videos where data patterns play an
extensive role. Convolutional Neural Network consists of multiple layers like the input layer,
Convolutional layer, Pooling layer, and fully connected layers.

2
CNN takes an image as input, which is classified and process under a certain category
such as dog, cat, lion, tiger, etc. The computer sees an image as an array of pixels and
depends on the resolution of the image. Based on image resolution, it will see as h * w * d,
where h= height w= width and d=dimension. For example, An RGB image is 6 * 6 * 3 array
of the matrix, and the grayscale image is 4 * 4 * 1 array of the matrix.
In CNN, each input image will pass through a sequence of convolution layers along
with pooling, fully connected layers, filters (Also known as kernels). After that, we will apply
the Soft-max function to classify an object with probabilistic values 0 and 1.

Convolution Layer:
Convolution layer is the first layer to extract features from an input image. By
learning image features using a small square of input data, the convolutional layer preserves
the relationship between pixels. It is a mathematical operation which takes two inputs such as
image matrix and a kernel or filter. The dimension of the image matrix is h×w×d.
 The dimension of the filter is f h ×f w ×d
 The dimension of the output is (h-f h +1)×(w-f w +1)×1

Figure: Image matrix multiplies kernel or filter matrix

3
Filters / Kernels:
 A filter provides a measure for how close a patch or a region of the input resembles a
feature. A feature may be any prominent aspect – a vertical edge, a horizontal edge,
an arch, a diagonal, etc.
 A filter acts as a single template or pattern, which, when convolved across the input,
finds similarities between the stored template & different locations/regions in the
input image.
 Let us consider an example of detecting a vertical edge in the input image.
 Each column of the 4×4 output matrix looks at exactly three columns & three rows
(the coloured boxes show the output of the filter as it moves over the input image).
The values in the output matrix represent the change in the intensity along the
horizontal direction w.r.t the columns in the input image.
 The output image has the value 0 in the 1st & last column. It means there is no change
in intensity in the first three columns & the previous three columns of the input image.
On the other hand, the output is 30 in the 2nd & 3rd column, indicating a change in the
intensity of the corresponding columns of the input image.

Let’s start with consideration a 5*5 image whose pixel values are 0, 1, and filter
matrix 3*3 as:

The convolution of 5*5 image matrix multiplies with 3*3 filter matrix is called
"Features Map" and show as an output.

Convolution of an image with different filters can perform an operation such as blur,
sharpen, and edge detection by applying filters.

4
Strides:
During convolution, the filter slides from left to right and from top to bottom until it
passes through the entire input image. We define stride as the step of the filter. So, when we
want to down sample the input image and end up with a smaller output, we set S>0.

Padding:
In a convolutional layer, we observe that the pixels located on the corners and the
edges are used much less than those in the middle. A simple and powerful solution to this
problem is padding, which adds rows and columns of zeros to the input image. If we apply
padding in an input image of size HXH, the output image has dimensions (W+2P)X(H+2P).
Below we can see an example image before & after padding with p=2, where the dimension
is increased from 5X5 to 9X9

By using padding in a convolutional layer, we increase the contribution of pixels at

the corners and the edges to the learning procedure.

5
Pooling Layer:
Pooling layer plays an important role in pre-processing of an image. Pooling layer
reduces the number of parameters when the images are too large. Pooling is "downscaling" of
the image obtained from the previous layers. It can be compared to shrinking an image to
reduce its pixel density. Spatial pooling is also called downsampling or subsampling, which
reduces the dimensionality of each map but retains the important information. There are two
types of poolings that are used:

1. Max pooling: Max pooling is a pooling operation that selects the maximum element
from the region of the feature map covered by the filter. Thus, the output after max-
pooling layer would be a feature map containing the most prominent features of the
previous feature map.

2. Average pooling: Average pooling computes the average of the elements present in
the region of feature map covered by the filter. Thus, while max pooling gives the
most prominent feature in a particular patch of the feature map, average pooling gives
the average of features present in a patch.

6
Fully Connected Layer / Dense Layer:
The fully connected layer is a layer in which the input from the other layers will be
flattened into a vector and sent. It will transform the output into the desired number of classes
by the network.

In the above diagram, the feature map matrix will be converted into the vector such as
x1, x2, x3... xn with the help of fully connected layers. We will combine features to create a
model and apply the activation function such as softmax or sigmoid to classify the outputs as
a car, dog, truck, etc.

MULTICHANNEL CONVOLUTION OPERATION

Convolutional Neural Networks differ from regular neural networks in that they
create an order of magnitude more features, and in that each feature is a function of just a
small patch from the input image.
Now we can get more specific: starting with n input pixels, the convolution operation
just described will create n output features, one for each location in the input image.
What actually happens in a convolutional Layer in a neural network goes one step
further: there, well create f sets of n features, each with a corresponding (initially random) set
of weights defining a visual pattern whose detection at each location in the input image will

7
be captured in the feature map. These f feature maps will be created via f convolution
operations.

While each “set of features” detected by a particular set of weights is called a feature
map, in the context of a convolutional Layer, the number of feature maps is referred to as the
number of channels of the Layer—this is why the operation involved with the Layer is called
the multichannel convolution. In addition, the f sets of weights Wi are called the
convolutional filters.

8
4. RECURRENT NEURAL NETWORKS

INTRODUCTION TO RNN

Sequence Learning Problems: Sequence learning problems are different from other
machine learning problems in two key ways:
 The inputs to the model are not of a fixed size
 The inputs to the model are dependent on each other

Examples of sequence learning problems include:

 Auto completion
 Part-of-speech tagging
 Sentiment analysis
 Video classification

Example:
Consider the task of auto completion. Given a sequence of characters, we want to
predict the next character. For example, given the sequence "d", we want to predict the next
character, which is "e".
An RNN would solve this problem by maintaining a hidden state. The hidden state
would be initialized with the information from the first input character, "d". Then, at the next
time step, the RNN would take the current input character, "e", and the hidden state as input
and produce a prediction for the next character. The hidden state would then be updated with
the new information. This process would be repeated until the end of the sequence. At the end
of the sequence, the RNN would output the final prediction.

Advantages of RNNs for sequence learning problems:

 RNNs can handle inputs of any length
 RNNs can learn long-term dependencies between the inputs in a sequence

9
Disadvantages of RNNs:
 RNNs can be difficult to train
 RNNs can be susceptible to vanishing and exploding gradients

RNNs are a powerful tool for solving sequence learning problems. They have been
used to achieve state-of- the-art results in many tasks, such as machine translation, text
summarization, and speech recognition.

Recurrent Neural Networks:

Recurrent neural networks (RNNs) are a type of neural network that are well-suited
for solving sequence learning problems. RNNs work by maintaining a hidden state that is
updated at each time step. The hidden state captures the information from the previous inputs,
which allows the model to predict the next output.
RNNs have several advantages over other types of neural networks for sequence
learning problems:
 RNNs can handle inputs of any length
 RNNs can learn long-term dependencies between the inputs in a sequence
 RNNs can be used to solve a wide variety of sequence learning problems, such as
natural language processing, machine translation, and speech recognition

How to model sequence learning problems with RNNs:

To model a sequence learning problem with an RNN, we first need to define the
function that the RNN will compute at each time step. The function should take as input the
current input and the hidden state from the previous time step, and output the next hidden
state and the prediction for the current time step.
Once we have defined the function, we can train the RNN using backpropagation
through time (BPTT). BPTT is a specialized training algorithm for RNNs that allows us to
train the network even though it has recurrent connections.

Backpropagation through time (BPTT):

BPTT is a training algorithm for recurrent neural networks (RNNs). It is used to
compute the gradients of the loss function with respect to the RNN's parameters, which are
then used to update the parameters using gradient descent.

10
To compute the gradients using BPTT, we need to first compute the explicit
derivative of the loss function with respect to the RNN's parameters. This is done by treating
all of the other inputs to the RNN as constants.
However, RNNs also have implicit dependencies, which means that the outputs of the
RNN at a given time step depends on the outputs of the RNN at previous time steps. This
makes it difficult to compute the gradients using the explicit derivative alone.
To address this problem, BPTT uses the chain rule to recursively compute the implicit
derivatives of the loss function with respect to the RNN's parameters. This involves summing
over all of the paths from the loss function to each parameter, where each path is a sequence
of RNN outputs and weights.
BPTT can be computationally expensive, but it is a powerful tool for training RNNs.
It has been used to achieve state-of-the-art results on a variety of sequence learning tasks,
such as natural language processing, machine translation, and speech recognition.

The problem of Exploding and Vanishing Gradients:

The problem of vanishing and exploding gradients is a common problem when
training recurrent neural networks (RNNs). It occurs because the gradients of the loss
function with respect to the RNN's parameters can become very small or very large as the
backpropagation algorithm progresses. This can make it difficult for the RNN to learn to
perform the desired task.
There are two main reasons why vanishing and exploding gradients can occur:
1. Bounded activations: RNNs typically use bounded activation functions, such as the
sigmoid or tanh function. This means that the derivatives of the activation functions
are also bounded. This can lead to vanishing gradients, especially if the RNN has a
large number of layers.
2. Product of weights: The gradients of the loss function with respect to the RNN's
parameters are computed by multiplying together the gradients of the activations at
each layer. This means that if the gradients of the activations are small or large, the
gradients of the parameters will also be small or large.

Vanishing and exploding gradients can be a major problem for training RNNs. If the
gradients vanish, the RNN will not be able to learn to perform the desired task. If the
gradients explode, the RNN will learn very quickly, but it will likely overfit the training data
and not generalize well to new data.

11
There are a number of techniques that can be used to address the problem of
vanishing and exploding gradients, such as:
 Truncated backpropagation: Truncated backpropagation only backpropagates the
gradients through a fixed number of layers. This helps to prevent the gradients from
vanishing.
 Gradient clipping: Gradient clipping normalizes the gradients so that their magnitude
does not exceed a certain threshold. This helps to prevent the gradients from
exploding.
 Weight initialization: The way that the RNN's parameters are initialized can have a
big impact on the problem of vanishing and exploding gradients. It is important to
initialize the parameters in a way that prevents the gradients from becoming too small
or too large.

Truncated backpropagation is a common technique used to address the problem of

vanishing and exploding gradients in recurrent neural networks (RNNs). However, it is not
the only solution.
Another common solution is to use gated recurrent units (GRUs) or long short-term
memory (LSTM) cells. These units are specifically designed to deal with the problem of
vanishing and exploding gradients.

Long Short Term Memory (LSTM) and Gated Recurrent Units (GRUs):
Long Short Term Memory (LSTM) and Gated Recurrent Units (GRU) are two types
of recurrent neural networks (RNNs) that are specifically designed to learn long-term
dependencies in sequential data. They are both widely used in a variety of tasks, including
natural language processing, machine translation, speech recognition, and time series
forecasting.
Both LSTMs and GRUs use a gating mechanism to control the flow of information
through the network. This allows them to learn which parts of the input sequence are
important to remember and which parts can be forgotten.

LSTM Architecture:
An LSTM cell has three gates: an input gate, a forget gate, and an output gate.
 The input gate controls how much of the current input is added to the cell state
 The forget gate controls how much of the previous cell state is forgotten

12
 The output gate controls how much of the cell state is output to the next cell in the
sequence
The LSTM cell also has a cell state, which is a long-term memory that stores information
about the previous inputs. The cell state is updated at each time step based on the input gate,
forget gate, and output gate.

GRU Architecture:
A GRU cell has two gates: a reset gate and an update gate.
 The reset gate controls how much of the previous cell state is forgotten.
 The update gate controls how much of the previous cell state is combined with the
current input to form the new cell state.
The GRU cell does not have a separate output gate. Instead, the output of the GRU
cell is simply the updated cell state.

The best choice of architecture for a particular task depends on a number of factors,
including the size and complexity of the dataset, the available computing resources, and the
specific requirements of the task.
In general, LSTMs are recommended for tasks where the input sequences are very
long or complex, or where the task requires a high degree of accuracy. GRUs are a good
choice for tasks where the input sequences are shorter or less complex, or where speed and
efficiency are important considerations.

RNN CODE
import keras

# Define the model

model = keras.Sequential([
keras.layers.LSTM(128, input_shape=(10, 256)),
keras.layers.Dense(64, activation='relu'),
keras.layers.Dense(1, activation='sigmoid')
])

# Compile the model

model.compile(loss='binary_crossentropy', optimizer='adam', metrics=['accuracy'])

13
# Train the model
model.fit(x_train, y_train, epochs=10)

# Evaluate the model

model.evaluate(x_test, y_test)

# Make predictions
predictions = model.predict(x_test)

This code defines a simple RNN model with one LSTM layer, one dense layer, and
one output layer. The LSTM layer has 128 hidden units, and the dense layer has 64 hidden
units. The output layer has a single unit, and it uses the sigmoid activation function to
produce a probability score.
The model is compiled using the binary cross-entropy loss function and the Adam
optimizer. The model is then trained on the training data for 10 epochs.
Once the model is trained, it can be evaluated on the test data to assess its
performance. The model can also be used to make predictions on new data. Here is an
example of how to use the model to make predictions:

# Make predictions on a new data sample

x_new = [[[0.1, 0.2], [0.3, 0.4], [0.5, 0.6]]]

# Get the prediction

prediction = model.predict(x_new)

# Print the prediction

print(prediction)

This code will print the prediction for the new data sample, which is a probability
score between 0 and 1. A probability score closer to 1 means that the model is more confident
in the prediction.
This is just a simple example of RNN code, and there are many other ways to
implement RNNs in Python. For more complex tasks, you may need to use a different RNN
architecture or add additional layers to the model.

14
PYTORCH TENSORS
PyTorch is an optimized Deep Learning tensor library based on Python and Torch and
is mainly used for applications using GPUs and CPUs. PyTorch is favored over other Deep
Learning frameworks like TensorFlow and Keras since it uses dynamic computation graphs
and is completely Pythonic.

Why is PyTorch used for deep learning?

It is open source, and is based on the popular Torch library. PyTorch is designed to
provide good flexibility and high speeds for deep neural network implementation. PyTorch is
different from other deep learning frameworks in that it uses dynamic computation graphs.
The major features of PyTorch are:
 Easy Interface: PyTorch offers easy to use API; hence it is considered to be very
simple to operate and runs on Python. The code execution in this framework is quite
easy.
 Python usage: This library is considered to be Pythonic which smoothly integrates
with the Python data science stack. Thus, it can leverage all the services and
functionalities offered by the Python environment.
 Computational graphs: PyTorch provides an excellent platform which offers dynamic
computational graphs. Thus a user can change them during runtime. This is highly
useful when a developer has no idea of how much memory is required for creating a
neural network model.

Advantages of PyTorch:
 It is easy to debug and understand the code
 It includes many layers as Torch
 It includes lot of loss functions
 It can be considered as NumPy extension to GPUs
 It allows building networks whose structure is dependent on computation itself

DEEP LEARNING WITH PYTORCH

PyTorch elements are the building blocks of PyTorch models. These elements are:
 Model: A model is a representation of a machine learning algorithm. It is made up of
layers and parameters.

15
 Layer: A layer is a unit of computation in a neural network. It performs a specific
mathematical operation on the input data.
 Optimizer: An optimizer is an algorithm that updates the model's parameters during
training.
 Loss: A loss function measures the error between the model's predictions and the
ground truth labels.

Models are created using the torch.nn.Module class. Layers are created using the
different classes provided by the torch.nn module. For example, to create a linear layer, you
would use the torch.nn.Linear class.
Optimizers are created using the classes provided by the torch.optim module. For
example, to create an Adam optimizer, you would use the torch.optim.Adam class.
Loss functions are created using the classes provided by the torch.nn.functional
module. For example, to create a mean squared error loss function, we would use the
torch.nn.functional.mse_loss function.
Once you have created the model, layers, optimizer, and loss function, you can train
the model using the following steps:
 Forward pass: The input data is passed through the model to produce predictions.
 Loss calculation: The loss function is used to calculate the error between the
predictions and the ground truth labels.
 Backward pass: The gradients of the loss function with respect to the model's
parameters are calculated.
 Optimizer step: The optimizer uses the gradients to update the model's parameters.

This process is repeated for a number of epochs until the model converges and
achieves the desired performance.

CNN IN PYTORCH
Convolutional neural networks (CNNs) are a type of neural network that are
specifically designed to work with image data. CNNs are able to learn spatial features in
images, which makes them very effective for tasks such as image classification, object
detection, and image segmentation.
PyTorch is a popular Python library for machine learning. It provides a number of
features that make it easy to build, train, and deploy CNNs.

16
To implement a CNN in PyTorch, you can use the torch.nn.Conv2d layer. This layer
performs a convolution operation on the input data. The convolution operation is a
mathematical operation that extracts features from the input data.
CNNs also use pooling layers to reduce the spatial size of the input data. This helps to
reduce the number of parameters in the network and makes it more efficient to train. Here is a
simple example of a CNN in PyTorch:

import torch
class CNN(torch.nn.Module):
def __init__(self):
super(CNN, self).__init__()

# Define the convolutional layers

self.conv1 = torch.nn.Conv2d(3, 6, 5)
self.conv2 = torch.nn.Conv2d(6, 16, 5)

# Define the pooling layers

self.pool1 = torch.nn.MaxPool2d(2, 2)
self.pool2 = torch.nn.MaxPool2d(2, 2)

# Define the fully connected layers

self.fc1 = torch.nn.Linear(16 * 5 * 5, 120)
self.fc2 = torch.nn.Linear(120, 84)
self.fc3 = torch.nn.Linear(84, 10)

def forward(self, x):

# Pass the input data through the convolutional layers
x = self.conv1(x)
x = self.pool1(x)
x = self.conv2(x)
x = self.pool2(x)

# Flatten the output of the convolutional layers

x = x.view(-1, 16 * 5 * 5)

17
# Pass the flattened output through the fully connected layers
x = self.fc1(x)
x = self.fc2(x)
x = self.fc3(x)
return x
# Create the model
model = CNN()
# Train the model
...

This code defines a simple CNN with two convolutional layers, two pooling layers,
and three fully connected layers. The convolutional layers have 6 and 16 filters, respectively.
The pooling layers have a kernel size of 2x2 and a stride of 2. The fully connected layers
have 120, 84, and 10 units, respectively.
The model is trained using the model.fit() method. The model can then be used to
make predictions on new data using the model.predict() method.
For more complex tasks, we may need to use a different CNN architecture or add
additional layers to the model. We can also use PyTorch to implement other types of neural
networks, such as recurrent neural networks (RNNs) and long short-term memory (LSTM)
networks.

RNN WITH PYTORCH

PyTorch is an open source machine learning (ML) framework based on the Python
programming language and the Torch library. Torch is an open source ML library used for
creating deep neural networks and is written in the Lua scripting language. It's one of the
preferred platforms for deep learning research. The framework is built to speed up the
process between research prototyping and deployment.
To implement RNNs in PyTorch, we can use the torch.nn.RNN module. This module
provides a number of different RNN architectures, including LSTM and GRU.

Here is a simple example of how to implement an LSTM in PyTorch:

 This code defines a simple LSTM model with one input layer, one LSTM layer, and
one output layer. The LSTM layer has 128 hidden units.
 The model is trained using the model.fit() method. The model can then be used to
make predictions on new data using the model.predict() method.

18
 For more complex tasks, we may need to use a different RNN architecture or add
additional layers to the model. We can also use PyTorch to implement bidirectional
RNNs, stacked RNNs, and other advanced RNN architectures.
 PyTorch also provides a number of tools for training and evaluating RNNs, such as
the torch.optim module and the torch.nn.functional module.

RNN Code:
import torch

class LSTM(torch.nn.Module):
def __init__(self, input_size, hidden_size, num_layers)
super(LSTM, self).__init__()
self.lstm = torch.nn.LSTM(input_size, hidden_size, num_layers)

def forward(self, x):

output, (hn, cn) = self.lstm(x)
return output

# Define the model

Model = LSTM(10, 128, 1)

#Train the model

………

#Make predictions
………

Module 4 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
No ratings yet
Module 4 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
21 pages
Module 3 Notes
No ratings yet
Module 3 Notes
22 pages
Unit 5 Ann
No ratings yet
Unit 5 Ann
28 pages
FODL Unit-4
No ratings yet
FODL Unit-4
46 pages
Unit 5th Ig Ann
No ratings yet
Unit 5th Ig Ann
112 pages
DL Mod3
No ratings yet
DL Mod3
102 pages
Convolution Neural Networks U2
No ratings yet
Convolution Neural Networks U2
24 pages
Unit 3 CNN 2024
No ratings yet
Unit 3 CNN 2024
58 pages
Introduction To Convolution Neural Network
No ratings yet
Introduction To Convolution Neural Network
15 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning
7 pages
Introduction To Convolution Neural Network
No ratings yet
Introduction To Convolution Neural Network
6 pages
Unit 3
No ratings yet
Unit 3
80 pages
Ad3501-Dl-Unit 2 Notes
No ratings yet
Ad3501-Dl-Unit 2 Notes
29 pages
Unit2 CNN
No ratings yet
Unit2 CNN
34 pages
DLT Unit-4
No ratings yet
DLT Unit-4
25 pages
Lecture 3 Updated
No ratings yet
Lecture 3 Updated
56 pages
Cnnbasics 171028092801
No ratings yet
Cnnbasics 171028092801
43 pages
Intro To CNN
No ratings yet
Intro To CNN
93 pages
Introduction To Convolutional Neural Networks
No ratings yet
Introduction To Convolutional Neural Networks
4 pages
Assignment #1: Afzal Ali (11282) Muhammad Hammad (11293) Muhammad Bilal (11291) Mehran Ahmed (11287) Date 20/03/2019
No ratings yet
Assignment #1: Afzal Ali (11282) Muhammad Hammad (11293) Muhammad Bilal (11291) Mehran Ahmed (11287) Date 20/03/2019
7 pages
Unit3 2023 NNDL
No ratings yet
Unit3 2023 NNDL
69 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
61 pages
DL Unit3
No ratings yet
DL Unit3
8 pages
Deep Learning Unit-III
No ratings yet
Deep Learning Unit-III
9 pages
Unit IV Deep Leraning
No ratings yet
Unit IV Deep Leraning
35 pages
Convolutional Networks 2024
No ratings yet
Convolutional Networks 2024
44 pages
AD3501-DL-Unit 2
No ratings yet
AD3501-DL-Unit 2
33 pages
DL Unit4
No ratings yet
DL Unit4
31 pages
Convolution Operation
No ratings yet
Convolution Operation
23 pages
Machine Learning-Lecture 17 (Student)
No ratings yet
Machine Learning-Lecture 17 (Student)
7 pages
CNN Notes Unit-3
No ratings yet
CNN Notes Unit-3
12 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
35 pages
Convolutional Neural Networks: ZV0GDF798E
No ratings yet
Convolutional Neural Networks: ZV0GDF798E
9 pages
UNIT 2 Study Materials 1
No ratings yet
UNIT 2 Study Materials 1
42 pages
Unit III
No ratings yet
Unit III
89 pages
CS601 Machine Learning Unit 3
No ratings yet
CS601 Machine Learning Unit 3
47 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
6 pages
HODL Lec 3 DNNs For Vision 1
No ratings yet
HODL Lec 3 DNNs For Vision 1
36 pages
What Is Convolutional Neural Network
No ratings yet
What Is Convolutional Neural Network
16 pages
Deep Learning UNIT-4
No ratings yet
Deep Learning UNIT-4
34 pages
Unit 3 CNN
No ratings yet
Unit 3 CNN
47 pages
Lecture-25 - Building - Training CNN
No ratings yet
Lecture-25 - Building - Training CNN
26 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
55 pages
Variants of CNN (Page No 17-23), Structured Output (29-31), Datatypes
No ratings yet
Variants of CNN (Page No 17-23), Structured Output (29-31), Datatypes
31 pages
(Fall 2024) Images and Convolutions
No ratings yet
(Fall 2024) Images and Convolutions
69 pages
UNIT-III DeepLearning Notes
No ratings yet
UNIT-III DeepLearning Notes
30 pages
CNN 1
No ratings yet
CNN 1
19 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
38 pages
Unit - 2
No ratings yet
Unit - 2
31 pages
Convolution Neural Network (CNN) Unit 2: Dr. Kavita R Singh
No ratings yet
Convolution Neural Network (CNN) Unit 2: Dr. Kavita R Singh
65 pages
Unit 3 - Machine Learning
No ratings yet
Unit 3 - Machine Learning
29 pages
Chapter14 CNN
No ratings yet
Chapter14 CNN
54 pages
Module 3
No ratings yet
Module 3
67 pages
Cnns Layers: Convolution Neural Network Convolutional Neural Network
No ratings yet
Cnns Layers: Convolution Neural Network Convolutional Neural Network
10 pages
Unit 3 ML
No ratings yet
Unit 3 ML
27 pages
Chapter 4 Ann
No ratings yet
Chapter 4 Ann
33 pages
DL Mod 3
No ratings yet
DL Mod 3
65 pages
Deep Learning Series CNN - 2
No ratings yet
Deep Learning Series CNN - 2
15 pages
Convolution Neural Network
No ratings yet
Convolution Neural Network
11 pages
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
From Everand
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
Fouad Sabry
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Deep Learning
100% (3)
Deep Learning
32 pages
Chap 10-2 Sequence Modeling Recurrent and Recursive Net-Hyun-Lim Yang
No ratings yet
Chap 10-2 Sequence Modeling Recurrent and Recursive Net-Hyun-Lim Yang
39 pages
Unit-2 Part-2
No ratings yet
Unit-2 Part-2
42 pages
Semantic Compositionality Through Recursive Matrix-Vector Spaces
No ratings yet
Semantic Compositionality Through Recursive Matrix-Vector Spaces
11 pages
Unit 3
No ratings yet
Unit 3
41 pages
Important Questions
No ratings yet
Important Questions
19 pages

Unit 4

Uploaded by

Unit 4

Uploaded by

4.

CONVOLUTIONAL NEURAL NETWORKS

NERUAL NETWORK AND REPRESENTATION LEARING

A Different Architecture for Image Data:

Figure: Image matrix multiplies kernel or filter matrix

By using padding in a convolutional layer, we increase the contribution of pixels at

MULTICHANNEL CONVOLUTION OPERATION

Examples of sequence learning problems include:

Advantages of RNNs for sequence learning problems:

Recurrent Neural Networks:

How to model sequence learning problems with RNNs:

Backpropagation through time (BPTT):

The problem of Exploding and Vanishing Gradients:

Truncated backpropagation is a common technique used to address the problem of

# Define the model

# Compile the model

# Evaluate the model

# Make predictions on a new data sample

# Get the prediction

# Print the prediction

Why is PyTorch used for deep learning?

DEEP LEARNING WITH PYTORCH

# Define the convolutional layers

# Define the pooling layers

# Define the fully connected layers

def forward(self, x):

# Flatten the output of the convolutional layers

RNN WITH PYTORCH

Here is a simple example of how to implement an LSTM in PyTorch:

def forward(self, x):

# Define the model

#Train the model

You might also like