0% found this document useful (0 votes)

72 views20 pages

Lecture 26-30 Unit 2

Uploaded by

topendrabdr1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views20 pages

Lecture 26-30 Unit 2

Uploaded by

topendrabdr1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 20

APEX INSTITUTE OF TECHNOLOGY

DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

DEEP LEARNING (20CSF-432)

Faculty: Dr. Amit Kukker(E16298)

Lecture – 26-30
Training a CNNs: weights initialization, batch DISCOVER . LEARN . EMPOWER
normalization, hyper parameter optimization,
By: Dr. Amit Kukker 1
Deep Learning: Course Objectives
COURSE OBJECTIVES
The Course aims to:
1. Understand the key features in a neural network’s architecture
2. Understand the main fundamentals that drive Deep Learning
3. Be able to build, train and apply fully connected deep neural networks
4. Know how to implement efficient CNN, LSTM, Bi-LSTM, Autoencoder, RNN, Adversarial
Generative Networks etc.
5. Implementation the fundamental methods involved in deep learning, including the underlying
optimization concepts (gradient descent and backpropagation) and how they can be combined to
solve real-world problems.

By: Dr. Amit Kukker 2

COURSE OUTCOMES

On completion of this course, the students shall be able to:-

CO1 Understand neural network, its working and parameters, and various optimization methods for
neural networks.
CO2 Differentiate between the major types of neural network architectures and its use case for
different problems (classification/recognition) by these architectures.

CO3 Understand different deep neural network model architectures and its parameters tuning.

CO4 Design sequence model using different neural network architectures for new data problems based
on their requirements and problem characteristics and analyse their performance.

CO5 Describe latest research being conducted in the field and open problems that are yet to be solved.

By: Dr. Amit Kukker 3

Unit-2 Syllabus
Unit-2 Second Order Methods

Second order methods for training,

Regularization methods (dropout, drop connect,
batch normalization), Introduction to CNNs -
Second Order convolution, pooling, Deep CNNs, Different deep
Methods CNN architectures - LeNet, AlexNet, VGG,
Training a CNNs: weights initialization, batch
normalization, hyper parameter optimization,
Understanding and visualizing CNNs

By: Dr. Amit Kukker 4

SUGGESTIVE READINGS
TEXT BOOKS:
• T1: Deep Learning with Python by Francois Chollet, Publisher: Manning Publications
• T2: Deep Learning from Scratch: Building with Python from First Principles by Seth Weidman
published by O`Reilley
• T3: Deep Learning by Ian Goodfellow, Yoshua Bengio and Aaron Courville published by MIT Press.

REFERENCE BOOKS:
• R1 Fundamentals of Deep Learning: by Nithin Buduma, Nikhil Buduma and Joe Papa, OREILLY
Publication, Second Edition.
• R2 Deep Learning: A Practitioners Approach by Josh Patterson and Adam Gibson, OREILLY
Publication.
• R3 Deep Learning for Coders with fastai and PyTorch by Jeremy Howard and Sylvain Gugger, OREILLY
Publication.
• R4 Deep Learning Using Python by S Lovelyn Rose, L Ashok Kumar, D Karthika Renuka, Wiley
Publication
By: Dr. Amit Kukker 5
Training a CNNs

By: Dr. Amit Kukker 6

def xavier_init(shape):
fan_in, fan_out = shape
limit = np.sqrt(6 / (fan_in + fan_out))
return np.random.uniform(-limit, limit, shape)

def he_init(shape):
fan_in, _ = shape
std = np.sqrt(2.0 / fan_in)
return np.random.randn(*shape) * std

By: Dr. Amit Kukker 7

By: Dr. Amit Kukker 8
By: Dr. Amit Kukker 9
By: Dr. Amit Kukker 10
By: Dr. Amit Kukker 11
from sklearn.model_selection import GridSearchCV
parameters = {'learning_rate': [0.1, 0.01, 0.001], 'batch_size': [16, 32, 64]}

from sklearn.model_selection import RandomizedSearchCV

parameters = {'learning_rate': [0.1, 0.01, 0.001], 'batch_size': [16, 32, 64]}

By: Dr. Amit Kukker 12

4. Understanding and Visualizing CNNs

Understanding and visualizing CNNs help in diagnosing and

improving model performance.

By: Dr. Amit Kukker 13

import matplotlib.pyplot as plt

def visualize_filters(layer, n_filters=6):

filters = layer.weight.data.clone()
filters = filters - filters.min()
filters = filters / filters.max()
filters = filters.numpy()

fig, ax = plt.subplots(1, n_filters, figsize=(20, 5))

for i in range(n_filters):
ax[i].imshow(filters[i, 0, :, :], cmap='gray')
ax[i].axis('off')
plt.show()

visualize_filters(model.conv1, n_filters=6)
By: Dr. Amit Kukker 14
def visualize_feature_maps(model, image):
x = image.unsqueeze(0) # Add batch dimension
layers = [model.conv1, model.bn1, model.conv2, model.bn2]
fig, ax = plt.subplots(len(layers), 1, figsize=(20, 20))

for i, layer in enumerate(layers):

x = layer(x)
if isinstance(layer, nn.BatchNorm2d):
x = torch.relu(x)
x = torch.max_pool2d(x, 2)
ax[i].imshow(x[0, 0, :, :].detach().numpy(), cmap='gray')
ax[i].axis('off')
plt.show()

visualize_feature_maps(model, example_image)
By: Dr. Amit Kukker 15
def saliency_map(model, image, label):
image.requires_grad_()
output = model(image)
loss = nn.CrossEntropyLoss()(output, label)
loss.backward()

saliency, _ = torch.max(image.grad.data.abs(), dim=1)

saliency = saliency.squeeze().cpu().numpy()

plt.imshow(saliency, cmap='hot')
plt.axis('off')
plt.show()

saliency_map(model, example_image.unsqueeze(0), example_label.unsqueeze(0))

By: Dr. Amit Kukker 16

By: Dr. Amit Kukker 17
References
Main text books:
• “Neural Networks: A Comprehensive Foundation”, S. Haykin (very
good -theoretical)
• “Pattern Recognition with Neural Networks”, C. Bishop (very good accessible)
• “Neural Network Design” by Hagan, Demuth and Beale (introductory)
• Books emphasizing the practical aspects:
• “Neural Smithing”, Reeds and Marks
• “Practical Neural Network Recipees in C++”’ T. Masters
• Seminal Paper (but now quite old!):
“Parallel Distributed Processing” Rumelhart and McClelland et al.
Deep Learning books and tutorials:
• http://www.deeplearningbook.org/
• Introduction to Learning Rules in Neural Network - DataFlair (data-flair.training) 18
Neural Networks Literature
Review Articles:
• R. P. Lippman, “An introduction to Computing with Neural Nets”’ IEEE
• ASP Magazine, 4-22, April 1987.
• T. Kohonen, “An Introduction to Neural Computing”, Neural Networks,
• 1, 3-16, 1988.
• A. K. Jain, J. Mao, K. Mohuiddin, “Artificial Neural Networks: A Tutorial”’
• IEEE Computer, March 1996’ p. 31-44.
Journals:
• IEEE Transactions on NN
• Neural Networks
• Neural Computation
• Biological Cybernetics
19
THANK YOU

For queries
Email: [email protected]

Unit 5
No ratings yet
Unit 5
61 pages
ML Question Bank
No ratings yet
ML Question Bank
7 pages
The Art of Troubleshooting - Ebook - V2
No ratings yet
The Art of Troubleshooting - Ebook - V2
356 pages
Activation Functions - Ipynb - Colaboratory
No ratings yet
Activation Functions - Ipynb - Colaboratory
10 pages
Autoencoders - Presentation
No ratings yet
Autoencoders - Presentation
18 pages
John Zink Burner Control Narratives
100% (3)
John Zink Burner Control Narratives
19 pages
DLunit 4
No ratings yet
DLunit 4
16 pages
AD601 Deep Learning Unit-2 Notes
No ratings yet
AD601 Deep Learning Unit-2 Notes
14 pages
Deep Learning - Unit-III Two Marks
100% (1)
Deep Learning - Unit-III Two Marks
3 pages
DL Question Bank
No ratings yet
DL Question Bank
23 pages
Assignment On RNN
No ratings yet
Assignment On RNN
1 page
CNN Lecture Notes
No ratings yet
CNN Lecture Notes
86 pages
Unit 4 NNDL
No ratings yet
Unit 4 NNDL
37 pages
Ad3501-Dl-Unit 2 Notes
No ratings yet
Ad3501-Dl-Unit 2 Notes
29 pages
Unit 4 Deeplearning
No ratings yet
Unit 4 Deeplearning
41 pages
CNN Architectures: Lenet, Alexnet, VGG, Googlenet, Resnet and More
No ratings yet
CNN Architectures: Lenet, Alexnet, VGG, Googlenet, Resnet and More
9 pages
UNIT2
No ratings yet
UNIT2
25 pages
Deep Learning With Tensorflow
No ratings yet
Deep Learning With Tensorflow
15 pages
Deep Learning (MODULE-3)
No ratings yet
Deep Learning (MODULE-3)
85 pages
Deep Learning
No ratings yet
Deep Learning
2 pages
2.building Blocks of Neural Networks
100% (1)
2.building Blocks of Neural Networks
2 pages
Deep Learning Exp
No ratings yet
Deep Learning Exp
25 pages
Module 4 Recurrent Neural Network
No ratings yet
Module 4 Recurrent Neural Network
78 pages
IF4071 Deep Learning QP
No ratings yet
IF4071 Deep Learning QP
2 pages
Unit 2 v1.
No ratings yet
Unit 2 v1.
41 pages
Train A Simple NN - Jupyter Notebook
No ratings yet
Train A Simple NN - Jupyter Notebook
4 pages
DL Question Bank Answers
No ratings yet
DL Question Bank Answers
55 pages
RBM, DBN, and DBM
No ratings yet
RBM, DBN, and DBM
79 pages
Thyroid Disease Classification Using Machine Learning Project
No ratings yet
Thyroid Disease Classification Using Machine Learning Project
34 pages
CNN Case Studies Unit 4
No ratings yet
CNN Case Studies Unit 4
13 pages
Deep Learnig
No ratings yet
Deep Learnig
16 pages
ML LAB Mannual-1
No ratings yet
ML LAB Mannual-1
79 pages
Machine Learning QB
No ratings yet
Machine Learning QB
3 pages
ML Unit-Iv
No ratings yet
ML Unit-Iv
18 pages
Unit-I Introduction and ANN Structure
No ratings yet
Unit-I Introduction and ANN Structure
15 pages
Unit III
No ratings yet
Unit III
58 pages
Neural Networks
No ratings yet
Neural Networks
29 pages
Artificial Neural Networks: Part 1/3
No ratings yet
Artificial Neural Networks: Part 1/3
25 pages
Soft Computing Questions
No ratings yet
Soft Computing Questions
1 page
ML Unit-Iv
No ratings yet
ML Unit-Iv
19 pages
What Is Gradient Based Learning in Deep Learning
100% (1)
What Is Gradient Based Learning in Deep Learning
12 pages
Question Bank AML
No ratings yet
Question Bank AML
4 pages
Unit - 3
No ratings yet
Unit - 3
42 pages
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
No ratings yet
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
7 pages
Question Bank
No ratings yet
Question Bank
4 pages
Lecture Notes 5
No ratings yet
Lecture Notes 5
3 pages
Back Propagation
100% (1)
Back Propagation
27 pages
Soft Computing Lab Manual
No ratings yet
Soft Computing Lab Manual
24 pages
Neural Network Unit 1 Handwritten Notes
No ratings yet
Neural Network Unit 1 Handwritten Notes
30 pages
Unit 2 DL
No ratings yet
Unit 2 DL
44 pages
ML Unit 1
No ratings yet
ML Unit 1
15 pages
Unit 5
No ratings yet
Unit 5
23 pages
Unit 4
No ratings yet
Unit 4
24 pages
18AI61
No ratings yet
18AI61
3 pages
Perceptron and Backpropagation
No ratings yet
Perceptron and Backpropagation
17 pages
AIML Unit 2 Notes
No ratings yet
AIML Unit 2 Notes
49 pages
CCS355 Neural Networks and Deep Learning
No ratings yet
CCS355 Neural Networks and Deep Learning
2 pages
Omkar Sabnis B4-764 Experiment No. 7 Aim: Implementation of MC-Culloch Pitt Model For AND Gate Using Python. Theory
No ratings yet
Omkar Sabnis B4-764 Experiment No. 7 Aim: Implementation of MC-Culloch Pitt Model For AND Gate Using Python. Theory
10 pages
Breadth First Search and Iterative Depth First Search: Practical 1
No ratings yet
Breadth First Search and Iterative Depth First Search: Practical 1
21 pages
Ad3511 DL Lab All Lab Manual
No ratings yet
Ad3511 DL Lab All Lab Manual
36 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
14 pages
Connectivity Prediction in Mobile Ad Hoc Networks for Real-Time Control
From Everand
Connectivity Prediction in Mobile Ad Hoc Networks for Real-Time Control
Sebastian Thelen
5/5 (1)
SonarQube Users (Archive) - Java - lang.OutOfMemoryError - Java Heap Space PDF
No ratings yet
SonarQube Users (Archive) - Java - lang.OutOfMemoryError - Java Heap Space PDF
9 pages
Important: Service Data Sheet
No ratings yet
Important: Service Data Sheet
4 pages
Lab02 DataTypes PDF
No ratings yet
Lab02 DataTypes PDF
5 pages
Trabajo Final de Ingles Técnico
No ratings yet
Trabajo Final de Ingles Técnico
5 pages
Practice Problems For Solid Geometry
No ratings yet
Practice Problems For Solid Geometry
12 pages
A Cyber Security Awareness and Education Framework For South Africa
No ratings yet
A Cyber Security Awareness and Education Framework For South Africa
219 pages
50 KLD STP Boq
No ratings yet
50 KLD STP Boq
104 pages
Irr 7920
No ratings yet
Irr 7920
15 pages
Onion - Wikipedia, The Free Encyclopedia1
No ratings yet
Onion - Wikipedia, The Free Encyclopedia1
7 pages
Essay Topics Grade 11
100% (2)
Essay Topics Grade 11
5 pages
K00200 - 20211027174133 - Rubrics Individual Assignment Paf3113 Sem A202
No ratings yet
K00200 - 20211027174133 - Rubrics Individual Assignment Paf3113 Sem A202
7 pages
Orientering
No ratings yet
Orientering
15 pages
The Relationship of Endodontic-Periodontic Lesions
No ratings yet
The Relationship of Endodontic-Periodontic Lesions
7 pages
Complete Guide To Service Learning 2
No ratings yet
Complete Guide To Service Learning 2
110 pages
Hufnagel Transcript
No ratings yet
Hufnagel Transcript
3 pages
Notes Summer 2024 - Finance and Economics Summary
No ratings yet
Notes Summer 2024 - Finance and Economics Summary
3 pages
D Professional Development For Office Administration 2 1
No ratings yet
D Professional Development For Office Administration 2 1
55 pages
TH 2
No ratings yet
TH 2
4 pages
Mercedes-Benz: Faculty of Political Science
No ratings yet
Mercedes-Benz: Faculty of Political Science
7 pages
Econ2330 Ch09
No ratings yet
Econ2330 Ch09
65 pages
Harshit Ipr PPT Mba Sec B First Sem
No ratings yet
Harshit Ipr PPT Mba Sec B First Sem
12 pages
Glass Fibers ASM PDF
No ratings yet
Glass Fibers ASM PDF
9 pages
Landing Page Inspiration 3
No ratings yet
Landing Page Inspiration 3
1 page
Oral Characteristics of Newborns: Journal of Dentistry For Children (Chicago, Ill.) December 2008
No ratings yet
Oral Characteristics of Newborns: Journal of Dentistry For Children (Chicago, Ill.) December 2008
4 pages
Keats
100% (1)
Keats
15 pages
Kursus ICT Refresh Course Programme (ICTRCP) Tahun 2024 (Sesi 6)
No ratings yet
Kursus ICT Refresh Course Programme (ICTRCP) Tahun 2024 (Sesi 6)
32 pages
Century Iib: Autopilot Flight System
No ratings yet
Century Iib: Autopilot Flight System
24 pages
Monsoon Theories
100% (1)
Monsoon Theories
14 pages

Lecture 26-30 Unit 2

Uploaded by

Lecture 26-30 Unit 2

Uploaded by

APEX INSTITUTE OF TECHNOLOGY

DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

DEEP LEARNING (20CSF-432)

By: Dr. Amit Kukker 2

On completion of this course, the students shall be able to:-

By: Dr. Amit Kukker 3

Second order methods for training,

By: Dr. Amit Kukker 4

By: Dr. Amit Kukker 6

By: Dr. Amit Kukker 7

from sklearn.model_selection import RandomizedSearchCV

By: Dr. Amit Kukker 12

Understanding and visualizing CNNs help in diagnosing and

By: Dr. Amit Kukker 13

def visualize_filters(layer, n_filters=6):

fig, ax = plt.subplots(1, n_filters, figsize=(20, 5))

for i, layer in enumerate(layers):

saliency, _ = torch.max(image.grad.data.abs(), dim=1)

saliency_map(model, example_image.unsqueeze(0), example_label.unsqueeze(0))

By: Dr. Amit Kukker 16

You might also like