0% found this document useful (0 votes)

8 views

ITNN Week3

Uploaded by

shalinipriya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

ITNN Week3

Uploaded by

shalinipriya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 21

Introduction to Neural Network - Week 3

POST
GRADUATE
PROGRAM
AIML ARTIFICIAL INTELLIGENCE & MACHINE
LEARNING

Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
1
Brain Stormer
Q1. Back Propagation is a learning technique that adjusts weights in neural network by propagating weight
changes.

a. Forward from input to output

b. Backward from output to input
c. Forward from input to hidden layers
d. Backward from hidden layers to input

Q2. What is sigmoid as an activation function in neural network.

Answer-
A weighted sum of inputs is passed through an activation function and this output serves as an input to
the next layer. When the activation function for a neuron is a sigmoid function it is a guarantee that the output of
this unit will always be between 0 and 1.

Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Week 3: Introduction to neural networks and deep learning

Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
3
Learning Objective
❖ Types of Optimizers
❖ Weight initialization
❖ Regularization
❖ Drop out
❖ Batch Normalization
❖ Types of neural networks
❖ Case study
❖ Questions

Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
4
Different types of optimizers
1. SGD with Momentum
This method computes gradient by exponentially weighted averages, hence it takes less time to converge compared to normal
stochastic gradient descent

2. Adagrad ( Adaptive gradient algorithm )

Adagrad does not use momentum concept rather it utilizes different learning rates hence making it simpler than SGD with momentum

3. RMSProp (Root Mean Square Propagation )

RMS prop automatically adjusts the learning rate for each parameter

4. ADAM
ADAM proposes the characteristics of both SGD with Momentum and RMSprop

References
Paperspace, medium, Medium

Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
5
Weight Initialization

Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
6
Why Initialize Weights
The aim of weight initialization is to prevent layer activation outputs from exploding or vanishing during the course of a
forward pass through a deep neural network. If either occurs, loss gradients will either be too large or too small to flow
backwards beneficially, and the network will take longer to converge, if it is even able to do so at all.

Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
7
What Happends When W=0 Init Is Used

Output Layer
Input Layer
Hidden Layer

The method of setting W=0 serves almost no purpose as it causes neurons to perform the same calculation
in each iterations and produces same outputs. neurons will learn same features in each iterations.
This problem is known as network failing to break symmetry.

Text source = Medium

Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
8
Initialization Techniques

● Zero initialization
● Random initialization
● Xavier initialization
● He initialization
● Kaiming initialization
● And many more

Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
9
Regularization

Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
10
Data Augmentation

Data augmentation is a technique to artificially

create new training data from existing training
data.

Image data augmentation is perhaps the most well-

known type of data augmentation and involves creating
transformed versions of images in the training dataset
that belong to the same class as the original image

Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
11
Where We Should Do Data Augmentation

● We may not have a big dataset, so create more data.

● It helps in regularizing the network.

Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
12
Data Augmentation Pipeline
Load image and label

“Dog ”

Compute
loss
CNN

Transformimage

Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
13
Data Augmentation Techniques
● Horizontal flips
● Rotation
● Crop/scale
● Color jitter
● Other creative techniques
○ Random mix/combinations of :
■ translation (what about a pure ConvNet?)
■ Rotation
■ Stretching
■ Shearing
■ lens distortions

Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
14
Dropout
Dropout is a regularization method that approximates training a large number of neural networks with different
architectures in parallel.

During training, some number of layer outputs are randomly ignored or “dropped out.” This has the effect of making the
layer look-like and be treated-like a layer with a different number of nodes and connectivity to the prior layer. In effect,
each update to a layer during training is performed with a different “view” of the configured layer.

Forces the network to have a redundant representation.

has an ear X

has a tail

is furry X cat
score

has claws

mischievous X
look

Dropout is training a large ensemble of models (that

share parameters).

Each binary mask is one model, gets trained on

only ~one batch.

Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
18
Batch Normalization
•Due to this normalization “layers” between each fully
connected layers, the range of input distribution of each layer
stays the same, no matter the changes in the previous layer

•Given x inputs from k-th neuron:

•Normalization brings all the inputs centered around 0. This

way, there is not much change in each layer input
Text and image source: Medium

Feed Forward
Neural
Network

Convolutional
Neural
Network

Recurrent
Neural
Network

LSTM – Long
Short-Term
Memory

Operating System
No ratings yet
Operating System
12 pages
DL Mod2
No ratings yet
DL Mod2
45 pages
Deep Learning UNIT-II Part1
No ratings yet
Deep Learning UNIT-II Part1
48 pages
NN-BNU2
No ratings yet
NN-BNU2
47 pages
Components-Algorithms/: The Basic Architecture of Neural Networks: Single Computational Layer
No ratings yet
Components-Algorithms/: The Basic Architecture of Neural Networks: Single Computational Layer
65 pages
Lect 12 -Deep Feed Forward NN- Review
No ratings yet
Lect 12 -Deep Feed Forward NN- Review
93 pages
Deep Learning concepts ppt
No ratings yet
Deep Learning concepts ppt
13 pages
Pure Optimization
No ratings yet
Pure Optimization
23 pages
Supervised Deep Learning
No ratings yet
Supervised Deep Learning
28 pages
ML Lec 10 Neural Networks
No ratings yet
ML Lec 10 Neural Networks
87 pages
a imprimer 4
No ratings yet
a imprimer 4
4 pages
General Observation
No ratings yet
General Observation
93 pages
Introduction To Neural Network: - CS 280 Tutorial One
No ratings yet
Introduction To Neural Network: - CS 280 Tutorial One
14 pages
Introduction To Neural Network: - CS 280 Tutorial One
No ratings yet
Introduction To Neural Network: - CS 280 Tutorial One
14 pages
Introduction To Neural Network: - CS 280 Tutorial One
No ratings yet
Introduction To Neural Network: - CS 280 Tutorial One
14 pages
ca3dl
No ratings yet
ca3dl
6 pages
NN Concepts
No ratings yet
NN Concepts
4 pages
Intro To DL
No ratings yet
Intro To DL
28 pages
Fundamentals of Deep Learning
No ratings yet
Fundamentals of Deep Learning
26 pages
Deep Learning Lab Manual
No ratings yet
Deep Learning Lab Manual
73 pages
cst414- Deep learning
No ratings yet
cst414- Deep learning
34 pages
Notes DL-1
No ratings yet
Notes DL-1
10 pages
Session NN
No ratings yet
Session NN
32 pages
Lecture8,9-Neural Networks
No ratings yet
Lecture8,9-Neural Networks
65 pages
Safari - 25 Jul 2019 at 11:43
No ratings yet
Safari - 25 Jul 2019 at 11:43
1 page
Chap 2 Training Feed Forward Neural Networks
No ratings yet
Chap 2 Training Feed Forward Neural Networks
22 pages
Neural Networks
No ratings yet
Neural Networks
27 pages
Deep Learning (1)
No ratings yet
Deep Learning (1)
19 pages
8.2.1: Introduction To Neural Networks: Objectives
No ratings yet
8.2.1: Introduction To Neural Networks: Objectives
11 pages
Deep Learing
No ratings yet
Deep Learing
37 pages
Unit 2
No ratings yet
Unit 2
112 pages
2. Deep Neural Network
No ratings yet
2. Deep Neural Network
60 pages
CS 224D: Deep Learning For NLP: Lecture Notes: Part III Spring 2015
No ratings yet
CS 224D: Deep Learning For NLP: Lecture Notes: Part III Spring 2015
14 pages
DL UNIT 3
No ratings yet
DL UNIT 3
14 pages
Notes Chapter Neural Networks
No ratings yet
Notes Chapter Neural Networks
18 pages
Optimization of Deep Networks
No ratings yet
Optimization of Deep Networks
84 pages
Unit 2
No ratings yet
Unit 2
112 pages
4.0 The Complete Guide To Artificial Neural Networks
No ratings yet
4.0 The Complete Guide To Artificial Neural Networks
23 pages
Introduction To Neural Network
No ratings yet
Introduction To Neural Network
20 pages
Deep Neural Networks
No ratings yet
Deep Neural Networks
26 pages
CS 224D: Deep Learning For NLP: Lecture Notes: Part III Spring 2016
No ratings yet
CS 224D: Deep Learning For NLP: Lecture Notes: Part III Spring 2016
14 pages
Artificial Intelligence: Outline
No ratings yet
Artificial Intelligence: Outline
35 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
29 pages
4 - DNN Tip
No ratings yet
4 - DNN Tip
52 pages
Understanding and Coding Neural Networks From Scratch in Python and R
No ratings yet
Understanding and Coding Neural Networks From Scratch in Python and R
12 pages
Neural Net 3rdclass
No ratings yet
Neural Net 3rdclass
35 pages
Artificial Intelligence - Chapter 7
No ratings yet
Artificial Intelligence - Chapter 7
18 pages
L10 Neural Network
No ratings yet
L10 Neural Network
52 pages
Large Scale Deep Learning
No ratings yet
Large Scale Deep Learning
170 pages
unit-1
No ratings yet
unit-1
19 pages
Neural Networks
100% (1)
Neural Networks
26 pages
Artificial Neural Networks_dl
No ratings yet
Artificial Neural Networks_dl
55 pages
Unit 1
No ratings yet
Unit 1
20 pages
Unit-2 Improving-Deep-Neural-Networks
No ratings yet
Unit-2 Improving-Deep-Neural-Networks
18 pages
ANN Doc
No ratings yet
ANN Doc
2 pages
Understanding and Coding Neural Networks From Scratch in Python and R
100% (1)
Understanding and Coding Neural Networks From Scratch in Python and R
15 pages
DL Intro
No ratings yet
DL Intro
64 pages
Unit 3
No ratings yet
Unit 3
7 pages
Neural Networks with Python
From Everand
Neural Networks with Python
Mei Wong
No ratings yet
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
From Everand
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
Mark Magic
No ratings yet
Artificial Intelligence Interview Questions
From Everand
Artificial Intelligence Interview Questions
Tech Interviews
5/5 (2)
Lab 1 (Cps-21BLC1139)
No ratings yet
Lab 1 (Cps-21BLC1139)
17 pages
Internship Offer Letter: Krishna Golyan
No ratings yet
Internship Offer Letter: Krishna Golyan
5 pages
Quick Start Guide: Before Setup
No ratings yet
Quick Start Guide: Before Setup
2 pages
Buy Moretec Pneumatic Tools Online
No ratings yet
Buy Moretec Pneumatic Tools Online
2 pages
Lesson 6 - Technology Doping
No ratings yet
Lesson 6 - Technology Doping
9 pages
0.8 Atex - STM - 2023
No ratings yet
0.8 Atex - STM - 2023
1 page
Load List for Steel-melt Shop
No ratings yet
Load List for Steel-melt Shop
3 pages
The Perfect Marketing Plan-Rqkni5
100% (1)
The Perfect Marketing Plan-Rqkni5
94 pages
HW1 Solutions
No ratings yet
HW1 Solutions
9 pages
Drones For Agriculture: Prepare and Design Your Drone (Uav) Mission
No ratings yet
Drones For Agriculture: Prepare and Design Your Drone (Uav) Mission
20 pages
Full PSPC Theory Complete C Programming Notes
No ratings yet
Full PSPC Theory Complete C Programming Notes
60 pages
How Do I Import An AutoCAD File Into SAP2000 - Theburn
No ratings yet
How Do I Import An AutoCAD File Into SAP2000 - Theburn
7 pages
TrashBox_Trash_Detection_and_Classification_using_Quantum_Transfer_Learning
No ratings yet
TrashBox_Trash_Detection_and_Classification_using_Quantum_Transfer_Learning
6 pages
Ie151-2x Or1 Concept
No ratings yet
Ie151-2x Or1 Concept
4 pages
Hardware Compatibility: List of Macos Versions, The Supported Systems On Which They Run, and Their Ram Requirements
No ratings yet
Hardware Compatibility: List of Macos Versions, The Supported Systems On Which They Run, and Their Ram Requirements
1 page
Tutorial2 Solutions 373
No ratings yet
Tutorial2 Solutions 373
5 pages
Wheels Crown Hyster
No ratings yet
Wheels Crown Hyster
36 pages
Humanoid Robots Replacing Nurses in The Clinical Practice
No ratings yet
Humanoid Robots Replacing Nurses in The Clinical Practice
4 pages
User Guide
No ratings yet
User Guide
32 pages
Prostream 9000: Stream Processing Platform
No ratings yet
Prostream 9000: Stream Processing Platform
79 pages
Cell Type UR18650F: Specifications
No ratings yet
Cell Type UR18650F: Specifications
5 pages
Funda MIMO
No ratings yet
Funda MIMO
69 pages
Delos Reyes - Precious Melody-Part 2
No ratings yet
Delos Reyes - Precious Melody-Part 2
16 pages
Stop Updates 10 Log
No ratings yet
Stop Updates 10 Log
3 pages
(HTB) Hackthebox Monitors Writeup
No ratings yet
(HTB) Hackthebox Monitors Writeup
7 pages
Manual Eng
No ratings yet
Manual Eng
4 pages
GC Assignment 2023
No ratings yet
GC Assignment 2023
9 pages
MiniCutter en
No ratings yet
MiniCutter en
2 pages
Lecture 1 - Paperless Office
No ratings yet
Lecture 1 - Paperless Office
4 pages

ITNN Week3

Uploaded by

ITNN Week3

Uploaded by

Introduction to Neural Network - Week 3

a. Forward from input to output

Q2. What is sigmoid as an activation function in neural network.

2. Adagrad ( Adaptive gradient algorithm )

3. RMSProp (Root Mean Square Propagation )

Text source = Medium

Data augmentation is a technique to artificially

Image data augmentation is perhaps the most well-

● We may not have a big dataset, so create more data.

Forces the network to have a redundant representation.

Dropout is training a large ensemble of models (that

Each binary mask is one model, gets trained on

•Given x inputs from k-th neuron:

•Normalization brings all the inputs centered around 0. This

You might also like