0% found this document useful (0 votes)

7 views96 pages

Neural Networks Optional

The document provides an introduction to deep learning and neural networks, explaining key concepts such as binary classification, logistic regression, and gradient descent. It emphasizes the importance of data, computation, and algorithms in driving deep learning progress. Additionally, it covers programming guidelines for neural networks, including vectorization and broadcasting in Python.

Uploaded by

Adit Haqy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views96 pages

Neural Networks Optional

Uploaded by

Adit Haqy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 96

Introduction to

Deep Learning

What is a
deeplearning.ai
Neural Network?
Housing Price Prediction
price

size of house
Housing Price Prediction
Housing Price Prediction

size 𝑥1

#bedrooms 𝑥2
y
zip code 𝑥3

wealth 𝑥4
Introduction to
Neural Networks

Why is Deep
deeplearning.ai Learning taking off?
Andrew Ng
Scale drives deep learning progress
Performance

Amount of data
Andrew Ng
Scale drives deep learning progress

Idea
• Data

• Computation

• Algorithms
Experiment Code

Andrew Ng
Basics of Neural
Network Programming

Binary Classification
deeplearning.ai
Binary Classification

1 (cat) vs 0 (non cat)

Blue
Green
Red

Andrew Ng
Notation

Andrew Ng
Basics of Neural
Network Programming

Logistic Regression
deeplearning.ai
Logistic Regression

Andrew Ng
Basics of Neural
Network Programming

Logistic Regression
deeplearning.ai
cost function
Logistic Regression cost function
1
𝑦ො = 𝜎 𝑤 𝑇 𝑥 + 𝑏 , where 𝜎 𝑧 =
1+𝑒 −𝑧

Given (𝑥 (1) , 𝑦 (1) ),…,(𝑥 (𝑚) , 𝑦 (𝑚) ) , want 𝑦ො (𝑖) ≈ 𝑦 𝑖 .

Loss (error) function:

Andrew Ng
Basics of Neural
Network Programming

Gradient Descent
deeplearning.ai
Gradient Descent
𝑇 1
Recap: 𝑦 = 𝜎 𝑤 𝑥 + 𝑏 , 𝜎 𝑧 =
1+𝑒 −𝑧
𝑚 𝑚
1 1
𝐽 𝑤, 𝑏 = 𝑚 ℒ(𝑦 𝑖 , 𝑦 (𝑖) ) =
−
𝑚 𝑦 (𝑖) log 𝑦 𝑖 + (1 − 𝑦 (𝑖) ) log(1 − 𝑦 𝑖 )
𝑖=1 𝑖=1

Want to find 𝑤, 𝑏 that minimize 𝐽 𝑤, 𝑏

𝐽 𝑤, 𝑏

𝑏
𝑤 Andrew Ng
Gradient Descent

Andrew Ng
Basics of Neural
Network Programming

Derivatives
deeplearning.ai
Intuition about derivatives
𝑓 𝑎 = 3𝑎

𝑎
Andrew Ng
Basics of Neural
Network Programming

More derivatives
deeplearning.ai
examples
Intuition about derivatives
𝑓 𝑎 = 𝑎2

𝑎
Andrew Ng
More derivative examples

Andrew Ng
Basics of Neural
Network Programming

Computation Graph
deeplearning.ai
Computation Graph

Andrew Ng
Basics of Neural
Network Programming

Derivatives with a
deeplearning.ai Computation Graph
Computing derivatives
𝑎=5
11 33
𝑏=3 6 𝑣 =𝑎+𝑢 𝐽 = 3𝑣
𝑢=𝑏𝑐
𝑐=2

Andrew Ng
Computing derivatives
𝑎=5
11 33
𝑏=3 6 𝑣 =𝑎+𝑢 𝐽 = 3𝑣
𝑢=𝑏𝑐
𝑐=2

Andrew Ng
Basics of Neural
Network Programming

Logistic Regression
deeplearning.ai
Gradient descent
Logistic regression recap

𝑧 = 𝑤𝑇𝑥 + 𝑏
𝑦ො = 𝑎 = 𝜎(𝑧)
ℒ 𝑎, 𝑦 = −(𝑦 log(𝑎) + (1 − 𝑦) log(1 − 𝑎))

Andrew Ng
Logistic regression derivatives
𝑥1
𝑤1
𝑥2 𝑧 = 𝑤1 𝑥1 + 𝑤2 𝑥2 + 𝑏 𝑎 = 𝜎(𝑧) ℒ(a, 𝑦)
𝑤2
b

Andrew Ng
Basics of Neural
Network Programming

Gradient descent
deeplearning.ai
on m examples
Logistic regression on m examples

Andrew Ng
Logistic regression on m examples

Andrew Ng
Basics of Neural
Network Programming

Vectorization
deeplearning.ai
What is vectorization?

Andrew Ng
Basics of Neural
Network Programming

More vectorization
deeplearning.ai
examples
Neural network programming guideline
Whenever possible, avoid explicit for-loops.

Andrew Ng
Neural network programming guideline
Whenever possible, avoid explicit for-loops.

Andrew Ng
Vectors and matrix valued functions
Say you need to apply the exponential operation on every element of a
matrix/vector.

𝑣1
𝑣= ⋮
𝑣𝑛

u = np.zeros((n,1))
for i in range(n):
u[i]=math.exp(v[i])

Andrew Ng
Logistic regression derivatives
J = 0, dw1 = 0, dw2 = 0, db = 0
for i = 1 to n:
𝑧 (𝑖) = 𝑤 𝑇 𝑥 (𝑖) + 𝑏
𝑎(𝑖) = 𝜎(𝑧 (𝑖) )
𝐽 += − 𝑦 (𝑖) log 𝑦ො 𝑖 + (1 − 𝑦 𝑖 ) log(1 − 𝑦ො 𝑖 )
d𝑧 (𝑖) = 𝑎(𝑖) (1 − 𝑎(𝑖) )
(𝑖)
d𝑤1 += 𝑥1 d𝑧 (𝑖)
(𝑖)
d𝑤2 += 𝑥2 d𝑧 (𝑖)
db += d𝑧 (𝑖)
J = J/m, d𝑤1 = d𝑤1 /m, d𝑤2 = d𝑤2 /m, db = db/m

Andrew Ng
Basics of Neural
Network Programming

Vectorizing Logistic
deeplearning.ai
Regression
Vectorizing Logistic Regression
𝑧 (1) = 𝑤 𝑇 𝑥 (1) + 𝑏 𝑧 (2) = 𝑤 𝑇 𝑥 (2) + 𝑏 𝑧 (3) = 𝑤 𝑇 𝑥 (3) + 𝑏
𝑎(1) = 𝜎(𝑧 (1) ) 𝑎(2) = 𝜎(𝑧 (2) ) 𝑎(3) = 𝜎(𝑧 (3) )

Andrew Ng
Basics of Neural
Network Programming

Vectorizing Logistic
deeplearning.ai Regression’s Gradient
Computation
Vectorizing Logistic Regression

Andrew Ng
Implementing Logistic Regression
J = 0, d𝑤1 = 0, d𝑤2 = 0, db = 0
for i = 1 to m:
𝑧 (𝑖) = 𝑤 𝑇 𝑥 (𝑖) + 𝑏
𝑎(𝑖) = 𝜎(𝑧 (𝑖) )
𝐽 += − 𝑦 (𝑖) log 𝑎 𝑖 + (1 − 𝑦 𝑖 ) log(1 − 𝑎 𝑖 )
d𝑧 (𝑖) = 𝑎 (𝑖) −𝑦 (𝑖)
(𝑖)
d𝑤1 += 𝑥1 d𝑧 (𝑖)
(𝑖)
d𝑤2 += 𝑥2 d𝑧 (𝑖)
db += d𝑧 (𝑖)
J = J/m, d𝑤1 = d𝑤1 /m, d𝑤2 = d𝑤2 /m
db = db/m
Andrew Ng
Basics of Neural
Network Programming

Broadcasting in
deeplearning.ai
Python
Broadcasting example
Calories from Carbs, Proteins, Fats in 100g of different foods:
Apples Beef Eggs Potatoes
Carb 56.0 0.0 4.4 68.0
Protein 1.2 104.0 52.0 8.0
Fat 1.8 135.0 99.0 0.9

cal = A.sum(axis = 0)
percentage = 100*A/(cal.reshape(1,4))
Broadcasting example
1 101
2 100 102
+ =
3 103
4 104

1 2 3 100 200 300 101 202 303

+ =
4 5 6 104 205 306

1 2 3 100 101 102 103

+ =
4 5 6 200 204 205 206
General Principle
Basics of Neural
Network Programming

A note on python/
deeplearning.ai
numpy vectors
Python Demo

Andrew Ng
Python / numpy vectors

import numpy as np

a = np.random.randn(5)

a = np.random.randn((5,1))

a = np.random.randn((1,5))

assert(a.shape = (5,1))

Andrew Ng
One hidden layer
Neural Network

Neural Networks
deeplearning.ai
Overview
What is a Neural Network?
𝑥1
𝑥2 𝑦ො
𝑥3 x

w 𝑧 = 𝑤𝑇𝑥 + 𝑏 𝑎 = 𝜎(𝑧) ℒ(𝑎, 𝑦)

b
𝑥1
𝑥2 𝑦ො
𝑥3 x

𝑊 [1] 𝑧 [1] = 𝑊 [1] 𝑥 + 𝑏 [1] 𝑎[1] = 𝜎(𝑧 [1] ) 𝑧 [2] = 𝑊 [2] 𝑎[1] + 𝑏 [2] 𝑎[2] = 𝜎(𝑧 [2] ) ℒ(𝑎[2] , 𝑦)

𝑏 [1] 𝑊 [2]
𝑏 [2] Andrew Ng
One hidden layer
Neural Network

Neural Network
deeplearning.ai
Representation
Neural Network Representation

𝑥1

𝑥2 𝑦ො

𝑥3

Andrew Ng
One hidden layer
Neural Network

Computing a
deeplearning.ai Neural Network’s
Output
Neural Network Representation

𝑥1 𝑥1
𝑥2 𝑤 𝑇 𝑥 + 𝑏 𝜎(𝑧) 𝑎 = 𝑦ො 𝑥2 𝑦ො
𝑧 𝑎
𝑥3 𝑥3

𝑧 = 𝑤𝑇𝑥 + 𝑏

𝑎 = 𝜎(𝑧)

Andrew Ng
Neural Network Representation

𝑥1 𝑥1
𝑥2 𝑤 𝑇 𝑥 + 𝑏 𝜎(𝑧) 𝑎 = 𝑦ො 𝑥2 𝑦ො
𝑧 𝑎
𝑥3 𝑥3

𝑧 = 𝑤𝑇𝑥 + 𝑏 𝑥1
𝑎 = 𝜎(𝑧) 𝑥2 𝑦ො
𝑥3
Andrew Ng
Neural Network Representation
1 1𝑇 [1] [1] 1
𝑎1
1
𝑧1 = 𝑤1 𝑥 + 𝑏1 , 𝑎 1 = 𝜎(𝑧1 )
𝑥1 1 1𝑇 [1] [1] 1
𝑎2
1
𝑧2 = 𝑤2 𝑥 + 𝑏2 , 𝑎 2 = 𝜎(𝑧2 )
𝑥2 𝑦ො 1 1𝑇 [1] [1] 1
𝑎3
1
𝑧3 = 𝑤3 𝑥 + 𝑏3 , 𝑎 3 = 𝜎(𝑧3 )
𝑥3 1 1𝑇 [1] [1] 1
𝑎4
1
𝑧4 = 𝑤4 𝑥 + 𝑏4 , 𝑎 4 = 𝜎(𝑧4 )

Andrew Ng
Neural Network Representation learning
1
𝑎1
Given input x:
𝑥1 1
𝑎2 1 1
𝑧 =𝑊 𝑥+𝑏 1
𝑥2 1
𝑦ො
𝑎3
𝑥3 𝑎 1 = 𝜎(𝑧 1 )
1
𝑎4
2 2
𝑧 =𝑊 𝑎1 +𝑏2

𝑎 2 = 𝜎(𝑧 2
)

Andrew Ng
One hidden layer
Neural Network

Vectorizing across
deeplearning.ai
multiple examples
Vectorizing across multiple examples
𝑧1 =𝑊 1 𝑥+𝑏 1
𝑥1 𝑎1 = 𝜎(𝑧 1 )
𝑥2 𝑦ො
𝑧2 =𝑊 2 𝑎1 +𝑏2
𝑥3
𝑎2 = 𝜎(𝑧 2 )

Andrew Ng
Vectorizing across multiple examples
for i = 1 to m:
𝑧 1 (𝑖) = 𝑊 1 𝑥 (𝑖) + 𝑏 1
𝑎 1 (𝑖) = 𝜎(𝑧 1 𝑖
)
2 (𝑖) 2
𝑧 =𝑊 𝑎 1 (𝑖) + 𝑏 2
𝑎 2 (𝑖) = 𝜎(𝑧 2 𝑖
)

Andrew Ng
One hidden layer
Neural Network

Explanation
deeplearning.ai for vectorized
implementation
Justification for vectorized implementation

Andrew Ng
Recap of vectorizing across multiple examples
for i = 1 to m
𝑥1
𝑧 1 (𝑖) =𝑊 1 𝑥 (𝑖) + 𝑏 1
𝑥2 𝑦ො
𝑥3 𝑎 1 (𝑖) = 𝜎(𝑧 1 𝑖 )
𝑧 2 (𝑖) =𝑊 2 𝑎 1 (𝑖) + 𝑏 2
𝑎 2 (𝑖) = 𝜎(𝑧 2 𝑖 )
𝑋 = 𝑥 (1) 𝑥 (2) … 𝑥 (𝑚)
𝑍1 =𝑊 1 𝑋+𝑏 1
𝐴1 = 𝜎(𝑍 1 )
𝑍2 =𝑊 2 𝐴1 +𝑏 2
A[1] = 𝑎[1](1) 𝑎[1](2) … 𝑎[1](𝑚)
𝐴2 = 𝜎(𝑍 2 )
Andrew Ng
One hidden layer
Neural Network

Activation functions
deeplearning.ai
Activation functions
𝑥1

𝑥2 𝑦ො
𝑥3

Given x:
𝑧1 =𝑊 1 𝑥+𝑏 1
𝑎1 = 𝜎(𝑧 1 )
𝑧2 =𝑊 2 𝑎1 +𝑏2
𝑎2 = 𝜎(𝑧 2 ) Andrew Ng
Pros and cons of activation functions
a a

x
z
1
sigmoid: 𝑎 =
1 + 𝑒 −𝑧
a a

z z
Andrew Ng
One hidden layer
Neural Network

Why do you
deeplearning.ai need non-linear
activation functions?
Activation function
𝑥1

𝑥2 𝑦ො
𝑥3

Given x:
𝑧 1 =𝑊 1 𝑥+𝑏 1
𝑎 1 = 𝑔[1] (𝑧 1
)
2 2
𝑧 =𝑊 𝑎1 +𝑏2
𝑎 2 = 𝑔[2] (𝑧 2 )
Andrew Ng
One hidden layer
Neural Network

Derivatives of
deeplearning.ai activation functions
Sigmoid activation function

a
1
𝑔(𝑧) =
1 + 𝑒 −𝑧
z

Andrew Ng
Tanh activation function
a
𝑔(𝑧) = tanh(𝑧)

Andrew Ng
ReLU and Leaky ReLU
a a

z z
ReLU Leaky ReLU

Andrew Ng
One hidden layer
Neural Network

Gradient descent for

deeplearning.ai neural networks
Gradient descent for neural networks

Andrew Ng
Formulas for computing derivatives

Andrew Ng
One hidden layer
Neural Network

Backpropagation
deeplearning.ai intuition (Optional)
Computing gradients
Logistic regression
𝑥
𝑤 𝑧 = 𝑤𝑇𝑥 + 𝑏 𝑎 = 𝜎(𝑧) ℒ(𝑎, 𝑦)
𝑏

Andrew Ng
Neural network gradients
𝑊 [2]
𝑥 𝑏 [2]
𝑊 [1] 𝑧 [1] = 𝑊 [1] 𝑥 + 𝑏 [1] 𝑎[1] = 𝜎(𝑧 [1] ) 𝑧 [2] = 𝑊 [2] 𝑥 + 𝑏 [2] 𝑎[2] = 𝜎(𝑧 [2] ) ℒ(𝑎[2] , y)

𝑏 [1]

Andrew Ng
Summary of gradient descent
𝑑𝑧 [2] = 𝑎[2] − 𝑦
𝑇
𝑑𝑊 [2] = 𝑑𝑧 [2] 𝑎 1

𝑑𝑏 [2] = 𝑑𝑧 [2]

𝑑𝑧 [1] = 𝑊 2 𝑇 𝑑𝑧 [2]
∗ 𝑔[1] ′(z 1 )

𝑑𝑊 [1] = 𝑑𝑧 [1] 𝑥 𝑇

𝑑𝑏 [1] = 𝑑𝑧 [1]
Andrew Ng
Summary of gradient descent
𝑑𝑧 [2] = 𝑎[2] − 𝑦 𝑑𝑍 [2] = 𝐴[2] − 𝑌

𝑇 1 𝑇
𝑑𝑊 [2] = 𝑑𝑧 [2] 𝑎 1 𝑑𝑊 = 𝑑𝑍 [2] 𝐴 1
[2]
𝑚
1
𝑑𝑏 [2] = 𝑑𝑧 [2] 𝑑𝑏 = 𝑛𝑝. 𝑠𝑢𝑚(𝑑𝑍 2 , 𝑎𝑥𝑖𝑠 = 1, 𝑘𝑒𝑒𝑝𝑑𝑖𝑚𝑠 = 𝑇𝑟𝑢𝑒)
[2]
𝑚

𝑑𝑧 [1] = 𝑊 2 𝑇 𝑑𝑧 [2]
∗ 𝑔[1] ′(z 1 ) 𝑑𝑍 [1] = 𝑊 2 𝑇 𝑑𝑍 [2] ∗ 𝑔[1] ′(Z 1 )
1
𝑑𝑊 [1] = 𝑑𝑧 [1] 𝑥 𝑇 𝑑𝑊 [1] = 𝑑𝑍 [1] 𝑋 𝑇
𝑚
1
𝑑𝑏 [1] = 𝑑𝑧 [1] 𝑑𝑏 [1] = 𝑛𝑝. 𝑠𝑢𝑚(𝑑𝑍 1 , 𝑎𝑥𝑖𝑠 = 1, 𝑘𝑒𝑒𝑝𝑑𝑖𝑚𝑠 = 𝑇𝑟𝑢𝑒)
𝑚
Andrew Ng
One hidden layer
Neural Network

Random Initialization
deeplearning.ai
What happens if you initialize weights to
zero?
[1]
𝑥1 𝑎1
[2]
𝑎1 𝑦ො
[1]
𝑥2 𝑎2

Andrew Ng
Random initialization
[1]
𝑥1 𝑎1
[2]
𝑎1 𝑦ො
[1]
𝑥2 𝑎2

Andrew Ng
Deep Neural
Networks

Deep L-layer
deeplearning.ai Neural network
What is a deep neural network?

logistic regression 1 hidden layer

2 hidden layers 5 hidden layers

Andrew
Ng
Deep neural network notation

Andrew
Ng
Deep Neural
Networks

Forward Propagation
deeplearning.ai in a Deep Network
Forward propagation in a deep network

Andrew
Ng
Deep Neural
Networks

Forward and backward

deeplearning.ai propagation
Forward propagation for layer l

Andrew
Ng
Backward propagation for layer l

Andrew
Ng
Summary

Andrew
Ng

TensorFlow in 1 Day: Make your own Neural Network
From Everand
TensorFlow in 1 Day: Make your own Neural Network
Krishna Rungta
3.5/5 (10)
Lecture 9. Neural Networks
No ratings yet
Lecture 9. Neural Networks
106 pages
DL Notes
No ratings yet
DL Notes
652 pages
Lecture 8 - Logistic Regression
No ratings yet
Lecture 8 - Logistic Regression
58 pages
Neural Network Training
No ratings yet
Neural Network Training
73 pages
Machine Learning Week 2
No ratings yet
Machine Learning Week 2
45 pages
Neural Networks Week 2
No ratings yet
Neural Networks Week 2
45 pages
Basics of Neural Network Programming: Binary Classification
No ratings yet
Basics of Neural Network Programming: Binary Classification
45 pages
Lecture 09 Slides - After
No ratings yet
Lecture 09 Slides - After
57 pages
DeepLearning Introduction
No ratings yet
DeepLearning Introduction
14 pages
Neural Networks Skimmed - Ipynb - Colab
No ratings yet
Neural Networks Skimmed - Ipynb - Colab
8 pages
01 Basics 01ML 02
No ratings yet
01 Basics 01ML 02
35 pages
Deep Learning
100% (4)
Deep Learning
100 pages
Slide 7 - Neural Networks
No ratings yet
Slide 7 - Neural Networks
64 pages
6.neural Networks 2
No ratings yet
6.neural Networks 2
44 pages
ML Lec 10 Neural Networks
No ratings yet
ML Lec 10 Neural Networks
87 pages
Lecture20 Backprop
No ratings yet
Lecture20 Backprop
77 pages
Chapter 2 - 2 Shallow Neural Network 2 - 2
No ratings yet
Chapter 2 - 2 Shallow Neural Network 2 - 2
34 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
80 pages
Deep Learning
No ratings yet
Deep Learning
15 pages
Machine Learning Part 9
No ratings yet
Machine Learning Part 9
33 pages
Neural Networks: Learning: Cost Function
No ratings yet
Neural Networks: Learning: Cost Function
33 pages
PDF 1678529419
No ratings yet
PDF 1678529419
100 pages
Chap6 (Neural Network)
No ratings yet
Chap6 (Neural Network)
63 pages
CS460 - Deep Learning - W02 & W03
No ratings yet
CS460 - Deep Learning - W02 & W03
44 pages
Neural Networks - 2
No ratings yet
Neural Networks - 2
79 pages
5 Backward Propagation
No ratings yet
5 Backward Propagation
81 pages
Deep Learning
No ratings yet
Deep Learning
15 pages
CI DeepLearningFundamentals
No ratings yet
CI DeepLearningFundamentals
45 pages
Deep Learning PDF
100% (1)
Deep Learning PDF
87 pages
Neural Networks Week 3
No ratings yet
Neural Networks Week 3
35 pages
One Hidden Layer Neural Network
No ratings yet
One Hidden Layer Neural Network
35 pages
Tutorial On Neural Networks - 18MAR2024
No ratings yet
Tutorial On Neural Networks - 18MAR2024
33 pages
cs188 sp23 Note25
No ratings yet
cs188 sp23 Note25
8 pages
Artificial Neural Networks: Introduction To Computational Neuroscience
No ratings yet
Artificial Neural Networks: Introduction To Computational Neuroscience
42 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
100 pages
Unit I
No ratings yet
Unit I
90 pages
First
No ratings yet
First
92 pages
Unit 2.1
No ratings yet
Unit 2.1
37 pages
Introduction To Feed Forward Neural Networks
No ratings yet
Introduction To Feed Forward Neural Networks
121 pages
W3.Neural Network Representation
No ratings yet
W3.Neural Network Representation
7 pages
Shallow Networks Versus Deep Networks
No ratings yet
Shallow Networks Versus Deep Networks
6 pages
MachineLearningSlides PartOne
No ratings yet
MachineLearningSlides PartOne
252 pages
Unit 1
No ratings yet
Unit 1
30 pages
Introduction To Deep Learning
No ratings yet
Introduction To Deep Learning
151 pages
Deep Learning PDF
No ratings yet
Deep Learning PDF
289 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
36 pages
Neural Network (Perceptrons)
No ratings yet
Neural Network (Perceptrons)
31 pages
cs188 Fa24 Lec24
No ratings yet
cs188 Fa24 Lec24
46 pages
009 Neural - Networks Complete
No ratings yet
009 Neural - Networks Complete
61 pages
Lecture 0.4 - Neural Networks
No ratings yet
Lecture 0.4 - Neural Networks
51 pages
Week2 - Intro To Neural Nets
No ratings yet
Week2 - Intro To Neural Nets
33 pages
Neural Network Presentation
No ratings yet
Neural Network Presentation
33 pages
Deep Learning Andrew NG
100% (3)
Deep Learning Andrew NG
173 pages
Understanding and Creating Neural Networks
No ratings yet
Understanding and Creating Neural Networks
69 pages
Lec3 MLP Optimization
No ratings yet
Lec3 MLP Optimization
86 pages
ANN-Unit 6 - Deep Neural Networks
No ratings yet
ANN-Unit 6 - Deep Neural Networks
29 pages
1.1 Introduction
No ratings yet
1.1 Introduction
73 pages
L3 Backpropagation
No ratings yet
L3 Backpropagation
61 pages
Python Programming: General-Purpose Libraries; NumPy,Pandas,Matplotlib,Seaborn,Requests,os & sys: Python, #2
From Everand
Python Programming: General-Purpose Libraries; NumPy,Pandas,Matplotlib,Seaborn,Requests,os & sys: Python, #2
e3
No ratings yet
ML - KNN - Decision Tree
No ratings yet
ML - KNN - Decision Tree
40 pages
Release - Notes Releasenotes en v3.9
No ratings yet
Release - Notes Releasenotes en v3.9
26 pages
Return Loss and VSWR With Formula-01
100% (1)
Return Loss and VSWR With Formula-01
6 pages
Saluran Open Short Match 2023
No ratings yet
Saluran Open Short Match 2023
9 pages
Pak Nando
No ratings yet
Pak Nando
2 pages
Coaxial Cable Equations-2023-01
No ratings yet
Coaxial Cable Equations-2023-01
17 pages
Mark Meadows Motion To Dismiss
No ratings yet
Mark Meadows Motion To Dismiss
34 pages
2.2. BASIC Work in Team Environment
No ratings yet
2.2. BASIC Work in Team Environment
3 pages
Assignment MCA 103
No ratings yet
Assignment MCA 103
4 pages
BMC Remedy Service Desk 7.6 Connector Installation and Configuration Guide
No ratings yet
BMC Remedy Service Desk 7.6 Connector Installation and Configuration Guide
50 pages
Column Layout Plan: Trims International (BD) LTD
No ratings yet
Column Layout Plan: Trims International (BD) LTD
1 page
Busi 601 Final
No ratings yet
Busi 601 Final
17 pages
Toshiba Satellite L30 SpecificationBrochure 110706
No ratings yet
Toshiba Satellite L30 SpecificationBrochure 110706
2 pages
5081505-02-GB Servicemanual ULUF450 - 490 - 850 - 890 - 750 (G-214)
No ratings yet
5081505-02-GB Servicemanual ULUF450 - 490 - 850 - 890 - 750 (G-214)
60 pages
AGS Guide To Ground Investigation Reports Final
No ratings yet
AGS Guide To Ground Investigation Reports Final
6 pages
GROUP6
No ratings yet
GROUP6
13 pages
Legal Framework For Truck Logistics in India
No ratings yet
Legal Framework For Truck Logistics in India
2 pages
Cibse Ken Dale Award Report 2020 2022 John Smyth
No ratings yet
Cibse Ken Dale Award Report 2020 2022 John Smyth
213 pages
Heroes of Might & Magic 2 - Manual UK
No ratings yet
Heroes of Might & Magic 2 - Manual UK
142 pages
U00 Syllabus 1
No ratings yet
U00 Syllabus 1
55 pages
Instructional English Classification Test
No ratings yet
Instructional English Classification Test
4 pages
MATULAC Activity 1 MidTerm
No ratings yet
MATULAC Activity 1 MidTerm
3 pages
MTU GLR 3 4 - Parts
No ratings yet
MTU GLR 3 4 - Parts
52 pages
18 Home Savings vs. Dailo
No ratings yet
18 Home Savings vs. Dailo
11 pages
NEBOSH IGC1-PART-2 (Answer)
No ratings yet
NEBOSH IGC1-PART-2 (Answer)
4 pages
En Girafe
No ratings yet
En Girafe
4 pages
Final PPT CAMPUS
No ratings yet
Final PPT CAMPUS
20 pages
Linearization OpenFAST
No ratings yet
Linearization OpenFAST
13 pages
11 Best Step - How To Plant An Avocado Seed in Soil - October 2024
No ratings yet
11 Best Step - How To Plant An Avocado Seed in Soil - October 2024
31 pages
Canadian Manual On Foundation Engineering
No ratings yet
Canadian Manual On Foundation Engineering
297 pages
Things Go Better With...
No ratings yet
Things Go Better With...
1 page
Customizing The Windchill 9 User Interface
No ratings yet
Customizing The Windchill 9 User Interface
3 pages
Exaugural Speech by Outgoing President Ronaldo Nilo
No ratings yet
Exaugural Speech by Outgoing President Ronaldo Nilo
1 page
Catalogo Bomba de Lodos Gardner Denver Pah-08 Ultimo
100% (3)
Catalogo Bomba de Lodos Gardner Denver Pah-08 Ultimo
35 pages
Module10 Activity
No ratings yet
Module10 Activity
4 pages
G4-T3 Exponential Moving Average (EMA)
No ratings yet
G4-T3 Exponential Moving Average (EMA)
4 pages

Neural Networks Optional

Uploaded by

Neural Networks Optional

Uploaded by

Introduction to

1 (cat) vs 0 (non cat)

Given (𝑥 (1) , 𝑦 (1) ),…,(𝑥 (𝑚) , 𝑦 (𝑚) ) , want 𝑦ො (𝑖) ≈ 𝑦 𝑖 .

Want to find 𝑤, 𝑏 that minimize 𝐽 𝑤, 𝑏

1 2 3 100 200 300 101 202 303

1 2 3 100 101 102 103

w 𝑧 = 𝑤𝑇𝑥 + 𝑏 𝑎 = 𝜎(𝑧) ℒ(𝑎, 𝑦)

Gradient descent for

logistic regression 1 hidden layer

2 hidden layers 5 hidden layers

Forward and backward

You might also like