0% found this document useful (0 votes)

6 views48 pages

Slides 11

The document provides an overview of feedforward neural networks, focusing on multi-layer perceptrons (MLPs) for classification and regression tasks. It covers key concepts such as backpropagation for gradient computation, the structure of neural networks, and practical applications using TensorFlow and the MNIST dataset. Additionally, it discusses training techniques, including stochastic gradient descent and regularization methods.

Uploaded by

jaihind100100

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views48 pages

Slides 11

Uploaded by

jaihind100100

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 48

Machine Learning - MT 2016

11 & 12. Neural Networks

Varun Kanade

University of Oxford
November 14 & 16, 2016
Announcements

I Problem Sheet 3 due this Friday by noon

I Practical 2 this week: Compare NBC & LR

I (Optional) Reading a paper

1
Outline

Today, we’ll study feedforward neural networks

I Multi-layer perceptrons

I Classification or regression settings

I Backpropagation to compute gradients

I Brief introduction to tensorflow and MNIST

2
Artificial Neuron : Logistic Regression

1 Unit

x1 w1 Σ yb = Pr(y = 1 | x, w, b)

x2
Linear Function
Non-linearity

I A unit in a neural network computes a linear function of its input and is

then composed with a non-linear activation function
I For logistic regression, the non-linear activation function is the sigmoid
1
σ(z) =
1 + e−z
I The separating surface is linear
3
Multilayer Perceptron (MLP) : Classification

1
b21
2
Σ
w11
x1 3
w11
2
w12
2
1 b31 Σ yb = Pr(y = 1 | x, W, b)
w21 3
x2 w12
2
w22
Σ
b22
1

4
Multilayer Perceptron (MLP) : Regression

1
b21
2
Σ
w11
x1 3
w11
2
w12
2
1 b31 Σ yb = E[y | x, W, b]
w21 3
x2 w12
2
w22
Σ
b22
1

5
A Toy Example

6
Logistic Regression Fails Badly

7
Solve using MLP

z12 a21
1
b21
Σ
2
w11 z13 a31
x1 3
w11
2
w12
1 b31 Σ yb = Pr(y = 1 | x, Wi , bi )
2
w21 z22 a22 3
x2 w12
2
w22
Σ
b22
1

Let us use the notation:

a1 = z1 = x
z2 = W2 a1 + b2
a2 = tanh(z2 )
z3 = W3 a2 + b3
y = a3 = σ(z3 )

8
Scatterplot Comparison (x1 , x2 ) vs (a21 , a22 )

9
Decision Boundary of the Neural Net

10
Feedforward Neural Networks
Layer 1 Layer 2 Layer 3 Layer 4
(Input) (Hidden) (Hidden) (Output)

Fully
Connected
Layer

11
Computing Gradients on Toy Example

b21 Want the derivatives

2 z12 → a21
w11
x1 3
w11
∂`
2 , ∂`
2
2 ∂w11 ∂w12
w12
∂` ∂`
b31 z13 → a31 `(y, a31 ) 2
∂w21
, 2
∂w22
2
w21 3 ∂` ∂`
x2 w12 3
∂w11
, 3
∂w12
2
w22 ∂` ∂` ∂`
z22 → a22 ∂b2
, ∂b2
, ∂b3
b22 1 2 1

∂` ∂` ∂`
Would suffice to compute 3
∂z1
, 2
∂z1
, 2
∂z2

12
Computing Gradients on Toy Example
Let us compute the following:
a31 −y
1. ∂`
∂a3
= − ay3 + 1−y
1−a3
= a3 3
1 1 1 1 (1−a1 )

3
2. ∂a
3
∂z1
= a31 · (1 − a31 )

3
∂ z1 3 3
3. ∂a2
= [w11 , w12 ]
" #
∂ a2 1 − tanh2 (z12 ) 0
4. =
∂z2 0 1 − tanh2 (z22 )

Then we can calculate

∂ a3
∂`
3
∂z1
= ∂`
∂a3
· ∂z1
1
3 = a31 − y
1

∂` ∂` ∂ a3 ∂ z3 ∂ a2 ∂`
3
∂ z1 ∂ a2
∂z2
= ∂a3
· ∂z3 · ∂a21 ·
1
∂z2
= 3
∂z1
· ∂a2
· ∂z2
1 1

13
loss ` aL

layer L Each layer consists of a linear function

and non-linear activation
∂`
∂zL
Layer l consists of the following:
layer L − 1
zl = Wl al−1 + bl
al = fl (zl )
layer l where fl is the non-linear activation in
∂` layer l.
∂zl

layer l − 1
If there are nl units in layer l, then Wl is
nl × nl−1
layer 2
Backward pass to compute derivatives

∂`
input x a1 ∂z2

14
loss ` aL

layer L Forward Equations

(1) a1 = x (input)
layer L − 1
(2) zl = Wl al−1 + bl

layer l (3) al = fl (zl )

(4) `(aL , y)
layer l − 1

layer 2

input x a1

15
Output Layer

aL
zL = WL aL−1 + bL
aL = fL (zL )
layer L (zL → aL )
Loss: `(y, aL )
∂` ∂` ∂` ∂ aL
aL−1 ∂zL ∂zL
= ∂aL
· ∂zL

∂` ∂`
If there are nL (output) units in layer L, then ∂aL
and ∂zL
are row vectors
L
∂a
with nL elements and ∂zL
is the nL × nL Jacobian matrix:
 L 
∂ a1 ∂ aL ∂ aL
1
· · · ∂zL1
 ∂z1L ∂z2 L
nL 
 ∂ aL ∂ aL ∂ aL

 L2 2
· · · ∂zL2 
∂ aL  ∂z1 ∂z2 L
nL

= . . .

∂zL  .. .. .. .. 

 L . 
 ∂ an ∂ aLnL ∂ aLnL

∂z L
L
∂z L
· · · ∂z L
1 2 nL

If fL is applied element-wise, e.g., sigmoid then this matrix is diagonal

16
Back Propagation

al (the inputs into layer l + 1)

z l+1
=W l+1 l
a +b l+1 l+1
(wj,k weight on connection from kth
l ∂`
a ∂zl+1 unit in layer l to j th unit in layer l + 1)
al = f (zl ) (f is a non-linearity)
l l
layer l (z → a ) ∂`
∂zl+1
(derivative passed from layer above)
l+1
∂` ∂` ∂z
∂` = ·
al−1 ∂zl ∂zl ∂zl+1 ∂zl
∂` ∂ zl+1 ∂ al
= ∂zl+1
· ∂al
· ∂zl
∂ al
= ∂`
∂zl+1
· Wl+1 · ∂zl

17
Gradients with respect to parameters

∂`
al ∂zl+1

zl = Wl al−1 + bl (wj,k
l
weight on connection from kth
unit in layer l-1 to j th unit in layer l)
l l
layer l (z → a )
∂`
∂zl
(obtained using backpropagation)
l−1 ∂`
a ∂zl

∂ zil
Consider ∂`
l
∂wij
= ∂`
∂zil
· ∂wijl = ∂`
∂zil
· al−1
j

∂` ∂`
∂bli
= ∂zil
T
More succinctly, we may write: ∂`
∂Wl
= al−1 ∂z
∂`
l

∂` ∂`
∂bl
= ∂zl

18
loss ` aL Forward Equations

(1) a1 = x (input)

layer L (2) zl = Wl al−1 + bl

∂`
∂zL (3) al = fl (zl )
layer L − 1
(4) `(aL , y)

layer l
∂` Back-propagation Equations
∂zl

layer l − 1 ∂` ∂` ∂ aL
(1) Compute ∂zL
= ∂aL
· ∂zL

∂ al
(2) ∂`
∂zl
= ∂`
∂zl+1
· Wl+1 · ∂zl
layer 2
T
(3) ∂`
∂Wl
= al−1 ∂z
∂`
l

∂` ∂` ∂`
input x a1 ∂z2
(4) ∂bl
= ∂zl

19
Computational Questions

What is the running time to compute the gradient for a single data point?
I As many matrix multiplications as there are fully connected layers
I Performed twice during forward and backward pass

What is the space requirement?

I Need to store vectors al , zl , and ∂`
∂zl
for each layer

Can we process multiple examples together?

I Yes, if we minibatch, we perform tensor operations
I Make sure that all parameters fit in GPU memory

20
Training Deep Neural Networks

I Back-propagation gives gradient

I Stochastic gradient descent is the method of choice

I Regularisation
I How do we add `1 or `2 regularisation?
I Don’t regularise bias terms

I How about convergence?

I What did we learn in the last 10 years, that we didn’t know in the 80s?

21
Training Feedforward Deep Networks
Layer 1 Layer 2 Layer 3 Layer 4
(Input) (Hidden) (Hidden) (Output)

Why do we get non-convex optimisation problem?

All units in a layer are symmetric, hence invariant to permutations
22
A toy example

1
z12 a21
b21
Σ a21 Target is y = 1−x
2
w12
x ∈ {−1, 1}

Squared Loss Function

`(a21 , y) = (a21 − y)2
1

∂ a2
∂`
2
∂z1
= 2(a21 − y) · ∂z1
1
2 = 2(a21 − y)σ 0 (z12 ) 0.8

0.6
If x = −1, w12 ≈ 5, b21 ≈ 0, then σ 0 (z12 ) ≈ 0
0.4

Cross-Entropy Loss Function 0.2

z12
`(a21 , y) = −(y log a21 + (1 − y) log(1 − a21 )) 0

−8 −6 −4 −2 0 2 4 6 8
a21 −y ∂ a2
∂`
2
∂z1
= a2 2 · 1
2 = (a21 − y)
1 (1−a1 ) ∂z1

23
Propagating Gradients Backwards

1 1 1
b21 b31 b41
x = a11 w12 Σ w13 Σ w14 Σ a41

I Cross entropy loss: `(a41 , y) = −(y log a41 + (1 − y) log(1 − a41 ))

I ∂`
4
∂z1
= a41 − y
4
∂ z1 ∂ a3
I ∂`
3
∂z1
= ∂`
4
∂z1
· ∂a3
· ∂z1
1
3 = (a41 − y) · w14 · σ 0 (z13 )
1
3
∂ z1 ∂ a2
I ∂`
2
∂z1
= ∂`
3
∂z1
· ∂a2
· ∂z1
1
3 = (a41 − y) · w14 · σ 0 (z13 ) · w13 · σ 0 (z12 )
1

I Saturation: When the output of an artificial neuron is in the ‘flat’ part,

e.g., where σ 0 (z) ≈ 0 for sigmoid
I Vanishing Gradient Problem: Multiplying several σ 0 (zil ) together makes
the gradient ≈ 0, when we have a large number of layers
I For example, when using sigmoid activation, σ 0 (z) ∈ [0, 1/4]

24
Avoiding Saturation

Rectifier
Use rectified linear units 3

Rectifier non-linearity
f (z) = max(0, z)
2

Rectified Linear Unit (ReLU)

max(0, a · w + b)
1
You can also use f (z) = |z|

Other variants
0
leaky ReLUs, parametric ReLUs −3 −2 −1 0 1 2 3

25
Initialising Weights and Biases

Initialising is important when minimising

non-convex functions. We may get very different
results depending on where we start the
optimisation.

Suppose we were using a sigmoid unit, how would

you initialise the weights?
PD
I Suppose z =
i=1 wi ai

I E.g., choose wi ∈ [− √1D , √1 ]

D
at random

What if it were a ReLU unit?

I You can initialise similarly

How about the biases?

I For sigmoid, can use 0 or a random value
around 0
I For ReLU, should use a small positive constant

26
Avoiding Overfitting

Deep Neural Networks have a lot of parameters

I Fully connected layers with n1 , n2 , .., nL units have at least

n1 n2 + n2 n3 + · · · + nL−1 nL parameters

I For Problem Sheet 4, you will be asked to train an MLP for digit
recognition with 2 million parameters and only 60,000 training images

I For image detection, one of the most famous models, the neural net
used by Krizhevsky, Sutskever, Hinton (2012) has 60 million parameters
and 1.2 million training images

I How do we prevent deep neural networks from overfitting?

27
Early Stopping

Maintain validation set and stop training

when error on validation set stops
decreasing.

What are the computational costs?

I Need to compute validation error
I Can do this every few iterations to
reduce overhead

What are the advantages?

I If validation error flattens, or starts
increasing can stop optimisation
I Prevents overfitting

See paper by Hardt, Recht and Singer (2015)

28
Add Data: Modified Data
Typically, getting additional data is either impossible or expensive

Fake the data!

Images can be translated slight, rotated slightly, change of brightness, etc.

Google Offline Translate trained on entirely fake data!

Google Research Blog

29
Add Data: Adversarial Training

Take trained (or partially trained model)

Create examples by modifications ‘‘imperceptible to the human eye’’, but

where the model fails

Szegedy et al. and Goodfellow et al.

30
Other Ideas to Reduce Overfitting

Hard constraints on weights

Gradient Clipping

Inject noise into the system

Enforce sparsity in the neural network

Unsupervised Pre-training
(Bengio et al.)

31
Bagging (Bootstrap Aggregation)

Bagging (Leo Breiman - 1994)

I Given dataset D = h(xi , yi )iN

i=1 , sample D1 , D2 , · · · , Dk of size N from
D with replacement

I Train classifiers f1 , . . . , fk on D1 , . . . , Dk

I When predicting use majority (or average if using regression)

I Clearly this approach is not practical for deep networks

32
Dropout

I For input x each hidden unit with probability 1/2 independently

I Every input, will have a potentially different mask
I Potentially exponentially different models, but have ‘‘same weights’’
I After training whole network is used by halving all the weights

Srivastava, Hinton, Krizhevsky, 2014

33
Errors Made by MLP for Digit Recognition

34
Avoiding Overfitting

I Use parameter sharing a.k.a weight tying in the model

I Exploit invariances to translation, rotation, etc.

I Exploit locality in images, audio, text, etc.

I Convolutional Neural Networks (convnets)

35
Convolutional Neural Networks (convnets)

(Fukushima, LeCun, Hinton 1980s)

36
Image Convolution

Source: L. W. Kheng

37
Convolution

In general, a convolution filter f is a tensor of dimension Wf × Hf × Fl ,

where Fl is the number of channels in the previous layer

Strides in x and y directions dictate which convolutions are computed to

obtain the next layer

Zero-padding can be used if required to adjust layer sizes and boundaries

Typically, a convolution layer will have a large number of filters, the

number of channels in the next layer will be the same as the number of
filters used

38
Source: Krizhevsky, Sutskever, Hinton (2012)

39
Sources: Krizhevsky, Sutskever, Hinton (2012); Wikipedia

40
Source: Krizhevsky, Sutskever, Hinton (2012)

41
Source: Zeiler and Fergus (2013)

42
Source: Zeiler and Fergus (2013)
43
Convolutional Layer

Suppose that there is no zero padding and strides in both directions are 1

Wf 0 Hf 0 F
l
XXX 0
zil+1
0 ,j 0 ,f 0 = bf 0 + ali0 +i−1,j 0 +j−1,f wi,j,f
l+1,f

i=1 j=1 f =1

∂ z l+1
i ,j ,f 0
0 0
l+1,f 0
= ali0 +i−1,j 0 +j−1,f
∂wi,j,f
X
∂`
l+1,f 0
= ∂`
· ali0 +i−1,j 0 +j−1,f
∂wi,j,f ∂z l+1
i ,j ,f 0
0 0
i0 ,j 0

44
Convolutional Layer

Suppose that there is no zero padding and strides in both directions are 1

Wf 0 Hf 0 F
l
XXX 0
zil+1
0 ,j 0 ,f 0 = bf 0 + ali0 +i−1,j 0 +j−1,f wi,j,f
l+1,f

i=1 j=1 f =1

∂ z l+1 0
i ,j ,f 0
0 0 l+1,f
∂ali,j,f
= wi−i 0 +1,j−j 0 +1,f

l+1,f 0
X
∂` ∂`
∂ali,j,f
= l+1 · wi−i 0 +1,j−j 0 +1,f
∂z 0 0 0
i ,j ,f
i0 ,j 0 ,f 0

45
Max-Pooling Layer

Let Ω(i0 , j 0 ) be the set of (i, j) pairs in the previous layer that are involved in
the maxpool

sl+1
i0 ,j 0 = max ali,j
i,j∈Ω(i0 ,j 0 )
!
∂ sl+1
0 0
i ,j
∂ali,j
= I (i, j) = argmax alĩ,j̃
ĩ,j̃∈Ω(i0 ,j 0 )

46
Next Week

I Practial will be about training neural networks on MNIST dataset

I Time permitting, implement one problem on the sheet in tensorflow

I Start Unsupervised Learning

I Revise eigenvectors, eigenvalues (Problem 4 on Sheet 3)

Primary and Secondary Data Sources
82% (11)
Primary and Secondary Data Sources
24 pages
China's Grand Strategy. Lukas K. Danner.
100% (4)
China's Grand Strategy. Lukas K. Danner.
219 pages
Untitled
No ratings yet
Untitled
118 pages
Stock Market Analysis 0th Review
No ratings yet
Stock Market Analysis 0th Review
27 pages
K 10 Continuum Maths
100% (1)
K 10 Continuum Maths
1 page
SM 38
No ratings yet
SM 38
58 pages
HR Intelligence Framework Unit II
100% (1)
HR Intelligence Framework Unit II
34 pages
2017 Hu & Zhang - A Pathway To Learner Autonomy A Self-Determination Theory Perspective
No ratings yet
2017 Hu & Zhang - A Pathway To Learner Autonomy A Self-Determination Theory Perspective
11 pages
The Nature of Psychology
No ratings yet
The Nature of Psychology
13 pages
Student Journey Mapping
No ratings yet
Student Journey Mapping
25 pages
International Human Resource Management Issues: Guide: - DR Sanjay Bhayani
No ratings yet
International Human Resource Management Issues: Guide: - DR Sanjay Bhayani
12 pages
The Influence of Green Building Certification Schemes On Real Estate Investor Behaviour: Evidence From Singapore
No ratings yet
The Influence of Green Building Certification Schemes On Real Estate Investor Behaviour: Evidence From Singapore
18 pages
Kagan Lecture2
No ratings yet
Kagan Lecture2
118 pages
Assignment On: New Venture Creation and Management
No ratings yet
Assignment On: New Venture Creation and Management
17 pages
Employees Perception & QSM
No ratings yet
Employees Perception & QSM
14 pages
Deep Learning PDF
100% (1)
Deep Learning PDF
87 pages
Tycho Deep Space II
No ratings yet
Tycho Deep Space II
6 pages
GDT GST111 Gen David PDF
No ratings yet
GDT GST111 Gen David PDF
72 pages
Learning Algorithm
No ratings yet
Learning Algorithm
100 pages
OUTPUT Valid
No ratings yet
OUTPUT Valid
13 pages
Media Release - For Immediate Release Immunotec Announces
No ratings yet
Media Release - For Immediate Release Immunotec Announces
2 pages
Effect of Drug Abuse On Youth in Plateau State
100% (1)
Effect of Drug Abuse On Youth in Plateau State
62 pages
CS224n: Natural Language Processing With Deep Learning
No ratings yet
CS224n: Natural Language Processing With Deep Learning
18 pages
Lesson 3 Artificial Neural Network
No ratings yet
Lesson 3 Artificial Neural Network
77 pages
Ece18898g Neural Networks
No ratings yet
Ece18898g Neural Networks
47 pages
Efficacy of Agents To Prevent and Treat Enteral Feeding Tube Clogs
No ratings yet
Efficacy of Agents To Prevent and Treat Enteral Feeding Tube Clogs
5 pages
Introduction To Neural Network
No ratings yet
Introduction To Neural Network
20 pages
Quasi Experimental Design
50% (2)
Quasi Experimental Design
4 pages
AML 03 Dense Neural Networks
No ratings yet
AML 03 Dense Neural Networks
20 pages
CS 224D: Deep Learning For NLP: Lecture Notes: Part III Spring 2016
No ratings yet
CS 224D: Deep Learning For NLP: Lecture Notes: Part III Spring 2016
14 pages
Assignment - 4
No ratings yet
Assignment - 4
24 pages
CS 224D: Deep Learning For NLP: Lecture Notes: Part III Spring 2015
No ratings yet
CS 224D: Deep Learning For NLP: Lecture Notes: Part III Spring 2015
14 pages
Ann MJJ-1
No ratings yet
Ann MJJ-1
64 pages
E-Note 14653 Content Document 20231228101402AM
No ratings yet
E-Note 14653 Content Document 20231228101402AM
10 pages
Syracuse University School of Architecture Thesis
100% (3)
Syracuse University School of Architecture Thesis
4 pages
26 Neural Nets
No ratings yet
26 Neural Nets
77 pages
Data Fission: Splitting A Single Data Point: James Leiner Boyan Duan Larry Wasserman Aaditya Ramdas
No ratings yet
Data Fission: Splitting A Single Data Point: James Leiner Boyan Duan Larry Wasserman Aaditya Ramdas
57 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
DL Assigment Aryan Gupta UE218015
No ratings yet
DL Assigment Aryan Gupta UE218015
5 pages
Sparseautoencoder 2011new
No ratings yet
Sparseautoencoder 2011new
19 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
26 pages
NN Notes
No ratings yet
NN Notes
39 pages
Thesis Topics For Radiologic Technology
100% (1)
Thesis Topics For Radiologic Technology
5 pages
ML807 Distributed and Federated Learning Slides 2
No ratings yet
ML807 Distributed and Federated Learning Slides 2
211 pages
L4 Training Neural Networks en
No ratings yet
L4 Training Neural Networks en
48 pages
Chapter 9
No ratings yet
Chapter 9
73 pages
Neural Network Training
No ratings yet
Neural Network Training
73 pages
Ad3451 ML Unit 4 Notes Eduengg
No ratings yet
Ad3451 ML Unit 4 Notes Eduengg
36 pages
Notes Chapter8
No ratings yet
Notes Chapter8
4 pages
Artificial Neural Networks - DL
No ratings yet
Artificial Neural Networks - DL
55 pages
Neural Networks
No ratings yet
Neural Networks
52 pages
DeepLearning
No ratings yet
DeepLearning
32 pages
L10 Neural Network
No ratings yet
L10 Neural Network
52 pages
Ai - W7L13
No ratings yet
Ai - W7L13
46 pages
Unit - 4 Artificial Neural Networks
No ratings yet
Unit - 4 Artificial Neural Networks
33 pages
Optimization of Deep Networks
No ratings yet
Optimization of Deep Networks
84 pages
ML Lec 10 Neural Networks
No ratings yet
ML Lec 10 Neural Networks
87 pages
Lect 12 - Deep Feed Forward NN - Review
No ratings yet
Lect 12 - Deep Feed Forward NN - Review
93 pages
Unit 2.1
No ratings yet
Unit 2.1
37 pages
CLIL As A Way To Bring Meaning and Motivation Into EFL Contexts
No ratings yet
CLIL As A Way To Bring Meaning and Motivation Into EFL Contexts
22 pages
Lec 6
No ratings yet
Lec 6
18 pages
A Imprimer 4
No ratings yet
A Imprimer 4
4 pages
Deep Learning
No ratings yet
Deep Learning
19 pages
Wa0006.
No ratings yet
Wa0006.
70 pages
ANN Unit IV Notes
No ratings yet
ANN Unit IV Notes
4 pages
Written Assignment - Lumiere Program Manager
No ratings yet
Written Assignment - Lumiere Program Manager
3 pages
MLS 1 - Presentation
No ratings yet
MLS 1 - Presentation
11 pages
Artificial Neural Networks and Deep Learning
No ratings yet
Artificial Neural Networks and Deep Learning
22 pages
Module 3 - Modified
No ratings yet
Module 3 - Modified
106 pages
Unit Ii DNN
No ratings yet
Unit Ii DNN
24 pages
Unit 4
No ratings yet
Unit 4
19 pages
06 AIS302 ANN Backpropagation
No ratings yet
06 AIS302 ANN Backpropagation
83 pages
Salx07150225payal Seth Et Alwithoacover
No ratings yet
Salx07150225payal Seth Et Alwithoacover
14 pages
CS460 - Deep Learning - W02 & W03
No ratings yet
CS460 - Deep Learning - W02 & W03
44 pages
cst414 - Deep Learning
No ratings yet
cst414 - Deep Learning
34 pages
Unit 2 Deep Learning and Neural Networks
No ratings yet
Unit 2 Deep Learning and Neural Networks
38 pages
ML Unit 4
No ratings yet
ML Unit 4
23 pages
Lecture 09 Slides - After
No ratings yet
Lecture 09 Slides - After
57 pages
Neural Networks
No ratings yet
Neural Networks
29 pages
Back Propagation
No ratings yet
Back Propagation
29 pages
National Investment Proposal
No ratings yet
National Investment Proposal
3 pages
Education and Development PDF
No ratings yet
Education and Development PDF
16 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
35 pages
Unit 2 - ML
No ratings yet
Unit 2 - ML
18 pages
5 - From Linear Models To Multi-Layer Perceptrons
No ratings yet
5 - From Linear Models To Multi-Layer Perceptrons
45 pages
Ann
No ratings yet
Ann
30 pages
Neural Network
No ratings yet
Neural Network
97 pages
DL U-I Introduction Part-2
No ratings yet
DL U-I Introduction Part-2
48 pages
Chapter 2 - Artificial Neural Networks
No ratings yet
Chapter 2 - Artificial Neural Networks
19 pages
Calculus: Maths of the Gods
From Everand
Calculus: Maths of the Gods
Bill Todorovich
No ratings yet