0% found this document useful (0 votes)

26 views29 pages

Lab09 Assignment

Uploaded by

Khang Vĩ

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views29 pages

Lab09 Assignment

Uploaded by

Khang Vĩ

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Practice Lab: Neural Networks for Handwritten

Digit Recognition, Binary

In this exercise, you will use a neural network to recognize the hand-written digits zero and one.

Outline
• 1 - Packages
• 2 - Neural Networks
– 2.1 Problem Statement
– 2.2 Dataset
– 2.3 Model representation
– 2.4 Tensorflow Model Implementation
• Exercise 1
– 2.5 NumPy Model Implementation (Forward Prop in NumPy)
• Exercise 2
– 2.6 Vectorized NumPy Model Implementation (Optional)
• Exercise 3
– 2.7 Congratulations!
– 2.8 NumPy Broadcasting Tutorial (Optional)

NOTE: To prevent errors from the autograder, you are not allowed to edit or delete non-graded
cells in this notebook . Please also refrain from adding any new cells. Once you have passed this
assignment and want to experiment with any of the non-graded code, you may follow the
instructions at the bottom of this notebook.

1 - Packages
First, let's run the cell below to import all the packages that you will need during this
assignment.

• numpy is the fundamental package for scientific computing with Python.

• matplotlib is a popular library to plot graphs in Python.
• tensorflow a popular platform for machine learning.
import numpy as np
import tensorflow as tf
from [Link] import Sequential
from [Link] import Dense
import [Link] as plt
from autils import *
%matplotlib inline

import logging
[Link]("tensorflow").setLevel([Link])
[Link].set_verbosity(0)

Tensorflow and Keras

Tensorflow is a machine learning package developed by Google. In 2019, Google integrated
Keras into Tensorflow and released Tensorflow 2.0. Keras is a framework developed
independently by François Chollet that creates a simple, layer-centric interface to Tensorflow.
This course will be using the Keras interface.

2 - Neural Networks
In Course 1, you implemented logistic regression. This was extended to handle non-linear
boundaries using polynomial regression. For even more complex scenarios such as image
recognition, neural networks are preferred.

2.1 Problem Statement

In this exercise, you will use a neural network to recognize two handwritten digits, zero and one.
This is a binary classification task. Automated handwritten digit recognition is widely used today
- from recognizing zip codes (postal codes) on mail envelopes to recognizing amounts written
on bank checks. You will extend this network to recognize all 10 digits (0-9) in a future
assignment.

This exercise will show you how the methods you have learned can be used for this classification
task.

2.2 Dataset
You will start by loading the dataset for this task.

• The load_data() function shown below loads the data into variables X and y

• The data set contains 1000 training examples of handwritten digits ❑1, here limited
to zero and one.
– Each training example is a 20-pixel x 20-pixel grayscale image of the digit.
• Each pixel is represented by a floating-point number indicating the
grayscale intensity at that location.
• The 20 by 20 grid of pixels is “unrolled” into a 400-dimensional vector.
• Each training example becomes a single row in our data matrix X.
• This gives us a 1000 x 400 matrix X where every row is a training example
of a handwritten digit image.

( )
−−− ( x 1 ) −− −
( )

X = −−− ( x ) −− −
( 2)

⋮
− −− ( x ) −− −
(m )

• The second part of the training set is a 1000 x 1 dimensional vector y that contains labels
for the training set
– y = 0 if the image is of the digit 0, y = 1 if the image is of the digit 1.

❑ This is a subset of the MNIST handwritten digit dataset ([Link]

# load dataset
X, y = load_data()

2.2.1 View the variables

Let's get more familiar with your dataset.

• A good place to start is to print out each variable and see what it contains.

The code below prints elements of the variables X and y.

print ('The first element of X is: ', X[0])

The first element of X is: [ 0.00000000e+00 0.00000000e+00

0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 8.56059680e-06
1.94035948e-06 -7.37438725e-04 -8.13403799e-03 -1.86104473e-02
-1.87412865e-02 -1.87572508e-02 -1.90963542e-02 -1.64039011e-02
-3.78191381e-03 3.30347316e-04 1.27655229e-05 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 1.16421569e-04 1.20052179e-04
-1.40444581e-02 -2.84542484e-02 8.03826593e-02 2.66540339e-01
2.73853746e-01 2.78729541e-01 2.74293607e-01 2.24676403e-01
2.77562977e-02 -7.06315478e-03 2.34715414e-04 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 1.28335523e-17 -3.26286765e-04 -1.38651604e-02
8.15651552e-02 3.82800381e-01 8.57849775e-01 1.00109761e+00
9.69710638e-01 9.30928598e-01 1.00383757e+00 9.64157356e-01
4.49256553e-01 -5.60408259e-03 -3.78319036e-03 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 5.10620915e-06
4.36410675e-04 -3.95509940e-03 -2.68537241e-02 1.00755014e-01
6.42031710e-01 1.03136838e+00 8.50968614e-01 5.43122379e-01
3.42599738e-01 2.68918777e-01 6.68374643e-01 1.01256958e+00
9.03795598e-01 1.04481574e-01 -1.66424973e-02 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 2.59875260e-05
-3.10606987e-03 7.52456076e-03 1.77539831e-01 7.92890120e-01
9.65626503e-01 4.63166079e-01 6.91720680e-02 -3.64100526e-03
-4.12180405e-02 -5.01900656e-02 1.56102907e-01 9.01762651e-01
1.04748346e+00 1.51055252e-01 -2.16044665e-02 0.00000000e+00
0.00000000e+00 0.00000000e+00 5.87012352e-05 -6.40931373e-04
-3.23305249e-02 2.78203465e-01 9.36720163e-01 1.04320956e+00
5.98003217e-01 -3.59409041e-03 -2.16751770e-02 -4.81021923e-03
6.16566793e-05 -1.23773318e-02 1.55477482e-01 9.14867477e-01
9.20401348e-01 1.09173902e-01 -1.71058007e-02 0.00000000e+00
0.00000000e+00 1.56250000e-04 -4.27724104e-04 -2.51466503e-02
1.30532561e-01 7.81664862e-01 1.02836583e+00 7.57137601e-01
2.84667194e-01 4.86865128e-03 -3.18688725e-03 0.00000000e+00
8.36492601e-04 -3.70751123e-02 4.52644165e-01 1.03180133e+00
5.39028101e-01 -2.43742611e-03 -4.80290033e-03 0.00000000e+00
0.00000000e+00 -7.03635621e-04 -1.27262443e-02 1.61706648e-01
7.79865383e-01 1.03676705e+00 8.04490400e-01 1.60586724e-01
-1.38173339e-02 2.14879493e-03 -2.12622549e-04 2.04248366e-04
-6.85907627e-03 4.31712963e-04 7.20680947e-01 8.48136063e-01
1.51383408e-01 -2.28404366e-02 1.98971950e-04 0.00000000e+00
0.00000000e+00 -9.40410539e-03 3.74520505e-02 6.94389110e-01
1.02844844e+00 1.01648066e+00 8.80488426e-01 3.92123945e-01
-1.74122413e-02 -1.20098039e-04 5.55215142e-05 -2.23907271e-03
-2.76068376e-02 3.68645493e-01 9.36411169e-01 4.59006723e-01
-4.24701797e-02 1.17356610e-03 1.88929739e-05 0.00000000e+00
0.00000000e+00 -1.93511951e-02 1.29999794e-01 9.79821705e-01
9.41862388e-01 7.75147704e-01 8.73632241e-01 2.12778350e-01
-1.72353349e-02 0.00000000e+00 1.09937426e-03 -2.61793751e-02
1.22872879e-01 8.30812662e-01 7.26501773e-01 5.24441863e-02
-6.18971913e-03 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 -9.36563862e-03 3.68349741e-02 6.99079299e-01
1.00293583e+00 6.05704402e-01 3.27299224e-01 -3.22099249e-02
-4.83053002e-02 -4.34069138e-02 -5.75151144e-02 9.55674190e-02
7.26512627e-01 6.95366966e-01 1.47114481e-01 -1.20048679e-02
-3.02798203e-04 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 -6.76572712e-04 -6.51415556e-03 1.17339359e-01
4.21948410e-01 9.93210937e-01 8.82013974e-01 7.45758734e-01
7.23874268e-01 7.23341725e-01 7.20020340e-01 8.45324959e-01
8.31859739e-01 6.88831870e-02 -2.77765012e-02 3.59136710e-04
7.14869281e-05 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 1.53186275e-04 3.17353553e-04 -2.29167177e-02
-4.14402914e-03 3.87038450e-01 5.04583435e-01 7.74885876e-01
9.90037446e-01 1.00769478e+00 1.00851440e+00 7.37905042e-01
2.15455291e-01 -2.69624864e-02 1.32506127e-03 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 2.36366422e-04
-2.26031454e-03 -2.51994485e-02 -3.73889910e-02 6.62121228e-02
2.91134498e-01 3.23055726e-01 3.06260315e-01 8.76070942e-02
-2.50581917e-02 2.37438725e-04 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 6.20939216e-18 6.72618320e-04 -1.13151411e-02
-3.54641066e-02 -3.88214912e-02 -3.71077412e-02 -1.33524928e-02
9.90964718e-04 4.89176960e-05 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00]

print ('The first element of y is: ', y[0,0])

print ('The last element of y is: ', y[-1,0])

The first element of y is: 0

The last element of y is: 1

2.2.2 Check the dimensions of your variables

Another way to get familiar with your data is to view its dimensions. Please print the shape of X
and y and see how many training examples you have in your dataset.

print ('The shape of X is: ' + str([Link]))

print ('The shape of y is: ' + str([Link]))
The shape of X is: (1000, 400)
The shape of y is: (1000, 1)

2.2.3 Visualizing the Data

You will begin by visualizing a subset of the training set.

• In the cell below, the code randomly selects 64 rows from X, maps each row back to a 20
pixel by 20 pixel grayscale image and displays the images together.
• The label for each image is displayed above the image
import warnings
[Link](action='ignore', category=FutureWarning)
# You do not need to modify anything in this cell

m, n = [Link]

fig, axes = [Link](8,8, figsize=(8,8))

fig.tight_layout(pad=0.1)

for i,ax in enumerate([Link]):

# Select random indices
random_index = [Link](m)

# Select rows corresponding to the random indices and

# reshape the image
X_random_reshaped = X[random_index].reshape((20,20)).T

# Display the image

[Link](X_random_reshaped, cmap='gray')

# Display the label above the image

ax.set_title(y[random_index,0])
ax.set_axis_off()
2.3 Model representation
The neural network you will use in this assignment is shown in the figure below.

• This has three dense layers with sigmoid activations.

– Recall that our inputs are pixel values of digit images.
– Since the images are of size 20 ×20 , this gives us 400 inputs
• The parameters have dimensions that are sized for a neural network with 25 units
in layer 1, 15 units in layer 2 and 1 output unit in layer 3.
– Recall that the dimensions of these parameters are determined as follows:
• If network has si n units in a layer and so u t units in the next layer, then
– W will be of dimension si n × so u t .
– b will a vector with so u t elements
– Therefore, the shapes of W, and b, are
• layer1: The shape of W1 is (400, 25) and the shape of b1 is (25,)

• layer2: The shape of W2 is (25, 15) and the shape of b2 is: (15,)

• layer3: The shape of W3 is (15, 1) and the shape of b3 is: (1,)

Note: The bias vector b could be represented as a 1-D (n,) or 2-D (1,n)
array. Tensorflow utilizes a 1-D representation and this lab will
maintain that convention.

2.4 Tensorflow Model Implementation

Tensorflow models are built layer by layer. A layer's input dimensions ( si n above) are calculated
for you. You specify a layer's output dimensions and this determines the next layer's input
dimension. The input dimension of the first layer is derived from the size of the input data
specified in the [Link] statement below.

Note: It is also possible to add an input layer that specifies the input dimension of the
first layer. For example:
[Link](shape=(400,)), #specify input shape
We will include that here to illuminate some model sizing.

Exercise 1
Below, using Keras Sequential model and Dense Layer with a sigmoid activation to construct the
network described above.

# UNQ_C1
# GRADED CELL: Sequential model

model = Sequential(
[
[Link](shape=(400,)), # Specify input size

# Hidden layer with 25 units and ReLU activation

[Link](25, activation='relu'),
# Hidden layer with 15 units and ReLU activation
[Link](15, activation='relu'),

# Output layer with 1 unit and sigmoid activation for binary

classification
[Link](1, activation='sigmoid')
],
name="my_model"
)

[Link]()

Model: "my_model"

┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳
━━━━━━━━━━━━━━━━━┓
┃ Layer (type) ┃ Output Shape ┃
Param # ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇
━━━━━━━━━━━━━━━━━┩
│ dense (Dense) │ (None, 25) │
10,025 │
├──────────────────────────────────────┼─────────────────────────────┼
─────────────────┤
│ dense_1 (Dense) │ (None, 15) │
390 │
├──────────────────────────────────────┼─────────────────────────────┼
─────────────────┤
│ dense_2 (Dense) │ (None, 1) │
16 │
└──────────────────────────────────────┴─────────────────────────────┴
─────────────────┘

Total params: 10,431 (40.75 KB)

Trainable params: 10,431 (40.75 KB)

Non-trainable params: 0 (0.00 B)

Expected Output (Click to Expand) The [Link]() function displays a useful summary
of the model. Because we have specified an input layer size, the shape of the weight and bias
arrays are determined and the total number of parameters per layer can be shown. Note, the
names of the layers may vary as they are auto-generated.

Model: "my_model"
_________________________________________________________________
Layer (type) Output Shape Param #
=================================================================
dense (Dense) (None, 25) 10025
_________________________________________________________________
dense_1 (Dense) (None, 15) 390
_________________________________________________________________
dense_2 (Dense) (None, 1) 16
=================================================================
Total params: 10,431
Trainable params: 10,431
Non-trainable params: 0
_________________________________________________________________

Click for hints As described in the lecture:

model = Sequential(
[
[Link](shape=(400,)), # specify input size
(optional)
Dense(25, activation='sigmoid'),
Dense(15, activation='sigmoid'),
Dense(1, activation='sigmoid')
], name = "my_model"
)

# UNIT TESTS
from public_tests import *
test_c1(model)

----------------------------------------------------------------------
-----
ValueError Traceback (most recent call
last)
<ipython-input-9-a0cf083b9265> in <cell line: 3>()
1 # UNIT TESTS
2 from public_tests import *
----> 3 test_c1(model)

/content/public_tests.py in test_c1(target)
8 assert len([Link]) == 3, \
9 f"Wrong number of layers. Expected 3 but got
{len([Link])}"
---> 10 assert [Link].as_list() == [None, 400], \
11 f"Wrong input shape. Expected [None, 400] but got
{[Link].as_list()}"
12 i = 0

/usr/local/lib/python3.10/dist-packages/keras/src/ops/[Link] in
input(self)
252 Input tensor or list of input tensors.
253 """
--> 254 return self._get_node_attribute_at_index(0,
"input_tensors", "input")
255
256 @property

/usr/local/lib/python3.10/dist-packages/keras/src/ops/[Link] in
_get_node_attribute_at_index(self, node_index, attr, attr_name)
283 """
284 if not self._inbound_nodes:
--> 285 raise ValueError(
286 f"The layer {[Link]} has never been called
"
287 f"and thus has no defined {attr_name}."

ValueError: The layer my_model has never been called and thus has no
defined input.

The parameter counts shown in the summary correspond to the number of elements in the
weight and bias arrays as shown below.

L1_num_params = 400 * 25 + 25 # W1 parameters + b1 parameters

L2_num_params = 25 * 15 + 15 # W2 parameters + b2 parameters
L3_num_params = 15 * 1 + 1 # W3 parameters + b3 parameters
print("L1 params = ", L1_num_params, ", L2 params = ", L2_num_params,
", L3 params = ", L3_num_params )

L1 params = 10025 , L2 params = 390 , L3 params = 16

We can examine details of the model by first extracting the layers with [Link] and then
extracting the weights with layerx.get_weights() as shown below.

[layer1, layer2, layer3] = [Link]

#### Examine Weights shapes

W1,b1 = layer1.get_weights()
W2,b2 = layer2.get_weights()
W3,b3 = layer3.get_weights()
print(f"W1 shape = {[Link]}, b1 shape = {[Link]}")
print(f"W2 shape = {[Link]}, b2 shape = {[Link]}")
print(f"W3 shape = {[Link]}, b3 shape = {[Link]}")

W1 shape = (400, 25), b1 shape = (25,)

W2 shape = (25, 15), b2 shape = (15,)
W3 shape = (15, 1), b3 shape = (1,)

Expected Output

W1 shape = (400, 25), b1 shape = (25,)

W2 shape = (25, 15), b2 shape = (15,)
W3 shape = (15, 1), b3 shape = (1,)
xx.get_weights returns a NumPy array. One can also access the weights directly in their
tensor form. Note the shape of the tensors in the final layer.

print([Link][2].weights)

[<KerasVariable shape=(15, 1), dtype=float32,

path=my_model/dense_2/kernel>, <KerasVariable shape=(1,),
dtype=float32, path=my_model/dense_2/bias>]

The following code will define a loss function and run gradient descent to fit the weights of the
model to the training data. This will be explained in more detail in the following week.

[Link](
loss=[Link](),
optimizer=[Link](0.001),
)

[Link](
X,y,
epochs=20
)

Epoch 1/20
32/32 ━━━━━━━━━━━━━━━━━━━━ 2s 2ms/step - loss: 0.4844
Epoch 2/20
32/32 ━━━━━━━━━━━━━━━━━━━━ 0s 2ms/step - loss: 0.1168
Epoch 3/20
32/32 ━━━━━━━━━━━━━━━━━━━━ 0s 2ms/step - loss: 0.0355
Epoch 4/20
32/32 ━━━━━━━━━━━━━━━━━━━━ 0s 2ms/step - loss: 0.0176
Epoch 5/20
32/32 ━━━━━━━━━━━━━━━━━━━━ 0s 1ms/step - loss: 0.0102
Epoch 6/20
32/32 ━━━━━━━━━━━━━━━━━━━━ 0s 2ms/step - loss: 0.0059
Epoch 7/20
32/32 ━━━━━━━━━━━━━━━━━━━━ 0s 1ms/step - loss: 0.0090
Epoch 8/20
32/32 ━━━━━━━━━━━━━━━━━━━━ 0s 2ms/step - loss: 0.0048
Epoch 9/20
32/32 ━━━━━━━━━━━━━━━━━━━━ 0s 2ms/step - loss: 0.0042
Epoch 10/20
32/32 ━━━━━━━━━━━━━━━━━━━━ 0s 2ms/step - loss: 0.0043
Epoch 11/20
32/32 ━━━━━━━━━━━━━━━━━━━━ 0s 2ms/step - loss: 0.0024
Epoch 12/20
32/32 ━━━━━━━━━━━━━━━━━━━━ 0s 1ms/step - loss: 0.0067
Epoch 13/20
32/32 ━━━━━━━━━━━━━━━━━━━━ 0s 2ms/step - loss: 0.0015
Epoch 14/20
32/32 ━━━━━━━━━━━━━━━━━━━━ 0s 1ms/step - loss: 0.0022
Epoch 15/20
32/32 ━━━━━━━━━━━━━━━━━━━━ 0s 1ms/step - loss: 0.0014
Epoch 16/20
32/32 ━━━━━━━━━━━━━━━━━━━━ 0s 1ms/step - loss: 0.0020
Epoch 17/20
32/32 ━━━━━━━━━━━━━━━━━━━━ 0s 2ms/step - loss: 9.1903e-04
Epoch 18/20
32/32 ━━━━━━━━━━━━━━━━━━━━ 0s 2ms/step - loss: 8.8541e-04
Epoch 19/20
32/32 ━━━━━━━━━━━━━━━━━━━━ 0s 2ms/step - loss: 9.5468e-04
Epoch 20/20
32/32 ━━━━━━━━━━━━━━━━━━━━ 0s 2ms/step - loss: 7.5980e-04

<[Link] at 0x7b8f48a7e9b0>

To run the model on an example to make a prediction, use Keras predict. The input to
predict is an array so the single example is reshaped to be two dimensional.

prediction = [Link](X[0].reshape(1,400)) # a zero

print(f" predicting a zero: {prediction}")
prediction = [Link](X[500].reshape(1,400)) # a one
print(f" predicting a one: {prediction}")

1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 52ms/step

predicting a zero: [[2.6726353e-05]]
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 20ms/step
predicting a one: [[0.9999354]]

The output of the model is interpreted as a probability. In the first example above, the input is a
zero. The model predicts the probability that the input is a one is nearly zero. In the second
example, the input is a one. The model predicts the probability that the input is a one is nearly
one. As in the case of logistic regression, the probability is compared to a threshold to make a
final prediction.

if prediction >= 0.5:

yhat = 1
else:
yhat = 0
print(f"prediction after threshold: {yhat}")

prediction after threshold: 1

Let's compare the predictions vs the labels for a random sample of 64 digits. This takes a
moment to run.

import warnings
[Link](action='ignore', category=FutureWarning)
# You do not need to modify anything in this cell
m, n = [Link]

fig, axes = [Link](8,8, figsize=(8,8))

fig.tight_layout(pad=0.1,rect=[0, 0.03, 1, 0.92]) #[left, bottom,
right, top]

for i,ax in enumerate([Link]):

# Select random indices
random_index = [Link](m)

# Select rows corresponding to the random indices and

# reshape the image
X_random_reshaped = X[random_index].reshape((20,20)).T

# Display the image

[Link](X_random_reshaped, cmap='gray')

# Predict using the Neural Network

prediction = [Link](X[random_index].reshape(1,400))
if prediction >= 0.5:
yhat = 1
else:
yhat = 0

# Display the label above the image

ax.set_title(f"{y[random_index,0]},{yhat}")
ax.set_axis_off()
[Link]("Label, yhat", fontsize=16)
[Link]()

1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 22ms/step

1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 20ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 24ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 24ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 20ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 28ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 20ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 20ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 22ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 32ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 25ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 25ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 20ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 21ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 20ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 21ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 27ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 30ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 30ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 39ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 32ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 34ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 28ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 43ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 29ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 28ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 29ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 33ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 28ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 33ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 33ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 37ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 36ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 31ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 32ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 34ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 32ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 32ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 32ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 34ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 31ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 36ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 22ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 20ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 26ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 21ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 20ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 21ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 20ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 20ms/step
2.5 NumPy Model Implementation (Forward Prop in NumPy)
As described in lecture, it is possible to build your own dense layer using NumPy. This can then
be utilized to build a multi-layer neural network.

Exercise 2
Below, build a dense layer subroutine. The example in lecture utilized a for loop to visit each unit
(j) in the layer and perform the dot product of the weights for that unit (W[:,j]) and sum the
bias for the unit (b[j]) to form z. An activation function g(z) is then applied to that result. This
section will not utilize some of the matrix operations described in the optional lectures. These
will be explored in a later section.

# UNQ_C2
# GRADED FUNCTION: my_dense

def my_dense(a_in, W, b, g):

"""
Computes dense layer
Args:
a_in (ndarray (n, )) : Data, 1 example
W (ndarray (n,j)) : Weight matrix, n features per unit, j
units
b (ndarray (j, )) : bias vector, j units
g activation function (e.g. sigmoid, relu..)
Returns
a_out (ndarray (j,)) : j units
"""
units = [Link][1]
a_out = [Link](units)
### BEGIN SOLUTION
for j in range(units):
w = W[:,j]
z = [Link](w, a_in) + b[j]
a_out[j] = g(z)

### END SOLUTION

return(a_out)

# Quick Check
x_tst = 0.1*[Link](1,3,1).reshape(2,) # (1 examples, 2 features)
W_tst = 0.1*[Link](1,7,1).reshape(2,3) # (2 input features, 3
output features)
b_tst = 0.1*[Link](1,4,1).reshape(3,) # (3 features)
A_tst = my_dense(x_tst, W_tst, b_tst, sigmoid)
print(A_tst)

[0.54735762 0.57932425 0.61063923]

Expected Output

[0.54735762 0.57932425 0.61063923]

Click for hints As described in the lecture:

def my_dense(a_in, W, b, g):

"""
Computes dense layer
Args:
a_in (ndarray (n, )) : Data, 1 example
W (ndarray (n,j)) : Weight matrix, n features per unit, j
units
b (ndarray (j, )) : bias vector, j units
g activation function (e.g. sigmoid, relu..)
Returns
a_out (ndarray (j,)) : j units
"""
units = [Link][1]
a_out = [Link](units)
for j in range(units):
w = # Select weights for unit j.
These are in column j of W
z = # dot product of w and a_in + b
a_out[j] = # apply activation to z
return(a_out)

Click for more hints

def my_dense(a_in, W, b, g):

"""
Computes dense layer
Args:
a_in (ndarray (n, )) : Data, 1 example
W (ndarray (n,j)) : Weight matrix, n features per unit, j
units
b (ndarray (j, )) : bias vector, j units
g activation function (e.g. sigmoid, relu..)
Returns
a_out (ndarray (j,)) : j units
"""
units = [Link][1]
a_out = [Link](units)
for j in range(units):
w = W[:,j]
z = [Link](w, a_in) + b[j]
a_out[j] = g(z)
return(a_out)

# UNIT TESTS

test_c2(my_dense)

All tests passed!

The following cell builds a three-layer neural network utilizing the my_dense subroutine above.
def my_sequential(x, W1, b1, W2, b2, W3, b3):
a1 = my_dense(x, W1, b1, sigmoid)
a2 = my_dense(a1, W2, b2, sigmoid)
a3 = my_dense(a2, W3, b3, sigmoid)
return(a3)

We can copy trained weights and biases from Tensorflow.

W1_tmp,b1_tmp = layer1.get_weights()
W2_tmp,b2_tmp = layer2.get_weights()
W3_tmp,b3_tmp = layer3.get_weights()

# make predictions
prediction = my_sequential(X[0], W1_tmp, b1_tmp, W2_tmp, b2_tmp,
W3_tmp, b3_tmp )
if prediction >= 0.5:
yhat = 1
else:
yhat = 0
print( "yhat = ", yhat, " label= ", y[0,0])
prediction = my_sequential(X[500], W1_tmp, b1_tmp, W2_tmp, b2_tmp,
W3_tmp, b3_tmp )
if prediction >= 0.5:
yhat = 1
else:
yhat = 0
print( "yhat = ", yhat, " label= ", y[500,0])

yhat = 0 label= 0
yhat = 0 label= 1

Run the following cell to see predictions from both the Numpy model and the Tensorflow
model. This takes a moment to run.

import warnings
[Link](action='ignore', category=FutureWarning)
# You do not need to modify anything in this cell

m, n = [Link]

fig, axes = [Link](8,8, figsize=(8,8))

fig.tight_layout(pad=0.1,rect=[0, 0.03, 1, 0.92]) #[left, bottom,
right, top]

for i,ax in enumerate([Link]):

# Select random indices
random_index = [Link](m)

# Select rows corresponding to the random indices and

# reshape the image
X_random_reshaped = X[random_index].reshape((20,20)).T

# Display the image

[Link](X_random_reshaped, cmap='gray')

# Predict using the Neural Network implemented in Numpy

my_prediction = my_sequential(X[random_index], W1_tmp, b1_tmp,
W2_tmp, b2_tmp, W3_tmp, b3_tmp )
my_yhat = int(my_prediction >= 0.5)

# Predict using the Neural Network implemented in Tensorflow

tf_prediction = [Link](X[random_index].reshape(1,400))
tf_yhat = int(tf_prediction >= 0.5)

# Display the label above the image

ax.set_title(f"{y[random_index,0]},{tf_yhat},{my_yhat}")
ax.set_axis_off()
[Link]("Label, yhat Tensorflow, yhat Numpy", fontsize=16)
[Link]()

1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 20ms/step

1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step

<ipython-input-24-620dacf31e47>:23: DeprecationWarning: Conversion of

an array with ndim > 0 to a scalar is deprecated, and will error in
future. Ensure you extract a single element from your array before
performing this operation. (Deprecated NumPy 1.25.)
my_yhat = int(my_prediction >= 0.5)
<ipython-input-24-620dacf31e47>:27: DeprecationWarning: Conversion of
an array with ndim > 0 to a scalar is deprecated, and will error in
future. Ensure you extract a single element from your array before
performing this operation. (Deprecated NumPy 1.25.)
tf_yhat = int(tf_prediction >= 0.5)

1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step

1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 24ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 26ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 20ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 21ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 20ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 28ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 20ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 26ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 20ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 20ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 20ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 20ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 21ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 39ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 24ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 20ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 21ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 20ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 22ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 20ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 19ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 29ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 55ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 32ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 32ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 28ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 30ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 30ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 29ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 29ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 36ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 33ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 27ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 28ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 41ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 33ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 110ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 96ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 101ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 130ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 139ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 101ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 54ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 43ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 41ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 38ms/step
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 62ms/step
2.6 Vectorized NumPy Model Implementation (Optional)
The optional lectures described vector and matrix operations that can be used to speed the
calculations. Below describes a layer operation that computes the output for all units in a layer
on a given input example:

We can demonstrate this using the examples X and the W1,b1 parameters above. We use
[Link] to perform the matrix multiply. Note, the dimensions of x and W must be
compatible as shown in the diagram above.
x = X[0].reshape(-1,1) # column vector (400,1)
z1 = [Link](x.T,W1) + b1 # (1,400)(400,25) = (1,25)
a1 = sigmoid(z1)
print([Link])

(1, 25)

You can take this a step further and compute all the units for all examples in one Matrix-Matrix
operation.

The full operation is Z=X W +b . This will utilize NumPy broadcasting to expand b to m rows. If
this is unfamiliar, a short tutorial is provided at the end of the notebook.

Exercise 3
Below, compose a new my_dense_v subroutine that performs the layer calculations for a
matrix of examples. This will utilize [Link]().

Note: This function is not graded because it is discussed in the optional lectures on
vectorization. If you didn't go through them, feel free to click the hints below the expected code
to see the code. You can also submit the notebook even with a blank answer here.

# UNQ_C3
# UNGRADED FUNCTION: my_dense_v

def my_dense_v(A_in, W, b, g):

"""
Computes dense layer
Args:
A_in (ndarray (m,n)) : Data, m examples, n features each
W (ndarray (n,j)) : Weight matrix, n features per unit, j
units
b (ndarray (1,j)) : bias vector, j units
g activation function (e.g. sigmoid, relu..)
Returns
A_out ([Link] or ndarray (m,j)) : m examples, j units
"""
### BEGIN SOLUTION
Z = [Link](A_in,W) + b
A_out = g(Z)

### END SOLUTION

return(A_out)

X_tst = 0.1*[Link](1,9,1).reshape(4,2) # (4 examples, 2 features)

W_tst = 0.1*[Link](1,7,1).reshape(2,3) # (2 input features, 3
output features)
b_tst = 0.1*[Link](1,4,1).reshape(1,3) # (1,3 features)
A_tst = my_dense_v(X_tst, W_tst, b_tst, sigmoid)
print(A_tst)

[[0.54735762 0.57932425 0.61063923]

[0.57199613 0.61301418 0.65248946]
[0.5962827 0.64565631 0.6921095 ]
[0.62010643 0.67699586 0.72908792]]

Expected Output

[[0.54735762 0.57932425 0.61063923]

[0.57199613 0.61301418 0.65248946]
[0.5962827 0.64565631 0.6921095 ]
[0.62010643 0.67699586 0.72908792]]

Click for hints In matrix form, this can be written in one or two lines.

Z = [Link] of A_in and W plus b

A_out is g(Z)

Click for code

def my_dense_v(A_in, W, b, g):

"""
Computes dense layer
Args:
A_in (ndarray (m,n)) : Data, m examples, n features each
W (ndarray (n,j)) : Weight matrix, n features per unit, j
units
b (ndarray (j,1)) : bias vector, j units
g activation function (e.g. sigmoid, relu..)
Returns
A_out (ndarray (m,j)) : m examples, j units
"""
Z = [Link](A_in,W) + b
A_out = g(Z)
return(A_out)

# UNIT TESTS

test_c3(my_dense_v)

All tests passed!

The following cell builds a three-layer neural network utilizing the my_dense_v subroutine
above.

def my_sequential_v(X, W1, b1, W2, b2, W3, b3):

A1 = my_dense_v(X, W1, b1, sigmoid)
A2 = my_dense_v(A1, W2, b2, sigmoid)
A3 = my_dense_v(A2, W3, b3, sigmoid)
return(A3)

We can again copy trained weights and biases from Tensorflow.

W1_tmp,b1_tmp = layer1.get_weights()
W2_tmp,b2_tmp = layer2.get_weights()
W3_tmp,b3_tmp = layer3.get_weights()

Let's make a prediction with the new model. This will make a prediction on all of the examples at
once. Note the shape of the output.

Prediction = my_sequential_v(X, W1_tmp, b1_tmp, W2_tmp, b2_tmp,

W3_tmp, b3_tmp )
[Link]

(1000, 1)

We'll apply a threshold of 0.5 as before, but to all predictions at once.

Yhat = (Prediction >= 0.5).astype(int)

print("predict a zero: ",Yhat[0], "predict a one: ", Yhat[500])

predict a zero: [0] predict a one: [0]

Run the following cell to see predictions. This will use the predictions we just calculated above.
This takes a moment to run.

import warnings
[Link](action='ignore', category=FutureWarning)
# You do not need to modify anything in this cell

m, n = [Link]

fig, axes = [Link](8, 8, figsize=(8, 8))

fig.tight_layout(pad=0.1, rect=[0, 0.03, 1, 0.92]) #[left, bottom,
right, top]

for i, ax in enumerate([Link]):
# Select random indices
random_index = [Link](m)

# Select rows corresponding to the random indices and

# reshape the image
X_random_reshaped = X[random_index].reshape((20, 20)).T

# Display the image

[Link](X_random_reshaped, cmap='gray')
# Display the label above the image
ax.set_title(f"{y[random_index,0]}, {Yhat[random_index, 0]}")
ax.set_axis_off()
[Link]("Label, Yhat", fontsize=16)
[Link]()

You can see how one of the misclassified images looks.

fig = [Link](figsize=(1, 1))

errors = [Link](y != Yhat)
random_index = errors[0][0]
X_random_reshaped = X[random_index].reshape((20, 20)).T
[Link](X_random_reshaped, cmap='gray')
[Link](f"{y[random_index,0]}, {Yhat[random_index, 0]}")
[Link]('off')
[Link]()

2.7 Congratulations!
You have successfully built and utilized a neural network.

2.8 NumPy Broadcasting Tutorial (Optional)

In the last example, Z=X W +b utilized NumPy broadcasting to expand the vector b . If you are
not familiar with NumPy Broadcasting, this short tutorial is provided.

X W is a matrix-matrix operation with dimensions ( m , j 1 ) ( j 1 , j 2 ) which results in a matrix with

dimension ( m , j 2 ). To that, we add a vector b with dimension ( 1 , j 2 ). b must be expanded to be a
( m , j2 ) matrix for this element-wise operation to make sense. This expansion is accomplished for
you by NumPy broadcasting.

Broadcasting applies to element-wise operations.

Its basic operation is to 'stretch' a smaller dimension by replicating elements to match a larger
dimension.

More specifically: When operating on two arrays, NumPy compares their shapes element-wise.
It starts with the trailing (i.e. rightmost) dimensions and works its way left. Two dimensions are
compatible when

• they are equal, or

• one of them is 1

If these conditions are not met, a ValueError: operands could not be broadcast together
exception is thrown, indicating that the arrays have incompatible shapes. The size of the
resulting array is the size that is not 1 along each axis of the inputs.

Here are some examples:

<center> <img src="./images/C2_W1_Assign1_BroadcastIndexes.PNG"
alt='missing' width="400" ><center/>
<figcaption>Calculating Broadcast Result shape</figcaption>

The graphic below describes expanding dimensions. Note the red text below:

<center> <img src="./images/C2_W1_Assign1_Broadcasting.gif"

alt='missing' width="600" ><center/>
<figcaption>Broadcast notionally expands arguments to match for
element wise operations</figcaption>

The graphic above shows NumPy expanding the arguments to match before the final operation.
Note that this is a notional description. The actual mechanics of NumPy operation choose the
most efficient implementation.

For each of the following examples, try to guess the size of the result before running the
example.

a = [Link]([1,2,3]).reshape(-1,1) #(3,1)
b = 5
print(f"(a + b).shape: {(a + b).shape}, \na + b = \n{a + b}")

(a + b).shape: (3, 1),

a + b =
[[6]
[7]
[8]]

Note that this applies to all element-wise operations:

a = [Link]([1,2,3]).reshape(-1,1) #(3,1)
b = 5
print(f"(a * b).shape: {(a * b).shape}, \na * b = \n{a * b}")

(a * b).shape: (3, 1),

a * b =
[[ 5]
[10]
[15]]

<img src="./images/C2_W1_Assign1_VectorAdd.PNG" alt='missing'

width="740" >
<center><figcaption><b>Row-Column Element-Wise
Operations</b></figcaption></center>

a = [Link]([1,2,3,4]).reshape(-1,1)
b = [Link]([1,2,3]).reshape(1,-1)
print(a)
print(b)
print(f"(a + b).shape: {(a + b).shape}, \na + b = \n{a + b}")

[[1]
[2]
[3]
[4]]
[[1 2 3]]
(a + b).shape: (4, 3),
a + b =
[[2 3 4]
[3 4 5]
[4 5 6]
[5 6 7]]

This is the scenario in the dense layer you built above. Adding a 1-D vector b to a (m,j) matrix.
Matrix + 1-D Vector

ML Project
No ratings yet
ML Project
10 pages
C2 W1 Assignment
No ratings yet
C2 W1 Assignment
24 pages
C2 W1 Assignment
No ratings yet
C2 W1 Assignment
24 pages
C2 W1 Assignment
No ratings yet
C2 W1 Assignment
25 pages
DL Record
No ratings yet
DL Record
36 pages
Newbie's Deep Learning Project To Recognize Handwritten Digit
No ratings yet
Newbie's Deep Learning Project To Recognize Handwritten Digit
6 pages
Ad3511 Deep Learning Lab Manual
No ratings yet
Ad3511 Deep Learning Lab Manual
80 pages
Deep Learning - Lab - Manual
No ratings yet
Deep Learning - Lab - Manual
59 pages
Handwritten Digit Recognition Systems
No ratings yet
Handwritten Digit Recognition Systems
12 pages
Deep Learning Python Code Notebook
No ratings yet
Deep Learning Python Code Notebook
9 pages
Deep Learning: Image Classification & XOR
No ratings yet
Deep Learning: Image Classification & XOR
3 pages
AD3511 Deep Learning Lab Manual
No ratings yet
AD3511 Deep Learning Lab Manual
54 pages
Handwritten Digit Recognition with CNN
No ratings yet
Handwritten Digit Recognition with CNN
11 pages
Classifying Hand-Written Digits Using Neural Network
No ratings yet
Classifying Hand-Written Digits Using Neural Network
21 pages
Handwritten Digit Recognition Model
No ratings yet
Handwritten Digit Recognition Model
4 pages
Recognizing Handwritten Digits With Scikit-Learn: Punam Seal
No ratings yet
Recognizing Handwritten Digits With Scikit-Learn: Punam Seal
21 pages
DNN Lab Manual for MCA Semester II
No ratings yet
DNN Lab Manual for MCA Semester II
34 pages
Assignment - 13: Title
No ratings yet
Assignment - 13: Title
2 pages
Lab Manual Ccs355
No ratings yet
Lab Manual Ccs355
12 pages
Deep Learning Lab Manual
No ratings yet
Deep Learning Lab Manual
72 pages
Deep Learning Lab Manual AD3511
No ratings yet
Deep Learning Lab Manual AD3511
63 pages
Python Deep Learning Lab Programs
No ratings yet
Python Deep Learning Lab Programs
35 pages
AI Mini Project Report
No ratings yet
AI Mini Project Report
7 pages
HW8 La
No ratings yet
HW8 La
18 pages
Assignment 02# - Machine Learning 2023
No ratings yet
Assignment 02# - Machine Learning 2023
8 pages
How To Develop A CNN For MNIST Handwritten Digit Classification
No ratings yet
How To Develop A CNN For MNIST Handwritten Digit Classification
43 pages
MNIST Dataset for Deep Learning
No ratings yet
MNIST Dataset for Deep Learning
12 pages
Neural Network for MNIST Digit Classification
No ratings yet
Neural Network for MNIST Digit Classification
5 pages
Aishwarya MiniProjectReport - SC
No ratings yet
Aishwarya MiniProjectReport - SC
6 pages
Deep Learning Manual
No ratings yet
Deep Learning Manual
53 pages
Deep Learning Lab With Tensorflow
No ratings yet
Deep Learning Lab With Tensorflow
84 pages
Deep Learning Lab Practicals
No ratings yet
Deep Learning Lab Practicals
24 pages
HW46
No ratings yet
HW46
5 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
62 pages
On Handwritten Digit Recognition
No ratings yet
On Handwritten Digit Recognition
15 pages
Exercise - 1
No ratings yet
Exercise - 1
38 pages
Ad3511-Deep Learning-Lab Manual
No ratings yet
Ad3511-Deep Learning-Lab Manual
53 pages
Deep Learning Lab Practical Record
No ratings yet
Deep Learning Lab Practical Record
50 pages
DL Final
No ratings yet
DL Final
63 pages
DL Lab Manual
No ratings yet
DL Lab Manual
44 pages
Deep Learning Lab Exercises
No ratings yet
Deep Learning Lab Exercises
56 pages
Applied Machine and Deep Learning
No ratings yet
Applied Machine and Deep Learning
34 pages
Exercise Classification
No ratings yet
Exercise Classification
8 pages
MNIST Handwritten Digit Classification
No ratings yet
MNIST Handwritten Digit Classification
5 pages
Handwritten Digit Recognition Using ML&DL
No ratings yet
Handwritten Digit Recognition Using ML&DL
3 pages
Handwritten Digit Recognition with CNNs
No ratings yet
Handwritten Digit Recognition with CNNs
6 pages
DL Lab Manual
No ratings yet
DL Lab Manual
38 pages
Mnist Classification Report
No ratings yet
Mnist Classification Report
15 pages
S. NO. Title of The Experiments Page No
No ratings yet
S. NO. Title of The Experiments Page No
11 pages
Assignment SQGAN
No ratings yet
Assignment SQGAN
14 pages
UNIT II - PPT - Part 1
No ratings yet
UNIT II - PPT - Part 1
41 pages
DEEPLEARNINGTUTORIAL Ipynb-Colaboratory
No ratings yet
DEEPLEARNINGTUTORIAL Ipynb-Colaboratory
8 pages
Manual Printout
No ratings yet
Manual Printout
50 pages
Mnist Handwritten Digit Classification
No ratings yet
Mnist Handwritten Digit Classification
26 pages
Deep Learning Practical Guide
No ratings yet
Deep Learning Practical Guide
31 pages
Sepehr Ahmadian Resume
No ratings yet
Sepehr Ahmadian Resume
3 pages
AI-Powered Shopping Assistant Case Study
No ratings yet
AI-Powered Shopping Assistant Case Study
2 pages
Plant Leaf Disease Detection Using Mask RCNN: 1. Abstract
No ratings yet
Plant Leaf Disease Detection Using Mask RCNN: 1. Abstract
3 pages
AI Face Detection Project Overview
No ratings yet
AI Face Detection Project Overview
12 pages
Kulkarni 2020
No ratings yet
Kulkarni 2020
11 pages
Python Machine Learning For Beginners Learning From Scratch Numpy Pandas Matplotlib Seaborn SKle
100% (1)
Python Machine Learning For Beginners Learning From Scratch Numpy Pandas Matplotlib Seaborn SKle
277 pages
Deep Learning With Python 1st Edition François Chollet Full Access
No ratings yet
Deep Learning With Python 1st Edition François Chollet Full Access
134 pages
7 Yr Exp AI - ML Engineer Shah
No ratings yet
7 Yr Exp AI - ML Engineer Shah
2 pages
Internship Report Presentation On
No ratings yet
Internship Report Presentation On
11 pages
Ansh Gudibanda
No ratings yet
Ansh Gudibanda
1 page
Overview of PyTorch for Deep Learning
No ratings yet
Overview of PyTorch for Deep Learning
20 pages
Data Science Resume - Devaiah
No ratings yet
Data Science Resume - Devaiah
1 page
AI Development Roadmap
No ratings yet
AI Development Roadmap
3 pages
Placement Resume
No ratings yet
Placement Resume
1 page
Deep Learning Neural Networks Comparison
No ratings yet
Deep Learning Neural Networks Comparison
71 pages
Abhijeet Nijampurkar PG
No ratings yet
Abhijeet Nijampurkar PG
1 page
Data Science & Machine Deep Learning
No ratings yet
Data Science & Machine Deep Learning
17 pages
Project Main
No ratings yet
Project Main
39 pages
Group E (Cse3) Report
No ratings yet
Group E (Cse3) Report
31 pages
Raw Nitex
No ratings yet
Raw Nitex
5 pages
Bumps and Pothole Detection Report Final
No ratings yet
Bumps and Pothole Detection Report Final
64 pages
Important - AUH CRC Job Notice - Altrodav Technologies - MeetMux - For B.tech, M.tech, MCA, MBA, M.SC 2026, 2027 Passing Batch - CTC 20 LPA
No ratings yet
Important - AUH CRC Job Notice - Altrodav Technologies - MeetMux - For B.tech, M.tech, MCA, MBA, M.SC 2026, 2027 Passing Batch - CTC 20 LPA
5 pages
Anhnp CV
No ratings yet
Anhnp CV
5 pages
Research Paper On Skin Disease Claasification
No ratings yet
Research Paper On Skin Disease Claasification
25 pages
Ats Resume Template
No ratings yet
Ats Resume Template
2 pages
Aryan Gupta: B.Tech - Artificial Intelligence and Machine Learning
No ratings yet
Aryan Gupta: B.Tech - Artificial Intelligence and Machine Learning
1 page
Project V
No ratings yet
Project V
35 pages
MLOps Program Syllabus Overview
No ratings yet
MLOps Program Syllabus Overview
5 pages
Data Science With Python The Ultimate Ste - Julian James McKinnon
No ratings yet
Data Science With Python The Ultimate Ste - Julian James McKinnon
110 pages
Mastering AI and ML With Python - ACE - INTL
No ratings yet
Mastering AI and ML With Python - ACE - INTL
223 pages