0% found this document useful (0 votes)

40 views28 pages

Deep Learning

Uploaded by

Utpalika Acharya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views28 pages

Deep Learning

Uploaded by

Utpalika Acharya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

What is deep learning

Deep learning is a subset of machine learning, which is a subset of artificial

intelligence. Artificial intelligence is a general term that refers to
techniques that enable computers to mimic human behavior. Machine
learning represents a set of algorithms trained on data that make all of this
possible. Deep learning is just a type of machine learning, inspired by the
structure of the human brain.

The first advantage of deep learning over machine learning is

the redundancy of the so-called feature extraction.
Here’s how it works: A more and more abstract and
compressed representation of the raw data is produced over
several layers of an artificial neural net. We then use this
compressed representation of the input data to produce the
result. The result can be, for example, the classification of the
input data into different classes.
❖In a machine learning model, to determine if a particular image is
showing a car or not, we humans first need to identify the unique
features of a car (shape, size, windows, wheels, etc.), then extract the
feature and give it to the algorithm as input data. In this way, the
algorithm would perform a classification of the images. That is, in
machine learning, a programmer must intervene directly in the
action for the model to come to a conclusion.

❖In the case of a deep learning model, the feature extraction step is
completely unnecessary. The model would recognize these unique
characteristics of a car and make correct predictions without human
intervention.
TensorFlow
TensorFlow is an open-source platform for machine learning and a
symbolic math library that is used for machine learning applications.

Keras
It is an Open Source Neural Network library that runs on top of Theano or
Tensorflow. It is designed to be fast and easy for the user to use. It is a
useful library to construct any deep learning algorithm of whatever choice
we want.

Difference between TensorFlow and Keras:

S.N
TensorFlow Keras
o

Tensorhigh-performan
1. ceFlow is written in Keras is written in Python.
C++, CUDA, Python.

TensorFlow is used for

large datasets and Keras is usually used for
2.
high performance small datasets.
models.

TensorFlow is a
framework that offers
3. Keras is a high-Level API.
both high and
low-level APIs.

TensorFlow is used for

Keras is used for
4. high-performance
low-performance models.
models.

In Keras framework, there

In TensorFlow
is only minimal
5. performing debugging
requirement for debugging
leads to complexities.
the simple networks.

TensorFlow has a Keras has a simple

6. complex architecture architecture and easy to
and not easy to use. use.

TensorFlow was Keras was developed by

7. developed by the François Chollet while he
Google Brain team. was working on the part of
the research effort of
project ONEIROS.

Processing Power
Deep learning can require significant processing power. Complex models
trained on bigdata datasets can take hours, days or even more to train. The
models we present in this chapter can be trained in minutes to just less
than an hour on computers with conventional CPUs. You’ll need only a
reasonably current personal computer. We’ll discuss the special
high-performance hardware called GPUs (Graphics Processing Units) and
TPUs (Tensor Processing Units) developed by NVIDIA and Google to meet
the extraordinary processing demands of edge-of-the-practice
deep-learning applications.
Keras Built-In Datasets
Here are some of Keras’s datasets (from the module
tensorflow.keras.datasets13) for practicing deep learning.
❖ MNIST database of handwritten digits
Used for classifying handwritten digit images, this dataset contains
28-by-28 grayscale digit images labeled as 0 through 9 with 60,000 images
for training and 10,000 for testing. We use this dataset in Section 16.6,
where we study convolutional neural networks.
❖Fashion-MNIST database of fashion articles
Used for classifying clothing images, this dataset contains 28-by-28
grayscale images of clothing labeled in 10 categories16 with 60,000 for
training and 10,000 for testing.
❖IMDb Movie reviews—Used for sentiment analysis, this dataset contains
reviews labeled as positive (1) or negative (0) sentiment with 25,000
reviews for training and 25,000 for testing.
❖CIFAR1018 small image classification
Used for small-image classification, this dataset contains 32-by-32 color
images labeled in 10 categories with 50,000 images for training and 10,000
for testing.
❖CIFAR10019 small image classification
Also, used for small-image classification, this dataset contains 32-by-32
color images labeled in 100 categories with 50,000 images for training and
10,000 for testing.
Neural Network
A neural network is a computational model inspired by the
human brain, composed of layers of interconnected nodes
(neurons) that learn patterns from data.
Similar to the human brain that has neurons interconnected to
one another, artificial neural networks also have neurons that
are interconnected to one another in various layers of the
networks. These neurons are known as nodes.

The given figure illustrates the typical diagram of Biological Neural Network.

The typical Artificial Neural Network looks something like the given figure.
Dendrites from Biological Neural Network represent inputs in Artificial Neural
Networks, cell nucleus represents Nodes, synapse represents Weights, and
Axon represents Output.

Relationship between Biological neural network and artificial neural network:

Biological Neural Network Artificial Neural Network

Dendrites Inputs

Cell nucleus Nodes

Synapse Weights

Axon Output

Various types of layers available in an artificial neural network.

Artificial Neural Network primarily consists of three layers:

Input Layer:

As the name suggests, it accepts inputs in several different formats provided

by the programmer.

Hidden Layer:
The hidden layer presents in-between input and output layers. It performs all
the calculations to find hidden features and patterns.

Output Layer:

The input goes through a series of transformations using the hidden layer,
which finally results in output that is conveyed using this layer.

The artificial neural network takes input and computes the weighted sum of
the inputs and includes a bias. This computation is represented in the form of
a transfer function.

It determines the weighted total is passed as an input to an activation

function to produce the output. Activation functions choose whether a node
should fire or not. Only those who are fired make it to the output layer. There
are distinctive activation functions available that can be applied upon the sort
of task we are performing.

1. Basic Components
Neuron (Node): Performs a computation on inputs.
Formula:
z = sum(wi * xi) + b
a = Activation(z)
w1*x1+w2*x2+b, which can be represented like Ax1+Bx2+c
Which is a line equation.

Weights (w): Learnable parameters that scale inputs.

Bias (b): Allows shifting the activation function.
Activation Function:
Adds non-linearity (e.g., ReLU, Sigmoid, Tanh,Softmax).

Layers: -
Input Layer: Takes input features.
Hidden Layers: Perform computations.
Output Layer: Produces final prediction

2. Forward Propagation
The process of passing data through the network to make a
prediction.
3. Loss Function
Measures how far the prediction is from the actual value.
Examples:
MSE (Mean Squared Error) for regression.
Cross-Entropy for classification.
4. Forward Propagation
The process of passing data through the network to make a
prediction.

5. Optimization Algorithm
Gradient Descent: Updates weights to minimize loss.
Variants include: SGD, Adam, RMSProp.

6. Training Process
1. Initialize weights randomly.
2. Forward pass.
3. Compute loss.
4. Backward pass.
5. Update weights.
6. Repeat for many epochs.

7. Overfitting & Regularization

Overfitting: Model memorizes training data.
Solutions:
- Dropout
- L2 Regularization
- Early Stopping

8. Popular Architectures
- Feedforward Neural Network (FNN)
- Convolutional Neural Network (CNN) for images
- Recurrent Neural Network (RNN) for sequences
- Transformer for NLP and more
Tensor
The main use of a tensor is to hold and manipulate data in
deep learning and machine learning.

Tensor Dimensions Example Shape

Type

0D Just a single number 5 () (no

Tensor dimensio
(Scalar n)
)
1D A list of numbers [1, 2, 3, 4] (4,)
Tensor
(Vector
)

2D Table of numbers [[1, 2], [3, (3, 2)

Tensor (rows × columns) 4], [5, 6]]
(Matrix
)

3D A "stack" of matrices [[[1,2], (2, 2,

Tensor (like a cube) [3,4]], [[5,6], 2)
[7,8]]]

4D Batch of 3D tensors Imagine a batch (10,

Tensor (common in deep of 10 RGB images, 32, 32,
learning for images) each 32×32 pixels 3)

5D Batch of videos 5 videos, each (5, 20,

Tensor (frames) — each with 20 frames, 64, 64,
video has multiple each frame is 3)
frames, each frame is 64×64 with 3
an image color channels

Quick Visual Feel:

● 0D → Single point

● 1D → Line of numbers

● 2D → Table of numbers (rows and columns)

● 3D → Cube (stack of tables)

● 4D → Collection of cubes (batch of images)

● 5D → Collection of moving cubes (batch of videos)

High-Performance Processors Powerful processors are needed

for real-world deep learning because the size of tensors can be
enormous and large-tensor operations can place crushing
demands on processors. The processors most commonly used
for deep learning are:
• NVIDIA GPUs (Graphics Processing Units)—Originally
developed by companies like NVIDIA for computer gaming,
GPUs are much faster than conventional CPUs for processing
large amounts of data, thus enabling developers to train,
validate and test deep-learning models more efficiently—and
thus experiment with more of them. GPUs are optimized for
the mathematical matrix operations typically performed on
tensors, an essential aspect of how deep learning works “under
the hood.” NVIDIA’s Volta Tensor Cores are specifically
designed for deep learning.31,32 Many NVIDIA GPUs are
compatible with TensorFlow, and hence Keras, and can enhance
the performance of your deep-learning models.33
• Google TPUs (Tensor Processing Units)—Recognizing that
deep learning is crucial to its future, Google developed TPUs
(Tensor Processing Units), which they now use in their Cloud
TPU service, which “can provide up to 11.5 petaflops of
performance in a single pod”34 (that’s 11.5 quadrillion
floating-point operations per second). Also, TPUs are designed
to be especially energy efficient. This is a key concern for
companies like Google with already massive computing clusters
that are growing exponentially and consuming vast amounts of
energy.
ANN
import numpy as np
import matplotlib.pyplot as plt
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import StandardScaler
from sklearn.datasets import load_iris
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Input
from tensorflow.keras.layers import Dense
from tensorflow.keras.utils import to_categorical

iris = load_iris()
X = iris.data # Features (4 features: Sepal length, Sepal
width, Petal length, Petal width)
y = iris.target # Labels (0, 1, 2)
# Check data
print("Features shape:", X.shape)
print("Labels shape:", y.shape)
output:
Features shape: (150, 4)
Labels shape: (150,)

# Standardize features (important for ANN)

scaler = StandardScaler()
X = scaler.fit_transform(X)
# Convert labels to one-hot encoding
y = to_categorical(y)
# Split dataset into train and test sets
X_train, X_test, y_train, y_test = train_test_split(X, y,
test_size=0.2, random_state=42)

# Initialize the model

model = Sequential()

# Add input layer separately

model.add(Input(shape=(4,))) # <-- input defined here
# Add first hidden layer
model.add(Dense(10, activation='relu')) # 10 neurons

# Add second hidden layer

model.add(Dense(8, activation='relu')) # 8 neurons

# Add output layer

model.add(Dense(3, activation='softmax')) # 3 classes
Code Explanation:
Input layer
● This explicitly defines the input shape to the model.
● shape=(4,) means the model expects an input vector of 4 features.
● The Input layer doesn't contain any neurons or weights — it just defines
the expected input format.
1st Hidden layer
● Dense(10) creates a fully connected layer with 10 neurons.
● Each neuron receives all 4 inputs.
● activation='relu' applies the Rectified Linear Unit activation function,
which helps the network learn non-linear patterns:
ReLU(x)=max⁡(0,x)
2nd Hidden Layer
● Adds another Dense layer with 8 neurons, each connected to the 10
outputs from the previous layer.
● Again uses ReLU activation for non-linearity.
Output Layer
● This layer has 3 neurons, suitable for a 3-class classification
problem.
● activation='softmax' turns the outputs into probabilities that sum to
1.
● The neuron with the highest probability is typically selected as the
predicted class.
# Compile the model
model.compile(optimizer='adam',
loss='categorical_crossentropy',
metrics=['accuracy'])

# Train the model

history = model.fit(X_train, y_train,
epochs=50,
batch_size=8,
validation_split=0.1, # use 10% of training data
for validation
verbose=1)
Output:

# Evaluate on test data

loss, accuracy = model.evaluate(X_test, y_test, verbose=0)
print("Test Accuracy: {:.2f}%".format(accuracy * 100))
Output:
Test Accuracy: 100.00%

# Plot training & validation accuracy values

plt.plot(history.history['accuracy'])
plt.plot(history.history['val_accuracy'])
plt.title('Model Accuracy')
plt.xlabel('Epoch')
plt.ylabel('Accuracy')
plt.legend(['Train', 'Validation'], loc='lower right')
plt.show()

# Plot training & validation loss values

plt.plot(history.history['loss'])
plt.plot(history.history['val_loss'])
plt.title('Model Loss')
plt.xlabel('Epoch')
plt.ylabel('Loss')
plt.legend(['Train', 'Validation'], loc='upper right')
plt.show()

Output:
predictions = model.predict(np.array([[5.1, 3.5, 1.4, 0.2]]))
print(predictions)
Output:
1/1 ━━━━━━━━━━━━━━━━━━━━ 0s 35ms/step
[[0.59903073 0.37916926 0.0218 ]]

Convolutional Neural Networks for Vision;

MultiClassification with the MNIST Dataset

In deep learning, particularly in Convolutional Neural

Networks (CNNs), filters (also called kernels) are crucial
components that help detect patterns in data, typically in
images. The convolution layer applies a filter (kernel) over the
input image/tensor to extract features.

Types of Filters in Deep Learning

1. Edge Detection Filters

Used in the early layers of CNNs to detect edges (boundaries)
in images.

● Sobel Filter: Detects edges in horizontal or vertical

directions.

● Horizontal Sobel:

[-1 0 1]
[-2 0 2]
[-1 0 1]

● Vertical Sobel:

[-1 -2 -1]
[ 0 0 0]
[ 1 2 1]

2. Sharpening Filter

● Enhances the edges or fine details in an image.

[ 0 -1 0]
[-1 5 -1]
[ 0 -1 0]

3. Blurring / Smoothing Filter

Reduces noise and detail.

● Gaussian Filter: Uses a Gaussian function to give more

weight to center pixels.

4. Learned Filters

In CNNs, most filters are learned during training, rather than

predefined. For example:
● Early layers might learn filters that detect edges or simple
textures.

● Intermediate layers may detect patterns like corners,

shapes, or parts of objects.

● Deep layers capture high-level features like faces, objects,

or scene elements.

5. Depthwise Filters (in MobileNets)

Each filter works on a single input channel, improving

computational efficiency.

6. Dilated (Atrous) Filters

Spread out across the input with gaps (dilations), useful for
capturing wider context without losing resolution (common in
segmentation tasks).

7. Separable Filters

Decompose a filter into simpler, smaller convolutions (e.g.,

depthwise separable convolutions in Xception), reducing
computational cost.
Inputs:
Input tensor (e.g., 2D image): size 𝐻×𝑊
Filter/kernel: size 𝑓×𝑓
Stride s: how many steps the filter moves
Padding p: how many zeros are added around the input

𝐻+2𝑃−𝑓
Output size formula:Output height/width=⌊ 𝑆
⌋+1

Steps:

1.Position the filter on the input tensor.

2.Compute element-wise multiplication between the filter

and the input slice.

3.Sum the results → this is a single value in the output.

4.Slide the filter based on stride and repeat.

Input image (4×4):

[[1, 2, 0, 1],
[3, 1, 2, 2],
[1, 0, 1, 3],
[2, 1, 2, 1]]

Filter/kernel (2×2):
[[1, 0],
[0, -1]]

Step-by-step:

1. Convolution (stride=1, padding=0)

Slide the 2×2 filter over the input and compute:

conv=∑(element-wise product)
Example at top-left (first 2×2 region):
Input slice:
[[1, 2],
[3, 1]]

Filter:
[[1, 0],
[0, -1]]

= (11 + 20 + 30 + 1(-1)) = 1 + 0 + 0 - 1 = 0

Repeat this for each region → Output will be 3×3.

2. Apply ReLU

Replace all negatives with 0.

3. Apply 2×2 Max Pooling (stride=1) on ReLU output.

Output:
1. Convolution Output:
[[ 0. 0. -2.]
[ 3. 0. -1.]
[ 0. -2. 0.]]

2. After ReLU (Negative values replaced with 0):

[[0. 0. 0.]
[3. 0. 0.]
[0. 0. 0.]]

3. After 2×2 Max Pooling (stride=1):

[[3. 0.]
[3. 0.]]

Program:
import numpy as np

# Input and filter

input_img = np.array([
[1, 2, 0, 1],
[3, 1, 2, 2],
[1, 0, 1, 3],
[2, 1, 2, 1]
])

kernel = np.array([
[1, 0],
[0, -1]
])

# Convolution
def convolve2d(image, kernel, stride=1):
k = kernel.shape[0]
output_dim = (image.shape[0] - k) // stride + 1
output = np.zeros((output_dim, output_dim))
for i in range(0, output_dim):
for j in range(0, output_dim):
region = image[i:i+k, j:j+k]
output[i, j] = np.sum(region * kernel)
return output

# ReLU
def relu(x):
return np.maximum(0, x)

# Max Pooling
def max_pooling(image, pool_size=2, stride=1):
output_dim = (image.shape[0] - pool_size) // stride + 1
output = np.zeros((output_dim, output_dim))
for i in range(0, output_dim):
for j in range(0, output_dim):
region = image[i:i+pool_size, j:j+pool_size]
output[i, j] = np.max(region)
return output

# Run pipeline
conv_output = convolve2d(input_img, kernel)
relu_output = relu(conv_output)
pool_output = max_pooling(relu_output)

# Print results
print("Convolution Output:\n", conv_output)
print("ReLU Output:\n", relu_output)
print("Max Pooling Output:\n", pool_output)
Output:
Convolution Output:
[[ 0. 0. -2.]
[ 3. 0. -1.]
[ 0. -2. 0.]]
ReLU Output:
[[0. 0. 0.]
[3. 0. 0.]
[0. 0. 0.]]
Max Pooling Output:
[[3. 0.]
[3. 0.]]

Convolutional Neural Networks for Vision;

MultiClassification with the MNIST Dataset
import tensorflow as tf
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Conv2D, MaxPooling2D,
Flatten, Dense, Dropout
from tensorflow.keras.datasets import mnist
from tensorflow.keras.utils import to_categorical

(X_train, y_train), (X_test, y_test) = mnist.load_data()

print(X_train.shape)
print(y_train.shape)
print(X_test.shape)
print(y_test.shape)

Output:
(60000, 28, 28)
(60000,)
(10000, 28, 28)
(10000,)
Visualizing Digits
%matplotlib inline
import matplotlib.pyplot as plt
import seaborn as sns
sns.set(font_scale=2)

import numpy as np
index = np.random.choice(np.arange(len(X_train)), 24,
replace=False)
figure, axes = plt.subplots(nrows=4, ncols=6, figsize=(16, 9))
for item in zip(axes.ravel(), X_train[index], y_train[index]):
axes, image, target = item
axes.imshow(image, cmap=plt.cm.gray_r)
axes.set_xticks([]) # remove x-axis tick marks
axes.set_yticks([]) # remove y-axis tick marks
axes.set_title(target)
plt.tight_layout()
Output:
Data Preprocessing
# Reshape to (num_samples, height, width, channels)
X_train = X_train.reshape(60000, 28, 28, 1)
X_test = X_test.reshape(10000, 28, 28, 1)

print(X_train.shape, X_test.shape)

Output:
(60000, 28, 28, 1) (10000, 28, 28, 1)
#Normalizing the Image Data
X_train = X_train.astype('float32') / 255
X_test = X_test.astype('float32') / 255

# Convert labels to one-hot vectors

y_train = to_categorical(y_train, num_classes=10)
y_test = to_categorical(y_test, num_classes=10)

Build the CNN Model

model = Sequential([
Conv2D(32, (3, 3), activation='relu', input_shape=(28, 28,
1)),
MaxPooling2D(pool_size=(2, 2)),

Conv2D(64, (3, 3), activation='relu'),

MaxPooling2D(pool_size=(2, 2)),

Flatten(),
Dense(128, activation='relu'),
Dropout(0.5),
Dense(10, activation='softmax') # 10 classes for digits 0-9
])
model.summary()
Output:
Model: "sequential"

┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━
━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Layer (type) ┃ Output Shape ┃ Param # ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━
━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ conv2d (Conv2D) │ (None, 26, 26, 32) │ 320 │
├─────────────────────────────────┼────────────────────────┼
───────────────┤
│ max_pooling2d (MaxPooling2D) │ (None, 13, 13, 32) │ 0│
├─────────────────────────────────┼────────────────────────┼
───────────────┤
│ conv2d_1 (Conv2D) │ (None, 11, 11, 64) │ 18,496 │
├─────────────────────────────────┼────────────────────────┼
───────────────┤
│ max_pooling2d_1 (MaxPooling2D) │ (None, 5, 5, 64) │ 0│
├─────────────────────────────────┼────────────────────────┼
───────────────┤
│ flatten (Flatten) │ (None, 1600) │ 0│
├─────────────────────────────────┼────────────────────────┼
───────────────┤
│ dense (Dense) │ (None, 128) │ 204,928 │
├─────────────────────────────────┼────────────────────────┼
───────────────┤
│ dropout (Dropout) │ (None, 128) │ 0│
├─────────────────────────────────┼────────────────────────┼
───────────────┤
│ dense_1 (Dense) │ (None, 10) │ 1,290 │
└─────────────────────────────────┴────────────────────────┴
───────────────┘

Total params: 225,034 (879.04 KB)

Trainable params: 225,034 (879.04 KB)

Non-trainable params: 0 (0.00 B)

Compile the Model

model.compile(optimizer='adam',
loss='categorical_crossentropy', # use
categorical_crossentropy with one-hot labels
metrics=['accuracy'])

Train the Model

model.fit(X_train, y_train, epochs=5, batch_size=64,
validation_split=0.1)

Output:
Epoch 1/5
844/844 ━━━━━━━━━━━━━━━━━━━━ 41s
47ms/step - accuracy: 0.8279 - loss: 0.5348 -
val_accuracy: 0.9837 - val_loss: 0.0565
Epoch 2/5
844/844 ━━━━━━━━━━━━━━━━━━━━ 39s
46ms/step - accuracy: 0.9717 - loss: 0.0967 -
val_accuracy: 0.9888 - val_loss: 0.0415
Epoch 3/5
844/844 ━━━━━━━━━━━━━━━━━━━━ 41s
46ms/step - accuracy: 0.9808 - loss: 0.0645 -
val_accuracy: 0.9893 - val_loss: 0.0332
Epoch 4/5
844/844 ━━━━━━━━━━━━━━━━━━━━ 39s
46ms/step - accuracy: 0.9847 - loss: 0.0521 -
val_accuracy: 0.9910 - val_loss: 0.0319
Epoch 5/5
844/844 ━━━━━━━━━━━━━━━━━━━━ 41s
46ms/step - accuracy: 0.9873 - loss: 0.0428 -
val_accuracy: 0.9917 - val_loss: 0.0327

<keras.src.callbacks.history.History at
0x78845d371910>
Evaluate the Model
import time # Import the time module
t1=time.time()
test_loss, test_acc = model.evaluate(X_test, y_test)
print(f"Test accuracy: {test_acc:.4f}")
t2=time.time()
print(f"Total time taken: {t2-t1:.2f}")
Output:
313/313 ━━━━━━━━━━━━━━━━━━━━ 2s
7ms/step - accuracy: 0.9857 - loss: 0.0390
Test accuracy: 0.9900
Total time taken: 2.63
Make Predictions
predictions = model.predict(X_test)
print(f"Prediction for first test image:
{tf.argmax(predictions[0]).numpy()}")
Output:
313/313 ━━━━━━━━━━━━━━━━━━━━ 2s
7ms/step
Prediction for first test image: 7

Dimensionality Reduction
Dimensionality reduction is the process of reducing the
number of features (dimensions) in a dataset while
preserving as much important information as possible.

Deep Learning Notes
100% (1)
Deep Learning Notes
71 pages
Deep Learning R18 Jntuh Lab Manual
0% (1)
Deep Learning R18 Jntuh Lab Manual
21 pages
Caravaggio in Binary
100% (1)
Caravaggio in Binary
20 pages
Introduction To TensorFlow For Artificial Intelligence
No ratings yet
Introduction To TensorFlow For Artificial Intelligence
41 pages
TensorFlow in 1 Day: Make your own Neural Network
From Everand
TensorFlow in 1 Day: Make your own Neural Network
Krishna Rungta
3.5/5 (10)
Unit - 3
No ratings yet
Unit - 3
42 pages
ML Unit-5
No ratings yet
ML Unit-5
19 pages
Unit-3 Aiml
No ratings yet
Unit-3 Aiml
10 pages
DL Unit-3 (CDS)
No ratings yet
DL Unit-3 (CDS)
32 pages
Tensorflow
No ratings yet
Tensorflow
9 pages
MLT Ese
No ratings yet
MLT Ese
21 pages
Deep Learning UNIT-3
No ratings yet
Deep Learning UNIT-3
20 pages
DLunit 3
No ratings yet
DLunit 3
13 pages
DL Unit 3
No ratings yet
DL Unit 3
21 pages
Class Notes DL Unit 2
No ratings yet
Class Notes DL Unit 2
47 pages
Lec 07 8
No ratings yet
Lec 07 8
40 pages
09 Tensorflow101 Slide
No ratings yet
09 Tensorflow101 Slide
78 pages
Unit3 DLT Material Important Notes
No ratings yet
Unit3 DLT Material Important Notes
33 pages
Unit 5
No ratings yet
Unit 5
10 pages
Deep Learning Basics
No ratings yet
Deep Learning Basics
28 pages
UNIT-1 Foundations of Deep Learning
100% (1)
UNIT-1 Foundations of Deep Learning
51 pages
Tensorflow: Features
No ratings yet
Tensorflow: Features
10 pages
Krishna Rungta - TensorFlow in 1 Day Make Your Own Neural Network (2018) - Trang-1
No ratings yet
Krishna Rungta - TensorFlow in 1 Day Make Your Own Neural Network (2018) - Trang-1
24 pages
ML06 Neural-Network 2024-2025
No ratings yet
ML06 Neural-Network 2024-2025
78 pages
PP&DS 5
No ratings yet
PP&DS 5
31 pages
The First Artificial Neuron
No ratings yet
The First Artificial Neuron
2 pages
2 DeepLearning
No ratings yet
2 DeepLearning
46 pages
Unit - 3 DL
No ratings yet
Unit - 3 DL
17 pages
Deep Learning Lab
No ratings yet
Deep Learning Lab
11 pages
CHP 3
No ratings yet
CHP 3
6 pages
Introduction To Deep Neural Networks - DataCamp
No ratings yet
Introduction To Deep Neural Networks - DataCamp
10 pages
ML Unit 4
No ratings yet
ML Unit 4
16 pages
UNIT - 5 Lecture 2
No ratings yet
UNIT - 5 Lecture 2
26 pages
Sony Ai Content
No ratings yet
Sony Ai Content
26 pages
Introduction To Artificial Neural Networks
No ratings yet
Introduction To Artificial Neural Networks
31 pages
Unit 4 Notes New
No ratings yet
Unit 4 Notes New
49 pages
DL Unit 1
No ratings yet
DL Unit 1
200 pages
Unit-1 and 2 Deep Learning
No ratings yet
Unit-1 and 2 Deep Learning
22 pages
Unit I
No ratings yet
Unit I
48 pages
ML Unit 3 Part1
No ratings yet
ML Unit 3 Part1
30 pages
Deep Learning 1687744660
No ratings yet
Deep Learning 1687744660
26 pages
Unit 1
No ratings yet
Unit 1
20 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
Deep Learning Introduction
No ratings yet
Deep Learning Introduction
14 pages
Review On Neural Network and Its Applications
No ratings yet
Review On Neural Network and Its Applications
27 pages
Unit I
No ratings yet
Unit I
10 pages
Keras1-Introduction Two KEras
No ratings yet
Keras1-Introduction Two KEras
6 pages
Chapter 3
No ratings yet
Chapter 3
24 pages
3.2 Preprocessing
No ratings yet
3.2 Preprocessing
10 pages
Deep Learning UNIT 1
No ratings yet
Deep Learning UNIT 1
22 pages
Intro To AI
No ratings yet
Intro To AI
44 pages
Image Classification Using Small Convolutional Neural Network
No ratings yet
Image Classification Using Small Convolutional Neural Network
5 pages
Deep Learning
No ratings yet
Deep Learning
1 page
DL Unit 3 Jntuk r20
100% (1)
DL Unit 3 Jntuk r20
47 pages
Unit 2
No ratings yet
Unit 2
10 pages
Tensorflow: Gpu Vs Tpu
No ratings yet
Tensorflow: Gpu Vs Tpu
5 pages
NN
No ratings yet
NN
2 pages
Unit 4 Part 3
No ratings yet
Unit 4 Part 3
8 pages
106106213
No ratings yet
106106213
637 pages
Lesson 05 TensorFlow
No ratings yet
Lesson 05 TensorFlow
113 pages
Lect 2 Common Architectural Principles of Deep Networks
No ratings yet
Lect 2 Common Architectural Principles of Deep Networks
20 pages
Interview Prep Timetable
No ratings yet
Interview Prep Timetable
2 pages
Real Python Interview Questions American Express
No ratings yet
Real Python Interview Questions American Express
7 pages
Revised PPT Timetable - 2026 - Even Sem - R1.0
No ratings yet
Revised PPT Timetable - 2026 - Even Sem - R1.0
1 page
Lab Experiment - Security (Lab1&2)
No ratings yet
Lab Experiment - Security (Lab1&2)
5 pages
Businessintelligencewithr PDF
100% (1)
Businessintelligencewithr PDF
301 pages
ME02023081
No ratings yet
ME02023081
3 pages
Volume Testing: Identify Whether The Following Is Personal Danger or Danger To Devices
No ratings yet
Volume Testing: Identify Whether The Following Is Personal Danger or Danger To Devices
12 pages
10.C Command Line Argument Questions With Solution
No ratings yet
10.C Command Line Argument Questions With Solution
7 pages
Updating The Ford Bluetooth and Multimedia System: Youtube Download
No ratings yet
Updating The Ford Bluetooth and Multimedia System: Youtube Download
1 page
Asynchronous Counters: Asynchronous 4-Bit UP Counter
No ratings yet
Asynchronous Counters: Asynchronous 4-Bit UP Counter
13 pages
CSE114 Unit2
No ratings yet
CSE114 Unit2
74 pages
Alcatel-Lucent GSM: Merge G2 TC To 9125 TC
No ratings yet
Alcatel-Lucent GSM: Merge G2 TC To 9125 TC
32 pages
Ford MP&L JD - B Tech
No ratings yet
Ford MP&L JD - B Tech
1 page
Class Diagram: Helping Material
No ratings yet
Class Diagram: Helping Material
32 pages
Abdiqadir Hassan DevOps Resume
No ratings yet
Abdiqadir Hassan DevOps Resume
4 pages
Mobile Educational Applications For Children. What Educators and Parents Need To Know
No ratings yet
Mobile Educational Applications For Children. What Educators and Parents Need To Know
23 pages
Package 1. Level 2. Level 3. Level 4. Level Node-Name Image Type Description Instructions Tutorial-Links Comment Last Edit
No ratings yet
Package 1. Level 2. Level 3. Level 4. Level Node-Name Image Type Description Instructions Tutorial-Links Comment Last Edit
21 pages
Design of Rigid Pavements 2 PDF
No ratings yet
Design of Rigid Pavements 2 PDF
5 pages
017 Android Based Digital Notice Board P10
No ratings yet
017 Android Based Digital Notice Board P10
49 pages
Circular Waveguide Case Study Cascwghfen23
No ratings yet
Circular Waveguide Case Study Cascwghfen23
11 pages
03 - Introduction To Trixbox
No ratings yet
03 - Introduction To Trixbox
25 pages
HP 15-Ay Series Bdl50 La-D702p
No ratings yet
HP 15-Ay Series Bdl50 La-D702p
52 pages
Text Book
100% (1)
Text Book
129 pages
Create A Fruit Ninja Inspired Game
No ratings yet
Create A Fruit Ninja Inspired Game
61 pages
MPMD Calibration Roadmap and Faq v9
No ratings yet
MPMD Calibration Roadmap and Faq v9
35 pages
Project Charter Template 20
No ratings yet
Project Charter Template 20
11 pages
2014 TV Firmware Upgrade Instruction T-NT14MDEUC
No ratings yet
2014 TV Firmware Upgrade Instruction T-NT14MDEUC
5 pages
FDA - Registration Form
No ratings yet
FDA - Registration Form
1 page
SMS Gateway Interface
No ratings yet
SMS Gateway Interface
4 pages
Payload Paint Hack
No ratings yet
Payload Paint Hack
2 pages
6416 978-1-5386-7150-4/18/$31.00 ©2018 Ieee Igarss 2018
No ratings yet
6416 978-1-5386-7150-4/18/$31.00 ©2018 Ieee Igarss 2018
4 pages
04-Sm-A107 Evapl 3
0% (1)
04-Sm-A107 Evapl 3
2 pages
Getinge US Customer Letter HCU 30
No ratings yet
Getinge US Customer Letter HCU 30
3 pages

Deep Learning

Uploaded by

Deep Learning

Uploaded by

What is deep learning

Deep learning is a subset of machine learning, which is a subset of artificial

The first advantage of deep learning over machine learning is

Difference between TensorFlow and Keras:

TensorFlow is used for

TensorFlow is used for

In Keras framework, there

TensorFlow has a Keras has a simple

TensorFlow was Keras was developed by

Relationship between Biological neural network and artificial neural network:

Biological Neural Network Artificial Neural Network

Cell nucleus Nodes

Various types of layers available in an artificial neural network.

Artificial Neural Network primarily consists of three layers:

As the name suggests, it accepts inputs in several different formats provided

It determines the weighted total is passed as an input to an activation

Weights (w): Learnable parameters that scale inputs.

7. Overfitting & Regularization

Tensor Dimensions Example Shape

0D Just a single number 5 () (no

2D Table of numbers [[1, 2], [3, (3, 2)

3D A "stack" of matrices [[[1,2], (2, 2,

4D Batch of 3D tensors Imagine a batch (10,

5D Batch of videos 5 videos, each (5, 20,

Quick Visual Feel:

●​ 2D → Table of numbers (rows and columns)​

●​ 3D → Cube (stack of tables)​

●​ 4D → Collection of cubes (batch of images)​

High-Performance Processors Powerful processors are needed

# Standardize features (important for ANN)

# Initialize the model

# Add input layer separately

# Add second hidden layer

# Add output layer

# Train the model

# Evaluate on test data

# Plot training & validation accuracy values

# Plot training & validation loss values

Convolutional Neural Networks for Vision;

In deep learning, particularly in Convolutional Neural

Types of Filters in Deep Learning

1. Edge Detection Filters

●​ Sobel Filter: Detects edges in horizontal or vertical

●​ Enhances the edges or fine details in an image.

3. Blurring / Smoothing Filter

Reduces noise and detail.

●​ Gaussian Filter: Uses a Gaussian function to give more

In CNNs, most filters are learned during training, rather than

●​ Intermediate layers may detect patterns like corners,

●​ Deep layers capture high-level features like faces, objects,

5. Depthwise Filters (in MobileNets)

Each filter works on a single input channel, improving

6. Dilated (Atrous) Filters

Decompose a filter into simpler, smaller convolutions (e.g.,

1.​Position the filter on the input tensor.​

2.​Compute element-wise multiplication between the filter

3.​Sum the results → this is a single value in the output.​

4.​Slide the filter based on stride and repeat.

Input image (4×4):

1. Convolution (stride=1, padding=0)

Slide the 2×2 filter over the input and compute:

= (1*1 + 2*0 + 3*0 + 1*(-1)) = 1 + 0 + 0 - 1 = 0

Repeat this for each region → Output will be 3×3.

Replace all negatives with 0.

3. Apply 2×2 Max Pooling (stride=1) on ReLU output.

2. After ReLU (Negative values replaced with 0):

3. After 2×2 Max Pooling (stride=1):

# Input and filter

Convolutional Neural Networks for Vision;

(X_train, y_train), (X_test, y_test) = mnist.load_data()

# Convert labels to one-hot vectors

Build the CNN Model

Conv2D(64, (3, 3), activation='relu'),

Total params: 225,034 (879.04 KB)

Trainable params: 225,034 (879.04 KB)

Non-trainable params: 0 (0.00 B)

● 2D → Table of numbers (rows and columns)

● 3D → Cube (stack of tables)

● 4D → Collection of cubes (batch of images)

● Sobel Filter: Detects edges in horizontal or vertical

● Enhances the edges or fine details in an image.

● Gaussian Filter: Uses a Gaussian function to give more

● Intermediate layers may detect patterns like corners,

● Deep layers capture high-level features like faces, objects,

1.Position the filter on the input tensor.

2.Compute element-wise multiplication between the filter

3.Sum the results → this is a single value in the output.

4.Slide the filter based on stride and repeat.

= (11 + 20 + 30 + 1(-1)) = 1 + 0 + 0 - 1 = 0