0% found this document useful (0 votes)

21 views30 pages

Unit 1

The document provides an overview of deep learning, explaining its foundation in artificial neural networks that mimic the human brain's structure. It discusses key features such as multi-layered networks, feature learning, and applications in various fields, while also introducing essential mathematical concepts like functions, derivatives, and the chain rule used in training neural networks. Additionally, it highlights the importance of feature creation and optimization techniques in enhancing model performance.

Uploaded by

DEVIBALA SUBRAMANIAN

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views30 pages

Unit 1

Uploaded by

DEVIBALA SUBRAMANIAN

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 30

UNIT 1

FOUNDATIONS
Ms Devibala Subramanian
Assistant Professor
PG & Research Department of Computer Science
Sri Ramakrishna College of Arts and Science
Coimbatore
Deep learning is a subset of machine learning that uses artificial neural networks to model and
process complex patterns in data. It is inspired by the human brain's structure and function,
consisting of multiple layers of interconnected neurons. These networks automatically learn
features from data through a process called backpropagation and optimization techniques like
gradient descent.

Key Features of Deep Learning:

Multi-layered Networks: Includes architectures like Convolutional Neural Networks (CNNs) for
image processing and Recurrent Neural Networks (RNNs) for sequential data.

Feature Learning: Extracts relevant patterns from raw data without manual feature engineering.

Scalability: Works efficiently with large datasets and high computational power.

Applications: Used in fields like image recognition, natural language processing (NLP), speech
recognition, autonomous vehicles, and healthcare.

Deep learning has revolutionized AI by enabling advanced capabilities such as real-time language
translation, medical diagnosis, and self-driving technology.
Deep Learning AI mimics the intricate neural networks of the human brain, enabling computers to
autonomously discover patterns and make decisions from vast amounts of unstructured data.

This transformative field has propelled breakthroughs across various domains, from computer
vision and natural language processing to healthcare diagnostics and autonomous driving.

What is Deep Learning?

The definition of Deep learning is that it is the branch of machine learning that is based on
artificial neural network architecture.

An artificial neural network or ANN uses layers of interconnected nodes called neurons that work
together to process and learn from the input data.

In a fully connected Deep neural network, there is an input layer and one or more hidden layers
connected one after the other.

Each neuron receives input from the previous layer neurons or the input layer.

The output of one neuron becomes the input to other neurons in the next layer of the network, and
this process continues until the final layer produces the output of the network
The layers of the neural network transform the input data through a series of nonlinear
transformations, allowing the network to learn complex representations of the input data.

Deep learning AI can be used for supervised, unsupervised as well as reinforcement machine
learning. it uses a variety of ways to process these.
Functions

A function is one of the fundamental building blocks in both mathematics and deep learning.
Understanding functions is crucial because neural networks are essentially large, complex functions
that map inputs to outputs.

A function is a rule or process that takes an input (or multiple inputs), applies some transformation
to it, and produces an output. Mathematically, a function is written as:

• f(x)=yf(x) = yf(x)=y
Where:
• x is the input (also called the independent variable).
• f is the function that transforms x.
• y is the output (also called the dependent variable).
Example Functions:
Simple Squaring Function
• f(x)=x2
• Input: x=3
• Output: f(3)=32=9

ReLU (Rectified Linear Unit) Function

• f(x)=max(0,x)
• Input: x=−5, Output: f(−5)=0
• Input: x=7, Output: f(7)=7

Functions can be represented in three main ways:

• Mathematical Representation (Equations)
• Graphical Representation (Diagrams)
• Computational Representation (Code)
Types of Function Representations

(A) Mathematical Representation

• Mathematically, functions describe relationships between variables. Some common function

types used in deep learning
• Linear Function:
f(x) = ax + b
(B) Graphical Representation
• Functions can be visualized as plots in a coordinate system. This helps understand their
behavior.
(C) Computational Representation (Code)
In Python, functions can be implemented using the def keyword. The Deep Learning from Scratch
book uses NumPy to efficiently handle multi-dimensional data.
Example 1: Simple Squaring Function
import numpy as np

def square(x: np.ndarray) -> np.ndarray:

"""
Computes the square of each element in the input array.
"""
return np.power(x, 2)

x = np.array([1, 2, 3, 4])
print(square(x)) # Output: [ 1 4 9 16 ]
3. Composite (Nested) Functions
• In deep learning, functions are often combined to form nested (or composite) functions.
Mathematically:

def nested_function(x: np.ndarray) -> np.ndarray:

return np.sqrt(np.power(x, 2))

x = np.array([-3, -2, -1, 0, 1, 2, 3])

print(nested_function(x)) # Output: [3 2 1 0 1 2 3]
Functions and Their Derivatives (Gradients)
• To train deep learning models, we need to compute derivatives of functions.

def derivative(func, x: np.ndarray, delta: float = 0.001) ->

np.ndarray:
return (func(x + delta) - func(x - delta)) / (2 * delta)

print(derivative(square, np.array([3]))) # Output: ~6

Why do we need derivatives?

• They help compute gradients, which guide optimization in neural networks.
• The chain rule allows us to compute derivatives for composite functions.
• Gradient Descent uses derivatives to update model parameters.
The chain rule
It is a fundamental concept in calculus that is essential for deep learning. It allows us to compute the
derivative of a function that is composed of multiple functions. Since neural networks consist of
layers of functions applied in sequence, the chain rule plays a crucial role in training them using
backpropagation.
• The chain rule states that if a function f(x) is composed of multiple functions, its derivative can
be computed by multiplying the derivatives of the individual functions.
Mathematical Definition
• If we have two functions:
• y=g(x)
• z=f(y) so that z=f(g(x)), •First, compute the derivative of g(x) (inner
function).
• Then the derivative of z with respect to x is: •Then, compute the derivative of f(y) (outer
function) at y=g(x).
•Multiply these derivatives to get dz\dx.
Visualizing the Chain Rule

(A) Function Composition as a Computational Graph

Think of a function composition as a series of transformations, represented as a computational
graph.
• Example: f(g(x))
• First, the input x goes through function g to produce an intermediate result y.
• Then, y is passed into function f, which produces the final output z.
• x → [ g(x) ] → y → [ f(y) ] → z.

Each arrow represents a transformation. The derivative of the final output z with respect to x is
computed using the chain rule.
Geometric Intuition (Slope Interpretation)
In deep learning, neural networks consist of multiple layers, where each layer applies a function to
the output of the previous layer. Training a neural network requires computing the derivative of a loss
function with respect to each layer’s parameters using the chain rule.

Neural Network Perspective

A simple two-layer neural network can be written as:

Each function represents a layer transformation. To compute the gradient:

Example: Computing the Derivative of

import numpy as np

def square(x):
return np.power(x, 2)
def sigmoid(x):
return 1 / (1 + np.exp(-x))

def derivative(func, x, delta=0.001):

return (func(x + delta) - func(x - delta)) / (2 * delta)
x = np.array([1.0, 2.0, 3.0]) # Input values

# Step 1: Compute intermediate value y = square(x)

y = square(x)

# Step 2: Compute gradients

dy_dx = derivative(square, x) # g'(x) = 2x
dz_dy = derivative(sigmoid, y) # f'(y) = sigmoid(y) * (1 - sigmoid(y))
# Step 3: Apply Chain Rule
dz_dx = dz_dy * dy_dx # Multiply gradients

print("Gradient of sigmoid(square(x)) with respect to x:", dz_dx)

Generalizing the Chain Rule

Functions with Multiple Inputs

A function with multiple inputs takes two or more independent variables and maps them to an
output.

Mathematical Definition

A function with two inputs x and y can be written as:

f(x,y)=some operation on x and y

Example Functions with Two Inputs:

Addition: f(x,y)=x+y

Multiplication: f(x,y)=x * y

Weighted Sum (Common in Deep Learning): f(x,y)=w1x+w2y

Here, w1and w2are weights that control the influence of x and y.

Python Implementation of a Function with Two Inputs

Function with Addition

import numpy as np

def add_function(x: np.ndarray, y: np.ndarray) -> np.ndarray:

return x + y

x = np.array([2, 3])
y = np.array([4, 5])

print(add_function(x, y)) # Output: [6, 8]

Functions with Multiple Inputs

Weighted Sum (Linear Combination)

In deep learning, a neuron in a neural network takes multiple inputs, multiplies them by weights, and
sums them.

Mathematical Representation:
def weighted_sum(x: np.ndarray, w: np.ndarray) -> np.ndarray:
return np.dot(x, w)

x = np.array([1, 2, 3])
w = np.array([0.1, 0.2, 0.3])

print(weighted_sum(x, w)) # Output: 10.1 + 20.2 + 3*0.3 = 1.4

Backpropagation in Neural Networks
In deep learning, functions with multiple inputs appear in each neuron of a neural network.

Example: One Neuron in a Neural Network A neuron computes:

Functions with Multiple Vector Inputs

A function with multiple vector inputs takes two or more vectors as inputs and produces an output,
which can be a scalar, vector, or matrix.
Mathematical Representation
If x and y are input vectors, a function with multiple vector inputs can be written as:

f(x,y)=some operation on x and y

Example Functions with Two Vector Inputs:

Python Implementation of Functions with Multiple Vector Inputs
Example 1: Dot Product Function
The dot product computes a weighted sum of two vectors.

import numpy as np

def dot_product(x: np.ndarray, y: np.ndarray) -> np.ndarray:

return np.dot(x, y)

x = np.array([1, 2, 3])
y = np.array([4, 5, 6])

print(dot_product(x, y)) # Output: 14 + 25 + 3*6 = 32

Example 2: Element-wise Multiplication

def elementwise_multiply(x: np.ndarray, y: np.ndarray) -> np.ndarray:

return x * y

print(elementwise_multiply(x, y)) # Output: [4, 10, 18]

Matrix Functions with Multiple Vector Inputs

In deep learning, most functions operate on matrices, not just vectors.

Example: Matrix Multiplication If X is an input matrix and W is a weight matrix, then the
function:
f(X,W)=X⋅W performs a linear transformation.

Python Implementation

def matrix_multiply(X: np.ndarray, W: np.ndarray) -> np.ndarray:

return np.dot(X, W)

X = np.array([[1, 2], [3, 4]]) # Shape (2,2)

W = np.array([[0.5, 1], [1.5, -1]]) # Shape (2,2)

print(matrix_multiply(X, W))
Creation of New Features from Existing Features

In a dataset, each feature represents an attribute or characteristic of the data. However, sometimes the
raw features may not be enough to capture useful patterns. By transforming or combining existing
features, we can create new, more informative features that improve learning.

Example in Deep Learning

Suppose we have two features:

x1= house size (square feet)
x2= number of rooms
A new, more informative feature can be:

Rooms per square foot:

Example: Predicting House Prices

Suppose we are building a house price prediction model. The dataset contains:
Size (sq ft)
Number of rooms Creating New Features:
Age of the house
Distance to city center

houses = pd.DataFrame({
'Size': [1000, 2000, 1500],
'Rooms': [3, 5, 4],
'Age': [5, 20, 50],
'Distance': [2, 10, 5]
})

# Creating new features

houses['Rooms_per_sqft'] = houses['Rooms'] / houses['Size']
houses['Is_Old'] = (houses['Age'] > 30).astype(int)
houses['City_Proximity'] = 1 / houses['Distance']

print(houses)
Feature Creation in Deep Learning

Deep learning models automatically learn new features, but feature engineering can still help.

(A)Creating Features for Neural Networks

In deep learning, new features are often created through layers:

1. Convolutional Layers (CNNs) extract new spatial features from images.
2. Recurrent Layers (RNNs) capture temporal patterns from sequences.
3. Autoencoders create compressed representations of inputs.

Example: Creating New Features in a Neural Network

import torch.nn as nn

class FeatureExtractor(nn.Module):
def __init__(self):
super().__init__()
self.fc = nn.Linear(10, 5) # Creates 5 new features from 10 inputs

def forward(self, x):

return self.fc(x)
Derivatives of Functions with Multiple Vector Inputs

For a function f(x,y) with two vector inputs x and y, we need to compute the partial derivatives with
respect to each input.
Derivatives help in:

• Gradient Descent: Updating model parameters to minimize loss.

• Backpropagation: Computing gradients efficiently in neural networks.
• Optimization: Using gradients to adjust weights.

Computational Graph with 2D Matrix Inputs

A computational graph is a directed graph where:

• Nodes represent mathematical operations or variables.
• Edges represent dependencies between operations.

Example (Scalar Computation):

If z=(x+y)2, the computational graph is:

x → [ + ] → a → [ ^2 ] → z
y→[+]

This helps track computations for derivatives (gradients).

Unit-5: Introduction To Deep Learning: Artificial Neural Networks
No ratings yet
Unit-5: Introduction To Deep Learning: Artificial Neural Networks
14 pages
Unit 2 Deep Learning
No ratings yet
Unit 2 Deep Learning
19 pages
Structure of Neural Networks
No ratings yet
Structure of Neural Networks
12 pages
First
No ratings yet
First
92 pages
Unit 1 DL
No ratings yet
Unit 1 DL
52 pages
What Is Gradient Based Learning in Deep Learning
100% (1)
What Is Gradient Based Learning in Deep Learning
12 pages
Understanding and Creating Neural Networks
No ratings yet
Understanding and Creating Neural Networks
69 pages
6COM1044 Deep Learning 1
No ratings yet
6COM1044 Deep Learning 1
49 pages
Deep Learning PDF
100% (1)
Deep Learning PDF
87 pages
Neural Networks
No ratings yet
Neural Networks
61 pages
006-Multiple Layers DNN
No ratings yet
006-Multiple Layers DNN
26 pages
Deep Learning
No ratings yet
Deep Learning
38 pages
ML Lec 10 Neural Networks
No ratings yet
ML Lec 10 Neural Networks
87 pages
Neural Network Training
No ratings yet
Neural Network Training
73 pages
Derivative Networks Reducedversion - 2022
No ratings yet
Derivative Networks Reducedversion - 2022
14 pages
Matrix Calculus
No ratings yet
Matrix Calculus
33 pages
Chap 3 Slides
No ratings yet
Chap 3 Slides
95 pages
NN Unit - 1
No ratings yet
NN Unit - 1
27 pages
Unit 1
No ratings yet
Unit 1
16 pages
Lecture20 Backprop
No ratings yet
Lecture20 Backprop
77 pages
CS217 2024 Lec11
No ratings yet
CS217 2024 Lec11
7 pages
Deep Learning
No ratings yet
Deep Learning
20 pages
Unit 5 (Second Half)
No ratings yet
Unit 5 (Second Half)
10 pages
Deep Learning
No ratings yet
Deep Learning
189 pages
Machine Learning: The Hundred-Page Book
No ratings yet
Machine Learning: The Hundred-Page Book
17 pages
07autodiff Nnets
No ratings yet
07autodiff Nnets
12 pages
Lecture - 05 (Introduction To ANN)
No ratings yet
Lecture - 05 (Introduction To ANN)
27 pages
Introduction Deep Eng
No ratings yet
Introduction Deep Eng
50 pages
CS231n Convolutional Neural Networks For Visual Recognition 4
No ratings yet
CS231n Convolutional Neural Networks For Visual Recognition 4
10 pages
Unit 2 Deep Learning and Neural Networks
No ratings yet
Unit 2 Deep Learning and Neural Networks
38 pages
2) Instruction Manual
No ratings yet
2) Instruction Manual
1,340 pages
Lecture 09 Slides - After
No ratings yet
Lecture 09 Slides - After
57 pages
Notes On Introduction To Deep Learning
No ratings yet
Notes On Introduction To Deep Learning
19 pages
Unit 4
No ratings yet
Unit 4
19 pages
Shortnotedeeplearning
No ratings yet
Shortnotedeeplearning
11 pages
PDF 1678529419
No ratings yet
PDF 1678529419
100 pages
CS 611 Slides 5
No ratings yet
CS 611 Slides 5
28 pages
Ann MJJ-1
No ratings yet
Ann MJJ-1
64 pages
Machine Learning (CSO851) - Lecture 08
No ratings yet
Machine Learning (CSO851) - Lecture 08
27 pages
Deep Learning
100% (4)
Deep Learning
100 pages
CS231n Convolutional Neural Networks For Visual Recognition
No ratings yet
CS231n Convolutional Neural Networks For Visual Recognition
9 pages
DEEP LEARNING Paper
No ratings yet
DEEP LEARNING Paper
12 pages
cst414 - Deep Learning
No ratings yet
cst414 - Deep Learning
34 pages
Automatic Differentiation and Neural Networks
No ratings yet
Automatic Differentiation and Neural Networks
13 pages
Module 2 Deep Feed Forward Networks
No ratings yet
Module 2 Deep Feed Forward Networks
18 pages
Unit Ii DNN
No ratings yet
Unit Ii DNN
24 pages
Deep Learning Techniques: 1. Define Neural Networks
No ratings yet
Deep Learning Techniques: 1. Define Neural Networks
31 pages
Module 2
No ratings yet
Module 2
44 pages
Unit I
No ratings yet
Unit I
90 pages
Artificial Neural Networks (Anns) : Intro
No ratings yet
Artificial Neural Networks (Anns) : Intro
15 pages
Assignment B 3 Customer Churn Modeling
No ratings yet
Assignment B 3 Customer Churn Modeling
7 pages
Machine Learning and Pattern Recognition Week 8 - Backprop
No ratings yet
Machine Learning and Pattern Recognition Week 8 - Backprop
8 pages
Unit V
No ratings yet
Unit V
9 pages
How To Build Your Own Neural Network From Scratch in
No ratings yet
How To Build Your Own Neural Network From Scratch in
6 pages
ANN Unit IV Notes
No ratings yet
ANN Unit IV Notes
4 pages
Valid Fortinet NSE5 FMG-5.4 Exam Dumps
No ratings yet
Valid Fortinet NSE5 FMG-5.4 Exam Dumps
4 pages
9.deep Feedforward Networks
100% (1)
9.deep Feedforward Networks
13 pages
Deep Learning
No ratings yet
Deep Learning
189 pages
Machine Learning
No ratings yet
Machine Learning
4 pages
Annette Paper
No ratings yet
Annette Paper
7 pages
Current and Future Trends of Media & Information: (Ubiquitous Learning)
100% (1)
Current and Future Trends of Media & Information: (Ubiquitous Learning)
25 pages
Turtle - Turtle Graphics - Python 3.9.7 Documentation
No ratings yet
Turtle - Turtle Graphics - Python 3.9.7 Documentation
40 pages
Listaadmin 10 08 2021
No ratings yet
Listaadmin 10 08 2021
35 pages
Course Outline
No ratings yet
Course Outline
6 pages
Detecting Outliers With Grubbs' Test
No ratings yet
Detecting Outliers With Grubbs' Test
8 pages
Smart Scale P2 Pro - EU - Manual
No ratings yet
Smart Scale P2 Pro - EU - Manual
60 pages
SIC XE Architecture
No ratings yet
SIC XE Architecture
9 pages
Data Exam 3
No ratings yet
Data Exam 3
42 pages
UNIT 1 - SPM Intro
No ratings yet
UNIT 1 - SPM Intro
44 pages
A. Two Subsequences: Codeforces Round #751 (Div. 2)
No ratings yet
A. Two Subsequences: Codeforces Round #751 (Div. 2)
4 pages
CM100 SpecificationEng
No ratings yet
CM100 SpecificationEng
3 pages
AV-30-E Installation Manual UAV-1004234-001 Rev E
No ratings yet
AV-30-E Installation Manual UAV-1004234-001 Rev E
93 pages
Bizagi 11.2.4 BPM Suite User Guide - Digital Business Platform
No ratings yet
Bizagi 11.2.4 BPM Suite User Guide - Digital Business Platform
12 pages
Unit 2 - RE1
No ratings yet
Unit 2 - RE1
40 pages
W8,9
No ratings yet
W8,9
12 pages
Adobe Media Encoder Log-Last
No ratings yet
Adobe Media Encoder Log-Last
2 pages
Unit 4 - Activity Planning
No ratings yet
Unit 4 - Activity Planning
35 pages
Advanced Perspective Techniques
No ratings yet
Advanced Perspective Techniques
29 pages
IOT Embedded Projects List 2021 - 2022
No ratings yet
IOT Embedded Projects List 2021 - 2022
10 pages
UNIT 2 - Advanced Data Structures
No ratings yet
UNIT 2 - Advanced Data Structures
23 pages
UOVision Glory LTE Cellular Trail Camera User Manual - Manuals+ PDF
No ratings yet
UOVision Glory LTE Cellular Trail Camera User Manual - Manuals+ PDF
47 pages
CSE-113: Structured Programming Language CSE-114: Structured Programming Language Lab
No ratings yet
CSE-113: Structured Programming Language CSE-114: Structured Programming Language Lab
21 pages
MKTG Practice Chap 3
No ratings yet
MKTG Practice Chap 3
8 pages
The Future of LTE
No ratings yet
The Future of LTE
6 pages
3.2.8-Packet-Tracer - Investigate-A-Vlan-Implementation
No ratings yet
3.2.8-Packet-Tracer - Investigate-A-Vlan-Implementation
3 pages
23x Xbox Full Capture
No ratings yet
23x Xbox Full Capture
3 pages
Compilation Techniques
No ratings yet
Compilation Techniques
15 pages
L2B Test 3 Answer Key
No ratings yet
L2B Test 3 Answer Key
2 pages
Aon - Cyber Solution: Ransomware Supplemental Questionnaire
No ratings yet
Aon - Cyber Solution: Ransomware Supplemental Questionnaire
9 pages
10900320024-Arnab Basak-OE-EC506B-ECE-3A-24
No ratings yet
10900320024-Arnab Basak-OE-EC506B-ECE-3A-24
8 pages
Gnuradio Install
100% (1)
Gnuradio Install
3 pages
CP1404 - Assignment 2 - Movies To Watch 2.0 (Part 1 ONLY) : Task
No ratings yet
CP1404 - Assignment 2 - Movies To Watch 2.0 (Part 1 ONLY) : Task
5 pages
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet

Unit 1

Uploaded by

Unit 1

Uploaded by

UNIT 1

Key Features of Deep Learning:

What is Deep Learning?

ReLU (Rectified Linear Unit) Function

Functions can be represented in three main ways:

(A) Mathematical Representation

• Mathematically, functions describe relationships between variables. Some common function

def square(x: np.ndarray) -> np.ndarray:

def nested_function(x: np.ndarray) -> np.ndarray:

x = np.array([-3, -2, -1, 0, 1, 2, 3])

def derivative(func, x: np.ndarray, delta: float = 0.001) ->

print(derivative(square, np.array([3]))) # Output: ~6

Why do we need derivatives?

(A) Function Composition as a Computational Graph

Neural Network Perspective

A simple two-layer neural network can be written as:

Each function represents a layer transformation. To compute the gradient:

def derivative(func, x, delta=0.001):

# Step 1: Compute intermediate value y = square(x)

# Step 2: Compute gradients

print("Gradient of sigmoid(square(x)) with respect to x:", dz_dx)

Generalizing the Chain Rule

A function with two inputs x and y can be written as:

Example Functions with Two Inputs:

Weighted Sum (Common in Deep Learning): f(x,y)=w1x+w2y

Here, w1​and w2​are weights that control the influence of x and y.

Function with Addition

def add_function(x: np.ndarray, y: np.ndarray) -> np.ndarray:

print(add_function(x, y)) # Output: [6, 8]

Weighted Sum (Linear Combination)

print(weighted_sum(x, w)) # Output: 1*0.1 + 2*0.2 + 3*0.3 = 1.4

Example: One Neuron in a Neural Network A neuron computes:

f(x,y)=some operation on x and y

Example Functions with Two Vector Inputs:

def dot_product(x: np.ndarray, y: np.ndarray) -> np.ndarray:

print(dot_product(x, y)) # Output: 1*4 + 2*5 + 3*6 = 32

Example 2: Element-wise Multiplication

def elementwise_multiply(x: np.ndarray, y: np.ndarray) -> np.ndarray:

print(elementwise_multiply(x, y)) # Output: [4, 10, 18]

In deep learning, most functions operate on matrices, not just vectors.

def matrix_multiply(X: np.ndarray, W: np.ndarray) -> np.ndarray:

X = np.array([[1, 2], [3, 4]]) # Shape (2,2)

Example in Deep Learning

Suppose we have two features:

Rooms per square foot:

# Creating new features

(A)Creating Features for Neural Networks

In deep learning, new features are often created through layers:

Example: Creating New Features in a Neural Network

def forward(self, x):

• Gradient Descent: Updating model parameters to minimize loss.

Computational Graph with 2D Matrix Inputs

A computational graph is a directed graph where:

Example (Scalar Computation):

If z=(x+y)2, the computational graph is:

This helps track computations for derivatives (gradients).

You might also like

Here, w1and w2are weights that control the influence of x and y.

print(weighted_sum(x, w)) # Output: 10.1 + 20.2 + 3*0.3 = 1.4

print(dot_product(x, y)) # Output: 14 + 25 + 3*6 = 32