0% found this document useful (0 votes)

2 views

Module 4 Lab 2

This guide provides a comprehensive introduction to Gradient Descent, detailing its function as an optimization algorithm to minimize loss functions. It covers essential concepts such as the importance of partial derivatives, the process of updating parameters, and the difference between full batch and mini-batch gradient descent. Key takeaways emphasize the role of the learning rate, loss function, and the significance of visualizing training loss for effective model evaluation.

Uploaded by

katrao39798

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Module 4 Lab 2

Uploaded by

katrao39798

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

<q>Fully Explained Guide: Module 4 Lab 2 —

Introduction to Gradient Descent

(Updated and Expanded with All Your Queries Answered)</q> Update this with all the queries
that I ve asked on this module 4 lab 2
Certainly! Here’s a fully updated and beginner-friendly guide to Module 4 Lab 2 —
Introduction to Gradient Descent, incorporating all your queries and explanations from our
conversation.

📖 Fully Explained Guide: Module 4 Lab 2 —

Introduction to Gradient Descent
(With All Your Questions Answered and Examples Simplified)

🔍 Section 1 — What is Gradient Descent?

Gradient Descent is an optimization algorithm used to find the best parameters (like weights in a
model) that minimize a loss function (a measure of how wrong your model is).
How it works:
1. Start with random guesses for your parameters.
2. Calculate how “bad” your guess is using a loss function (like Mean Squared Error).
3. Compute the gradient (slope) of the loss with respect to each parameter—this tells you
which way to move to reduce the loss.
4. Update each parameter a little bit in the direction that reduces the loss.
5. Repeat until the loss stops getting smaller.
Update formula:

= parameter (like , , or )
= learning rate (step size)
gradient = partial derivative of the loss with respect to
🔍 Section 2 — Importing Required Libraries

import numpy as np
import matplotlib.pyplot as plt
import random

random.seed(42)
np.random.seed(42)

numpy: For calculations.

matplotlib.pyplot: For plotting graphs.
random: For adding randomness.
Setting seeds ensures you get the same results every time.

🔍 Section 3 — Generating Data for a Quadratic Function

We want to fit data that follows (so ).

We generate 100 random values between -10 and 10.
For each , we compute .
To make it realistic, we add a little random “jitter” (noise) to each .
Why add noise?
Real-world data is never perfect; adding jitter simulates measurement errors.

🔍 Section 4 — What Are We Trying to Learn?

We want to find the best coefficients , , and for the quadratic equation
that fit our noisy data.

🔍 Section 5 — How Do We Measure “Best”? (The Loss Function)

We use Mean Squared Error (MSE):

: Actual value
: Predicted value from our current guess
Lower MSE = better fit.
🔍 Section 6 — How Does Gradient Descent Improve Our Guess?

A. Start with a Guess

Set , , (or any starting values).

B. Make Predictions
For each , compute .

C. Calculate the Loss

Compute the MSE for all points.

D. Use Partial Derivatives to Guide Updates

For each parameter ( , , ), calculate the partial derivative of the loss with respect to that
parameter.
This tells you: If I nudge this parameter up or down, does the loss get better or worse?
Update Rule for Each Parameter:

Example:
If and , then

E. Repeat
Keep repeating the steps (predict, compute loss, compute gradients, update) for many
iterations (epochs).
The loss should get smaller each time.

🔍 Section 7 — Why Do We Use Partial Derivatives?

Partial derivatives tell us how much the loss will change if we change just one parameter.
They point in the direction of steepest increase; moving in the opposite direction reduces
the loss.
This is how gradient descent “knows” which way to step for each parameter.

🔍 Section 8 — Example: Updating Parameters

Suppose after calculating, you get:
Learning rate
Update:

🔍 Section 9 — Full Batch vs. Mini-Batch Gradient Descent

Full Batch: Uses all data to compute the gradient in each update. No need to shuffle data;
order doesn’t matter.
Mini-Batch: Uses small, randomly selected subsets (mini-batches) to compute the gradient
and update parameters.
Shuffling is important to avoid biased batches (e.g., all one class in a batch).

🔍 Section 10 — Visualizing Training Loss

Plot the MSE after each update (epoch).
You should see the loss decrease and flatten out as the model learns.

🔍 Section 11 — Final Model Evaluation

After training, print out the learned , , .

They should be close to the real values (here, , , ).

🔍 Section 12 — Key Takeaways

Gradient Descent uses partial derivatives to update each parameter in the direction that
reduces loss.
Learning rate controls the size of each step.
Loss function (like MSE) measures how well the model fits the data.
Batch size (full vs. mini) affects how updates are computed and whether shuffling is
needed.
Visualization of loss helps you see if training is working.

Simple Analogy
Gradient descent is like finding the bottom of a valley (minimum loss) by feeling the slope (partial
derivatives) and always stepping downhill (negative gradient), adjusting your direction for each
parameter ( , , ) separately.
If you want more details, code examples, or further clarification on any step, just ask!

The Flex Offense
100% (1)
The Flex Offense
38 pages
Pig Farming Book 3
50% (4)
Pig Farming Book 3
67 pages
LInear
No ratings yet
LInear
14 pages
5.1Loss Function, Optimization,Gd
No ratings yet
5.1Loss Function, Optimization,Gd
39 pages
Gradient Decent
No ratings yet
Gradient Decent
40 pages
Module 4 Lab 3
No ratings yet
Module 4 Lab 3
6 pages
14-RMSProp and Adam Optimization-12!08!2024
No ratings yet
14-RMSProp and Adam Optimization-12!08!2024
2 pages
gradient-descent-from-scratch-complete-intuition
No ratings yet
gradient-descent-from-scratch-complete-intuition
8 pages
11 Gradient Descent
No ratings yet
11 Gradient Descent
58 pages
Adam Optimizer
No ratings yet
Adam Optimizer
22 pages
Math YHPLinear Regression
No ratings yet
Math YHPLinear Regression
13 pages
Gradient Descent Deep Learning: by T.K. Damodharan Vice President, RBS Reg - No: PC2013003013008
No ratings yet
Gradient Descent Deep Learning: by T.K. Damodharan Vice President, RBS Reg - No: PC2013003013008
37 pages
Gradient Descent Algorithm in Machine Learning
No ratings yet
Gradient Descent Algorithm in Machine Learning
21 pages
Gradient Descent Algorithm in Machine Learning: Dr. P. K. Chaurasia
No ratings yet
Gradient Descent Algorithm in Machine Learning: Dr. P. K. Chaurasia
24 pages
Chapter4 PDF
No ratings yet
Chapter4 PDF
9 pages
Gradient Descent_PR
No ratings yet
Gradient Descent_PR
31 pages
Gradient Descent
No ratings yet
Gradient Descent
13 pages
Gradient Descent Unit3
No ratings yet
Gradient Descent Unit3
9 pages
Gradient Descent DS Rohit Sharma Fench Knjs
No ratings yet
Gradient Descent DS Rohit Sharma Fench Knjs
15 pages
Yash 21bsds12
No ratings yet
Yash 21bsds12
3 pages
07_Gradient_Descent_For_Linear_Regression_10_min
No ratings yet
07_Gradient_Descent_For_Linear_Regression_10_min
5 pages
Gradient Descent Algorithm is a first
No ratings yet
Gradient Descent Algorithm is a first
5 pages
Mlfa Autumn 23 Optimization
No ratings yet
Mlfa Autumn 23 Optimization
37 pages
05 Gradient Descent
No ratings yet
05 Gradient Descent
23 pages
DL Unit -2
No ratings yet
DL Unit -2
20 pages
Introduction To Gradient Descent
No ratings yet
Introduction To Gradient Descent
13 pages
DeepLearning Lect2 3
No ratings yet
DeepLearning Lect2 3
89 pages
Gradient_Descent
No ratings yet
Gradient_Descent
52 pages
WINSEM2024-25_CSE4006_ETH_AP2024254000693_2025-01-08_Reference-Material-I
No ratings yet
WINSEM2024-25_CSE4006_ETH_AP2024254000693_2025-01-08_Reference-Material-I
40 pages
Gradient Descent - A Quick, Simple Introduction - Built in
No ratings yet
Gradient Descent - A Quick, Simple Introduction - Built in
15 pages
GD Types
No ratings yet
GD Types
98 pages
Gradient Descent Regression
No ratings yet
Gradient Descent Regression
14 pages
Notes Unit 1-3 Part-III
No ratings yet
Notes Unit 1-3 Part-III
25 pages
Chapter 4
No ratings yet
Chapter 4
65 pages
An Introduction To Gradient Descent and Linear Regression
No ratings yet
An Introduction To Gradient Descent and Linear Regression
8 pages
Gradient Descent a Fundamental Optimization Algorithm
No ratings yet
Gradient Descent a Fundamental Optimization Algorithm
30 pages
Gradient Descent
No ratings yet
Gradient Descent
6 pages
Chapter 4 - Anatomy of A Learning Algorithms
No ratings yet
Chapter 4 - Anatomy of A Learning Algorithms
2 pages
3 Gradient Descent
No ratings yet
3 Gradient Descent
8 pages
GRADIENT DESCENT
No ratings yet
GRADIENT DESCENT
5 pages
Gradient Descent
No ratings yet
Gradient Descent
58 pages
Slides-4 Optimization Extra Gradient Descent
No ratings yet
Slides-4 Optimization Extra Gradient Descent
67 pages
Gradient Descent
No ratings yet
Gradient Descent
9 pages
Basic Machine Learning: Case Study
No ratings yet
Basic Machine Learning: Case Study
11 pages
CCS355 Neural Networks and Deep Learning
No ratings yet
CCS355 Neural Networks and Deep Learning
142 pages
ML Lec 08 Gradient Descent
No ratings yet
ML Lec 08 Gradient Descent
37 pages
Gradient Descent
No ratings yet
Gradient Descent
108 pages
4. Gradient Descent
No ratings yet
4. Gradient Descent
15 pages
5 Optimizers
No ratings yet
5 Optimizers
10 pages
3.Linear Regression
No ratings yet
3.Linear Regression
18 pages
chp2 Gradient Descent algorithm
No ratings yet
chp2 Gradient Descent algorithm
5 pages
Introduction-to-Gradient-Descent (2)
No ratings yet
Introduction-to-Gradient-Descent (2)
8 pages
Machine Vesion hw6
No ratings yet
Machine Vesion hw6
18 pages
Sheet 3 Sol 3
No ratings yet
Sheet 3 Sol 3
3 pages
Enigma Submission
No ratings yet
Enigma Submission
3 pages
Chapter04_Training_Models
No ratings yet
Chapter04_Training_Models
33 pages
3 Types of Gradient Descent Algorithms For Small & Large Datasets
No ratings yet
3 Types of Gradient Descent Algorithms For Small & Large Datasets
9 pages
Gradient Descent
No ratings yet
Gradient Descent
5 pages
Lec05-1-Gradient Descent-Detailed
No ratings yet
Lec05-1-Gradient Descent-Detailed
62 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
Amazing Java: Learn Java Quickly
From Everand
Amazing Java: Learn Java Quickly
Andrei Besedin
No ratings yet
Module 3 Lab 3
No ratings yet
Module 3 Lab 3
4 pages
Module 1 Lab 2
No ratings yet
Module 1 Lab 2
7 pages
temp 2 Lab 1
No ratings yet
temp 2 Lab 1
5 pages
Module 2 Lab 3
No ratings yet
Module 2 Lab 3
5 pages
Analysis of Water Distribution Networks Using MATLAB PDF
No ratings yet
Analysis of Water Distribution Networks Using MATLAB PDF
13 pages
Opi SG Hse 044 Ups r01 - Control of Hazardous Energy
No ratings yet
Opi SG Hse 044 Ups r01 - Control of Hazardous Energy
28 pages
Biochromatography Theory and Practice, 1st Edition Annotated PDF Download
100% (10)
Biochromatography Theory and Practice, 1st Edition Annotated PDF Download
15 pages
Acid & Alkalis Note+ Exercise
No ratings yet
Acid & Alkalis Note+ Exercise
5 pages
Abigail_at_Red_Shield-student_copy
No ratings yet
Abigail_at_Red_Shield-student_copy
8 pages
APQR
No ratings yet
APQR
12 pages
DE-CHINH-THUC-GUI-DH-DAP-AN
No ratings yet
DE-CHINH-THUC-GUI-DH-DAP-AN
5 pages
Rujuk Apr
No ratings yet
Rujuk Apr
168 pages
What Is A Partnership
No ratings yet
What Is A Partnership
79 pages
Dynamic FREE FALL Lab Report
No ratings yet
Dynamic FREE FALL Lab Report
2 pages
1142 2536 2 PB
No ratings yet
1142 2536 2 PB
15 pages
Vlog Final Assignment
100% (1)
Vlog Final Assignment
3 pages
Computer Technology in India
No ratings yet
Computer Technology in India
10 pages
Sachal Jo Risalo
No ratings yet
Sachal Jo Risalo
524 pages
Notice for Security Refund
No ratings yet
Notice for Security Refund
7 pages
Basic Rules
100% (1)
Basic Rules
29 pages
Randstad - White Paper Choosing The Right RPO
No ratings yet
Randstad - White Paper Choosing The Right RPO
12 pages
Marlowe s Ovid The Elegies in the Marlowe Canon Marlowe instant download
No ratings yet
Marlowe s Ovid The Elegies in the Marlowe Canon Marlowe instant download
82 pages
Formation of Image L
No ratings yet
Formation of Image L
24 pages
Android Development Slides Lec 01 GCUF
No ratings yet
Android Development Slides Lec 01 GCUF
16 pages
Get (eBook PDF) Understanding Employment Relations (UK Higher Education Business Management) free all chapters
100% (10)
Get (eBook PDF) Understanding Employment Relations (UK Higher Education Business Management) free all chapters
42 pages
Zam Zam Aqua Systems: 1. Technical Details
No ratings yet
Zam Zam Aqua Systems: 1. Technical Details
4 pages
OODP Strategy
No ratings yet
OODP Strategy
25 pages
Ecommerce Coursework Assignment - UK University BSC Final Year
No ratings yet
Ecommerce Coursework Assignment - UK University BSC Final Year
4 pages
IR Application Note - 1123
No ratings yet
IR Application Note - 1123
21 pages
EAC Brightspace Learner Guide PDF
No ratings yet
EAC Brightspace Learner Guide PDF
31 pages
February 14 2013 Mount Ayr Record-News
No ratings yet
February 14 2013 Mount Ayr Record-News
14 pages
A2 Slot: School of Electrical Engineering Fall Semester 2019-2020 Eee 4012 - Renewable Energy Systems Assignment - I
No ratings yet
A2 Slot: School of Electrical Engineering Fall Semester 2019-2020 Eee 4012 - Renewable Energy Systems Assignment - I
2 pages

Module 4 Lab 2

Uploaded by

Module 4 Lab 2

Uploaded by

<q>Fully Explained Guide: Module 4 Lab 2 —

Introduction to Gradient Descent

📖 Fully Explained Guide: Module 4 Lab 2 —

🔍 Section 1 — What is Gradient Descent?

numpy: For calculations.

🔍 Section 3 — Generating Data for a Quadratic Function

We want to fit data that follows (so ).

🔍 Section 4 — What Are We Trying to Learn?

🔍 Section 5 — How Do We Measure “Best”? (The Loss Function)

A. Start with a Guess

C. Calculate the Loss

D. Use Partial Derivatives to Guide Updates

🔍 Section 7 — Why Do We Use Partial Derivatives?

🔍 Section 8 — Example: Updating Parameters

🔍 Section 9 — Full Batch vs. Mini-Batch Gradient Descent

🔍 Section 10 — Visualizing Training Loss

🔍 Section 11 — Final Model Evaluation

After training, print out the learned , , .

🔍 Section 12 — Key Takeaways

You might also like