0% found this document useful (0 votes)

51 views40 pages

Lecture # 2-1 Probabilistic Models

This is the 3 lec of GEN AI

Uploaded by

Syed Muhammad Ali Shah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views40 pages

Lecture # 2-1 Probabilistic Models

This is the 3 lec of GEN AI

Uploaded by

Syed Muhammad Ali Shah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 40

National University of Computer and Emerging Sciences

Probabilistic Models

AI-4009 Generative AI

Dr. Akhtar Jamil

Department of Computer Science

09/09/2024 Presented by Dr. AKHTAR JAMIL 1

Goals
• Review of Previous Lecture
• Today’s Lecture
– Bayesian Networks
– Terminologies: Loss functions, linear regression, gradient descent,
overfitting, underfitting generalization, regularization, cross-validation

09/09/2024 Presented by Dr. AKHTAR JAMIL 2

Review of Previous Lecture

09/09/2024 Presented by Dr. AKHTAR JAMIL 3

Discriminative Vs Generative Model
• Generative learn the joint probability distribution P(X, Y) , where
X is the input data and Y is the output label.
• Discriminative models learn the conditional probability P(Y | X) ,
which is the probability of the output label Y given the input data
X.

09/09/2024 Presented by Dr. AKHTAR JAMIL 4

What are Generative Models?
Generative machine learning algorithms model complex, high-dimensional objects.

Discriminative Models Generative Models

09/09/2024 Presented by Dr. AKHTAR JAMIL 5

Learning a Generative Model
We are given a training set of examples, e.g., images
of dogs

Present

ed by
6 / 31
09/09/2024 Dr.

AKHTAR

JAMIL

We want to learn a probability distribution p(x ) over images x

such that
Generation: If we sample xnew ∼ p(x ), xnew should look like a dog
(sampling)
Representation learning: We should be able to learn what
these images have in common, e.g., ears, tail, etc.
(features)
First step: how to represent p(x )
Learning a Generative Model
• Defining Probabilistic Models of the Data
• Examples of Probabilistic Models
– The Curse of Dimensionality
• Parameter-Efficient Models through Conditional
Independence
– Bayesian Networks: An Example of Shallow Generative Models
3

Presented
09/09/2024
by Dr. AKHTAR JAMIL 7 / 31
Probabilistic Models: Basic Discrete Distributions
Bernoulli distribution: (biased) coin flip
Domain: { Heads, Tails}
Specify P(X = Heads) = p. Then P(X = Tails) = 1
Present
− p. Write: X ∼ Ber (p) : only one parameter p
ed by
8 / 31
Sampling: flip a (biased) coin
Dr.
09/09/2024

AKHTAR

JAMIL
Categorical distribution: (biased) m-sided dice
Domain: { 1, · · · , Σ
m}
Specify P(Y = i ) = pi , such thatpi =
Write: Y ∼ Cat(p1 , · · · , pm ) : m-1
1
parameters
Sampling: roll a (biased) die
Probabilistic Models: A Multi-Variate
Joint Distribution
• Suppose we want to define a distribution over one pixel in
an image. We use three discrete random variables:
• Red Channel R. Val(R) = {0, · · · ,
255} Present

ed by

• Green Channel G . Val(G ) = { 0, · · ·

9 / 31
09/09/2024 Dr.

AKHTAR

,255}
JAMIL

• Blue Channel B. Val(B) = { 0, · · · ,

• Sampling from the joint distribution (r , g, b) ∼ p(R, G, B) randomly
255}a color for the pixel.
generates
• How many parameters do we need to specify the joint distribution
p(R = r , G = g, B = b)?
256 ∗ 256 ∗ 256 − 1
The Curse of Dimensionality in Probabilistic Models
Suppose we want to model a BW image of digit with n = 28 ·
28 pixels.

Present

ed by

Pixels X1, . . . , Xn are modeled as binary (Bernoulli) random

10 / 31
09/09/2024 Dr.

AKHTAR

variables, i.e., Val(Xi ) = { 0, 1} = { Black, White} .

JAMIL

How many possible states?

n
2×2× ···×2
= 2 n times

Sampling from p(x1, . . . , xn) generates an image

How many parameters to specify the joint distribution
p(x1, . . . , xn) over n binary pixels? 2n − 1 (exponential) =>
curse of dimensionality
Parameter-Efficient Models Through Independence
If X1, . . . , Xn are independent, then

p(x1 , . . . , xn ) = p(x1 )p(x2 ) · · · p(xn )

How many possible states? 2n

Present

ed by

How many parameters to specify the joint distribution p(x1, .

11 / 31
09/09/2024 Dr.

AKHTAR

. . , xn)?
JAMIL

How many to specify the marginal distribution p(x1)? 1

2n entries can be described by just n numbers (if |Val(Xi )| =
2)! Independence assumption is too strong. Model not
likely to be useful For example, each pixel chosen
independently when we sample from it.
Key Notion: Conditional Independence
Two events A, B are conditionally independent given event
C if

p(A ∩ B|C ) = p(A|C )p(B|C )

Random variables X, Y are conditionally independent

Present

ed by

09/09/2024 given Z if for all values x ∈Val(X ), y ∈Val(Y ), z ∈Val(Z )

Dr.
12 / 31

AKHTAR

JAMIL

p(X = x ∩ Y = y |Z = z ) = p(X = x |Z = z )p(Y = y |Z

=z)

We will also write p(X, Y |Z ) = p(X |Z )p(Y |Z ). Note

the more compact notation.
Equivalent definition: p(X |Y , Z ) =
p(X |Z ). We write X ⊥ Y | Z
Today’s Lecture

09/09/2024 Presented by Dr. AKHTAR JAMIL 13

Two Important Rules in Probability

1 Chain rule Let S1 , . . . Sn be events, p(Si )

> 0.
p(S1 ∩ S2 ∩ · · · ∩ Sn) = p(S1)p(S2 | S1) · · · p(Sn | S1 ∩ . . .
Present

09/09/2024
∩ Sn−1) ed by

Dr.
14 / 31
2
AKHTAR

JAMIL

Bayes’ rule Let S1 , S2 be events, p(S1 ) > 0 and p(S2 ) > 0.

p(S1 ∩ S2) p(S2 |S1)p(S
p(S1 | S ) =
p(S ) 1 )
= 2 2
p(S2)
Assumption with conditional independence

15 / 31
09/09/2024

Presented by Dr. AKHTAR JAMIL

Bayesian Networks: General Idea
Use conditional parameterization (instead of joint
parameterization)
For each random variable Xi , specify p(xi |xAi ) for set XAi of
Present
random variables
ed by
16 / 31
Dr.
09/09/2024
Then get joint parametrization as
AKHTAR

JAMIL

• This is a Bayesian Network.

• It is a classical approach for data generation
• Need to guarantee it is a valid probability
distribution.
• Choosing those variable is important. How?
Bayesian Networks: Formal Definition

09/09/2024
What is a Directed Acyclic
Graph?

Present

ed by
18 / 31
09/09/2024 Dr.

AKHTAR

JAMIL

DAG stands for Directed Acyclic

Graph
Bayesian Networks: An Example

09/09/2024
Graph Structure Encodes Conditional Independencies

09/09/2024
Bayesian Networks: An Example 2
• Consider a Bayesian Network with five variables.
– Exercise (E): Whether the person exercises regularly (Yes or No).
– Diet (D): Whether the person has a healthy diet (Yes or No).
– Body Weight (BW): Categorized as Underweight, Normal, Overweight.
– Blood Pressure (BP): Categorized as Low, Normal, High.
– Heart Disease Risk (HR): Risk level of heart disease, categorized as Low,
Medium, High.

09/09/2024 Presented by Dr. AKHTAR JAMIL 21

Bayesian Networks: An Example 2
• We'll assume the following dependencies:
– Exercise (E) and Diet (D) are independent variables.
– Body Weight (BW) depends on both Exercise (E) and Diet (D).
– Blood Pressure (BP) is influenced by Body Weight (BW).
– Heart Disease Risk (HR) is influenced by Blood Pressure (BP) and directly
by Body Weight (BW).
• Draw a possible Gaussian Network

09/09/2024 Presented by Dr. AKHTAR JAMIL 22

Naive Bayes: A Generative Classification Algorithm

09/09/2024 Presented by Dr. AKHTAR JAMIL 23

Naive Bayes: A Generative Classification Algorithm

09/09/2024 Presented by Dr. AKHTAR JAMIL 24

Discriminative Models

09/09/2024 Presented by Dr. AKHTAR JAMIL 25

Machine Learning Fundamentals

09/09/2024 Presented by Dr. AKHTAR JAMIL 26

Workflow of ML tasks

09/09/2024 Presented by Dr. AKHTAR JAMIL 27

Hyperparameters vs Parameters
• Hyperparameters and parameters are both essential components of a
machine learning model.
– Have different purposes and distinct characteristics.
• Parameters:
– Parameters are the internal variables of a machine learning model that are
learned during the training process.
– Model adjusts to fit the training data to understand the relationships in data.
– For example, in a linear regression model, the parameters are the coefficients
assigned to each feature, and in a neural network, the parameters include the
weights and biases of the network's neurons.
– Keep updating these parameters iteratively to minimize a chosen loss function

09/09/2024 Presented by Dr. AKHTAR JAMIL 28

Training, Validation and Testing Data

09/09/2024 Presented by Dr. AKHTAR JAMIL 29

Train, Test and Evaluate model
• Cross-Validation
• Set aside some portion of the data for validation and Train on rest of
it.
• LOOCV (Leave One Out Cross Validation)
– Perform training on the whole training data set but leaves only
one sample for validation
• K-Fold Cross Validation
– The data-set into split into k subsets(folds)
– Perform training on the all the subsets but leave one(k-1)
– Iterate for all folds
09/09/2024 Presented by Dr. AKHTAR JAMIL 30
Cost function
• The cost function helps find optimal model parameters
– Best fit line for the data points.
• Searching for these parameters is a minimization problem
– Model with minimum error between the predicted
value and the actual value.
• One such cost function is:
– Mean Squared Error(MSE):

• : is predicted label
• : Original label
09/09/2024 Presented by Dr. AKHTAR JAMIL 31
Gradient Descent
• Gradient descent is an optimization algorithm
• It helps for searching for the optimal model parameters
• Update parameters according to the gradient values.
• A gradient measures how much the output of a function changes if
you change the parameter values.

09/09/2024 Presented by Dr. AKHTAR JAMIL 32

Gradient Descent
• Initialize w (e.g., randomly)
• Update the values of w based on the gradient:

• Where is learning rate

• To find take derivate of the function with respect to it:

09/09/2024 Presented by Dr. AKHTAR JAMIL 33

Gradient Descent
• To find take derivate of the function with respect to it:

• After solving for the two parameters we get:

09/09/2024 Presented by Dr. AKHTAR JAMIL 34

Gradient Descent

09/09/2024 Presented by Dr. AKHTAR JAMIL 35

Gradient Descent

09/09/2024 Presented by Dr. AKHTAR JAMIL 36

Gradient Descent

09/09/2024 Presented by Dr. AKHTAR JAMIL 37

Thought Provoking Question
• How can we evaluate the performance on the test data set when
we can observe only the training set?

09/09/2024 Presented by Dr. AKHTAR JAMIL 38

References
• Chapter 20, Deep Learning, MIT Press, Ian Goodfellow, Yoshua
Bengio, Aaron Courville
• Lecture slides of https://www.cs.cornell.edu/~kuleshov/

09/09/2024 Presented by Dr. AKHTAR JAMIL 39

Thank You 

09/09/2024 Presented by Dr. AKHTAR JAMIL 40

Analysis of Survival Data - LN - D Zhang - 05
100% (1)
Analysis of Survival Data - LN - D Zhang - 05
264 pages
Gulf Real Estate Properties Case Solution
No ratings yet
Gulf Real Estate Properties Case Solution
31 pages
Football Math PDF
100% (1)
Football Math PDF
66 pages
MATH 1281 Written Assignment Unit 6
No ratings yet
MATH 1281 Written Assignment Unit 6
3 pages
Multivariate Material
No ratings yet
Multivariate Material
58 pages
DL (Unit I)
No ratings yet
DL (Unit I)
25 pages
(Adaptive Computation and Machine Learning) Daphne Koller - Nir Friedman - Probabilistic Graphical Models - Principles and PDF
No ratings yet
(Adaptive Computation and Machine Learning) Daphne Koller - Nir Friedman - Probabilistic Graphical Models - Principles and PDF
1,270 pages
Daphne Koller, Nir Friedman Probabilistic Graphical Models Principles and Techniques 2009
100% (10)
Daphne Koller, Nir Friedman Probabilistic Graphical Models Principles and Techniques 2009
1,270 pages
ECE F344 - Information-Theory-Coding - Handout
No ratings yet
ECE F344 - Information-Theory-Coding - Handout
3 pages
Lecture # 1-2 Introduction To Gen AI
No ratings yet
Lecture # 1-2 Introduction To Gen AI
41 pages
Software Engineer
No ratings yet
Software Engineer
207 pages
Contact Session6
No ratings yet
Contact Session6
57 pages
Jeff Byers - Machine Learning and Advanced Statitics
No ratings yet
Jeff Byers - Machine Learning and Advanced Statitics
48 pages
Cs 228
No ratings yet
Cs 228
98 pages
Ijcai Ecai Tutorial
No ratings yet
Ijcai Ecai Tutorial
115 pages
Week 6 v1.61 (Hidden) - Revision, CW1, and Probabilistic Graphical Models
No ratings yet
Week 6 v1.61 (Hidden) - Revision, CW1, and Probabilistic Graphical Models
65 pages
Chapter 4 Bayesian Networks
No ratings yet
Chapter 4 Bayesian Networks
62 pages
Unit 1 - Deep Learning
No ratings yet
Unit 1 - Deep Learning
49 pages
The Significant Difference Between Control and Experimental Group
No ratings yet
The Significant Difference Between Control and Experimental Group
8 pages
Advanced Machine Learning
No ratings yet
Advanced Machine Learning
63 pages
WINSEM2024-25 CSE4006 ETH AP2024254000689 2025-01-03 Reference-Material-I
No ratings yet
WINSEM2024-25 CSE4006 ETH AP2024254000689 2025-01-03 Reference-Material-I
39 pages
L09 Learning I Bayesian Learning
No ratings yet
L09 Learning I Bayesian Learning
66 pages
Descriptive Statistics and Inferential Statistics
100% (1)
Descriptive Statistics and Inferential Statistics
2 pages
07 - Bayesian Learning
No ratings yet
07 - Bayesian Learning
55 pages
Independent Sample T Test
100% (1)
Independent Sample T Test
27 pages
PML Class 1 2025
No ratings yet
PML Class 1 2025
54 pages
ML 5
No ratings yet
ML 5
28 pages
Slide07 Bayes
No ratings yet
Slide07 Bayes
51 pages
Queuing
No ratings yet
Queuing
21 pages
6 Probabilities
No ratings yet
6 Probabilities
52 pages
Lecture 2
No ratings yet
Lecture 2
31 pages
Generative Learning Algorithims 1233
No ratings yet
Generative Learning Algorithims 1233
33 pages
Lecture 12 Bayesian Neural Network
No ratings yet
Lecture 12 Bayesian Neural Network
46 pages
ML 1
No ratings yet
ML 1
64 pages
Applied Maths
No ratings yet
Applied Maths
34 pages
Advance Deep Learning - BIT L4
No ratings yet
Advance Deep Learning - BIT L4
100 pages
DLT Unit-1
No ratings yet
DLT Unit-1
28 pages
Slide 1
No ratings yet
Slide 1
37 pages
CLASS 2025 Bayesian Framework
No ratings yet
CLASS 2025 Bayesian Framework
46 pages
Artificial Intelligence and Machine Learning
No ratings yet
Artificial Intelligence and Machine Learning
55 pages
Essentials of Bayesian Inference 1706204646
No ratings yet
Essentials of Bayesian Inference 1706204646
21 pages
Bishop2008 Chapter ANewFrameworkForMachineLearnin
No ratings yet
Bishop2008 Chapter ANewFrameworkForMachineLearnin
24 pages
Lecture 03 - Feedforward Networks - 4p
No ratings yet
Lecture 03 - Feedforward Networks - 4p
19 pages
Lec 12
No ratings yet
Lec 12
15 pages
Machine Learning and Data Mining: Prof. Alexander Ihler
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler
51 pages
Applied Statistics - Lecture 1: Mario Beraha
No ratings yet
Applied Statistics - Lecture 1: Mario Beraha
52 pages
Lecture 6 - Generative Models
No ratings yet
Lecture 6 - Generative Models
33 pages
2223hk1 Slide01 ML2022-2
No ratings yet
2223hk1 Slide01 ML2022-2
23 pages
Bayes ML Tutorial
No ratings yet
Bayes ML Tutorial
69 pages
King 5
No ratings yet
King 5
22 pages
Dealing With Uncertainty P (X - E) : Probability Theory The Foundation of Statistics
No ratings yet
Dealing With Uncertainty P (X - E) : Probability Theory The Foundation of Statistics
34 pages
BCS-DS-602: Machine Learning: Dr. Sarika Chaudhary Associate Professor Fet-Cse
No ratings yet
BCS-DS-602: Machine Learning: Dr. Sarika Chaudhary Associate Professor Fet-Cse
18 pages
Machine Learning and Data Mining: Prof. Alexander Ihler
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler
51 pages
cs236 Lecture2
No ratings yet
cs236 Lecture2
29 pages
Probability Theory For Machine Learning: Chris Cremer September 2015
No ratings yet
Probability Theory For Machine Learning: Chris Cremer September 2015
40 pages
CSE546: Naïve Bayes: Winter 2012
No ratings yet
CSE546: Naïve Bayes: Winter 2012
35 pages
Aula 4 (L) - Oggi La Tua Lezione È in Presenza
No ratings yet
Aula 4 (L) - Oggi La Tua Lezione È in Presenza
11 pages
Lecture5 Maximum Likelihood
No ratings yet
Lecture5 Maximum Likelihood
13 pages
Machine Learning: Lecture 6: Bayesian Learning (Based On Chapter 6 of Mitchell T.., Machine Learning, 1997)
No ratings yet
Machine Learning: Lecture 6: Bayesian Learning (Based On Chapter 6 of Mitchell T.., Machine Learning, 1997)
15 pages
Wooldridge Slides 10 Diff in Diffs
No ratings yet
Wooldridge Slides 10 Diff in Diffs
31 pages
Unit 2 Notes
No ratings yet
Unit 2 Notes
7 pages
Lesson 6.0 Supervised Learning With Naive Bayes Classifiers
No ratings yet
Lesson 6.0 Supervised Learning With Naive Bayes Classifiers
13 pages
Bark08 Ghahramani Samlbb 01
No ratings yet
Bark08 Ghahramani Samlbb 01
26 pages
AI Week 14
No ratings yet
AI Week 14
3 pages
Bayes
No ratings yet
Bayes
10 pages
Cheat Sheet 4
No ratings yet
Cheat Sheet 4
2 pages
CS 182 Berkeley 2021 Discussion 1
No ratings yet
CS 182 Berkeley 2021 Discussion 1
7 pages
Introduction To Probabilistic Learning
No ratings yet
Introduction To Probabilistic Learning
9 pages
Machine Learning Notation: 1 Numbers & Arrays 4 Functions
No ratings yet
Machine Learning Notation: 1 Numbers & Arrays 4 Functions
2 pages
Lec23 PDF
No ratings yet
Lec23 PDF
7 pages
Probability and Statistics - Practice Tests and Solutions
No ratings yet
Probability and Statistics - Practice Tests and Solutions
46 pages
Module in Statistics and Probability
100% (1)
Module in Statistics and Probability
4 pages
Option Valuation The Black-Scholes-Merton Option Pricing Model
No ratings yet
Option Valuation The Black-Scholes-Merton Option Pricing Model
9 pages
ML Unit-2
No ratings yet
ML Unit-2
55 pages
Brownian Motion Warwick Notes
No ratings yet
Brownian Motion Warwick Notes
64 pages
Managerial Economics in A Global Economy, 5th Edition by Dominick Salvatore
No ratings yet
Managerial Economics in A Global Economy, 5th Edition by Dominick Salvatore
26 pages
HTMM Notes
No ratings yet
HTMM Notes
3 pages
Stat W1211 Introduction To Statistics Sec 003 Spring 2012
No ratings yet
Stat W1211 Introduction To Statistics Sec 003 Spring 2012
2 pages
14bt3bs03-Probability and Statistics
No ratings yet
14bt3bs03-Probability and Statistics
3 pages
Cheung & Rensvold 2002
100% (1)
Cheung & Rensvold 2002
24 pages
Lecture 3 With Notes - PDF
No ratings yet
Lecture 3 With Notes - PDF
11 pages
Chapter 6
No ratings yet
Chapter 6
24 pages
علاقة القيادة التحويلية في تحسين الأداء الوظيفي في إطار مشروع المؤسسة لدى مديري المدارس الابتدائية
No ratings yet
علاقة القيادة التحويلية في تحسين الأداء الوظيفي في إطار مشروع المؤسسة لدى مديري المدارس الابتدائية
18 pages
Prob and Statistics 1st Lecture 12-2-2023
No ratings yet
Prob and Statistics 1st Lecture 12-2-2023
13 pages
Discrete and Continuous Random Variables: Lesson 1.2
No ratings yet
Discrete and Continuous Random Variables: Lesson 1.2
27 pages
OCR MEI S2 Revision Sheets
No ratings yet
OCR MEI S2 Revision Sheets
8 pages
Week4 CheatSheet ModelDevelopment
No ratings yet
Week4 CheatSheet ModelDevelopment
4 pages
Hasts211 W8y23
No ratings yet
Hasts211 W8y23
2 pages
Mock Exam Chap1To7 2
No ratings yet
Mock Exam Chap1To7 2
5 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet

Lecture # 2-1 Probabilistic Models

Uploaded by

Lecture # 2-1 Probabilistic Models

Uploaded by

National University of Computer and Emerging Sciences

Dr. Akhtar Jamil

09/09/2024 Presented by Dr. AKHTAR JAMIL 1

09/09/2024 Presented by Dr. AKHTAR JAMIL 2

09/09/2024 Presented by Dr. AKHTAR JAMIL 3

09/09/2024 Presented by Dr. AKHTAR JAMIL 4

Discriminative Models Generative Models

09/09/2024 Presented by Dr. AKHTAR JAMIL 5

We want to learn a probability distribution p(x ) over images x

• Green Channel G . Val(G ) = { 0, · · ·

• Blue Channel B. Val(B) = { 0, · · · ,

Pixels X1, . . . , Xn are modeled as binary (Bernoulli) random

variables, i.e., Val(Xi ) = { 0, 1} = { Black, White} .

How many possible states?

Sampling from p(x1, . . . , xn) generates an image

p(x1 , . . . , xn ) = p(x1 )p(x2 ) · · · p(xn )

How many possible states? 2n

How many parameters to specify the joint distribution p(x1, .

How many to specify the marginal distribution p(x1)? 1

p(A ∩ B|C ) = p(A|C )p(B|C )

Random variables X, Y are conditionally independent

09/09/2024 given Z if for all values x ∈Val(X ), y ∈Val(Y ), z ∈Val(Z )

p(X = x ∩ Y = y |Z = z ) = p(X = x |Z = z )p(Y = y |Z

We will also write p(X, Y |Z ) = p(X |Z )p(Y |Z ). Note

09/09/2024 Presented by Dr. AKHTAR JAMIL 13

1 Chain rule Let S1 , . . . Sn be events, p(Si )

Bayes’ rule Let S1 , S2 be events, p(S1 ) > 0 and p(S2 ) > 0.

Presented by Dr. AKHTAR JAMIL

• This is a Bayesian Network.

DAG stands for Directed Acyclic

09/09/2024 Presented by Dr. AKHTAR JAMIL 21

09/09/2024 Presented by Dr. AKHTAR JAMIL 22

09/09/2024 Presented by Dr. AKHTAR JAMIL 23

09/09/2024 Presented by Dr. AKHTAR JAMIL 24

09/09/2024 Presented by Dr. AKHTAR JAMIL 25

09/09/2024 Presented by Dr. AKHTAR JAMIL 26

09/09/2024 Presented by Dr. AKHTAR JAMIL 27

09/09/2024 Presented by Dr. AKHTAR JAMIL 28

09/09/2024 Presented by Dr. AKHTAR JAMIL 29

09/09/2024 Presented by Dr. AKHTAR JAMIL 32

• Where is learning rate

09/09/2024 Presented by Dr. AKHTAR JAMIL 33

• After solving for the two parameters we get:

09/09/2024 Presented by Dr. AKHTAR JAMIL 34

09/09/2024 Presented by Dr. AKHTAR JAMIL 35

09/09/2024 Presented by Dr. AKHTAR JAMIL 36

09/09/2024 Presented by Dr. AKHTAR JAMIL 37

09/09/2024 Presented by Dr. AKHTAR JAMIL 38

09/09/2024 Presented by Dr. AKHTAR JAMIL 39

09/09/2024 Presented by Dr. AKHTAR JAMIL 40

You might also like