0% found this document useful (0 votes)

43 views2 pages

Chapter 4 - Anatomy of A Learning Algorithms

Uploaded by

crybaby.surviving011

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views2 pages

Chapter 4 - Anatomy of A Learning Algorithms

Uploaded by

crybaby.surviving011

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Chapter 4 - Anatomy of a Learning Algorithms

Anatomy of a Learning Algorithm

Building Blocks of a Learning Algorithm
Every learning algorithm consists of three parts
1. A loss function

2. An optimization criterion

3. An optimization routine

Gradient Descent
Gradient descent is an iterative optimization algorithm for finding the minimum of a
function. To find a local minimum of a function using gradient descent, one starts at some
random point and takes steps proportional to the negative of the gradient (or approximate
gradient) of the function at the current point.
Gradient descent can be used to find optimal parameters for linear and logistic regression,
SVM and also neural networks which we consider later. For many models, such as logistic
regression or SVM, the optimization criterion is convex. Convex functions have only one
minimum, which is global. Optimization criteria for neural networks are not convex, but in
practice even finding a local minimum suffices.
Working of Gradient Descent

Linear regression model looks like : . where is called weights and is called
f (x) = wx + b w b

bias. in order to get the optimal model we have to find the optimal values for both and . w b

we look for such values and that minimize the mean square error:
w b

N
1 2
l = ∑ (y i − (wx i + b)) .
N
i=1

Gradient Descent starts with calculating the partial derivate for every parameter.
N
∂l 1
= ∑ −2x i (y i − (w x i + b));
∂w N
i=1

N
∂l 1
= ∑ −2(y i − (wx i + b))
∂b N
i=1

To find the partial derivate of the term with respect to we applied the
(y i − (wx + b))
2
w

chain rule. Here we have the chain where

f = f 2 (f 1 ) and . To find a
f 1 = y i − (wx + b) f
2

partial derivate of with respect to we have to first find the partial derivate of with
1

f w f
respect to which is equal to
f2 . and then we have to multiply it by the
2(y i − (wx + b))

partial derivate of with respect to which is equal to . So overall

y i − (wx + b) w −x

N
∂l 1
= ∑ −2x i (y i − (wx i + b))
∂w N
i=1

We initialize and
w0 = 0 and then iterate through out training examples. each
b0 = 0

examples having the form of . For each examples we update

(x i , y i ) = (Spendings i , Sales i ) w

and using our partial derivates, The learning rate controls the size of an update :
b α

−2x i (y i − (w i−1 + b i−1 ))

wi = α
N
−2(y i − (w i−1 x i + b i−1 ))
bi ← α
N

Where and denote the values of and after using the example
wi bi w b (x i , y i ) for the update.
One pass through all training examples is called an epoch.
How machine Learning engineers work
Machine learning engineers use libraries instead of implementing learning algorithm
themselves. The most frequently used open-source library is scikit-learn:
def train(x, y):
from sklearn.linear_model import LinearRegression
modl = LinearRegression().fit(x, y)
return model

model = train(x, y)

x_new = 23.0
y_new = model.predict(x_new)
print(y_new)

WRD 2024-JH
No ratings yet
WRD 2024-JH
165 pages
Bashir-UCP Art1, Trade Payment
No ratings yet
Bashir-UCP Art1, Trade Payment
89 pages
Engineering Economics
No ratings yet
Engineering Economics
123 pages
Installation of Oracle 11g Release 2
No ratings yet
Installation of Oracle 11g Release 2
8 pages
AML 04 Backpropagation
100% (1)
AML 04 Backpropagation
26 pages
Lesson Plan Robots 1
100% (1)
Lesson Plan Robots 1
6 pages
Gradient Descent Unit3
No ratings yet
Gradient Descent Unit3
9 pages
ML3 Unit 4-3
No ratings yet
ML3 Unit 4-3
13 pages
TPA Deals Only With Immovable Property'
0% (1)
TPA Deals Only With Immovable Property'
17 pages
Artificial Neural Networks - Truc
No ratings yet
Artificial Neural Networks - Truc
50 pages
Jntuk R20 ML Unit-V
No ratings yet
Jntuk R20 ML Unit-V
19 pages
DL U-I Introduction Part-2
No ratings yet
DL U-I Introduction Part-2
48 pages
Intro To Neural Networks
No ratings yet
Intro To Neural Networks
100 pages
Week 06 - Deep Feedforward Networks - Optimization
No ratings yet
Week 06 - Deep Feedforward Networks - Optimization
83 pages
Se9150 en
No ratings yet
Se9150 en
72 pages
Linear Classifier: by Dr. Sanjeev Kumar Associate Professor Department of Mathematics IIT Roorkee, Roorkee-247 667, India
No ratings yet
Linear Classifier: by Dr. Sanjeev Kumar Associate Professor Department of Mathematics IIT Roorkee, Roorkee-247 667, India
86 pages
Domnic Object Detecion Basics
No ratings yet
Domnic Object Detecion Basics
62 pages
Module2 Optimizations
No ratings yet
Module2 Optimizations
65 pages
NCS 2008 Mathematics Exam
100% (4)
NCS 2008 Mathematics Exam
13 pages
Unit 1
No ratings yet
Unit 1
72 pages
Forecasting Gold Price.
No ratings yet
Forecasting Gold Price.
6 pages
L3 Cse256 Fa24 FFN
No ratings yet
L3 Cse256 Fa24 FFN
64 pages
L6 Neural Network
No ratings yet
L6 Neural Network
57 pages
Slides-4 Optimization Extra Gradient Descent
No ratings yet
Slides-4 Optimization Extra Gradient Descent
67 pages
Lecture8 DeepLearning
No ratings yet
Lecture8 DeepLearning
94 pages
WINSEM2024-25 CSE4006 ETH AP2024254000693 2025-01-08 Reference-Material-I
No ratings yet
WINSEM2024-25 CSE4006 ETH AP2024254000693 2025-01-08 Reference-Material-I
40 pages
Gradient Descent Optimization
No ratings yet
Gradient Descent Optimization
27 pages
Intercorporate Loan
No ratings yet
Intercorporate Loan
1 page
Neural Networks
No ratings yet
Neural Networks
29 pages
Closurenotice
No ratings yet
Closurenotice
1 page
Lecture 7 - Optimization Part I
No ratings yet
Lecture 7 - Optimization Part I
38 pages
Week 7
No ratings yet
Week 7
53 pages
3 TrainingNetwork
No ratings yet
3 TrainingNetwork
65 pages
Gradient Descent - PR
No ratings yet
Gradient Descent - PR
31 pages
DL Unit 2
No ratings yet
DL Unit 2
46 pages
EE769 7 Introduction To Neural Networks
No ratings yet
EE769 7 Introduction To Neural Networks
52 pages
Deep Learning: Course Code: Unit 1
No ratings yet
Deep Learning: Course Code: Unit 1
41 pages
Lecture 5
No ratings yet
Lecture 5
34 pages
ML Lec 08 Gradient Descent
No ratings yet
ML Lec 08 Gradient Descent
37 pages
Back Propagation
No ratings yet
Back Propagation
27 pages
Unit 2.4
No ratings yet
Unit 2.4
31 pages
3.linear Regression
No ratings yet
3.linear Regression
18 pages
L04 Slides - mlp1
No ratings yet
L04 Slides - mlp1
22 pages
Chapter 2
No ratings yet
Chapter 2
33 pages
5.1loss Function, Optimization, GD
No ratings yet
5.1loss Function, Optimization, GD
39 pages
Mlfa Autumn 23 Optimization
No ratings yet
Mlfa Autumn 23 Optimization
37 pages
ML Notes
No ratings yet
ML Notes
14 pages
LLM4RE Final Submitted1
No ratings yet
LLM4RE Final Submitted1
25 pages
Gradient Descent Deep Learning: by T.K. Damodharan Vice President, RBS Reg - No: PC2013003013008
No ratings yet
Gradient Descent Deep Learning: by T.K. Damodharan Vice President, RBS Reg - No: PC2013003013008
37 pages
Backward Forward Propogation
No ratings yet
Backward Forward Propogation
19 pages
Gradient Descent
No ratings yet
Gradient Descent
13 pages
VOCABULARY-sport Car Lis
No ratings yet
VOCABULARY-sport Car Lis
16 pages
Mod 2.4,2.5,2.6 Architecture Design
No ratings yet
Mod 2.4,2.5,2.6 Architecture Design
20 pages
05 Gradient Descent
No ratings yet
05 Gradient Descent
23 pages
DSP Pyq
No ratings yet
DSP Pyq
13 pages
Handout Delta Rule
No ratings yet
Handout Delta Rule
10 pages
Rhce Exams
No ratings yet
Rhce Exams
8 pages
Multi Percept Ron
No ratings yet
Multi Percept Ron
14 pages
Gradient Descent
No ratings yet
Gradient Descent
15 pages
Mid 1 DL Notes
No ratings yet
Mid 1 DL Notes
15 pages
List of Guitar Manufacturers - Wikipedia
No ratings yet
List of Guitar Manufacturers - Wikipedia
11 pages
LInear
No ratings yet
LInear
14 pages
Neural Networks Essay Feranmi Dere
No ratings yet
Neural Networks Essay Feranmi Dere
7 pages
Kevin Swingler - Lecture 4: Multi-Layer Perceptrons
No ratings yet
Kevin Swingler - Lecture 4: Multi-Layer Perceptrons
20 pages
Module 4 Lab 3
No ratings yet
Module 4 Lab 3
6 pages
Tektronix Power Test Set TU-75B Manual - Tektronix - 1964 - 621.381548
No ratings yet
Tektronix Power Test Set TU-75B Manual - Tektronix - 1964 - 621.381548
10 pages
Annette Paper
No ratings yet
Annette Paper
7 pages
Gradient Descent
No ratings yet
Gradient Descent
9 pages
How Technology Has Made Governance Easier
No ratings yet
How Technology Has Made Governance Easier
8 pages
Sms Essay 2
No ratings yet
Sms Essay 2
6 pages
Sample of An Information For Malversation of Public Funds and Property
No ratings yet
Sample of An Information For Malversation of Public Funds and Property
7 pages
Module 4 Lab 2
No ratings yet
Module 4 Lab 2
5 pages
Deep Learning Algorithms Report PDF
No ratings yet
Deep Learning Algorithms Report PDF
11 pages
Lec 24
No ratings yet
Lec 24
7 pages
Basic Machine Learning: Case Study
No ratings yet
Basic Machine Learning: Case Study
11 pages
F505-87 (2011) Standard Practice For Comparative Evaluati
No ratings yet
F505-87 (2011) Standard Practice For Comparative Evaluati
4 pages
Prepared Food Photos, Inc V New Kianis Pizza & Subs, Inc: Judgment Entered $51,461.50
No ratings yet
Prepared Food Photos, Inc V New Kianis Pizza & Subs, Inc: Judgment Entered $51,461.50
5 pages
Man Sci
No ratings yet
Man Sci
5 pages
Chapter4 PDF
No ratings yet
Chapter4 PDF
9 pages
شبكات عصبية ٢
No ratings yet
شبكات عصبية ٢
6 pages
Economics Enrichment Plan UPSC
No ratings yet
Economics Enrichment Plan UPSC
2 pages
DH-IPC-HDW2831T-ZS-S2: 8MP Lite IR Vari-Focal Eyeball Nework Camera
No ratings yet
DH-IPC-HDW2831T-ZS-S2: 8MP Lite IR Vari-Focal Eyeball Nework Camera
3 pages
HP LaserJet 2605
No ratings yet
HP LaserJet 2605
2 pages
Modular Organizer Drawers
No ratings yet
Modular Organizer Drawers
3 pages
Chapter2 PDF
No ratings yet
Chapter2 PDF
6 pages
Why Is The BSP The Main Government Agency Responsible For Promoting Price Stability
No ratings yet
Why Is The BSP The Main Government Agency Responsible For Promoting Price Stability
4 pages
Building and Site Security Policy
No ratings yet
Building and Site Security Policy
1 page
Ramu Prabhakar Chatla: Electrical Designer
No ratings yet
Ramu Prabhakar Chatla: Electrical Designer
3 pages
Secure Data Exchange Via HSM
No ratings yet
Secure Data Exchange Via HSM
3 pages

Chapter 4 - Anatomy of A Learning Algorithms

Uploaded by

Chapter 4 - Anatomy of A Learning Algorithms

Uploaded by

Chapter 4 - Anatomy of a Learning Algorithms

Anatomy of a Learning Algorithm

chain rule. Here we have the chain where

partial derivate of with respect to which is equal to . So overall

examples having the form of . For each examples we update

−2x i (y i − (w i−1 + b i−1 ))

You might also like