Lesson02-Python Calculus Maths

The document covers key concepts in machine learning, including linear and nonlinear functions, derivatives, gradient descent, and loss functions. It explains the importance of activation functions in neural networks and various types such as Sigmoid, ReLU, and ELU. Additionally, it discusses how gradient descent is used to minimize loss functions through iterative parameter updates.

Uploaded by

yennhing.work

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views19 pages

Lesson02-Python Calculus Maths

Uploaded by

yennhing.work

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

CONTENT

1. Linear and Nonlinear Functions

2. Derivatives and Finding Extreme Points
3. Gradient Descent
4. Loss Function
1. Linear and Nonlinear Functions
• A linear function is a systematic or sequential increase or decrease
represented by a straight line.
• Example : Linear Regression
1. Linear and Nonlinear Functions

Searching minimal loss Loss function

1. Linear and Nonlinear Functions
• A non-linear function is a function where the data does not increase or decrease
in a systematic or sequential way.
• Activation function is an important concept in machine learning, especially in
deep learning. They basically decide whether a neuron should be activated or
not and introduce non-linear transformation to a neural network. The main
purpose of these functions is to convert an input signal of a neuron and produce
an output to feed in the next neuron in the next layer
• Example: Activation Functions
1. Linear and Nonlinear Functions

Activation Functions
Advantages
1. Linear and Nonlinear Functions
• Sigmoid Activation Function:
• Range from [0,1]
• Not Zero Centered
• Have Exponential Operation
• Hyperbolic Tangent Activation Function(tanh):
• Ranges Between [-1,1]
• Zero Centered
• Rectified Linear Unit Activation Function (ReLU):
• It doesn’t Saturate
• It converges faster than some other activation functions
1. Linear and Nonlinear Functions
• Leaky ReLU:
• Leaky ReLU improvement over ReLU Activation function.
• It has all properties of ReLU
• It will never have dead ReLU problem.
• Maxout:
• It has property of Linearity in it
• it never saturates or die
• But is Expensive as it doubles the parameters.
• ELU(Exponential Linear Units):
• No Dead ReLU Situation.
• Closer to Zero mean Outputs than Leaky ReLU
• More Computation because of Exponential Function
2. Derivatives and Finding Extreme Points
• Suppose we have a function y = f(x) which is dependent on x then the derivation
of this function means the rate at which the value y of the function changes
with change in x.
• In geometry slope represents the steepness of a line. It answers the question:
how much does y or f(x) change given a specific change in x?
• Using this definition we can easily calculate the slope between two points. But
what if I asked you, instead of the slope between two points, what is the slope
at a single point on the line? In this case there isn’t any obvious “rise-over-run”
to calculate. Derivatives help us answer this question
2. Derivatives and Finding Extreme Points

Finding Extreme
Points
2. Derivatives and Finding Extreme Points

Partial derivative
3. Gradient Descent
• A gradient is a vector that stores the partial derivatives of multivariable
functions. It helps us calculate the slope at a specific point on a curve for
functions with multiple independent variables.
• The gradient vector is the vector generating the line orthogonal to the tangent
hyperplane. Then you take the opposite of this vector (hence “descent”),
multiply it by the learning rate lr.
3. Gradient Descent
• The projection of this vector on the parameter space (here: the x-axis) gives you
the new (updated) parameter. Then you repeat this operation several times to
go down the cost (error) function, with the goal of reaching a value for w where
the cost function is minimal.
• The parameter is thus updated as follow at each step:
parameter <-- parameter - lr*gradient
3. Gradient Descent
3. Gradient Descent
4. Loss Function
• Let’s say you are on the top of a hill and need to climb down. How do you
decide where to walk towards? Here’s what I would do:
• Look around to see all the possible paths
• Reject the ones going up. This is because these paths would actually cost me
more energy and make my task even more difficult
• Finally, take the path that I think has the most slope downhill
• A loss function maps decisions to their associated costs.
4. Loss Function

Log
Loss
where,
N : no. of samples.
M : no. of attributes.
yij : indicates whether ith sample belongs to jth class or not.
pij : indicates probability of ith sample belonging to jth class.

Focal Loss

The hyperparameter γ of the Focal loss is used to tune the

weight of different samples. When γ > 0, it reduces the
relative loss for well-classified examples.
4. Loss Function

Exponential
Loss

Hinge Loss
4. Loss Function

Cross
Entropy Loss
19 19

The Practically Cheating Calculus Handbook
From Everand
The Practically Cheating Calculus Handbook
S. Deviant
3.5/5 (7)
Chapter 3. Linear Regression
No ratings yet
Chapter 3. Linear Regression
16 pages
Activation Function
No ratings yet
Activation Function
36 pages
CS601 - Machine Learning - Unit 2 - Notes - 1672759753
No ratings yet
CS601 - Machine Learning - Unit 2 - Notes - 1672759753
14 pages
CS601 Machine Learning Unit 2 Notes 1672759753
No ratings yet
CS601 Machine Learning Unit 2 Notes 1672759753
14 pages
Slides-4 Optimization Extra Gradient Descent
No ratings yet
Slides-4 Optimization Extra Gradient Descent
67 pages
Gradient Descent - Xiaowei Huang
No ratings yet
Gradient Descent - Xiaowei Huang
53 pages
Unit 2b
No ratings yet
Unit 2b
11 pages
Activation Function in NN
No ratings yet
Activation Function in NN
29 pages
003 Activation Functions in Machine Learning
No ratings yet
003 Activation Functions in Machine Learning
19 pages
LInear
No ratings yet
LInear
14 pages
DL Unit-I
No ratings yet
DL Unit-I
30 pages
Linear Regression
No ratings yet
Linear Regression
9 pages
4-Neural Networks and Activation Function
No ratings yet
4-Neural Networks and Activation Function
28 pages
DeepLearning Lect2 3
No ratings yet
DeepLearning Lect2 3
89 pages
4 4 Choosing The Right Activation Function For Neural Networks
No ratings yet
4 4 Choosing The Right Activation Function For Neural Networks
25 pages
Detailed Guide 7 Loss Functions Machine Learning Python Code
No ratings yet
Detailed Guide 7 Loss Functions Machine Learning Python Code
16 pages
Functii de Activare1
No ratings yet
Functii de Activare1
89 pages
HODL Lec 2 Training NNs Intro TF
No ratings yet
HODL Lec 2 Training NNs Intro TF
83 pages
Aravind Rangamreddy 500195259 cs3
No ratings yet
Aravind Rangamreddy 500195259 cs3
8 pages
Lecture 4 Introduction To Calculus (Part 1)
No ratings yet
Lecture 4 Introduction To Calculus (Part 1)
45 pages
L02 Linear Regression
No ratings yet
L02 Linear Regression
9 pages
ML Unit 3
No ratings yet
ML Unit 3
46 pages
Activation Function
No ratings yet
Activation Function
43 pages
ML Notes
No ratings yet
ML Notes
14 pages
Chapter 4
No ratings yet
Chapter 4
65 pages
NN Unit 3
No ratings yet
NN Unit 3
68 pages
Performance Analysis of Various Activation Functio
No ratings yet
Performance Analysis of Various Activation Functio
7 pages
Activation Functions
No ratings yet
Activation Functions
4 pages
EE769 7 Introduction To Neural Networks
No ratings yet
EE769 7 Introduction To Neural Networks
52 pages
Deep Learning Assignment2 Solutions PDF
No ratings yet
Deep Learning Assignment2 Solutions PDF
16 pages
Need and Use of Activation Functions in Anndeep Learning
No ratings yet
Need and Use of Activation Functions in Anndeep Learning
7 pages
DL Assi02
No ratings yet
DL Assi02
9 pages
Lecture 1, Part 1: Linear Regression: Roger Grosse
No ratings yet
Lecture 1, Part 1: Linear Regression: Roger Grosse
9 pages
FAI 3 Mathematical Concepts I
No ratings yet
FAI 3 Mathematical Concepts I
45 pages
Math Lecture 4
No ratings yet
Math Lecture 4
27 pages
Activation Functions
No ratings yet
Activation Functions
8 pages
Linearity: Skip To Content
No ratings yet
Linearity: Skip To Content
10 pages
Differentiation, Partial Differentiation & Gradients
No ratings yet
Differentiation, Partial Differentiation & Gradients
51 pages
DL M2 Tech
No ratings yet
DL M2 Tech
32 pages
Activation
No ratings yet
Activation
7 pages
W2 Ann
No ratings yet
W2 Ann
12 pages
Linear Regression
No ratings yet
Linear Regression
29 pages
Types of Neural Network Activation Functions - How To Choose
No ratings yet
Types of Neural Network Activation Functions - How To Choose
36 pages
Module2 Optimizations
No ratings yet
Module2 Optimizations
65 pages
Activation Function
No ratings yet
Activation Function
13 pages
ML Unit 3 1
No ratings yet
ML Unit 3 1
57 pages
Week_6
No ratings yet
Week_6
72 pages
Linear Regression Using Gradient Descent
No ratings yet
Linear Regression Using Gradient Descent
2 pages
Gradient Descent Deep Learning: by T.K. Damodharan Vice President, RBS Reg - No: PC2013003013008
No ratings yet
Gradient Descent Deep Learning: by T.K. Damodharan Vice President, RBS Reg - No: PC2013003013008
37 pages
5.1loss Function, Optimization, GD
No ratings yet
5.1loss Function, Optimization, GD
39 pages
DL Unit2
No ratings yet
DL Unit2
113 pages
Mod 2.3 - Activation Function
No ratings yet
Mod 2.3 - Activation Function
9 pages
CS6910 Tutorial1
No ratings yet
CS6910 Tutorial1
10 pages
Activation Functions in Neural Networks
No ratings yet
Activation Functions in Neural Networks
10 pages
Activation Function
No ratings yet
Activation Function
34 pages
3 TrainingNetwork
No ratings yet
3 TrainingNetwork
65 pages
Most Influential Data Science Research Papers
No ratings yet
Most Influential Data Science Research Papers
628 pages
Exercises of Multi-Variable Functions
From Everand
Exercises of Multi-Variable Functions
Simone Malacrida
No ratings yet
Introduction to Logarithms and Exponentials
From Everand
Introduction to Logarithms and Exponentials
Simone Malacrida
No ratings yet

Lesson02-Python Calculus Maths

Uploaded by

Lesson02-Python Calculus Maths

Uploaded by

CONTENT

1. Linear and Nonlinear Functions

Searching minimal loss Loss function

The hyperparameter γ of the Focal loss is used to tune the

You might also like