0% found this document useful (0 votes)

6 views18 pages

Op Tim Ization

The document discusses the concept of functions in mathematics, particularly focusing on their parameters and how they relate to machine learning models. It emphasizes the importance of loss functions in optimizing model performance, detailing criteria for selecting appropriate loss functions and challenges faced in optimization processes. Additionally, it highlights issues such as non-convexity, ill-conditioning, and overfitting that can complicate optimization in machine learning.

Uploaded by

u21ec159

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views18 pages

Op Tim Ization

Uploaded by

u21ec159

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

21 April 2023 20:05

Session on Optimization Page 1

Functions
21 April 2023 14:28

A function is a mathematical rule that takes an input value, processes it according to a specific
formula or set of instructions, and produces a unique output value. In other words, a function
is a relationship between input and output values where each input is connected to exactly
one output.

Session on Optimization Page 2

Multivariable Functions
21 April 2023 16:35

Session on Optimization Page 3

Parameters in a Function
21 April 2023 16:36

In mathematics, parameters of a function are the variables that are used to define the
behaviour of the function. The parameters influence the function's output by determining how
the input values are processed.

The parameters are the constants or coefficients that appear in the function's formula. For
example, in the quadratic function f(x) = ax^2 + bx + c, 'a', 'b', and 'c' are the parameters of the
function. By changing the values of these parameters, you can modify the shape and position
of the parabola represented by the function.

Session on Optimization Page 4

ML models as Mathematical Function
21 April 2023 16:36

Session on Optimization Page 5

Parametric Vs Non-Parametric ML models
21 April 2023 16:36

Session on Optimization Page 6

Linear Regression as a Parametric ML model
21 April 2023 16:36

Session on Optimization Page 7

Loss Function
21 April 2023 16:37

A loss function, also known as a cost function or objective function, is a mathematical function
that measures the difference between the predicted output and the actual target values in a
machine learning model. The primary goal of training a machine learning model is to minimize
the value of the loss function, which corresponds to improving the model's performance on
the given task.

Loss functions play a crucial role in the optimization process, guiding the learning algorithm to
adjust the model's parameters to achieve better predictions.

Session on Optimization Page 8

How to select a good Loss Function
21 April 2023 16:37

1. Problem type: The choice of a loss function depends on the type of problem you are
solving. For example, in regression tasks, mean squared error (MSE) or mean absolute error
(MAE) are commonly used. For binary classification, cross-entropy loss or hinge loss can be
employed. For multi-class classification, categorical cross-entropy or multi-class hinge loss
can be used. Choose a loss function that aligns with the objectives of the specific problem
you are addressing.

2. Robustness to outliers: Some loss functions, like mean squared error, are more sensitive to
outliers, which can lead to a model that is overly influenced by extreme values. If your
dataset contains outliers or is prone to noise, consider using a loss function that is more
robust to outliers, such as mean absolute error (MAE) or Huber loss.

3. Interpretability and ease of use: A good loss function should be interpretable and easy to
implement. Simple loss functions like mean squared error or cross-entropy loss are widely
used because they are easy to understand, compute, and differentiate. When possible, opt
for a loss function that is easy to work with and can be easily incorporated into your
optimization process.

4. Differentiability: Most optimization algorithms, like gradient descent, require the loss
function to be differentiable. Choose a loss function that has continuous first-order
derivatives, which makes it easier to compute the gradients needed for optimization.

5. Compatibility with the model: Ensure that the chosen loss function is compatible with the
model architecture you are using. Some models have specific requirements or assumptions
about the loss function. For example, linear regression assumes a Gaussian noise
distribution, which is why mean squared error is a suitable loss function in that case.

Session on Optimization Page 9

Calculating Parameters From a Loss Function (the easy way and
problem)
21 April 2023 16:37

Session on Optimization Page 10

Session on Optimization Page 11
Problem with the easy way
21 April 2023 16:44

1. Non-convexity: The loss function may not always be convex, meaning that it might have
multiple local minima and maxima. In such cases, setting the gradient to zero might lead to
a local minimum or maximum, which is not necessarily the global minimum (the optimal
solution).

2. Complexity: For some models, the loss function can be highly complex, and finding the
analytical solution by setting the gradient to zero might be computationally expensive or
even impossible. This is particularly true for deep learning models, where the loss functions
involve a large number of parameters and complex relationships between them.

3. Scalability: In large-scale machine learning problems with massive amounts of data or high-
dimensional feature spaces, computing the analytical solution by setting the gradient to
zero can be computationally prohibitive due to the high cost of processing and storing the
data.

4. Online learning and streaming data: In some applications, the data is not available all at
once but arrives in a continuous stream. In these scenarios, models need to be updated
incrementally as new data arrives, and an analytical solution would not be practical.
Gradient descent and its variants, such as stochastic gradient descent, are well-suited for
online learning and handling streaming data.

Session on Optimization Page 12

Convex And Non Convex Loss Functions
21 April 2023 16:38

Session on Optimization Page 13

Gradient Descent
21 April 2023 16:38

Session on Optimization Page 14

Session on Optimization Page 15
Gradient Descent with multiple Parameters
21 April 2023 16:39

Session on Optimization Page 16

Problems faced in Optimization
21 April 2023 16:39

1. Non-convexity: For many machine learning models, such as artificial neural networks, the
loss function is non-convex, which means it has a complex landscape with multiple local
minima, maxima, and saddle points. This makes it difficult for optimization algorithms to
find the global minimum and can result in suboptimal solutions.

2. Ill-conditioning: The loss function may be ill-conditioned, meaning the gradients in some
dimensions are much larger than in others. This can cause gradient-based optimization
algorithms, such as gradient descent, to oscillate and converge slowly.

3. Vanishing and exploding gradients: In deep neural networks, the gradients can become
very small (vanish) or very large (explode) as they propagate through the layers. This can
lead to slow convergence or unstable training dynamics, making it difficult to optimize the
loss function.

4. Overfitting: When optimizing the loss function, the algorithm may overfit the training data,
resulting in a model that performs poorly on unseen data. This occurs when the model is
too complex and learns the noise in the training data instead of the underlying patterns.

5. Scalability: For large-scale problems with a high number of features, instances, or model
parameters, optimizing the loss function can be computationally expensive and time -
consuming. This can limit the applicability of certain optimization techniques or require
significant computational resources.

Session on Optimization Page 17

Other optimization techniques
21 April 2023 16:39

Session on Optimization Page 18

Unit 2 Introduction To Deep Learning
No ratings yet
Unit 2 Introduction To Deep Learning
79 pages
2 - Monaco Tips and Tricks
100% (1)
2 - Monaco Tips and Tricks
99 pages
Cost Function in Machine Learning - Javatpoint
No ratings yet
Cost Function in Machine Learning - Javatpoint
9 pages
Arnold Zellner - Statistics, Econometrics & Forecasting PDF
No ratings yet
Arnold Zellner - Statistics, Econometrics & Forecasting PDF
186 pages
Gradient Descent
No ratings yet
Gradient Descent
108 pages
Loss Function
No ratings yet
Loss Function
3 pages
2-LR Optim
No ratings yet
2-LR Optim
60 pages
The Mathematics of Optimization
No ratings yet
The Mathematics of Optimization
29 pages
CS480 6 Linear Models
No ratings yet
CS480 6 Linear Models
68 pages
Linear Regression
No ratings yet
Linear Regression
63 pages
Optimization For Data Science
No ratings yet
Optimization For Data Science
18 pages
05 AIS302 ANN-Optimization
No ratings yet
05 AIS302 ANN-Optimization
44 pages
ML Unit 3
No ratings yet
ML Unit 3
46 pages
Linear Models (Unit II) Chapter III 1
No ratings yet
Linear Models (Unit II) Chapter III 1
24 pages
Loss Function
No ratings yet
Loss Function
23 pages
Deep Learning (Part 2) - Loss Function and Gradient Function - by Sumbatilinda - Medium
No ratings yet
Deep Learning (Part 2) - Loss Function and Gradient Function - by Sumbatilinda - Medium
30 pages
Loss Functions
No ratings yet
Loss Functions
29 pages
Chapter 4
No ratings yet
Chapter 4
65 pages
1
No ratings yet
1
31 pages
Detailed Guide 7 Loss Functions Machine Learning Python Code
No ratings yet
Detailed Guide 7 Loss Functions Machine Learning Python Code
16 pages
Lecture-16 Machine Learning With Python
No ratings yet
Lecture-16 Machine Learning With Python
39 pages
Week 10 Notes MLF
No ratings yet
Week 10 Notes MLF
20 pages
Lecture 8
No ratings yet
Lecture 8
16 pages
Screenshot 2024-10-19 at 10.37.25 AM
No ratings yet
Screenshot 2024-10-19 at 10.37.25 AM
25 pages
5.1loss Function, Optimization, GD
No ratings yet
5.1loss Function, Optimization, GD
39 pages
ML Module 5 Full Notes
No ratings yet
ML Module 5 Full Notes
23 pages
Lecture 4 - Cost Function
No ratings yet
Lecture 4 - Cost Function
18 pages
Lecture-4 Emprical Risk and Optimization
No ratings yet
Lecture-4 Emprical Risk and Optimization
20 pages
Optimization PDF
No ratings yet
Optimization PDF
59 pages
Chapter
No ratings yet
Chapter
46 pages
Machine Vesion hw6
No ratings yet
Machine Vesion hw6
18 pages
Stochastic Gradient Descent Algorithm With Python and NumPy - Real Python
No ratings yet
Stochastic Gradient Descent Algorithm With Python and NumPy - Real Python
21 pages
Optimization With COMSOL Multiphysics
No ratings yet
Optimization With COMSOL Multiphysics
55 pages
Unit IV BPA GD
No ratings yet
Unit IV BPA GD
12 pages
Linear Regression
No ratings yet
Linear Regression
29 pages
Operations Management SEM 1
No ratings yet
Operations Management SEM 1
127 pages
Loss
No ratings yet
Loss
18 pages
Slidesgo Exploring Loss Functions and Surrogate Approaches in Deep Learning Challenges in Neural Network Opt 20250102153205ENKz
No ratings yet
Slidesgo Exploring Loss Functions and Surrogate Approaches in Deep Learning Challenges in Neural Network Opt 20250102153205ENKz
14 pages
Optimisation and Optimal Control
No ratings yet
Optimisation and Optimal Control
82 pages
4-Loss Function
No ratings yet
4-Loss Function
8 pages
Gradient Descent
No ratings yet
Gradient Descent
15 pages
Deep Learning (Part 8) - Coursesteach
No ratings yet
Deep Learning (Part 8) - Coursesteach
16 pages
Setting Parameters of A Deep Neural Network - Hierarchical Representations
No ratings yet
Setting Parameters of A Deep Neural Network - Hierarchical Representations
10 pages
Loss Functions
No ratings yet
Loss Functions
8 pages
Unit VI Optimization Techniques Question Bank Solved Answer
No ratings yet
Unit VI Optimization Techniques Question Bank Solved Answer
20 pages
An Introduction To An Introduction To Optimization Optimization Using Using Evolutionary Algorithms Evolutionary Algorithms
No ratings yet
An Introduction To An Introduction To Optimization Optimization Using Using Evolutionary Algorithms Evolutionary Algorithms
45 pages
NN WK 3 Lec 5 6 Gradient Descent
No ratings yet
NN WK 3 Lec 5 6 Gradient Descent
7 pages
NEOM UNIT-1 Sept-23
No ratings yet
NEOM UNIT-1 Sept-23
34 pages
Gradient Descent
No ratings yet
Gradient Descent
4 pages
chp2 Cost Functions
No ratings yet
chp2 Cost Functions
7 pages
Chapter 6 - Optimization and Its Applications in Chemical Engineering
No ratings yet
Chapter 6 - Optimization and Its Applications in Chemical Engineering
61 pages
Chapter 02 Linear Programming Graphical Method
No ratings yet
Chapter 02 Linear Programming Graphical Method
13 pages
Review of Livestock Feed Formulation Techniques
No ratings yet
Review of Livestock Feed Formulation Techniques
9 pages
ML Assignment
No ratings yet
ML Assignment
5 pages
Huber Loss
No ratings yet
Huber Loss
5 pages
Module 3dl1
No ratings yet
Module 3dl1
11 pages
Anoptimal Design For Axial-Flow Fan Blade - Theorical and Experimental Studies
No ratings yet
Anoptimal Design For Axial-Flow Fan Blade - Theorical and Experimental Studies
10 pages
L02 Linear Regression
No ratings yet
L02 Linear Regression
9 pages
Top 7 Loss Functions To Evaluate Regression Models
No ratings yet
Top 7 Loss Functions To Evaluate Regression Models
8 pages
Lecture 1, Part 1: Linear Regression: Roger Grosse
No ratings yet
Lecture 1, Part 1: Linear Regression: Roger Grosse
9 pages
Cost Function of Logistic Regression
No ratings yet
Cost Function of Logistic Regression
6 pages
ML Notes
No ratings yet
ML Notes
14 pages
Chapter 0: Introduction: 0.2.1 Examples in Machine Learning
No ratings yet
Chapter 0: Introduction: 0.2.1 Examples in Machine Learning
4 pages
Lec 18
No ratings yet
Lec 18
6 pages
Optimization
No ratings yet
Optimization
38 pages
Flair Furniture Example in Excel
No ratings yet
Flair Furniture Example in Excel
9 pages
Solutions Supp C-Instructor Manual
0% (1)
Solutions Supp C-Instructor Manual
11 pages
I Year QT Notes
No ratings yet
I Year QT Notes
283 pages
Man Sci 25 - 4.2-1
No ratings yet
Man Sci 25 - 4.2-1
27 pages
Final Report On Taguchi3443434434
No ratings yet
Final Report On Taguchi3443434434
42 pages
Gan Tutorial
No ratings yet
Gan Tutorial
57 pages
Optimal Design of Conical Springs RG
No ratings yet
Optimal Design of Conical Springs RG
15 pages
Business Analytics With Goal Programming
No ratings yet
Business Analytics With Goal Programming
33 pages
Bacterial Foraging Optimization
No ratings yet
Bacterial Foraging Optimization
31 pages
LPSupplement
No ratings yet
LPSupplement
7 pages
Filtering and Identification: The System Identification Cycle
No ratings yet
Filtering and Identification: The System Identification Cycle
57 pages
Robust Ship Scheduling With Multiple Time Windows: Marielle Christiansen, Kjetil Fagerholt
No ratings yet
Robust Ship Scheduling With Multiple Time Windows: Marielle Christiansen, Kjetil Fagerholt
15 pages
Management Advisory Services: Adamson University
No ratings yet
Management Advisory Services: Adamson University
5 pages
Portfolio Optimization With Conditional Value-at-Risk Objective and Constraints
No ratings yet
Portfolio Optimization With Conditional Value-at-Risk Objective and Constraints
26 pages
CS221 - Artificial Intelligence - Machine Learning - 3 Linear Classification
No ratings yet
CS221 - Artificial Intelligence - Machine Learning - 3 Linear Classification
28 pages
Seem 2420 or
No ratings yet
Seem 2420 or
34 pages
Ward Linkage-3941
No ratings yet
Ward Linkage-3941
26 pages
Operations Research Mid-Term
No ratings yet
Operations Research Mid-Term
16 pages
Linear Programming Questions
No ratings yet
Linear Programming Questions
3 pages
An Overview of The Simultaneous Perturbation Method For Efficient Optimization
No ratings yet
An Overview of The Simultaneous Perturbation Method For Efficient Optimization
11 pages
Computers & Industrial Engineering: Ching-Ter Chang, Huang-Mu Chen, Zheng-Yun Zhuang
No ratings yet
Computers & Industrial Engineering: Ching-Ter Chang, Huang-Mu Chen, Zheng-Yun Zhuang
8 pages
Linear Regression
No ratings yet
Linear Regression
3 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Computer Vision Graph Cuts: Exploring Graph Cuts in Computer Vision
From Everand
Computer Vision Graph Cuts: Exploring Graph Cuts in Computer Vision
Fouad Sabry
No ratings yet

Op Tim Ization

Uploaded by

Op Tim Ization

Uploaded by

21 April 2023 20:05

Session on Optimization Page 1

Session on Optimization Page 2

Session on Optimization Page 3

Session on Optimization Page 4

Session on Optimization Page 5

Session on Optimization Page 6

Session on Optimization Page 7

Session on Optimization Page 8

Session on Optimization Page 9

Session on Optimization Page 10

Session on Optimization Page 12

Session on Optimization Page 13

Session on Optimization Page 14

Session on Optimization Page 16

Session on Optimization Page 17

Session on Optimization Page 18

You might also like