0% found this document useful (0 votes)

12 views

10 Convex Optimisation

Uploaded by

mb6hbk2ctg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

10 Convex Optimisation

Uploaded by

mb6hbk2ctg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

Foundations of Data Science, Fall 2024

Introduction to Data Science for Doctoral Students, Fall 2024

10. Convex Optimisation

Dr. Haozhe Zhang

October 21, 2024

MSc: https://lms.uzh.ch/url/RepositoryEntry/17589469505
PhD: https://lms.uzh.ch/url/RepositoryEntry/17589469506
Solving Machine Learning Problems

Most machine learning methods can be cast as optimisation problems.

• So far in the course: Closed-form solutions

e.g., minimisation of least squares and ridge regression objectives

• Most interesting learning problems do not admit closed-form solutions :(

1
Solving Machine Learning Problems

Most machine learning methods can be cast as optimisation problems.

• So far in the course: Closed-form solutions

e.g., minimisation of least squares and ridge regression objectives

• Most interesting learning problems do not admit closed-form solutions :(

Two approaches to solving the problems beyond closed-form solutions:

1. Frame the objective of the ML problem as a mathematical problem

Use existing blackbox solver for such problems

When objectives can be formulated as convex optimisation problems

2. Gradient-based optimisation methods

They are not blackbox: optimisation hyper-parameters affect performance

1
A Crash Course in Optimisation

Today:

• Convex optimisation

Next time:

• Recap: Gradients, Hessians

• Gradient Descent

• Stochastic Gradient Descent

• Constrained optimisation

Most machine learning packages, e.g., scikit-learn, tensorflow, octave, torch,

have optimisation methods readily implemented.

You need to understand the basics of optimisation to use them effectively.

2
Convex Sets
x1, x2 ∈ C, 0≤θ≤1 =⇒ θx1 + (1 −

A set C ⊆ RD is convex if for any x, y ∈ C and λ ∈ [0, 1], it holds λ x + (1 − λ) y ∈ C

mples (one convex, two nonconvex sets)

sets

3
Examples of Convex Sets

• Set RD
λ x + (1 − λ) y ∈ RD for all x, y ∈ RD

• Intersections of convex sets

Tn
Given convex sets C1 , . . . , Cn , the set i =1 Ci is convex

• Norm balls
For any L-norm || · ||, the set B = {x ∈ RD : ||x|| ≤ 1} is convex

• Polyhedra
Given A ∈ Rm×n and b ∈ Rm , the polyhedron {x ∈ Rn : A x ≤ b} is convex

• Positive semidefinite cone

D
The set S+ of positive semi-definite matrices is convex

4
Showing the Set of PSD Matrices is Convex

5
Showing the Norm Balls Form Convex Sets

6
Showing the Polyhedron is Convex + Example

Given A ∈ Rm×n and b ∈ Rm , the polyhedron P = {x ∈ Rn : A x ≤ b} is convex

7
Convex Functions

A function f : RD → R defined on a convex domain is convex if:

for all x, y ∈ RD where f is defined and 0 ≤ λ ≤ 1,

f (λ · x + (1 − λ) · y) ≤ λ · f (x) + (1 − λ) · f (y)

8
Examples of Convex Functions

• Affine functions: f (x) = bT x + c

• Quadratic functions: f (x) = 1/2 · xT Ax + bT x + c,

where A is symmetric positive semidefinite

• Nonnegative weighted sums of convex functions: Given convex functions

f1 , . . . , fn and w1 , . . . , wn ∈ R≥0 , the following is a convex function
n
X
f (x) = wi · fi (x)
i =1

• Norms: ∥ · ∥p except p = 0

9
Convex Optimisation

Given convex functions f , g1 , . . . , gm and affine functions h1 , . . . hn ,

a convex optimisation problem has the form:

minimise f (x)
subject to gi (x) ≤ 0 i ∈ [m]
hj (x) = 0 j ∈ [n ]

10
Convex Optimisation

Given convex functions f , g1 , . . . , gm and affine functions h1 , . . . hn ,

a convex optimisation problem has the form:

minimise f (x)
subject to gi (x) ≤ 0 i ∈ [m]
hj (x) = 0 j ∈ [n ]

Goal is to find an optimal value v ∗ of a convex optimisation problem:

S = {f (x) : gi (x) ≤ 0, i ∈ [m], hi (x) = 0, j ∈ [n]} set of feasible solutions
∗
v = min S optimal value of the objective
∗ ∗ ∗
x = argmin S , i.e., f (x ) = v (not necessarily unique) optimal point
x

10
Convex Optimisation

Given convex functions f , g1 , . . . , gm and affine functions h1 , . . . hn ,

a convex optimisation problem has the form:

minimise f (x)
subject to gi (x) ≤ 0 i ∈ [m]
hj (x) = 0 j ∈ [n ]

Goal is to find an optimal value v ∗ of a convex optimisation problem:

Infeasible and unbounded instances

def
• v ∗ = +∞ for infeasible instances (feasible = fulfils all constraints gi and hj )
def
• v ∗ = −∞ for unbounded instances (unbounded = the set of feasible
instances has no infimum)
10
Local Optima are Global Optima for Convex Optimisation Problems

x is locally optimal if:

• x is feasible and
• There is B > 0 s.t. f (x) ≤ f (y) for all feasible y with ||x − y||2 ≤ B.

x is globally optimal if:

• x is feasible and
• f (x) ≤ f (y) for all feasible y.

11
Local Optima are Global Optima for Convex Optimisation Problems

x is locally optimal if:

• x is feasible and
• There is B > 0 s.t. f (x) ≤ f (y) for all feasible y with ||x − y||2 ≤ B.

x is globally optimal if:

• x is feasible and
• f (x) ≤ f (y) for all feasible y.

Theorem: For any convex optimisation problem, all locally optimal points are
globally optimal.

11
Local Optima are Global Optima for Convex Optimisation Problems: Proof

12
Local Optima are Global Optima for Convex Optimisation Problems: Figure

f (x )

B
f (x )
f (z )
f (y )

x
x z y

13
Classes of Convex Optimisation Problems

Linear Programming:
T
minimize c x + d
subject to A x ≤ e
Bx=f

14
Classes of Convex Optimisation Problems

Linear Programming:
T
minimize c x + d
subject to A x ≤ e
Bx=f

Quadratically Constrained Quadratic Programming:

1 T T
minimize x Bx+c x+d
2
1 T T
subject to x Qi x + ri x + si ≤ 0 i ∈ [m ]
2
Ax=b

14
Classes of Convex Optimisation Problems

Linear Programming:
T
minimize c x + d
subject to A x ≤ e
Bx=f

Quadratically Constrained Quadratic Programming:

1 T T
minimize x Bx+c x+d
2
1 T T
subject to x Qi x + ri x + si ≤ 0 i ∈ [m ]
2
Ax=b

Semidefinite Programming:

minimize tr(C X)
subject to tr(Ai X) = bi i ∈ [m ]
X positive semidefinite

For a matrix B, tr(B) is the trace of B

14
Linear Programming

Looking for solutions x ∈ Rn to the following optimisation problem

T
minimize c x + d
subject to A x ≤ e
Bx=f

• No closed-form solution
• Efficient algorithms exist, both in
theory and practice (for tens of
thousands of variables)

15
Linear Model with Absolute Loss

Suppose we have data (X, y) and that we want to minimise the objective:
N
X
L(w) = |wT xi − yi |
i =1

We would like to transform this optimisation problem into a linear program.

16
Linear Model with Absolute Loss

Suppose we have data (X, y) and that we want to minimise the objective:
N
X
L(w) = |wT xi − yi |
i =1

We would like to transform this optimisation problem into a linear program.

We introduce one ζi for each datapoint.

The linear program in the D + N variables w1 , . . . , wD , ζ1 , . . . , ζN

N
X
minimize ζi
i =1

subject to:
T
w xi − yi ≤ ζi , i ∈ [N ]
T
yi − w xi ≤ ζi , i ∈ [N ]

16
Linear Model with Absolute Loss

Suppose we have data (X, y) and that we want to minimise the objective:
N
X
L(w) = |wT xi − yi |
i =1

We would like to transform this optimisation problem into a linear program.

We introduce one ζi for each datapoint.

The linear program in the D + N variables w1 , . . . , wD , ζ1 , . . . , ζN

N
X
minimize ζi
i =1

subject to:
T
w xi − yi ≤ ζi , i ∈ [N ]
T
yi − w xi ≤ ζi , i ∈ [N ]

The solution to this linear program gives w that minimises the objective L.
16
Linear Model with Absolute Loss via Linear Programming (1/2)
N
X
minimize ζi
N
X i =1
L(w) = |wT xi − yi | subject to:
i =1
T
w xi − yi ≤ ζi , i ∈ [N ]
T
yi − w xi ≤ ζi , i ∈ [N ]
Claim: The solution to this linear program gives w that minimises the objective L.

17
Linear Model with Absolute Loss via Linear Programming (2/2)

18
Recall: Likelihood of Linear Regression (Gaussian Noise Model)

Likelihood
N /2
1 1
p(y | X, w, σ) = exp − 2 (Xw − y)T (Xw − y)
2πσ 2 2σ

Maximise Likelihood = Maximise Log-Likelihood (log : R+ → R is increasing)

N 1
LL(y | X, w, σ) = − log(2πσ 2 ) − (Xw − y)T (Xw − y)
2 2σ 2

Maximise Log-Likelihood = Minimise Negative Log-Likelihood

N 1
NLL(y | X, w, σ) = log(2πσ 2 ) + (Xw − y)T (Xw − y)
2 2σ 2

19
Recall: Likelihood of Linear Regression (Gaussian Noise Model)

Likelihood
N /2
1 1
p(y | X, w, σ) = exp − 2 (Xw − y)T (Xw − y)
2πσ 2 2σ

Maximise Likelihood = Maximise Log-Likelihood (log : R+ → R is increasing)

N 1
LL(y | X, w, σ) = − log(2πσ 2 ) − (Xw − y)T (Xw − y)
2 2σ 2

Maximise Log-Likelihood = Minimise Negative Log-Likelihood

N 1
NLL(y | X, w, σ) = log(2πσ 2 ) + (Xw − y)T (Xw − y)
2 2σ 2
 
N 1  T T
= log(2πσ 2 ) + 2 w
| X{zXw} − 2y
T T
Xw + y y 

2 2σ | {z } |{z}
| {z } wT Bw cT w constant
constant

This is a convex quadratic optimisation problem with no constraints!

19
Minimising the Lasso Objective

For the Lasso objective, i.e., linear model with ℓ1 -regularisation, we have
N
X D
X D
X
Llasso (w) = (wT xi − yi )2 + λ |wi | = wT XT Xw − 2yT Xw + yT y + λ | wi |
i =1 i =1 i =1

20
Minimising the Lasso Objective

For the Lasso objective, i.e., linear model with ℓ1 -regularisation, we have
N
X D
X D
X
Llasso (w) = (wT xi − yi )2 + λ |wi | = wT XT Xw − 2yT Xw + yT y + λ | wi |
i =1 i =1 i =1

• Quadratic part of the loss function cannot be framed as linear programming

• Lasso regularisation does not allow for closed-form solutions

• Can be rephrased as quadratic programming problem

• Alternatively resort to general optimisation methods

Math 273a: Optimization: Instructor: Wotao Yin Department of Mathematics, UCLA Fall 2015
No ratings yet
Math 273a: Optimization: Instructor: Wotao Yin Department of Mathematics, UCLA Fall 2015
17 pages
AMIE Syllabus - Mechanical Engineering
No ratings yet
AMIE Syllabus - Mechanical Engineering
14 pages
optimization
No ratings yet
optimization
49 pages
Konveksna Optimizacija
No ratings yet
Konveksna Optimizacija
179 pages
Convexity II: Optimization Basics: Ryan Tibshirani Convex Optimization 10-725
No ratings yet
Convexity II: Optimization Basics: Ryan Tibshirani Convex Optimization 10-725
28 pages
Data Science - Convex Optimization and Examples PDF
No ratings yet
Data Science - Convex Optimization and Examples PDF
9 pages
Convex Optimization - Introduction (S.l. Dr. Ing. Carmen Voicu)
No ratings yet
Convex Optimization - Introduction (S.l. Dr. Ing. Carmen Voicu)
32 pages
1 Intro
No ratings yet
1 Intro
25 pages
1 Introduction
No ratings yet
1 Introduction
24 pages
Model 20161010
No ratings yet
Model 20161010
48 pages
Convex Optimization: Instructor: Angelia Nedich
No ratings yet
Convex Optimization: Instructor: Angelia Nedich
17 pages
Lecture7[1]
No ratings yet
Lecture7[1]
46 pages
Lectures Hd
No ratings yet
Lectures Hd
301 pages
BV Cvxslides PDF
No ratings yet
BV Cvxslides PDF
301 pages
lect5_removed
No ratings yet
lect5_removed
35 pages
Lecture 01 - Intro
No ratings yet
Lecture 01 - Intro
20 pages
Chapter 2 Basis Math
No ratings yet
Chapter 2 Basis Math
14 pages
ConvexSpring25_Week_1_2
No ratings yet
ConvexSpring25_Week_1_2
46 pages
CS675: Convex and Combinatorial Optimization Fall 2019 Convex Optimization Problems
No ratings yet
CS675: Convex and Combinatorial Optimization Fall 2019 Convex Optimization Problems
62 pages
Concise - Lecture - Notes - On - Optimization - Methods - 1722728042 2024-08-03 23 - 34 - 09
No ratings yet
Concise - Lecture - Notes - On - Optimization - Methods - 1722728042 2024-08-03 23 - 34 - 09
258 pages
Lec 1 - Introduction
No ratings yet
Lec 1 - Introduction
14 pages
斯坦福大学机器学习数学基础 41-48
No ratings yet
斯坦福大学机器学习数学基础 41-48
8 pages
Review 3
No ratings yet
Review 3
14 pages
NLP 2016 Intro
No ratings yet
NLP 2016 Intro
36 pages
CS480 6 Linear Models
No ratings yet
CS480 6 Linear Models
68 pages
Convex Optimization Problems
No ratings yet
Convex Optimization Problems
47 pages
ConvexOptimization Boyd Slides
No ratings yet
ConvexOptimization Boyd Slides
394 pages
Lecture1 introductionPCA
No ratings yet
Lecture1 introductionPCA
75 pages
01 Intro Notes Cvxopt f22
No ratings yet
01 Intro Notes Cvxopt f22
25 pages
week 10 notes MLF
No ratings yet
week 10 notes MLF
20 pages
Computational OPT Book 2023 Chapter 01
No ratings yet
Computational OPT Book 2023 Chapter 01
18 pages
Optimizatio With Matlab
No ratings yet
Optimizatio With Matlab
49 pages
Interior Point Method
No ratings yet
Interior Point Method
48 pages
Lecture 3
No ratings yet
Lecture 3
5 pages
Pieo20 1 Lpmodels
No ratings yet
Pieo20 1 Lpmodels
52 pages
ML MODULE 5 FULL NOTES
No ratings yet
ML MODULE 5 FULL NOTES
23 pages
Handout 1 Introduction
No ratings yet
Handout 1 Introduction
7 pages
Module-4-Optimization
No ratings yet
Module-4-Optimization
6 pages
Matinf 2360 Part 3
No ratings yet
Matinf 2360 Part 3
106 pages
Mathematical Optimization
No ratings yet
Mathematical Optimization
11 pages
Notes HQ
No ratings yet
Notes HQ
96 pages
Optimization PDF
No ratings yet
Optimization PDF
59 pages
Optimisation
No ratings yet
Optimisation
38 pages
Optimization (SF1811 SF1831 SF1841)
No ratings yet
Optimization (SF1811 SF1831 SF1841)
198 pages
Chapter 0: Introduction: 0.2.1 Examples in Machine Learning
No ratings yet
Chapter 0: Introduction: 0.2.1 Examples in Machine Learning
4 pages
ConvexSpring25_Week3
No ratings yet
ConvexSpring25_Week3
30 pages
Continuous Optimization
No ratings yet
Continuous Optimization
51 pages
OpTimIzation Overview
No ratings yet
OpTimIzation Overview
47 pages
Solving Optimization Problems Using The Matlab Opt
No ratings yet
Solving Optimization Problems Using The Matlab Opt
50 pages
Introduction to Optimization
No ratings yet
Introduction to Optimization
18 pages
Exam1Review Annotated
No ratings yet
Exam1Review Annotated
13 pages
Optimization Models
No ratings yet
Optimization Models
104 pages
5 NLP Models PDF
No ratings yet
5 NLP Models PDF
50 pages
LGT2
No ratings yet
LGT2
32 pages
B For I 1,, M: N J J J
No ratings yet
B For I 1,, M: N J J J
19 pages
Optimization1
No ratings yet
Optimization1
32 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Hyperbolic Functions (Trigonometry) Mathematics E-Book For Public Exams
From Everand
Hyperbolic Functions (Trigonometry) Mathematics E-Book For Public Exams
Mohmmad Khaja Shareef
No ratings yet
Calculus-II (Mathematics) Question Bank
From Everand
Calculus-II (Mathematics) Question Bank
Mohmmad Khaja Shareef
No ratings yet
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
Thermodynamic Optimization of Bottoming Cycle
100% (1)
Thermodynamic Optimization of Bottoming Cycle
15 pages
An Efficient 3D Toplogy Optimization Code Written in MATLAB
No ratings yet
An Efficient 3D Toplogy Optimization Code Written in MATLAB
22 pages
Asian Journal of Current Engineering and
No ratings yet
Asian Journal of Current Engineering and
3 pages
Operations Management: William J. Stevenson
No ratings yet
Operations Management: William J. Stevenson
19 pages
Physics Simulation and Form Seeking in Architecture Design Education
No ratings yet
Physics Simulation and Form Seeking in Architecture Design Education
9 pages
PMRes 2007 4
No ratings yet
PMRes 2007 4
82 pages
Production and Operations Management
No ratings yet
Production and Operations Management
24 pages
Unit 3: Introduction To Linear Programming
No ratings yet
Unit 3: Introduction To Linear Programming
12 pages
Micro Book 3 Christopher P. Chambers, Federico Echenique
No ratings yet
Micro Book 3 Christopher P. Chambers, Federico Echenique
239 pages
Linear Programming
No ratings yet
Linear Programming
10 pages
The Controlled Random Search Algorithm in Optimizing Regression Models
No ratings yet
The Controlled Random Search Algorithm in Optimizing Regression Models
6 pages
Chapter4 SMO Jagdish 14march
No ratings yet
Chapter4 SMO Jagdish 14march
16 pages
ControlSystemDesignwithMATLABandSimulink
No ratings yet
ControlSystemDesignwithMATLABandSimulink
4 pages
Relationship of Artificial Intelligence To The Reliability Optimization of Electric Power Systems (Review)
No ratings yet
Relationship of Artificial Intelligence To The Reliability Optimization of Electric Power Systems (Review)
5 pages
Pierre Boisleve Projet Biblio
No ratings yet
Pierre Boisleve Projet Biblio
4 pages
CST Horn Antenna
100% (3)
CST Horn Antenna
25 pages
Aggregate Planning and Scheduling
No ratings yet
Aggregate Planning and Scheduling
41 pages
Real-Time Implementation of An Enhanced Proportional Integral-Derivative Controller Based On Sparrow Search Algorithm For Micro-Robotics System
No ratings yet
Real-Time Implementation of An Enhanced Proportional Integral-Derivative Controller Based On Sparrow Search Algorithm For Micro-Robotics System
10 pages
Previewpdf
No ratings yet
Previewpdf
40 pages
Program Utama Cat Swarm Optimization
No ratings yet
Program Utama Cat Swarm Optimization
5 pages
Gan Tutorial
No ratings yet
Gan Tutorial
57 pages
Diffusion Min-SNR
No ratings yet
Diffusion Min-SNR
18 pages
Particle Optimisation Dispatch Cost Function: Swarm For Economic With Cubic Fuel
No ratings yet
Particle Optimisation Dispatch Cost Function: Swarm For Economic With Cubic Fuel
4 pages
Production & Costs
0% (1)
Production & Costs
175 pages
MTH601-MidTerm-By Rana Abubakar Khan
No ratings yet
MTH601-MidTerm-By Rana Abubakar Khan
6 pages
Journal of Air Transport - Perfect Storm 2023
No ratings yet
Journal of Air Transport - Perfect Storm 2023
11 pages
Sequencing and Scheduling - An Overview: Chapter-Ii
No ratings yet
Sequencing and Scheduling - An Overview: Chapter-Ii
14 pages
(Lecture Notes in Computer Science 6466 - Theoretical Computer Science and General Issues) Shi-Zheng Zhao (Auth.), Bijaya Ketan Panigrahi, Swagatam Das, Ponnuthurai Nagaratnam Suganthan, Subhransu Sek
No ratings yet
(Lecture Notes in Computer Science 6466 - Theoretical Computer Science and General Issues) Shi-Zheng Zhao (Auth.), Bijaya Ketan Panigrahi, Swagatam Das, Ponnuthurai Nagaratnam Suganthan, Subhransu Sek
775 pages
Multi-Robot Path Planning in A Dynamic en
No ratings yet
Multi-Robot Path Planning in A Dynamic en
19 pages

10 Convex Optimisation

Uploaded by

10 Convex Optimisation

Uploaded by

Foundations of Data Science, Fall 2024

Introduction to Data Science for Doctoral Students, Fall 2024

10. Convex Optimisation

Dr. Haozhe Zhang

October 21, 2024

Most machine learning methods can be cast as optimisation problems.

• So far in the course: Closed-form solutions

• Most interesting learning problems do not admit closed-form solutions :(

Most machine learning methods can be cast as optimisation problems.

• So far in the course: Closed-form solutions

• Most interesting learning problems do not admit closed-form solutions :(

Two approaches to solving the problems beyond closed-form solutions:

1. Frame the objective of the ML problem as a mathematical problem

Use existing blackbox solver for such problems

When objectives can be formulated as convex optimisation problems

2. Gradient-based optimisation methods

They are not blackbox: optimisation hyper-parameters affect performance

• Recap: Gradients, Hessians

• Stochastic Gradient Descent

Most machine learning packages, e.g., scikit-learn, tensorflow, octave, torch,

You need to understand the basics of optimisation to use them effectively.

A set C ⊆ RD is convex if for any x, y ∈ C and λ ∈ [0, 1], it holds λ x + (1 − λ) y ∈ C

• Intersections of convex sets

• Positive semidefinite cone

Given A ∈ Rm×n and b ∈ Rm , the polyhedron P = {x ∈ Rn : A x ≤ b} is convex

A function f : RD → R defined on a convex domain is convex if:

for all x, y ∈ RD where f is defined and 0 ≤ λ ≤ 1,

• Affine functions: f (x) = bT x + c

• Quadratic functions: f (x) = 1/2 · xT Ax + bT x + c,

• Nonnegative weighted sums of convex functions: Given convex functions

Given convex functions f , g1 , . . . , gm and affine functions h1 , . . . hn ,

a convex optimisation problem has the form:

Given convex functions f , g1 , . . . , gm and affine functions h1 , . . . hn ,

a convex optimisation problem has the form:

Goal is to find an optimal value v ∗ of a convex optimisation problem:

Given convex functions f , g1 , . . . , gm and affine functions h1 , . . . hn ,

a convex optimisation problem has the form:

Goal is to find an optimal value v ∗ of a convex optimisation problem:

Infeasible and unbounded instances

x is locally optimal if:

x is globally optimal if:

x is locally optimal if:

x is globally optimal if:

Quadratically Constrained Quadratic Programming:

Quadratically Constrained Quadratic Programming:

For a matrix B, tr(B) is the trace of B

Looking for solutions x ∈ Rn to the following optimisation problem

We would like to transform this optimisation problem into a linear program.

We would like to transform this optimisation problem into a linear program.

We introduce one ζi for each datapoint.

The linear program in the D + N variables w1 , . . . , wD , ζ1 , . . . , ζN

We would like to transform this optimisation problem into a linear program.

We introduce one ζi for each datapoint.

The linear program in the D + N variables w1 , . . . , wD , ζ1 , . . . , ζN

Maximise Likelihood = Maximise Log-Likelihood (log : R+ → R is increasing)

Maximise Log-Likelihood = Minimise Negative Log-Likelihood

Maximise Likelihood = Maximise Log-Likelihood (log : R+ → R is increasing)

Maximise Log-Likelihood = Minimise Negative Log-Likelihood

This is a convex quadratic optimisation problem with no constraints!

• Quadratic part of the loss function cannot be framed as linear programming

• Lasso regularisation does not allow for closed-form solutions

• Can be rephrased as quadratic programming problem

• Alternatively resort to general optimisation methods

You might also like