0% found this document useful (0 votes)

35 views26 pages

Lecture 8 Applications

Uploaded by

은지

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views26 pages

Lecture 8 Applications

Uploaded by

은지

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 26

Lecture 8: Applications in ML

Nicholas Ruozzi
University of Texas at Dallas
Function Fitting: ML Applications
• A wide variety of machine learning problems can be cast as
function fitting problems

• Given data observations with corresponding “labels”, find

the function that is the best fit for the data
• We saw an example of this on homework 1, the least squares
regression problem

• Here the function being fit is parameterized by two

numbers , e.g.,

2
L1 Regression
• Suppose that instead of a squared error, we wanted to minimize
the absolute error?

• What optimization procedure should we use?

3
L1 Regression
• Suppose that instead of a squared error, we wanted to minimize
the absolute error?

• We can reformulate this problem to make it differentiable

subject to

(apply existing LP solvers!)

4
Sparse Least Squares Regression

• Sometimes we might prefer a solution vector that has a small

number of nonzero entries – such vectors are called sparse
• This kind of preference does not yield a convex optimization
problem

• Instead, it can be shown that, under certain assumptions, is

a good surrogate

• Can incorporate this as either a constraint or a penalty

5
Sparse Least Squares Regression

• Called the LASSO (least absolute shrinkage and selection

operator) optimization problem
• Here, is a constant that controls the trade-off between a
solution that achieves a low squared error and one that
minimizes the -norm
• Which optimization procedure should we use to solve this
problem?

6
Sparse Least Squares Regression

subject to

• We could also add this penalty as a hard constraint where

controls how large of an -norm is allowed
• Which optimization procedure should we apply here?

7
Maximum Likelihood Estimation
• When fitting a statistical model to data, the principle of
maximum likelihood estimation posits that the best fit model is
the one that generates the data with highest probability

• Example: suppose that you roll a biased 6-sided die 100 times
and observe a sequence of outcomes, e.g.,

• A biased die is described by 6 numbers such that

8
Maximum Likelihood Estimation
• Example: suppose that you roll a biased 6-sided die 100 times
and observe a sequence of outcomes, e.g.,
• A biased die is described by 6 numbers such that

• Let be equal to the number of data observations that were

equal to

• Probability of seeing these observations then

9
Maximum Likelihood Estimation

subject to

10
Maximum Likelihood Estimation

subject to

11
Maximum Likelihood Estimation

subject to

Has a closed form solution, but...

12
Stochastic Gradient Descent
• These types of problems often can be written as minimizing a
sum of a, perhaps large, number of terms:
• Approximate the gradient of a sum by sampling a few indices (as
few as one) uniformly at random and averaging

Each is sampled uniformly at random from

• Stochastic gradient descent converges to the global optimum
under certain assumptions on the step size

13
Stochastic Gradient Descent
• These types of problems often can be written as minimizing a
sum of a, perhaps large, number of terms:
• Approximate the gradient of a sum by sampling a few indices (as
few as one) uniformly at random and averaging Expectation
taken over
the random
Each is sampled uniformly at random from
samples
• Stochastic gradient descent converges to the global optimum
under certain assumptions on the step size

14
SGD for Least Squares

• Select an index uniformly at random

• Update

15
Stochastic Gradient Descent

• Often, SGD is simply implemented as a round robin procedure,

i.e., instead of picking randomly, you just iterate through all of
the indices 1,...,M in a cyclic fashion
• One pass from to is equivalent in terms of computation time
to computing the entire gradient
• What is the terminating condition for stochastic gradient
descent?

16
Logistic Regression
Given and ,

What optimization strategies can we apply?

17
Logistic Regression
Given and ,

18
Logistic Regression
Given and ,

Newton’s Method:

19
Logistic Regression
Given and ,

Newton’s Method:

20
Logistic Regression
Given and ,

Newton’s Method:

21
Solving Linear Systems

• Solving linear systems, e.g., find a solution to or determine that

there is no such , can be cast as a convex optimization problem
• If is positive semidefinite, then we can write this as an
unconstrained minimization problem

What optimization strategies can we apply?

22
Solving Linear Systems

• Solving linear systems, e.g., find a solution to or determine that

there is no such , can be cast as a convex optimization problem
• For general , can write this as a constrained minimization
problem

subject to

23
Solving Linear Systems

subject to

What optimization strategies can we apply?

24
Sparse Linear Systems

• Solving linear systems, e.g., find a solution to or determine that

there is no such , can be cast as a convex optimization problem
• If we are interested in sparse solutions...

subject to

Called the basis pursuit problem

25
Sparse Linear Systems

• Solving linear systems, e.g., find a solution to or determine that

there is no such , can be cast as a convex optimization problem
• If we are interested in sparse solutions...

subject to

Optim
No ratings yet
Optim
70 pages
CS 304.A Training Models
No ratings yet
CS 304.A Training Models
149 pages
Coordinate Descent Algorithms: Stephen J. Wright
No ratings yet
Coordinate Descent Algorithms: Stephen J. Wright
32 pages
M.A Economics PDF
No ratings yet
M.A Economics PDF
42 pages
Mythmakers Spell Mastery Compendium V2.9 - 4-1-24
No ratings yet
Mythmakers Spell Mastery Compendium V2.9 - 4-1-24
146 pages
Continuous Optimization - Vaithilingam Jeyakumar, Alexander Rubinov
100% (1)
Continuous Optimization - Vaithilingam Jeyakumar, Alexander Rubinov
453 pages
Super Consciousness The Quest for the Peak Experience Digital EPUB Download
100% (10)
Super Consciousness The Quest for the Peak Experience Digital EPUB Download
14 pages
Numerical Methods for Least Squares Problems, Second Edition
No ratings yet
Numerical Methods for Least Squares Problems, Second Edition
510 pages
Lecture 17 Least Squares, State Estimation
No ratings yet
Lecture 17 Least Squares, State Estimation
29 pages
Optimization (SF1811 SF1831 SF1841)
100% (1)
Optimization (SF1811 SF1831 SF1841)
198 pages
Models PDF
No ratings yet
Models PDF
86 pages
Intro SVM New Example PDF
100% (1)
Intro SVM New Example PDF
56 pages
Lecture 10_04.09.2024_Regression-02 Lecture Slides
No ratings yet
Lecture 10_04.09.2024_Regression-02 Lecture Slides
61 pages
CS550 Regression Aug12
100% (1)
CS550 Regression Aug12
63 pages
CIS 4526: Foundations of Machine Learning Linear Regression: (Modified From Sanja Fidler)
No ratings yet
CIS 4526: Foundations of Machine Learning Linear Regression: (Modified From Sanja Fidler)
20 pages
Parallelizing Stochastic Gradient Descent for Least Squares Regression mini-batching, averaging, and model misspecification
No ratings yet
Parallelizing Stochastic Gradient Descent for Least Squares Regression mini-batching, averaging, and model misspecification
39 pages
Sketching As A Tool For Numerical Linear Algebra
No ratings yet
Sketching As A Tool For Numerical Linear Algebra
139 pages
Least Squares Problems
No ratings yet
Least Squares Problems
30 pages
Optimumengineeringdesign Day5
No ratings yet
Optimumengineeringdesign Day5
84 pages
Lecture3 2015
No ratings yet
Lecture3 2015
38 pages
Linear and NonLinearProgramming18903341X
No ratings yet
Linear and NonLinearProgramming18903341X
6 pages
NO LINEALs
No ratings yet
NO LINEALs
61 pages
Dalalyan - 2017 - Theoretical Guarantees For Approximate Sampling From Smooth and Log-Concave Densities
No ratings yet
Dalalyan - 2017 - Theoretical Guarantees For Approximate Sampling From Smooth and Log-Concave Densities
26 pages
14329_Adaptive_SGD_with_Polyak
No ratings yet
14329_Adaptive_SGD_with_Polyak
29 pages
Process Optimization
No ratings yet
Process Optimization
70 pages
Math 0602339
No ratings yet
Math 0602339
4 pages
No. 06(b) - Client Service Charter
No ratings yet
No. 06(b) - Client Service Charter
23 pages
10 Convex Optimisation
No ratings yet
10 Convex Optimisation
31 pages
SkriptOptMach
No ratings yet
SkriptOptMach
49 pages
Module3_Ch1
No ratings yet
Module3_Ch1
83 pages
MLexam001 - 1 2 2
No ratings yet
MLexam001 - 1 2 2
9 pages
Enwefa
No ratings yet
Enwefa
27 pages
Convex Cardinality Optimization
No ratings yet
Convex Cardinality Optimization
26 pages
DuchiShSiCh08
No ratings yet
DuchiShSiCh08
8 pages
lec6_7_Linear_regression
No ratings yet
lec6_7_Linear_regression
38 pages
chapter8-Unconstrained Optimization
No ratings yet
chapter8-Unconstrained Optimization
14 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
Practice 1130
No ratings yet
Practice 1130
20 pages
04 LinearRegression
No ratings yet
04 LinearRegression
61 pages
Introduction To Machine Learning Lecture 2: Linear Regression
No ratings yet
Introduction To Machine Learning Lecture 2: Linear Regression
38 pages
O4MD 01 Introduction
No ratings yet
O4MD 01 Introduction
10 pages
Exercise 2: Optimization: Problem 1
No ratings yet
Exercise 2: Optimization: Problem 1
3 pages
Convex Optimization - Introduction (S.l. Dr. Ing. Carmen Voicu)
No ratings yet
Convex Optimization - Introduction (S.l. Dr. Ing. Carmen Voicu)
32 pages
CS-6777 Liu Abs
No ratings yet
CS-6777 Liu Abs
103 pages
Optimizatio With Matlab
No ratings yet
Optimizatio With Matlab
49 pages
Regression Using LS Handout
No ratings yet
Regression Using LS Handout
21 pages
Matthew Johnson Affidavit and Exhibits Alaska Dispatch Joe Miller Case
No ratings yet
Matthew Johnson Affidavit and Exhibits Alaska Dispatch Joe Miller Case
93 pages
FDS Flowchart
No ratings yet
FDS Flowchart
1 page
A MATLAB Library for Stochastic Optimization Algorithms
No ratings yet
A MATLAB Library for Stochastic Optimization Algorithms
5 pages
Machine Learning Interview Questions
No ratings yet
Machine Learning Interview Questions
8 pages
Least
No ratings yet
Least
2 pages
Lec 07-08 - Final
No ratings yet
Lec 07-08 - Final
32 pages
The Educational Theory of ST Thomas Aquinas
No ratings yet
The Educational Theory of ST Thomas Aquinas
4 pages
Handout 1 Introduction
No ratings yet
Handout 1 Introduction
7 pages
Notes Ending 21 Feb 2024
No ratings yet
Notes Ending 21 Feb 2024
7 pages
Lecture3_upload
No ratings yet
Lecture3_upload
28 pages
ML Summary PDF
No ratings yet
ML Summary PDF
5 pages
Z.H. Sikder University of Science and Technology: Mid-Term Examination, Fall-2020
No ratings yet
Z.H. Sikder University of Science and Technology: Mid-Term Examination, Fall-2020
6 pages
theory-note-1
No ratings yet
theory-note-1
5 pages
Basics of The Legislative Branch Web Quest
No ratings yet
Basics of The Legislative Branch Web Quest
7 pages
Amazon ML Pyq
No ratings yet
Amazon ML Pyq
8 pages
RigNotes15 PDF
No ratings yet
RigNotes15 PDF
130 pages
She's Always A Woman - by Billy Joel
0% (1)
She's Always A Woman - by Billy Joel
3 pages
Intro Teoria Neoriemaniana
100% (1)
Intro Teoria Neoriemaniana
15 pages
Introduction To Corporate Governance
No ratings yet
Introduction To Corporate Governance
15 pages
Biblecontradictions
No ratings yet
Biblecontradictions
12 pages
Some Unpublished Religious Texts of Samas
No ratings yet
Some Unpublished Religious Texts of Samas
22 pages
LF-MEK1000 Desember2021
No ratings yet
LF-MEK1000 Desember2021
9 pages
Biography of Imam Abu Hanifah 1
No ratings yet
Biography of Imam Abu Hanifah 1
13 pages
Emerging Trends in Content Analysis 2015
No ratings yet
Emerging Trends in Content Analysis 2015
14 pages
My Reading List 2
No ratings yet
My Reading List 2
7 pages
Cook - Another Look at the Curse in Sefire I A 24
No ratings yet
Cook - Another Look at the Curse in Sefire I A 24
6 pages
Fys2140 Oblig 02
No ratings yet
Fys2140 Oblig 02
4 pages
138 Complaint Noorjahan
No ratings yet
138 Complaint Noorjahan
4 pages
_book_edcoll_9789004250369_B9789004250369-s016-preview
No ratings yet
_book_edcoll_9789004250369_B9789004250369-s016-preview
2 pages
Narrative CRUZ
No ratings yet
Narrative CRUZ
6 pages
Writing Essays: Structure of A Composition Planning and Writing A Composition
100% (1)
Writing Essays: Structure of A Composition Planning and Writing A Composition
3 pages
Cephean Archive - Slightly Magical Trinkets
No ratings yet
Cephean Archive - Slightly Magical Trinkets
3 pages
Hand in Set 1
No ratings yet
Hand in Set 1
2 pages
Question and Answers
No ratings yet
Question and Answers
4 pages
Citizenship (Article 5-11)
100% (1)
Citizenship (Article 5-11)
7 pages
DEVOIR ANGLAIS N°1 6ème U2...
No ratings yet
DEVOIR ANGLAIS N°1 6ème U2...
2 pages
164961654512604bbbb
No ratings yet
164961654512604bbbb
8 pages
Heat Testing (HT) Stress Testing (ST) Per Hour (2) (1) 50,000 Total (3) Per Hour (4) (3) 30,000
No ratings yet
Heat Testing (HT) Stress Testing (ST) Per Hour (2) (1) 50,000 Total (3) Per Hour (4) (3) 30,000
10 pages
Tug Master Training
No ratings yet
Tug Master Training
6 pages
Cathay
No ratings yet
Cathay
9 pages
Gec 5: Purposive Communication 1: Business Letter Template: Job Application Letter
No ratings yet
Gec 5: Purposive Communication 1: Business Letter Template: Job Application Letter
9 pages
Godspell Synopsis
No ratings yet
Godspell Synopsis
4 pages
Botany Books PDF
No ratings yet
Botany Books PDF
3 pages
Characteristics of Politics
No ratings yet
Characteristics of Politics
7 pages
A Conversation About Calculus
From Everand
A Conversation About Calculus
Ginachukwu Amah
No ratings yet
Optimization Algorithms and Hierarchical Convergence
From Everand
Optimization Algorithms and Hierarchical Convergence
Pasquale De Marco
No ratings yet
Hill Climbing: Fundamentals and Applications
From Everand
Hill Climbing: Fundamentals and Applications
Fouad Sabry
No ratings yet
Simulated Annealing: Fundamentals and Applications
From Everand
Simulated Annealing: Fundamentals and Applications
Fouad Sabry
No ratings yet
Best First Search: Fundamentals and Applications
From Everand
Best First Search: Fundamentals and Applications
Fouad Sabry
No ratings yet
Random Optimization: Fundamentals and Applications
From Everand
Random Optimization: Fundamentals and Applications
Fouad Sabry
No ratings yet

Lecture 8 Applications

Uploaded by

Lecture 8 Applications

Uploaded by

Lecture 8: Applications in ML

• Given data observations with corresponding “labels”, find

• Here the function being fit is parameterized by two

• What optimization procedure should we use?

• We can reformulate this problem to make it differentiable

(apply existing LP solvers!)

• Sometimes we might prefer a solution vector that has a small

• Instead, it can be shown that, under certain assumptions, is

• Can incorporate this as either a constraint or a penalty

• Called the LASSO (least absolute shrinkage and selection

• We could also add this penalty as a hard constraint where

• A biased die is described by 6 numbers such that

• Let be equal to the number of data observations that were

• Probability of seeing these observations then

Has a closed form solution, but...

Each is sampled uniformly at random from

• Select an index uniformly at random

• Often, SGD is simply implemented as a round robin procedure,

What optimization strategies can we apply?

• Solving linear systems, e.g., find a solution to or determine that

What optimization strategies can we apply?

• Solving linear systems, e.g., find a solution to or determine that

What optimization strategies can we apply?

• Solving linear systems, e.g., find a solution to or determine that

Called the basis pursuit problem

• Solving linear systems, e.g., find a solution to or determine that

You might also like