0% found this document useful (0 votes)

5 views5 pages

Theory Note 1

The lecture covers the basics of optimization, including its classification into continuous and discrete problems, and introduces linear programming as a method for optimization. It discusses various examples of optimization problems, such as the knapsack problem, supply chain location, image de-blurring, and curve fitting in machine learning. The lecture also explains the standard form of linear programming and the principles of duality, including weak and strong duality.

Uploaded by

saramukhopadhyay

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views5 pages

Theory Note 1

Uploaded by

saramukhopadhyay

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

CS 217: Artificial Intelligence and Machine Learning Jan 10, 2024

Lecture 1: The Basics of Optimization

Lecturer: Swaprava Nath Scribes: SG1 & SG2

Disclaimer: These notes aggregate content from several texts and have not been subjected to the usual
scrutiny deserved by formal publications. If you find errors, please bring to the notice of the Instructor.

In this lecture, we discuss what essentially optimization is, where we use optimization, and look at a very
basic optimization technique called linear programming.
Broadly, optimization problems can be classified into two classes:

Class Variable Solution space Solution complexity

Continuous optimization continuous infinite polynomial in the size of the problem
Discrete optimization discrete finite exponential in the size of the problem

Table 1.1: Classes of optimization problems.

The knapsack problem is a classic discrete optimization problem, where we are given a bunch of objects with
specific weights and a knapsack (backpack, for instance), which can carry at most some amount of weight.
We want to fill our knapsack up to the highest permissible weight. There is nothing like partial association
of an item with the knapsack: An item is either in the knapsack or not. A polynomial solution to the most
general knapsack problem is not known. One way to solve it is to brute force all possible combinations and
find the optimal one, but this is not efficient.
In continuous optimization problems, even though the solution space is infinite, it is not very difficult to find
the optimal solution. We will be talking about continuous optimization problems.
There is a significant trend in AI of formulating problems in terms of optimization problems.

1.1 Optimization: What is it? — Motivating Applications

There are two components of an optimization problem: The objective function which we want to minimize
(or maximize, which is just the negative of the minimization problem), and the constraint set. Let f (x) be
the objective function, and C be the constraint set. Then, the optimization problem is written as

min f (x).
x∈C

Example. Minimize (x − 2)2 with the constraint that x ∈ [0, 1] ∪ [4, 7].
Here, the objective function is f (x) = (x−2)2 and the constraint set is C = [0, 1]∪[4, 7]. This is a one-variable
function, and we can easily see from the plot in figure 1.1a that x∗ = 1 is the optimal value of x.
Example (Geometry). Suppose we have a map of a country with cities represented by the points y1 , . . . , yn
in two-dimensions. We want to set up a supply chain and deliver to all these cities. We want to build a
warehouse such that the sum of the distances from it to all the cities is minimized (assuming we can transport
the Euclidean way).

1-1
1-2 Lecture 1: The Basics of Optimization

y
01 4 7
x x
(a) A plot of f (x) = (x − 2)2 . (b) An example of curve-fitting.

•y3 •yn

•y2 •x

•y1

(c) Map of a country with cities at y1 , . . . , yn and a

warehouse at x.

Figure 1.1: Examples of optimization problems.

Let the warehouse be located at x. Then, the optimization problem is

m
X
min ∥x − yi ∥2 ,
x∈C
i=1

where C is the set of all points in the country, and ∥x∥2 =

p
x21 + x22 represents the L2 norm of x = (x1 , x2 ).
Example (Computer vision). Image de-blurring is a common problem in computer vision. We want to
de-blur a blurred image according to some policy.
We consider grayscale images of size m × n, where each pixel has an intensity value in [0, 1]: 0 meaning black,
and 1 meaning white. Let y = [yi,j ]m×n be the blurred image that we are given. Let x = [xi,j ]m×n be the
original image and k be the blurring filter which was applied to get y. Then, to get the original (de-blurred)
image, we have to solve
 
m−1 n−1 m−1 n−1
X X XX
(xi,j − xi,j+1 )2 + (xi+1,j − xi,j )2  .

minm×n  |yi,j − (k ∗ x)i,j | + λ
x∈[0,1]
i=0 j=0 i=0 j=0
| {z }
to reduce sudden intensity changes

λ and k are the hyperparameters. λ penalizes a high difference in intensity of adjacent pixels.
Example (Machine learning). Suppose we have inputs (xi , yi ) for i ∈ [n]. We are trying to fit a curve
through these points. Suppose that we hypothesize the curve to be a polynomial hθ (x) = w0 + w1 x + w2 x2 ,
where θ = (w0 , w1 , w2 ) is a vector of the parameters. For each point xi , we want the hypothesized point
hθ (xi ) to be close to yi . Let ℓ(x, y) = (x − y)2 be a (very simple) loss function. Our objective is to minimize
the sum of the loss over all the observed points.
The optimization problem is
n
X
min3 ℓ(hθ (xi ), yi ).
θ∈R
i=1
Lecture 1: The Basics of Optimization 1-3

1.2 Linear Programming: A way for Optimization

It is another type of optimization where we have to find an optimal value of vector x that maximizes the
following function along with satisfying some linear constraints. Let x be a n sized vector and A(m ×
n), c(size n) and b(size m) are given matrices/vectors, then the problem is:

max cT x

x

subject to Ax ≤ b
and x ≥ 0
The above form is the standard form of a linear program. If x = {x1 , x2 , x3 , . . . } and c = {c1 , c2 , c3 , . . . },
then the objective function to be maximized is:

cT x = c1 · x1 + c2 · x2 + c3 · x3 + . . .

For any two vectors {u, v}, u ≤ v means ui ≤ vi ∀i.

Example (Political Winning). Investing money to win the election

Consider a political scenario where a political party P needs to invest money to win elections. There are
3 different demographic classes, class 1, class 2, and class 3, and four issues to be addressed: A, B, C, and
D. There is a specific pattern in which people from different classes respond to different issues. Table 1.2
contains the number of votes gained or lost by the political party in each class per unit money spent on an
issue, the population of each class, and the majority required by the party to win in each class.

Classes
Issues
Class1 Class2 Class3
A −2 5 3
B 8 2 −5
C 0 0 10
D 10 0 2
Population 100 000 200 000 50 000
Majority 50 000 100 000 25 000

Table 1.2: Return Ratio Data

The aim of the political party is to minimize the total amount of money it needs to invest, yet get the
required majority across each class. Let x1 , x2 , x3 , x4 be the amount of money the party invests in issues A,
B, C, and D respectively. Then, we have the following optimization problem:

min x1 + x2 + x3 + x4
x1 ,x2 ,x3 ,x4

subject to the constraints

−2x1 + 8x2 + 0x3 + 10x4 ≥ 50 000, (1.1)

5x1 + 2x2 + 0x3 + 0x4 ≥ 100 000, (1.2)
3x1 + 5x2 + 10x3 + 2x4 ≥ 25 000, (1.3)
1-4 Lecture 1: The Basics of Optimization

and x1 , x2 , x3 , x4 ≥ 0. Let us look at the optimal solution

2 050 000 ∗ 425 000 ∗ 625 000
x∗1 = , x2 = , x3 = 0, x∗4 = .
111 111 111

The optimal value of x1 + x2 + x3 + x4 = x∗1 + x∗2 + x∗3 + x∗4 = 3 100 000/111.

After multiplying and adding the equations as (1.1) · 25/222 + (1.2) · 46/222 + (1.3) · 14/222, we get

140 3 100 000

x1 + x2 + x3 + x4 ≥ .
222 111
Since x1 + x2 + x3 + x4 = x1 + x2 + 140x3 /222 + x4 ≥ 3 100 000/111, this proves that our given solution is
truly the optimal solution.

1.2.1 Standard Form of Linear Program

Let x ∈ Rn be the vector containing the variables to optimize and c ∈ Rn be the vector of constants:
   
x1 c1
 ..   .. 
x =  . , c =  . .
xn cn

Then, we can write the standard form of linear program as maxx c⊤ x subject to the constraints
 
b1
 .. 
Am×n xn×1 ≤ bm×1 =  .  and x ≥ 0,
bm

where ≥ represents element-wise greater than or equal to:

   
x1 0
 ..   .. 
 .  ≥  .  ⇐⇒ ∀i, xi ≥ 0.
xn 0

The aforementioned problem is commonly referred to as the primal problem and it is accompanied by a
corresponding dual problem:

Primal Problem Dual Problem

max c⊤ x min b⊤ y
x y
s.t. Ax ≤ b, s.t. A⊤ y ≥ c,
x ≥ 0. y ≥ 0.

1.2.1.1 Principles of Duality — Strong & Weak Duality

We will be looking at two famous and ubiquitous Duality Principles – Weak Duality and Strong Duality. We
will be providing a proof for the first in this scribe.
Lecture 1: The Basics of Optimization 1-5

Theorem 1.1 (Weak Duality Principle) Let x and y represent feasible solutions, i.e., solutions that
satisfy all the constraints, for the primal and dual problems, respectively. Then,

b⊤ y ≥ c⊤ x.

Proof. Since x is a feasible solution of the primal problem,

Ax ≤ b,

and hence

x⊤ A⊤ ≤ b⊤

Since y ≥ 0, multiplying it on both sides will not change the inequality:

x⊤ A⊤ y ≤ b⊤ y.

Since y is a feasible solution of the dual problem, A⊤ y ≥ c, and hence

b⊤ y ≥ x⊤ c,

which can also be written as b⊤ y ≥ c⊤ x. Thus, weak duality principle provides a relation between the
solutions of primal and dual problems.

Theorem 1.2 (Strong Duality Principle) It suggests that for any optimal solution x∗ of primal problem
and any optimal solution y ∗ of dual problem, both the optima achieved are equal:

bT y ∗ = cT x∗

Optimization Models by Giuseppe C. Calafiore, Laurent El Ghaoui)
No ratings yet
Optimization Models by Giuseppe C. Calafiore, Laurent El Ghaoui)
632 pages
Unit1 (Complete)
No ratings yet
Unit1 (Complete)
111 pages
13 Spreadsheet
No ratings yet
13 Spreadsheet
17 pages
Optimization (SF1811 SF1831 SF1841)
100% (1)
Optimization (SF1811 SF1831 SF1841)
198 pages
Chapter 8 - Linear Programming
No ratings yet
Chapter 8 - Linear Programming
163 pages
LP Book
No ratings yet
LP Book
161 pages
Linear Programming Ferguson
No ratings yet
Linear Programming Ferguson
66 pages
CO Lecture Notes
No ratings yet
CO Lecture Notes
84 pages
Classification of Optimization Methods
No ratings yet
Classification of Optimization Methods
68 pages
Adaptive Quadrature
No ratings yet
Adaptive Quadrature
8 pages
Iai&ml Unit-4
No ratings yet
Iai&ml Unit-4
34 pages
Surveying Image Segmentation Approaches in Astronomy
No ratings yet
Surveying Image Segmentation Approaches in Astronomy
33 pages
Phuong Phap Tinh
No ratings yet
Phuong Phap Tinh
49 pages
Lecture1 D3 PRT
No ratings yet
Lecture1 D3 PRT
22 pages
CH 1-LP
No ratings yet
CH 1-LP
39 pages
Optimisation
No ratings yet
Optimisation
38 pages
Lecture 15-ParameterEstimation
No ratings yet
Lecture 15-ParameterEstimation
16 pages
10 Numpy Matplotlib
No ratings yet
10 Numpy Matplotlib
10 pages
Competitive Learning Extended
No ratings yet
Competitive Learning Extended
35 pages
Bcs 054
No ratings yet
Bcs 054
3 pages
Phishing Detection System Through Hybrid
No ratings yet
Phishing Detection System Through Hybrid
16 pages
Syllabus - CS30001 - Design and Analysis of Algorithms
No ratings yet
Syllabus - CS30001 - Design and Analysis of Algorithms
2 pages
Optimizatio With Matlab
No ratings yet
Optimizatio With Matlab
49 pages
Exercises Matching Flow
No ratings yet
Exercises Matching Flow
3 pages
Goemans LP Notes
No ratings yet
Goemans LP Notes
40 pages
Linearization
No ratings yet
Linearization
23 pages
3647-Full Paper-12782-1-10-20230817
No ratings yet
3647-Full Paper-12782-1-10-20230817
6 pages
Rauf Khann C M M File
No ratings yet
Rauf Khann C M M File
12 pages
Chapter 12.1, Communication Systems, Carlson.: Reference
No ratings yet
Chapter 12.1, Communication Systems, Carlson.: Reference
17 pages
3
No ratings yet
3
1 page
4
No ratings yet
4
1 page
1
No ratings yet
1
1 page
Advanced Linear Programming: DR R.A.Pendavingh September 6, 2004
No ratings yet
Advanced Linear Programming: DR R.A.Pendavingh September 6, 2004
82 pages
Tutors: Prof. Dr. Tanka Nath Dhamala & Mr. Ram Chandra Dhungana
No ratings yet
Tutors: Prof. Dr. Tanka Nath Dhamala & Mr. Ram Chandra Dhungana
55 pages
Leclin 112
No ratings yet
Leclin 112
42 pages
Pieo20 1 Lpmodels
No ratings yet
Pieo20 1 Lpmodels
52 pages
Mathematical Optimization
No ratings yet
Mathematical Optimization
11 pages
Artificial Intelligence Heuristic (Informed) Search
No ratings yet
Artificial Intelligence Heuristic (Informed) Search
51 pages
01 Intro Notes Cvxopt f22
No ratings yet
01 Intro Notes Cvxopt f22
25 pages
Güler2010 Chapter LinearProgramming
No ratings yet
Güler2010 Chapter LinearProgramming
13 pages
Bca Semester IV Design and Analysis of Algorithms 2023 Quest
No ratings yet
Bca Semester IV Design and Analysis of Algorithms 2023 Quest
2 pages
Routing and Scheduling Algorithms
100% (1)
Routing and Scheduling Algorithms
25 pages
15 Optimization Script
No ratings yet
15 Optimization Script
62 pages
Lec 05 - Week 05 Operations Research
No ratings yet
Lec 05 - Week 05 Operations Research
33 pages
Data Mining Unit 3
No ratings yet
Data Mining Unit 3
50 pages
Lab Exp
No ratings yet
Lab Exp
2 pages
Linearprog 1
No ratings yet
Linearprog 1
32 pages
Ge4 Linear Programming
No ratings yet
Ge4 Linear Programming
5 pages
Assignment 3-2
No ratings yet
Assignment 3-2
2 pages
Linear Programming
No ratings yet
Linear Programming
66 pages
05 Lecture - ILP-and-duality
No ratings yet
05 Lecture - ILP-and-duality
8 pages
2024 03 07 Lecture Notes
No ratings yet
2024 03 07 Lecture Notes
43 pages
Java Cheat Sheet With All BigO
No ratings yet
Java Cheat Sheet With All BigO
4 pages
Lec1 Coen505
No ratings yet
Lec1 Coen505
15 pages
Linear Programming
No ratings yet
Linear Programming
88 pages
APPC 1.6A WKST Polynomial End Behavior
No ratings yet
APPC 1.6A WKST Polynomial End Behavior
2 pages
AD3461 ML Lab Manual
No ratings yet
AD3461 ML Lab Manual
32 pages
Ads Sy
No ratings yet
Ads Sy
3 pages
Rightpdf 100 Percent Maths cl-9 ch-2 Ty 2023 Watermark Unlocked
No ratings yet
Rightpdf 100 Percent Maths cl-9 ch-2 Ty 2023 Watermark Unlocked
5 pages
Be Winter 2018
No ratings yet
Be Winter 2018
2 pages
Wisdom of Crowds Intro
No ratings yet
Wisdom of Crowds Intro
53 pages
Pre-Analysis: Example: Steady One-Dimensional Heat Conduction in A Bar
No ratings yet
Pre-Analysis: Example: Steady One-Dimensional Heat Conduction in A Bar
12 pages
CSPC - 204
No ratings yet
CSPC - 204
4 pages
LINEAR PROGRAMMING - Lesson 2 PDF
No ratings yet
LINEAR PROGRAMMING - Lesson 2 PDF
39 pages
1 References and Resources
No ratings yet
1 References and Resources
6 pages
UNIT 5 Session 6
No ratings yet
UNIT 5 Session 6
67 pages
Leclin 11
No ratings yet
Leclin 11
42 pages
Math 273a: Optimization: Instructor: Wotao Yin Department of Mathematics, UCLA Fall 2015
No ratings yet
Math 273a: Optimization: Instructor: Wotao Yin Department of Mathematics, UCLA Fall 2015
17 pages
Convex Optimization - Introduction (S.l. Dr. Ing. Carmen Voicu)
No ratings yet
Convex Optimization - Introduction (S.l. Dr. Ing. Carmen Voicu)
32 pages
Math Activity 1
No ratings yet
Math Activity 1
3 pages
Handout 1 Introduction
No ratings yet
Handout 1 Introduction
7 pages
Na
No ratings yet
Na
67 pages
Lecture1
No ratings yet
Lecture1
3 pages
BTonly CH 12345
60% (10)
BTonly CH 12345
267 pages
Solving Optimization Problems Using The Matlab Opt
No ratings yet
Solving Optimization Problems Using The Matlab Opt
50 pages
A Review On The Current Segmentation Algorithms For Medical Images
No ratings yet
A Review On The Current Segmentation Algorithms For Medical Images
6 pages
Optimization Problems: 1.1 Preliminary Definitions
No ratings yet
Optimization Problems: 1.1 Preliminary Definitions
4 pages
Linear Programming (LP) (Also Called Linear Optimization) Is A Method To Achieve The Best
No ratings yet
Linear Programming (LP) (Also Called Linear Optimization) Is A Method To Achieve The Best
2 pages
Linear Programming Problem (LPP)
100% (1)
Linear Programming Problem (LPP)
65 pages
Operations Research
No ratings yet
Operations Research
19 pages
INDE 513 hw1 Sol
No ratings yet
INDE 513 hw1 Sol
7 pages
AOT - Lecture Notes V1
No ratings yet
AOT - Lecture Notes V1
5 pages
Notes On Linear Programming
No ratings yet
Notes On Linear Programming
13 pages
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
Introduction to Minimax
From Everand
Introduction to Minimax
V. F. Dem’yanov
No ratings yet
Optimization in Function Spaces
From Everand
Optimization in Function Spaces
Amol Sasane
No ratings yet
Generalized Fermat Equation
From Everand
Generalized Fermat Equation
Ran Van Vo
No ratings yet
Elements of Tensor Calculus
From Everand
Elements of Tensor Calculus
A. Lichnerowicz
3.5/5 (2)
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Hyperbolic Functions (Trigonometry) Mathematics E-Book For Public Exams
From Everand
Hyperbolic Functions (Trigonometry) Mathematics E-Book For Public Exams
Mohmmad Khaja Shareef
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Calculus-II (Mathematics) Question Bank
From Everand
Calculus-II (Mathematics) Question Bank
Mohmmad Khaja Shareef
No ratings yet
Factoring and Algebra - A Selection of Classic Mathematical Articles Containing Examples and Exercises on the Subject of Algebra (Mathematics Series)
From Everand
Factoring and Algebra - A Selection of Classic Mathematical Articles Containing Examples and Exercises on the Subject of Algebra (Mathematics Series)
CSPacademic
No ratings yet
Transformation of Axes (Geometry) Mathematics Question Bank
From Everand
Transformation of Axes (Geometry) Mathematics Question Bank
Mohmmad Khaja Shareef
3/5 (1)
Inverse Trigonometric Functions (Trigonometry) Mathematics Question Bank
From Everand
Inverse Trigonometric Functions (Trigonometry) Mathematics Question Bank
Mohmmad Khaja Shareef
No ratings yet

Theory Note 1

Uploaded by

Theory Note 1

Uploaded by

CS 217: Artificial Intelligence and Machine Learning Jan 10, 2024

Lecture 1: The Basics of Optimization

Class Variable Solution space Solution complexity

Table 1.1: Classes of optimization problems.

1.1 Optimization: What is it? — Motivating Applications

(c) Map of a country with cities at y1 , . . . , yn and a

Figure 1.1: Examples of optimization problems.

Let the warehouse be located at x. Then, the optimization problem is

where C is the set of all points in the country, and ∥x∥2 =

1.2 Linear Programming: A way for Optimization

For any two vectors {u, v}, u ≤ v means ui ≤ vi ∀i.

Example (Political Winning). Investing money to win the election

Table 1.2: Return Ratio Data

subject to the constraints

−2x1 + 8x2 + 0x3 + 10x4 ≥ 50 000, (1.1)

and x1 , x2 , x3 , x4 ≥ 0. Let us look at the optimal solution

The optimal value of x1 + x2 + x3 + x4 = x∗1 + x∗2 + x∗3 + x∗4 = 3 100 000/111.

140 3 100 000

1.2.1 Standard Form of Linear Program

where ≥ represents element-wise greater than or equal to:

Primal Problem Dual Problem

1.2.1.1 Principles of Duality — Strong & Weak Duality

Proof. Since x is a feasible solution of the primal problem,

Since y ≥ 0, multiplying it on both sides will not change the inequality:

Since y is a feasible solution of the dual problem, A⊤ y ≥ c, and hence

You might also like