Optim Problems From AI

Uploaded by

Đồng Minh Đức

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views5 pages

Optim Problems From AI

Uploaded by

Đồng Minh Đức

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

List of Optimization Problems in AI

B.T. Kien*

1 Introduction
Optimization problems are fundamental in artificial intelligence (AI) because they help improve
performance in various tasks, from training machine learning models to planning and decision-
making. Based on ChatGPT, we list some key optimization problems in AI and their contexts.

2 List of problems
2.1 Training Neural Networks (Deep Learning)
In deep learning, the goal is to minimize a loss function that quantifies how far a neural network’s
predictions are from the true values. This problem is often solved using Stochastic Gradient
Descent (SGD) and its variants.
Optimization Problem:

min L(f (X, θ), Y ), (2.1)

where
θ are the parameters of the neural network (weights, biases),
f (θ, X) is the neural network with inputs
Y is the true label,
L is the loss function, such as mean squared error (MSE) or cross-entropy.
Challenges:
-Non-convexity: Neural networks often have non-convex loss landscapes, which means there
are many local minima.
-High dimensionality: The number of parameters (weights and biases) can be in the millions
for modern networks.

2.2 Reinforcement Learning (RL)

In reinforcement learning, an agent learns to make decisions by interacting with an environment.
The agent seeks to maximize cumulative rewards over time. The optimization problem in RL
involves learning a policy that maximizes the expected reward.
*
Department of Optimization and Control Theory, Institute of Mathematics, Vietnam Academy of Science
and Technology,18 Hoang Quoc Viet road, Hanoi, Vietnam; email: [email protected]

1
Optimization Problem:

T
hX i
max Eπ γ t rt (st , at ) , (2.2)
π
t=0

where
π is policy
st , at are the state and action at time t
rt (st , at ) is is the reward received at time t
γ ∈ [0, 1] is a discount factor that weighs future rewards.
Chalanges:
-Exploration vs. Exploitation: Balancing the exploration of new strategies and exploiting
known good strategies
- High-dimensional state/action spaces: Environments like robotic control or games (e.g.,
Go) have massive state/action spaces.

2.3 Support Vector Machines (SVM)

Support Vector Machines are a type of supervised learning algorithm used for classification. The
goal is to find a hyperplane that best separates the classes in the feature space while maximizing
the margin between the classes.
Optimization Problem: For a linearly separable case, the SVM optimization problem is:

1
min ∥W ∥2 s.t. yi (W · Xi + b) ≥ 0 i = 1, 2, .., n, (2.3)
W,b 2

where
W is the weight vector (defining the hyperplane),
b is the bias term,
Xi is training example and yi is its label, yi ∈ {−1, 1}
Chalenges:
-Non-linearity: When data is not linearly separable, kernel functions (e.g., RBF) are used,
leading to a more complex optimization.
-Scalability: For very large datasets, solving the quadratic programming problem can become
computationally expensive.

2.4 Generative Adversarial Networks (GANs)

In GANs, two networks (the generator and discriminator) compete against each other. The
generator creates synthetic data, and the discriminator evaluates the quality of the synthetic
data by comparing it to real data. The goal is for the generator to improve its ability to generate
realistic data, while the discriminator tries to better distinguish between real and fake data.
Optimization Problem (Minimax Game):

2

min max Ex∼pdata log D(x) + Ez∼pz log(1 − D(G(z))) , (2.4)
G D

where
D(x) is the discriminator’s output for real data,
G(z) is the generator’s output for random noise z
pdata is the real data distribution
pz is is the noise distribution.
Challenges:
-Mode collapse: The generator may collapse to generating only a few types of data points
(or even a single point).
- Training instability: GAN training is often unstable and requires careful balancing between
the generator and discriminator.

2.5 Constrained Optimization for AI Planning

In AI planning and robotics, optimization problems often involve constraints on the actions that
can be taken. These constraints might represent physical limitations (e.g., robot joint limits) or
task requirements.
Optimization Problem:

min f (x) s.t. gi (x) ≤ 0 i = 1, 2, ..., m, (2.5)

x∈X

where
x is represents a sequence of actions or control inputs,
f (x) is a cost function, such as minimizing energy consumption or time,
gi (x) are constraints representing system dynamics, safety, or feasibility.
Challenges:
- Nonlinear constraints: Often, the constraints are nonlinear, leading to a complex optimiza-
tion problem.
-Real-time constraints: Optimization has to be solved quickly in real-time applications like
autonomous driving or robotic control.

2.6 Optimization in Natural Language Processing (NLP)

In NLP, optimization is used to train models that understand and generate language. For
example, transformer models used in machine translation, text generation, or sentiment analysis
are trained by minimizing a loss function over a sequence of words.
Optimization Problem:

N
1 X
min L(yi , f (xi , θ)), (2.6)
θ N
i=1

3
where
xi is the input text,
yi is the target output (e.g., the translated sentence),
f (xi , θ) is the model (e.g., a neural network like BERT or GPT),
L is the loss function (e.g., cross-entropy loss for classification tasks).
Challenges:
-Sequence-to-sequence modeling: Optimizing models that generate sequences (e.g., transla-
tions, dialogue) is complex because of the dependencies between words in the sequence.
-Large-scale datasets: NLP models often require huge amounts of data and computational
power to optimize.

2.7 Hyperparameter Optimization

In machine learning, the performance of a model can depend significantly on its hyperparameters
(e.g., learning rate, batch size, number of layers in a neural network). Finding the optimal set
of hyperparameters is an optimization problem in itself.
Optimization Problem:

min E[Validation Error(λ)], (2.7)

λ∈Λ

where
λ represents the hyperparameters (e.g., learning rate, number of layers),
Λ is the hyperparameter search space, The validation error is the error on a held-out dataset.
Method:
-Grid search: Evaluate all combinations of hyperparameters in a predefined grid.
-Random search: Randomly sample hyperparameters from the search space.
-Bayesian optimization: Build a probabilistic model of the objective function and use it to
select the most promising hyperparameters to evaluate next.
Challenges:
-High-dimensional search space: The number of hyperparameters can be large, leading to a
high-dimensional optimization problem.
-Computational cost: Each evaluation of the objective function (e.g., training a model) can
be computationally expensive.

2.8 Bayesian Optimization for Expensive Function Evaluations

In AI applications where function evaluations are expensive (e.g., hyperparameter tuning, tun-
ing the architecture of neural networks), Bayesian optimization is used to find the optimal
parameters with fewer evaluations.
Optimization Problem:

min f (x), (2.8)

x∈X

4
where
f (x) is expensive to evaluate (e.g., training a neural network),
A probabilistic model (e.g., a Gaussian process) is built to approximate f (x) and the optimiza-
tion algorithm iteratively updates this model to find the optimal x
Challenges:
-Exploration vs. exploitation: Balancing exploration of unknown regions of the search space
with exploitation of regions that seem promising.
-Scalability: Bayesian optimization typically does not scale well to high-dimensional spaces.

3 Conclusion
Optimization is at the heart of many AI problems, from training machine learning models to
solving real-time control problems in robotics. Techniques like stochastic gradient descent, rein-
forcement learning, and Bayesian optimization play a critical role in solving these optimization
problems. However, the challenges of non-convexity, high-dimensionality, and computational
cost make optimization in AI a complex and fascinating field of study.

DL unit 4&5
No ratings yet
DL unit 4&5
27 pages
IMP Deep Learning
No ratings yet
IMP Deep Learning
9 pages
syn
No ratings yet
syn
6 pages
hogwarts-sols
No ratings yet
hogwarts-sols
8 pages
TheoryDL
No ratings yet
TheoryDL
227 pages
AI Unit 1 QB Solutions
No ratings yet
AI Unit 1 QB Solutions
24 pages
Gelbart Dissertation 2015
No ratings yet
Gelbart Dissertation 2015
137 pages
Gonzalez 2021
No ratings yet
Gonzalez 2021
67 pages
module2 Question and Answer
No ratings yet
module2 Question and Answer
25 pages
DL-12
No ratings yet
DL-12
55 pages
23-Practical Aspects of Optimization
No ratings yet
23-Practical Aspects of Optimization
7 pages
ALML QUESTION PAPER
No ratings yet
ALML QUESTION PAPER
8 pages
AIML SOLVED ANSWERS FOR QP
No ratings yet
AIML SOLVED ANSWERS FOR QP
39 pages
Srarm_Unit 1
No ratings yet
Srarm_Unit 1
16 pages
Data Science Module 4 q & A
No ratings yet
Data Science Module 4 q & A
9 pages
Advanced Topics in Machine Learning: Supervised Learning, Deep Learning, and Optimization Techniques
No ratings yet
Advanced Topics in Machine Learning: Supervised Learning, Deep Learning, and Optimization Techniques
5 pages
8.2 NNOptimization
No ratings yet
8.2 NNOptimization
17 pages
(1)_IJAIML23022024P0A3_(p.1-8)
No ratings yet
(1)_IJAIML23022024P0A3_(p.1-8)
8 pages
AI Lecture 3
No ratings yet
AI Lecture 3
23 pages
Bio Optimization of Deep Learning Network Architectures 22fguqp5
No ratings yet
Bio Optimization of Deep Learning Network Architectures 22fguqp5
11 pages
Unit IV
No ratings yet
Unit IV
89 pages
MODULE 3
No ratings yet
MODULE 3
7 pages
optimization
No ratings yet
optimization
16 pages
Optimization Models (Giuseppe C. Calafiore, Laurent El Ghaoui) (Z-Library)
No ratings yet
Optimization Models (Giuseppe C. Calafiore, Laurent El Ghaoui) (Z-Library)
648 pages
DL 4
No ratings yet
DL 4
15 pages
April May 2024
No ratings yet
April May 2024
17 pages
Lec 2
No ratings yet
Lec 2
5 pages
AIML105
No ratings yet
AIML105
5 pages
A Study of the Optimization Algorithms in Deep Learning
No ratings yet
A Study of the Optimization Algorithms in Deep Learning
4 pages
Module 3dl1
No ratings yet
Module 3dl1
11 pages
SkriptOptMach
No ratings yet
SkriptOptMach
49 pages
MLP Encoder Decoder
No ratings yet
MLP Encoder Decoder
14 pages
Chapter
No ratings yet
Chapter
46 pages
Important Optimization Algorithms Essentials
No ratings yet
Important Optimization Algorithms Essentials
12 pages
Answer Key
No ratings yet
Answer Key
12 pages
8 Adagrad, RMSprop, Adam 04 Sep 2020material I 04 Sep 2020 Module4 Optimization
No ratings yet
8 Adagrad, RMSprop, Adam 04 Sep 2020material I 04 Sep 2020 Module4 Optimization
50 pages
Introduction to Optimization-Lec1
No ratings yet
Introduction to Optimization-Lec1
36 pages
Bott Curt Noce 18
No ratings yet
Bott Curt Noce 18
89 pages
3
No ratings yet
3
11 pages
Lecture_2
No ratings yet
Lecture_2
31 pages
15 Optimization Script
No ratings yet
15 Optimization Script
62 pages
DL Test-2
No ratings yet
DL Test-2
28 pages
aiml university ans key
No ratings yet
aiml university ans key
6 pages
Qualification Exam Question: 1 Statistical Models and Methods
No ratings yet
Qualification Exam Question: 1 Statistical Models and Methods
4 pages
Ai Assignment
No ratings yet
Ai Assignment
6 pages
2012 Nikolaos Nikolaou MSC
No ratings yet
2012 Nikolaos Nikolaou MSC
102 pages
Deep Learning Module 3
No ratings yet
Deep Learning Module 3
15 pages
AIML Assignment
No ratings yet
AIML Assignment
9 pages
Pure Optimization
No ratings yet
Pure Optimization
23 pages
Module 3-DL
No ratings yet
Module 3-DL
12 pages
Optimization For Deep Learning Theory and Algorithms
No ratings yet
Optimization For Deep Learning Theory and Algorithms
60 pages
Dive Into Deep Learning-435-462
No ratings yet
Dive Into Deep Learning-435-462
28 pages
unit 3
No ratings yet
unit 3
18 pages
UNIT V NNHDL
No ratings yet
UNIT V NNHDL
33 pages
Customizing Stata Graphs Made Easy (Part 2) : 18, Number 4, Pp. 786-802
No ratings yet
Customizing Stata Graphs Made Easy (Part 2) : 18, Number 4, Pp. 786-802
17 pages
UNIT 5
No ratings yet
UNIT 5
36 pages
Op Tim Ization
No ratings yet
Op Tim Ization
22 pages
Violent Python a cookbook for hackers forensic analysts penetration testers and security engineers 1st Edition O'Connor pdf download
100% (4)
Violent Python a cookbook for hackers forensic analysts penetration testers and security engineers 1st Edition O'Connor pdf download
54 pages
Module2.3 Hyperparameter Optimization
No ratings yet
Module2.3 Hyperparameter Optimization
29 pages
Real Time Fault Monitoring of Industrial Processes
No ratings yet
Real Time Fault Monitoring of Industrial Processes
570 pages
ZIEHL ABEGG Operating Instructions ZAdyn4C English
100% (1)
ZIEHL ABEGG Operating Instructions ZAdyn4C English
224 pages
Java File 44214802718 2
No ratings yet
Java File 44214802718 2
84 pages
RAILWAY SYSTEM PROJECT
No ratings yet
RAILWAY SYSTEM PROJECT
21 pages
Ste Final Report
No ratings yet
Ste Final Report
20 pages
Sony Klv-32s400a Chassis Eg1l-Ga
80% (5)
Sony Klv-32s400a Chassis Eg1l-Ga
81 pages
Electronics 12 00218
No ratings yet
Electronics 12 00218
19 pages
Computer Science Project
No ratings yet
Computer Science Project
29 pages
Grade 8 Ch 2 QA
No ratings yet
Grade 8 Ch 2 QA
2 pages
MD-100: Windows 10 Chapter 12 - Windows Tools
No ratings yet
MD-100: Windows 10 Chapter 12 - Windows Tools
16 pages
EWM CLASS 33 - Slotting Process
No ratings yet
EWM CLASS 33 - Slotting Process
8 pages
User Manual: HDD & DVD Player/ Recorder DVDR3440H
No ratings yet
User Manual: HDD & DVD Player/ Recorder DVDR3440H
80 pages
PNG University of Technology Mathematics & Computer Science Department
No ratings yet
PNG University of Technology Mathematics & Computer Science Department
15 pages
Font Awesome Cheatsheet PDF
No ratings yet
Font Awesome Cheatsheet PDF
7 pages
CNE Tutorial 02 - Real - LAN and LAN With Packet Tracer
No ratings yet
CNE Tutorial 02 - Real - LAN and LAN With Packet Tracer
18 pages
EEN-4143: Microcontroller Based Design: 1. Course Books
No ratings yet
EEN-4143: Microcontroller Based Design: 1. Course Books
5 pages
Eng SS 408-8623 C
No ratings yet
Eng SS 408-8623 C
4 pages
ECE 5745 Complex Digital ASIC Design Verilog Usage Rules
No ratings yet
ECE 5745 Complex Digital ASIC Design Verilog Usage Rules
3 pages
Y10 05 CT27 Lesson Plan
No ratings yet
Y10 05 CT27 Lesson Plan
2 pages
Nevion Viewer R2008
No ratings yet
Nevion Viewer R2008
2 pages
Local Print: If - Events.Interface - Table
No ratings yet
Local Print: If - Events.Interface - Table
3 pages
Ecc Hana Sizing
No ratings yet
Ecc Hana Sizing
9 pages
Web Technology Lab
No ratings yet
Web Technology Lab
1 page
Products and Systems For Operator Control and Visualization: Simatic HMI
No ratings yet
Products and Systems For Operator Control and Visualization: Simatic HMI
46 pages
Finding An IP Address With Wireshark Using ARP Requests: Generate ARP Traffic Upon Startup
100% (1)
Finding An IP Address With Wireshark Using ARP Requests: Generate ARP Traffic Upon Startup
4 pages
Review of Related Literature Management Information System
No ratings yet
Review of Related Literature Management Information System
3 pages
Random Optimization: Fundamentals and Applications
From Everand
Random Optimization: Fundamentals and Applications
Fouad Sabry
No ratings yet
Mathematical Optimization: Fundamentals and Applications
From Everand
Mathematical Optimization: Fundamentals and Applications
Fouad Sabry
No ratings yet
AI Project Cycle: 2.1. Problem Scoping
100% (1)
AI Project Cycle: 2.1. Problem Scoping
6 pages
Computer Vision Graph Cuts: Exploring Graph Cuts in Computer Vision
From Everand
Computer Vision Graph Cuts: Exploring Graph Cuts in Computer Vision
Fouad Sabry
No ratings yet
CDS11309 Toyota Hilux 1KD 2012 Kit
No ratings yet
CDS11309 Toyota Hilux 1KD 2012 Kit
14 pages