0% found this document useful (0 votes)

9 views6 pages

JDSE18 M.Outahar

The document proposes using a neuroevolution algorithm called CMA-ES to optimize the parameters of a PID controller for a nonholonomic mobile robot in real time. A neural network is trained by CMA-ES to tune the PID parameters based on the error and uncertainty from an extended Kalman filter. The system takes the error and covariance matrix as inputs to the neural network which then outputs the optimized controller parameters.

Uploaded by

Djaouida Mansouri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views6 pages

JDSE18 M.Outahar

Uploaded by

Djaouida Mansouri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Neuroevolution with CMA-ES for the tuning of a PID

controller of nonholonomic car-like mobile robot

Mohamed Outahar, Eric Lucet

To cite this version:

Mohamed Outahar, Eric Lucet. Neuroevolution with CMA-ES for the tuning of a PID controller
of nonholonomic car-like mobile robot. JDSE 2018 - 3rd Junior Conference on Data Science and
Engineering, Sep 2018, Saclay, France. �hal-04575330�

HAL Id: hal-04575330

https://hal.science/hal-04575330
Submitted on 14 May 2024

HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est

archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents
entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non,
lished or not. The documents may come from émanant des établissements d’enseignement et de
teaching and research institutions in France or recherche français ou étrangers, des laboratoires
abroad, or from public or private research centers. publics ou privés.
Neuroevolution with CMA-ES for the tuning of
a PID controller of nonholonomic car-like mobile
robot

Mohamed Outahar and Eric Lucet

CEA, LIST, Interactive Robotics Laboratory, Gif-sur-Yvette, F-91191, France

Abstract. In the field of mobile robotics, finding an optimal control

policy is a challenging task. PID controllers have been widely used in
the industry. However, tuning a PID controller is not easy, especially to
take into consideration the fluctuation in the precision of the perception.
We propose a neuroevolution algorithm to find the optimal param-
eters of the controller in real time. The controller is tuned by a neural
network which is trained by with the covariance matrix adaption
evolution strategy (CMA-ES) . The neural network takes into ac-
count both the error and the uncertainty of the measurement the tuning
of the parameters. The level of uncertainty in the measurement is given
by the the covariance matrix of the Kalman filter.

Keywords: Neuroevolution, Machine learning, Neural network, Gradient-free

optimization, Robotics, mobile robot, Control theory, PID controller

1 Introduction
In the last few years, neuroevolution [1] has gained interest in the research com-
munity. It has been shown to outperform reinforcement learning algorithms in
certain situations where the search space is non-convex and noisy or the gradient
is not available [2]. Neuroevolution describes a method to optimize neural net-
works with evolutionary algorithms. The algorithm is extremely parallelizable
and scalable [3]. This document aims to demonstrate a method to automatically
tune a PID controller of a car-like mobile robot using neuroevolution. The CMA-
ES optimization algorithm was chosen to optimize the neural network. CMA-ES
being also noted CMA with ES standing for Evolution Strategy which is a fam-
ily of algorithms that is loosely based on biological evolution (hence the name).
Multiple Evolutionary algorithms exist and they are all based on the same ba-
sic steps of population generation, evaluation, selection and reproduction. The
CMA-ES has outperformed many algorithms in black box optimization prob-
lems [4]. That is why this algorithm has been chosen to tune PID controllers in
many cases with promising results [5] [6].
As seen in figure 1, the system is composed of a robot controlled by a PID
controller. The state of the robot is observed by an extended Kalman filter
(EKF). The EKF provides the state x̂ and the corresponding covariance matrix
P. The core concept of the document is to use the covariance matrix with the
corresponding error as inputs to a neural network which outputs the parameters
of the controller in real time.Both of the CMA-ES blocks are used to define and
optimize the neural network in order to adapt the behavior of the robot to the
level of uncertainty in the measurements.

Fig. 1. Control bloc diagram.

2 PID tuning using a neural network

The goal here is to find the optimal parameters KP , KI and KD to control the
robot, by taking into consideration the error and the covariance matrix of the
EKF. A neural network is used because if offers both adaptability and efficiency.

2.1 PID controller

PID controllers are wildly used in the industry. This is due to their reliability
and simplicity. PID has shown to have good preferences in multiple cases [7].
The general formula for the PID controller is as follows:
t
d
Z
C(t) = KP e(t) + KI e(τ ) dτ + KD e(t) (1)
0 dt
with e(t)=actual(t)-target(t)
KP , KI and KD are the proportional, integral and derivative gains respectively.
Even thou the controller is easy to implement, the tuning of its parameters is not
a simple task and is a large area of research [8]. The three actions of proportional,
integral and derivative have different and some concurrent effects. For example,
the proportional term decreases the rise time while the derivative term increases
it, while both are essential for the stability of complex systems. This is the reason
of the difficulty of finding optimal gains.
2.2 Neural network

Neural networks are highly connected systems that are used to model complex,
non-linear functions.In a simple representation of a neural network, the outputs
of the each layer are multiplied by the weights and summed together with the
biases and passed through the activation functions. Activation functions are what
makes the system capable of modeling non-linear behavior, it can be represented
graphically by neurons.
A big part of the progress done in this area is due to the backpropagation
algorithm. This algorithm allows the neural network to learn patterns and desired
behaviors. However The backpropagation algorithm uses the gradient to optimize
the neural network. In this case, the gradient is not available, therefore the
backpropagation algorithm can not be used.

3 Neuroevolution

A neural network is used to find the optimal parameters to control the robot
efficiently, even with the presence of uncertainty. In traditional neural networks,
the backpropagation algorithm is used to update the weights and biases. Here,
an evolutionary algorithm is used instead. The choice was made because of the
need of exploration in our problem and because neuroevolution is a gradient free
method, which reduces execution time by orders of magnitude [2].

3.1 CMA-ES

The CMA-ES is an evolutionary algorithm [9]. it had been used because it out-
performed most black box optimization algorithms. The algorithm starts off by
generating a population of candidates. Those candidates are evaluated and put
in order of fitness. From the top preforming candidates, a percentage is selected
to regenerate the new population. The new population is again reevaluated and
the cycle continues until a termination condition is met. The termination condi-
tion is based on number of generations or the resemblance between parents and
offspring.

3.2 Objective function

The CMA-ES algorithm takes an objective function as an input, and has the
neural network parameters as outputs. the objective function is critical to the
performance of the optimization. In our case it will be set to take into consid-
eration the absolute error between the non noisy signals and the reference. In
other words, the CMA-ES will tweak the neural network parameters in order
to minimize the influence of the noise on the system. This is done to force the
neural network to learn to control the system based on the level of noise (EKF’s
covariance matrix).
4 Results and perspectives
Multiple implementations varying in complexity, were realized for this work. At
first a fix PID controller was optimized with the CMA-ES to adapt to fluctuations
in the precision of the perception. After this initial phase, a neural network was
used to tune a PID controller on line. The neural network was optimized by
CMA-ES. The architecture of the neural network was chosen by the user. One of
the latest Implementations describes the complete system, where both CMA-ES
blocks work to have an optimal system.

Fig. 2. Evolution of the objective function across generations. The size of the step
between generations is displayed in green, the change in the objective function in cyan
and the minimum objective function of each generation in blue. The red asterisk is the
overall minimum objective function found by CMA-ES [10].

In figure 2 we see the evolution of the objective function throughout the

generations. We notice that CMA-ES finds local optima but successfully over-
comes them until the fix number of function evaluations has been achieved. As
mentioned before the backpropagation algorithm can not be used. Therefore we
can not compare the training phase of the two algorithms to evaluate the perfor-
mance of the developed system. However we can evaluate the performance of the
developed system by comparing it to other controllers which are used in these
cases. These types of tests have been carried and showed promising results.

References
1. K. O. Stanley and R. Miikkulainen, “Evolving neural networks through augmenting
topologies,” Evolutionary Computation, vol. 10, no. 2, pp. 99–127, 2002.
2. T. Salimans, J. Ho, X. Chen, S. Sidor, and I. Sutskever, “Evolution Strategies as
a Scalable Alternative to Reinforcement Learning,” ArXiv e-prints, 2017.
3. X. Zhang, J. Clune, and K. O. Stanley, “On the relationship between the openai
evolution strategy and stochastic gradient descent,” CoRR, vol. abs/1712.06564,
2017.
4. I. Loshchilov, “Cma-es with restarts for solving cec 2013 benchmark problems,” 06
2013.
5. M. s. Saad, H. Jamaluddin, and I. Mat Darus, “Pid controller tuning using evolu-
tionary algorithms,” vol. 7, pp. 139–149, 01 2012.
6. K. Marova, “Using CMA-ES for tuning coupled PID controllers within models of
combustion engines,” CoRR, vol. abs/1609.06741, 2016.
7. Y. Wakasa, S. Kanagawa, K. Tanaka, and Y. Nishimura, “PID Controller Tuning
Based on the Covariance Matrix Adaptation Evolution Strategy,” IEEJ Transac-
tions on Electronics, Information and Systems, vol. 130, pp. 737–742, 2010.
8. B. Doicin, M. Popescu, and C. Patrascioiu, “Pid controller optimal tuning,” 2016
8th International Conference on Electronics, Computers and Artificial Intelligence
(ECAI), June 2016.
9. N. Hansen, “The cma evolution strategy: A tutorial,” 2010.
10. N. Hansen, “Cma-es source code.”

Tuning Innovation With Biotechnology - 1st Edition One-Click Download
100% (10)
Tuning Innovation With Biotechnology - 1st Edition One-Click Download
14 pages
Slides
No ratings yet
Slides
249 pages
CSTR Ga Pso
No ratings yet
CSTR Ga Pso
63 pages
AI & Machine Learning
No ratings yet
AI & Machine Learning
47 pages
SCARA Robot
No ratings yet
SCARA Robot
31 pages
Final GNG
No ratings yet
Final GNG
42 pages
2011 0006.advanced Evolutionary
No ratings yet
2011 0006.advanced Evolutionary
76 pages
Using Genetic Algorithms To Evolve Artificial Neural Networks
No ratings yet
Using Genetic Algorithms To Evolve Artificial Neural Networks
24 pages
Pole-Balancing With Different Evolved Neurocontrollers: August 2000
No ratings yet
Pole-Balancing With Different Evolved Neurocontrollers: August 2000
8 pages
Backpropagation For Continuous Theta Neural Networks
No ratings yet
Backpropagation For Continuous Theta Neural Networks
90 pages
Softcomputing NN
No ratings yet
Softcomputing NN
84 pages
PPR - Espinal - Comparison of PSO and DE For Training Neural Networks
No ratings yet
PPR - Espinal - Comparison of PSO and DE For Training Neural Networks
5 pages
Neuroevolution Through Erlang: Erlang Factory San Francisco - March 2012
No ratings yet
Neuroevolution Through Erlang: Erlang Factory San Francisco - March 2012
92 pages
Stability Analysis of Neural Networks-Based System
No ratings yet
Stability Analysis of Neural Networks-Based System
8 pages
Paper NN Pso
No ratings yet
Paper NN Pso
17 pages
ResearchPaper2 1 David Laredo
No ratings yet
ResearchPaper2 1 David Laredo
31 pages
Evolving Structure and Function of Neurocontrollers: Stochastic Synthesis) Outlined in Section 2 Is Inspired by A Bi
No ratings yet
Evolving Structure and Function of Neurocontrollers: Stochastic Synthesis) Outlined in Section 2 Is Inspired by A Bi
6 pages
Control of A DC Motor Using Feedback Linearization and Gray Wolf Optimization Algorithm
No ratings yet
Control of A DC Motor Using Feedback Linearization and Gray Wolf Optimization Algorithm
16 pages
Lab Manual On Soft Computing (IT-802) : Ms. Neha Sexana
No ratings yet
Lab Manual On Soft Computing (IT-802) : Ms. Neha Sexana
29 pages
Ma'arif 2019 J. Phys. Conf. Ser. 1373 012039
No ratings yet
Ma'arif 2019 J. Phys. Conf. Ser. 1373 012039
11 pages
Presentation Format
No ratings yet
Presentation Format
13 pages
A Differential Evolution Based Neural Network Approach To Nonlinear System Identification
No ratings yet
A Differential Evolution Based Neural Network Approach To Nonlinear System Identification
11 pages
Exercise 4 "Linear System Identification Using Neural Networks" Objective
No ratings yet
Exercise 4 "Linear System Identification Using Neural Networks" Objective
7 pages
Interpretable PID Parameter Tuning For Control Engineering Using General Dynamic Neural Networks: An Extensive Comparison
No ratings yet
Interpretable PID Parameter Tuning For Control Engineering Using General Dynamic Neural Networks: An Extensive Comparison
16 pages
40 EE E22 - NNFS - MCQs
No ratings yet
40 EE E22 - NNFS - MCQs
9 pages
Real 17 A
No ratings yet
Real 17 A
10 pages
PID Tuning Using Machine Learning
No ratings yet
PID Tuning Using Machine Learning
7 pages
Medium Com @maohar502 Pid Tuning Using Machine Learning 6cf6f7fe5690...
No ratings yet
Medium Com @maohar502 Pid Tuning Using Machine Learning 6cf6f7fe5690...
7 pages
De Paper Final
No ratings yet
De Paper Final
6 pages
Machine Learning in Embedded System
No ratings yet
Machine Learning in Embedded System
56 pages
CEC2013
No ratings yet
CEC2013
9 pages
Develop The Following Programs in The MATLAB Environment
No ratings yet
Develop The Following Programs in The MATLAB Environment
7 pages
Optimal Design of CMAC Neural-Network Controller For Robot Manipulators
No ratings yet
Optimal Design of CMAC Neural-Network Controller For Robot Manipulators
10 pages
Cmaes:: A Simple Yet Practical Python Library For CMA-ES
No ratings yet
Cmaes:: A Simple Yet Practical Python Library For CMA-ES
9 pages
J Matpr 2021 02 281
No ratings yet
J Matpr 2021 02 281
6 pages
Neural Net 3rdclass
No ratings yet
Neural Net 3rdclass
35 pages
PID-CSTR Coupled
No ratings yet
PID-CSTR Coupled
11 pages
Why Does The Future Need Us.
100% (4)
Why Does The Future Need Us.
26 pages
Literature Review of PID Controller Based On Various Soft Computing Techniques
No ratings yet
Literature Review of PID Controller Based On Various Soft Computing Techniques
4 pages
Pine River - Tijan
No ratings yet
Pine River - Tijan
460 pages
Car Price Prediction Using Machine Learning
33% (3)
Car Price Prediction Using Machine Learning
15 pages
Dual Class - Arthur Inverse
No ratings yet
Dual Class - Arthur Inverse
740 pages
Class X-A.i-Worksheet-Introduction and Applications of A.I
No ratings yet
Class X-A.i-Worksheet-Introduction and Applications of A.I
3 pages
Beginners Guide Prompt Egineering
No ratings yet
Beginners Guide Prompt Egineering
7 pages
Artificial Analysis State of AI China Q2 2025 Highlights
No ratings yet
Artificial Analysis State of AI China Q2 2025 Highlights
17 pages
AI ch1 ch2
No ratings yet
AI ch1 ch2
43 pages
3.7 Robots and Autonomous Technologies
No ratings yet
3.7 Robots and Autonomous Technologies
15 pages
Ai Danger To Humanity
No ratings yet
Ai Danger To Humanity
10 pages
Artificial Intelligence in Forensics & Criminal Investigation in Indian Perspective
No ratings yet
Artificial Intelligence in Forensics & Criminal Investigation in Indian Perspective
3 pages
CPAIF
No ratings yet
CPAIF
30 pages
Ai 2122-Language Alchemy: Crafting Art With Ai Precision Quarter 1: Ai - Driven Language Processing
No ratings yet
Ai 2122-Language Alchemy: Crafting Art With Ai Precision Quarter 1: Ai - Driven Language Processing
3 pages
From Eliza To XiaoIce Challenges and Opportunities With Social Chatbot
No ratings yet
From Eliza To XiaoIce Challenges and Opportunities With Social Chatbot
17 pages
Class X AI Question Bank
No ratings yet
Class X AI Question Bank
2 pages
Paper 2 Bim 1 2024 Question Booklet
No ratings yet
Paper 2 Bim 1 2024 Question Booklet
8 pages
Internship Report Poorab
No ratings yet
Internship Report Poorab
30 pages
1.3. Philosophy of AI
No ratings yet
1.3. Philosophy of AI
8 pages
Essay Level C1
No ratings yet
Essay Level C1
2 pages
Teaching and Learning With AI: How Artificial Intelligence Is Transforming The Future of Education
No ratings yet
Teaching and Learning With AI: How Artificial Intelligence Is Transforming The Future of Education
3 pages
A Quantitative Study of NLP Approaches To Question-1
No ratings yet
A Quantitative Study of NLP Approaches To Question-1
12 pages
Day Wise Syllabus AI ML Using Python
No ratings yet
Day Wise Syllabus AI ML Using Python
1 page
Ai Mba Summer Project
No ratings yet
Ai Mba Summer Project
18 pages
Artificial Intelligence in Healthcare Field (Autosaved)
No ratings yet
Artificial Intelligence in Healthcare Field (Autosaved)
9 pages
Ai Foundation Syllabus
No ratings yet
Ai Foundation Syllabus
18 pages
Doreen Ajuna
No ratings yet
Doreen Ajuna
11 pages
Mca 3 Sem Artificial Intelligence Kca301 2022
No ratings yet
Mca 3 Sem Artificial Intelligence Kca301 2022
2 pages
Presentation Text About AI
No ratings yet
Presentation Text About AI
2 pages
7 Bawal Gawin Pagkatapos Kumain - Pang-Masa
No ratings yet
7 Bawal Gawin Pagkatapos Kumain - Pang-Masa
1 page
AI Terms
No ratings yet
AI Terms
3 pages
Python Beyond Limits: Python, #3
From Everand
Python Beyond Limits: Python, #3
AnwaarX
No ratings yet
Efficient Memory Optimization for IoT Intrusion Detection
From Everand
Efficient Memory Optimization for IoT Intrusion Detection
Ethan Evelyn
No ratings yet
Advanced Backend Code Optimization
From Everand
Advanced Backend Code Optimization
Sid Touati
No ratings yet
Feedback Control Theory
From Everand
Feedback Control Theory
Bruce Francis
5/5 (1)
Digital Engineering: Complex System Design
From Everand
Digital Engineering: Complex System Design
S Mathioudakis
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Loop-shaping Robust Control
From Everand
Loop-shaping Robust Control
Philippe Feyel
No ratings yet
PLC Programming & Implementation: An Introduction to PLC Programming Methods and Applications
From Everand
PLC Programming & Implementation: An Introduction to PLC Programming Methods and Applications
Ojula Technology Innovations
No ratings yet
A SECURE DATA AGGREGATION TECHNIQUE IN WIRELESS SENSOR NETWORK
From Everand
A SECURE DATA AGGREGATION TECHNIQUE IN WIRELESS SENSOR NETWORK
Dr Chaitra HV
No ratings yet
Advanced Dynamic-System Simulation: Model Replication and Monte Carlo Studies
From Everand
Advanced Dynamic-System Simulation: Model Replication and Monte Carlo Studies
Granino A. Korn
No ratings yet
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
César Pérez López
No ratings yet
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
Natural Computing with Python: Learn to implement genetic and evolutionary algorithms to solve problems in a pythonic way
From Everand
Natural Computing with Python: Learn to implement genetic and evolutionary algorithms to solve problems in a pythonic way
Giancarlo Zaccone
No ratings yet
Python Machine Learning: Machine Learning Algorithms for Beginners - Data Management and Analytics for Approaching Deep Learning and Neural Networks from Scratch
From Everand
Python Machine Learning: Machine Learning Algorithms for Beginners - Data Management and Analytics for Approaching Deep Learning and Neural Networks from Scratch
Ahmed Ph. Abbasi
No ratings yet
Analog Dialogue, Volume 47, Number 1: Analog Dialogue, #9
From Everand
Analog Dialogue, Volume 47, Number 1: Analog Dialogue, #9
Analog Dialogue
No ratings yet
Analog Dialogue, Volume 48, Number 2
From Everand
Analog Dialogue, Volume 48, Number 2
Analog Dialogue
No ratings yet
Automotive Electronic Diagnostics (Course 2)
From Everand
Automotive Electronic Diagnostics (Course 2)
Mandy Concepcion
4/5 (2)
Automatic Target Recognition: Advances in Computer Vision Techniques for Target Recognition
From Everand
Automatic Target Recognition: Advances in Computer Vision Techniques for Target Recognition
Fouad Sabry
No ratings yet
Optical Flow: Exploring Dynamic Visual Patterns in Computer Vision
From Everand
Optical Flow: Exploring Dynamic Visual Patterns in Computer Vision
Fouad Sabry
No ratings yet
Automatic Target Recognition: Fundamentals and Applications
From Everand
Automatic Target Recognition: Fundamentals and Applications
Fouad Sabry
No ratings yet
Fuzzy Systems: Fundamentals and Applications
From Everand
Fuzzy Systems: Fundamentals and Applications
Fouad Sabry
No ratings yet
Production System: Fundamentals and Applications
From Everand
Production System: Fundamentals and Applications
Fouad Sabry
No ratings yet

JDSE18 M.Outahar

Uploaded by

JDSE18 M.Outahar

Uploaded by

Neuroevolution with CMA-ES for the tuning of a PID

controller of nonholonomic car-like mobile robot

To cite this version:

HAL Id: hal-04575330

HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est

Mohamed Outahar and Eric Lucet

CEA, LIST, Interactive Robotics Laboratory, Gif-sur-Yvette, F-91191, France

Abstract. In the field of mobile robotics, finding an optimal control

Keywords: Neuroevolution, Machine learning, Neural network, Gradient-free

Fig. 1. Control bloc diagram.

2 PID tuning using a neural network

2.1 PID controller

3.2 Objective function

In figure 2 we see the evolution of the objective function throughout the

You might also like