Sample-Efficient Model Predictive Control Design of Soft Robotics by Bayesian Optimization
Sample-Efficient Model Predictive Control Design of Soft Robotics by Bayesian Optimization
Bayesian Optimization
Anuj Pal, Tianyi He∗ , Wenpeng Wei
Abstract— This paper presents a sample-efficient data-driven With the appropriate models, different control techniques
method to design model predictive control (MPC) for cable- can be applied. Model predictive control can effectively ad-
actuated soft robotics using Bayesian optimization. Instead of dress the constraints of the states and inputs. This technique
modeling the complex dynamics of the soft robots, the proposed
approach uses Bayesian optimization to search the best-guessed has been used in a pneumatic humanoid robot [8], [9]. To
account for the model uncertainty, robust H∞ control is used
arXiv:2210.08780v1 [cs.RO] 17 Oct 2022
x1
0
The Bayesian optimization is conducted using the
Bayesian Optimization Toolbox bayesopt by MATLAB. The -0.005
soft robot in SoRoSim and the Bayesian optimization are
-0.01
co-simulated with the sampling time 50 ms in simulation 0 5 10 15 20
time of 20 seconds. The soft material of the robot body is
0.04
selected as PDMS. The outputs x1 and x2 displacement are
constrained by [−0.1, 0.1]. The control input is constrained
x2
0.02
by [−10, 10]N. The parameters of robot and Bayesian opti-
mization are summerized in the Table I.
0
TABLE I
0 5 10 15 20
S UMMARY OF PARAMETERS IN THE SIMULATION
Time/s
BO parameter value soft robot values
Fig. 4. Position tracking of the origin under perturbation.
white noise variance σ 2 0.012 density 1000 kg/m3
maximum iteration kmax 100 Young’s modulus 1e6 N/m2
number of seed point 10 Poisson’s ratio 0.5 2) Scenario 2: tracking a circular shape: The reference
GPA active set size 300 length 0.6 m trajectory to be tracked is a circle with a radius of 0.05
prediction horizon Tp 10 radius 0.1 m m. The equation is expressed as x12 + x22 = 0.052 . The con-
weighting matrix Q 103 I3 actuation cable 3
weighting matrix R I3 damping 0.11e5 Pa.s vergence of the performance relative to iterations is plotted
in Fig. 5. Although it takes longer iterations to achieve the
optimal, the sub-optimal can be obtained in a few iterations.
B. Simulation results The responses of the positions x1 and x2 are plotted in
1) Scenario 1: position initialization to origin: In this Fig.6. The red curve is the response from the actual soft
simulation scenario, the soft robot is driven to position to robot, and the blue curve is the desired circular shape. In
the origin point to initialize the system. The tracking target the time horizon of the tracking task, the soft robot starts
is the constant reference trajectory at origin point (0, 0). from the origin point and converges to the desired circle.
Gradually, the position curve matches the desired trajectory.
12
10 12
Current point
8 Current best point
Overall best point 10 Current point
6 Current best point
8 Overall best point
4
6
2
0 4
-2 2
-4
0 20 40 60 80 100 0
-2
Fig. 3. Performance index in iterations of Bayesian optimizaion. 0 20 40 60 80 100
The performance index J at the iterations is plotted in Fig. 5. Performance index in iterations of Bayesian optimization.
the Fig.3. The best performance point is achieved at the
green point of the 42th iteration. At the early stage of
convergence, a sub-optimal solution can be achieved by V. C ONCLUSIONS AND FUTURE WORK
around 10 iterations. This indicates that a good controller can This paper presents a systematic framework for approx-
be tuned by the Bayesian optimization in a few iterations. imating the complex dynamics of a soft robot using the
0.05
[9] P. Hyatt and M. D. Killpack, “Real-time nonlinear model predictive
0.04 control of robots using a graphics processing unit,” IEEE Robotics and
Real trajectory
Automation Letters, vol. 5, no. 2, pp. 1468–1475, 2020.
0.03 Desired trajectory [10] A. Doroudchi, S. Shivakumar, R. E. Fisher, H. Marvi, D. Aukes, X. He,
S. Berman, and M. M. Peet, “Decentralized control of distributed
0.02 actuation in a segmented soft robot arm,” in 2018 IEEE Conference
on Decision and Control (CDC). IEEE, 2018, pp. 7002–7009.
0.01 [11] C. Della Santina, C. Duriez, and D. Rus, “Model based control of soft
robots: A survey of the state of the art and open challenges,” arXiv
x2
0
preprint arXiv:2110.01358, 2021.
-0.01 [12] D. Kim, S.-H. Kim, T. Kim, B. B. Kang, M. Lee, W. Park, S. Ku,
D. Kim, J. Kwon, H. Lee et al., “Review of machine learning methods
-0.02 in soft robotics,” Plos one, vol. 16, no. 2, p. e0246102, 2021.
[13] Z. Q. Tang, H. L. Heung, K. Y. Tong, and Z. Li, “A novel iterative
-0.03 learning model predictive control method for soft bending actuators,”
in 2019 International Conference on Robotics and Automation (ICRA),
-0.04 2019, pp. 4004–4010.
[14] D. Bruder, C. D. Remy, and R. Vasudevan, “Nonlinear system iden-
-0.05
-0.05 0 0.05 tification of soft robot dynamics using koopman operator theory,” in
x1 2019 International Conference on Robotics and Automation (ICRA).
IEEE, 2019, pp. 6244–6250.
[15] D. Bruder, X. Fu, R. B. Gillespie, C. D. Remy, and R. Vasudevan,
Fig. 6. Position trajectory tracking of the desired circular shape trajectory. “Data-driven control of soft robots using koopman operator theory,”
IEEE Transactions on Robotics, vol. 37, no. 3, pp. 948–961, 2020.
[16] A. Pal, Y. Wang, L. Zhu, and G. G. Zhu, “Engine calibration
optimization based on its surrogate models,” in Dynamic Systems and
Bayesian optimization (BO) framework. The study integrates Control Conference, vol. 59155. American Society of Mechanical
the framework to identify the low-dimensional linear dy- Engineers, 2019, p. V002T12A002.
namic model, which is then used with a linear MPC con- [17] L. Zhu, Y. Wang, A. Pal, and G. Zhu, “Engine calibration using
global optimization methods with customization,” SAE Technical
troller to achieve the desired tracking performance. For BO, Paper 2020-01-0270, Tech. Rep., 2020.
the Gaussian process is used for model development, and [18] A. Pal, L. Zhu, Y. Wang, and G. G. Zhu, “Constrained surrogate-
expected improvement (EI) is used as an acquisition function based engine calibration using lower confidence bound,” IEEE/ASME
Transactions on Mechatronics, vol. 26, no. 6, pp. 3116–3127, 2021.
to perform exploration and exploitation of the search/design [19] A. Pal, Y. Wang, L. Zhu, and G. G. Zhu, “Multi-objective surrogate-
space to obtain optimal solutions. The results demonstrate assisted stochastic optimization for engine calibration,” Journal of
superior performance using our proposed approach in iden- Dynamic Systems, Measurement, and Control, vol. 143, no. 10, p.
101004, 2021.
tifying the approximate system dynamics, which is then used [20] A. Pal, L. Zhu, Y. Wang, and G. G. Zhu, “Multi-objective stochastic
with linear MPC to achieve desired system performance. bayesian optimization for iterative engine calibration,” in 2020 Amer-
The future work includes investigating the robustness of the ican Control Conference (ACC), 2020, pp. 4893–4898.
[21] T. Gutjahr, T. Kruse, and T. Huber, “Advanced modeling and optimiza-
proposed method and experimental validations by real-time tion for virtual calibration of internal combustion engines,” in NDIA
implementations. Ground Vehicle Systems Engineering and Technology Symposium,
2017.
[22] J. Tang, A. Pal, W. Dai, C. Archer, J. Yi, and G. Zhu, “Stochastic
R EFERENCES bayesian optimization for predicting borderline knock,” International
Journal of Engine Research, p. 14680874211065237, 2021.
[1] M. Runciman, A. Darzi, and G. P. Mylonas, “Soft robotics in min- [23] W. Lyu, F. Yang, C. Yan, D. Zhou, and X. Zeng, “Multi-objective
imally invasive surgery,” Soft robotics, vol. 6, no. 4, pp. 423–443, bayesian optimization for analog/rf circuit synthesis,” in Proceedings
2019. of the 55th Annual Design Automation Conference, 2018, pp. 1–6.
[2] P. Polygerinos, N. Correll, S. A. Morin, B. Mosadegh, C. D. Onal, [24] J. Zhou, X. Su, and G. Cui, “An adaptive kriging surrogate method for
K. Petersen, M. Cianchetti, M. T. Tolley, and R. F. Shepherd, “Soft efficient joint estimation of hydraulic and biochemical parameters in
robotics: Review of fluid-driven intrinsically soft devices; manufac- reactive transport modeling,” Journal of contaminant hydrology, vol.
turing, sensing, control, and applications in human-robot interaction,” 216, pp. 50–57, 2018.
Advanced Engineering Materials, vol. 19, no. 12, p. 1700016, 2017. [25] B. Liu, V. Grout, and A. Nikolaeva, “Efficient global optimization of
[3] D. Trivedi, A. Lotfi, and C. D. Rahn, “Geometrically exact models for actuator based on a surrogate model assisted hybrid algorithm,” IEEE
soft robotic manipulators,” IEEE Transactions on Robotics, vol. 24, Transactions on Industrial Electronics, vol. 65, no. 7, pp. 5712–5721,
no. 4, pp. 773–780, 2008. 2017.
[4] C. Della Santina, A. Bicchi, and D. Rus, “On an improved state [26] W. Ali, M. S. Khan, M. A. Qyyum, and M. Lee, “Surrogate-assisted
parametrization for soft robots with piecewise constant curvature and modeling and optimization of a natural-gas liquefaction plant,” Com-
its use in model based control,” IEEE Robotics and Automation Letters, puters & Chemical Engineering, vol. 118, pp. 132–142, 2018.
vol. 5, no. 2, pp. 1001–1008, 2020. [27] M. Neumann-Brosig, A. Marco, D. Schwarzmann, and S. Trimpe,
“Data-efficient autotuning with bayesian optimization: An industrial
[5] F. Boyer, V. Lebastard, F. Candelier, and F. Renda, “Dynamics of
control study,” IEEE Transactions on Control Systems Technology,
continuum and soft robots: A strain parameterization based approach,”
vol. 28, no. 3, pp. 730–740, 2019.
IEEE Transactions on Robotics, vol. 37, no. 3, pp. 847–863, 2020.
[28] C. K. Williams and C. E. Rasmussen, Gaussian processes for machine
[6] S. Grazioso, G. Di Gironimo, and B. Siciliano, “A geometrically exact
learning. MIT press Cambridge, MA, 2006, vol. 2, no. 3.
model for soft continuum robots: The finite element deformation space
[29] D. R. Jones, M. Schonlau, and W. J. Welch, “Efficient global optimiza-
formulation,” Soft robotics, vol. 6, no. 6, pp. 790–811, 2019.
tion of expensive black-box functions,” Journal of Global optimization,
[7] S. Sadati, S. E. Naghibi, L. Da Cruz, and C. Bergeles, “Reduced-order vol. 13, no. 4, pp. 455–492, 1998.
modeling and model order reduction for soft robots,” 2021. [30] F. Renda, C. Armanini, V. Lebastard, F. Candelier, and F. Boyer,
[8] C. M. Best, M. T. Gillespie, P. Hyatt, L. Rupert, V. Sherrod, and M. D. “A geometric variable-strain approach for static modeling of soft
Killpack, “A new soft robot control method: Using model predictive manipulators with tendon and fluidic actuation,” IEEE Robotics and
control for a pneumatically actuated humanoid,” IEEE Robotics & Automation Letters, vol. 5, no. 3, pp. 4006–4013, 2020.
Automation Magazine, vol. 23, no. 3, pp. 75–84, 2016.