chen2017
chen2017
chen2017
Abstract— Lower exoskeleton has gained considerable inter- patient to walk with a reference gait. Most of these control
ests in rehabilitation and health care applications, since it has methods are combined with Finite State Machines (FSM),
the potential to help the paraplegia patients to walk again. which contains several gait trajectories in different walking
On the control of these lower exoskeletons, the predefined gait
control methods are commonly used, which requires the pilot patterns, such as in ATLAS[4], ReWalk[5], elegs[6], etc.
to adjust his/her movements to follow the predefined motion These methods achieve efficient results in walking assistance
trajectories. In the meanwhile, the desired trajectories are of paraplegic patients. However, there are two significant
predefined from healthy persons or extrapolated from clinical disadvantages of these methods which with predefined gait
gait analysis (CGA) datasets. However, in individual walking trajectories. Firstly for different patients, the gait trajectories
assistance situations, the exoskeleton should have the ability
to adapt with variant motions of the pilot, since the pilot of the exoskeleton should be corrected manually through trial
will change his motion in different walking situations. This and error. Secondly during normal walking, the patient can
paper presents a novel step length adaptation method to adapt not adjust his/her gait motion even slightly to adapt to the
the pilot’s motion for walking assistance exoskeletons. In this environment, e.g., before sitting down or stepping up/down
paper, the exoskeleton robot is model as a special Hybrid stairs. Moreover, the stability of the exoskeleton and the
Human-Exoskeleton Agent (HHEA), which considers both the
exoskeleton and the pilot. The Dynamic Movement Primitives energy cost of the patient also needs to be considered in
(DMP) is utilized to model the exoskeleton gait trajectories the gait trajectories.
dynamically, in which the relationship between the gait length In this paper, we present a novel step length adaptation
and Center of Mass (CoM) of HHEA is considered. In the method with dynamic gait models to model the exoskeleton
training process, a Reinforcement Learning (RL) method is
employed to update the parameters of dynamic gait model
gait flexibly, which enables the exoskeleton with the ability to
online. We demonstrate the efficiency of the proposed step learn/adapt the patient’s motion online. As a human-coupled
length adaptation method in simulation environment as well system, we modeled the exoskeleton with the pilot as a
as a lower limb exoskeleton system named as AssItive DEvice Hybrid Human-Exoskeleton Agent (HHEA). The Dynamic
for paRaplegics (AIDER). Experimental results shows that the Movement Primitive (DMP) architecture is employed to
proposed step length adaptation method is able to adapt variant
motions of the pilot during walking.
model the exoskeleton gait, which combining the relationship
Index Terms - Step Length Adaptation, Walking Assis- between the Center of Mass (CoM) of the HHEA and the
tance, Dynamic Movement Primitives, Reinforcement Learning, gait length. In order to learn the dynamic gait models online,
AIDER. a Reinforcement Learning (RL) method is utilized to update
the parameters of the gait models, in which the energy cost
I. INTRODUCTION
of the patient and the stability of the HHEA are considered
Many orthoses and lower exoskeletons have been devel- as cost terms. The paper’s contribution can be summarized
oped to help the paraplegic and quadriplegic people to regain as follows:
their locomotion abilities in lower limbs [1][4][5][6].
1) A novel step length adaptation method is proposed to
In recent years lower limb exoskeletons have gained
adapt to the patient’s motion, in which the exoskeleton
considerable interests since they can not only be utilized
with the pilot is modeled as a HHEA;
in clinical rehabilitation [2], also have the ability to serve
2) We employ a DMP architecture to model the exoskele-
the patients’ daily lives [1]. Lower limb exoskeletons have
ton gait flexibly, which combining the CoM of the
progressed from the stuff of science to commercialized
HHEA and the gait length;
products over last few decades [7]. However, lower limb
3) Parameters of the gait models are updated online with
exoskeletons also face challenge problems when serving
RL method.
patients in their daily lives. Gait planning is one of the critical
issues in walking assistance since the gait of the exoskeleton We first validated the proposed step length adaptation
will affect the patient’s movements directly. method in a simulation environment, and then test it on
The predefined gait trajectory control method is commonly a walking assistance exoskeleton system which named as
used in lower exoskeletons for walking assistance. With AssIstive DEvice for paRaplegics (AIDER). Experimental
predefined gait trajectories, the exoskeleton will force the results show that the advantages of the proposed step length
adaptation method over the traditional predefined gait tra-
1 Qiming Chen, Hong Cheng, ChunFeng Yue, Rui Huang, Hongliang
jectory control method, especially with consideration of the
Guo are with Center for Robotics, School of Automation Engineering,
University of Electronic Science and Technology of China, Chengdu, China
patient’s energy cost and the stability of the HHEA.
kipochen@163.com The structure of this paper is organized as follows. We
978-1-5090-6759-6/17/$31.00 ©2017 IEEE 644
first introduce the dynamic gait models for step length the patient and ‘hinder legs’ are actuated by the exoskeleton.
adaptation in Section II, in which the HHEA and DMP Fig. 1 shows the schematic diagram of the HHEA. As same
are also introduced. Section III lay down the reinforcement as most human-exoskeleton systems, the HHEA model has
learning algorithm for online adaptation of gait parameters. only four active joints on hips and knees. In order to simplify
Experimental results on both simulation environment and the the model, the DOF of the patient’s wrist is ignored, so the
AIDER system are presented and analyzed in Section IV. ‘front legs’ only have two DOFs in each shoulder joint and
This paper ends with conclusion and future work in Section one DOF in each elbow joint.
V. During normal walking of HHEA, a whole gait cycle
can be separated into two phases: double support phase and
II. DYNAMIC G AIT M ODELS FOR S TEP L ENGTH triple support phase. Fig. 2 illustrate normal walking gait of
A DAPTATION HHEA. In double support phase, both of feet is contacting
This section presents the methodology details of the with the ground and the patient move two sticks forward.
proposed dynamic gait models for step length adaptation. After two sticks touch the ground, one of the exoskeleton’s
Before introducing the models of dynamic gait, we regard leg swings front with the support of other three ‘stance legs’.
the assistance exoskeleton system as a Hybrid Human-
Exoskeleton Agent (HHEA) in Section II-A. In Section II- LF
Hip Trajectory
Z
Y
X
Foot Trajectory
Fig. 1. The schematic diagram of the HHEA (joints in blue belong to the
pilot and joints in black belong to the exoskeleton).
where x and v indicate the position and velocity of the Back-Right Front-Left
Input: θinit , δ , N, K
arm
Output: θ
1. while the system cost R is not converged do
2. Create K roll-outs of the system from the same start with
thigh noise εk ∼ N(0, δ 2 ).
3. for k = 1 → K do
4. for i = 1 → N do
5. Update the stochastic cost S through (12)
stick 6. Update the probability P through (13)
7. end for
shank 8. end for
9. for i = 1 → N do
10. Update δ θti for each time step through (14)
z 11. end for
12. Update δ θ through (15)
x
13. Update θ ← θ + δ θ
X CoM 14. Create one noiseless roll-out to evaluate the trajectory cost
X sp R of updated θ .
15. end while
16. return θ
Fig. 5. The relationship between the CoM’s projection and the step length
of HHEA.
The parameter λ in (13) is set automatically within (0, 1].
After obtaining the updated parameters θ of dynamic gait
control matrix, and f (xt ,t) is the passive dynamics of the models, the HHEA should take a noiseless roll-out which
control system. without exploration noise εt to compute the cost R. With the
In the implementation of HHEA, the immediate cost reinforcement learning process , the parameters of dynamic
function is defined based on the patient energy cost: gait models can be updated online to adapt the patient’s gait,
�t
f dt which aiming to reduce the energy cost of the patient as well
0
rt =
, (10) as stable walking.
t
where f is ground reaction forces of sticks. For each roll- IV. E XPERIMENTS AND D ISCUSSIONS
out in reinforcement learning process, the immediate cost
In this section, experiments are carried out to validate the
will be calculated in each step of the patient. The total cost
proposed step length adaptation method on both simulation
of each roll-out is combined with the immediate cost and a
platform and real-time exoskeleton system. Experimental
final cost. In this paper, we utilize the balance of HHEA as
setup, results and discussions are provided in the following
the final cost of each roll-out. Hence, the cost function of
two subsections.
each roll-out can be defined as:
N −m N A. Dynamic Simulation
R = λ1 + λ2 ∑ ri , (11)
N i=1 1) HHEA in Simulation Platform: In the simulation en-
vironment, the exoskeleton with the patient are modeled as
where λ1 and λ2 are weighted parameters. N indicate the
HHEA which described as Fig. 1. The HHEA is modeled as
number of steps for each roll-out and m means the HHEA
a special quadruped robot which has total 18 DOFs. As same
fall down after m steps. Here the discrete form of immediate
with the exoskeleton in real-time applications, the HHEA in
cost of each step is utilized to calculate the total cost.
simulation environment has 10 active joints, in which 4 active
After defining the policies and cost function, the learning
joints are in exoskeleton hips and knees, and rest 6 active
process is shown in Tab. I, with equations (12) to (15).
joints are aiming to actuate the ‘front legs’ of HHEA. The
1 N−1 simulation platform is based on Gazebo environment, Fig. 6
S(i, k) = Rk + ∑ (θ + εk )2 ,
2 j=1
(12)
shows the HHEA model in simulation environment.
The HHEA’s motion in simulation environment is planned
1
e− λ S(i,k) separately in two parts. The first part is the motion planners
P(i, k) = 1 , (13) for exoskeleton legs, which include motion trajectories of
∑Kk=1 [e− λ S(i,k) ]
knee joints and hip joints. Another part is the planners of
K the patient, which aiming to move the sticks (‘front legs’ of
δ θti = ∑ [P(i, k)εk ], (14) HHEA) to given goal places.
k=1
In order to obtain some basic information of HHEA, three
∑N−1
i=1 (N − i)δ θti kinds of sensors are utilized to measure the feedback which
δθ = . (15)
∑N−1
i=1 (N − i)
can be employed in the proposed step length adaptation
647
the learned dynamic gait.
9000
Initial Param: A
8000 Initial Param: B
7000
Cost
6000
5000
4000
3000
0 2 4 6 8 10 12 14 16
Updates
(a) Cost during learning process
Step Number
8 Initial Param: B
method. The Inertial Measurement Unit (IMU) sensor at-
tached on the upper body is utilized to estimate whether the 6
HHEA is fall down or not. On each stick, pressure sensors are
installed to measure the ground reaction force on sticks. For 4
each joint in HHEA, a joint encoder is embedded to obtain
2
joint states, and also can be utilized to calculate joint’s power 0 2 4 6 8 10 12 14 16
of the pilot. Updates
2) Experimental Setup: In the dynamic simulation exper- (b) Average stable steps of one roll-out
iments, the pilot’s intention (goal places of sticks) during
Fig. 7. Experimental results with different initial parameters in simulation
normal walking is set as a noisy parameter with Gaussian environment.
white noise, which makes each step of the pilot is not
fixed in the simulation environment. The dynamic gait of
HHEA is modeled through DMP with CoM compensation, After obtaining optimal parameters of dynamic gait mod-
which introduced in Section II. In the reinforcement learning els, we compare the performance of proposed step length
process, the parameters of dynamic gait models are updated adaptation method with traditional fixed step length methods.
every 5 roll-outs. During each roll-out, the HHEA takes 10 In the experiment, the torque of HHEA’s upper body is
gait cycles with the updated parameters including exploration utilized to validate the efficiency of the proposed dynamic
noises. gait, which indicate that the strength from the patient. Fig. 8
illustrates the performances of proposed step length adapta-
TABLE II
tion method with fixed step length methods, the results show
I NITIAL PARAMETERS IN SIMULATION EXPERIMENTS .
that the proposed method can reduce the provided torque of
Parameters X̂CoM α
the patient after learning.
Initial parameters A 0.6 -1.0
Initial parameters B 0.9 -0.4 Dynamic Gait
Torque of Upper Body (N.M)
2 3
0.12m
450
0.14m 2002, pp 1499-1504
0.16m [4] D. Sanz Merodio, M. Cestari Soto, J.C. Arevalo, et al., “Control
400 0.18m Motion Approach of a Lower Limb Orthosis to Reduce Energy
0.2m Consumption, International Journal of Advanced Robotic Systems,
350 Dynamic Gait 2012, 9:1-18
[5] M. Talaty, A. Esquenazi, J.E. Briceno, “Differentiating Ability in Users
of the ReWalk TM Powered Exoskeleton: An Analysis of Walking
300 Kinematics”, In IEEE International Conference on Rehabilitation
Robotics (ICORR), 2013, pp 1-5
250 [6] K.A. Strausser, H. Kazerooni, “The Development and Testing of a
Human Machine Interface for a Mobile Medical Exoskeleton”, In
0 100 200 300 400 IEEE International Conference on Intelligent Robots and Systems
Ticks (20ms) (IROS), 2011, pp 4911-4916
(a) Measured pressure of sticks [7] T. Yan, M. Cempini, C.M. Oddo, et al., “Review of Assistive Strategies
in Powered Lower-Limb Orthoses and Exoskeletons”, Robotics and
Autonomous Systems, 2015, 64:120-136
10
[8] S. Wang, L. Wang, C. Meijneke, et al., “Design and Control of the
MINDWALKER Exoskeleton”, IEEE transactions on neural systems
8 and rehabilitation engineering, 2015, 23(2): 277-286
Step Number
650