State Estimation in Nonlinear System Using Sequential Evolutionary Filter
State Estimation in Nonlinear System Using Sequential Evolutionary Filter
6, JUNE 2016
Abstract—As a commonly encountered problem in the and UKF has been demonstrated in many literatures and lots
particle filters (PFs), the particle impoverishment is caused of the improved versions have been proposed [10], [12], [13].
partially by the reduction of particle diversity after resam- However, these nonlinear filters have the difficulties in dealing
pling. In this paper, a novel particle filtering technique
named sequential evolutionary filter (SEF) is introduced, by with the arbitrary distributions in the considered system.
which the particle impoverishment problem can be effec- The particle filter (PF) provides another effective tool for
tively mitigated. SEF is proposed based on the genetic the state estimation problem of nonlinear systems. Under the
algorithm (GA). A GA-inspired strategy is designed and Bayesian framework, the Monte Carlo simulation method is
incorporated in SEF. With this strategy, the resampling incorporated in PF. The posterior distribution is approximated
used in most of the existing PFs is not necessary, and
the particle diversity can be maintained. The experimental by a number of weighted samples (or particles). Compared
results also demonstrate the effectiveness of SEF. with EKF and UKF, PF offers several superior performances.
For example, it can be used in the system with non-Gaussian
Index Terms—Genetic algorithm (GA), nonlinear system,
distributions. And for the highly nonlinear system, it can gen-
particle filter (PF), sequential evolutionary filter (SEF), state
estimation. erate more accurate state estimations than EKF and UKF. In
the past two decades, PF has found a wide variety of applica-
I. I NTRODUCTION tions in the practical problems. In [14], both the unconstrained
and constrained state estimation problems using PF have been
with the generic PF, although the particle impoverishment {x0 , . . . , xk } and y 1:k = {y 1 , . . . , y k }, respectively. As
problem has been mitigated to some degree, most of these aforementioned, PF approximates the posterior distribution of
improved PFs have the more complex strategies and suffer the state xk with a number of particles, and the approximated
from higher computational burdens. Besides the GA-based PF, form of the joint posterior distribution p(x0:k |y 1:k ) is
there are also some other evolutionary inspired PFs. More N
information can be found in [6] and [25]. p(x0:k |y 1:k ) ≈ wki δ(x0:k − xi0:k ) (2)
In this paper, a novel filter named sequential evolutionary i=1
filter (SEF) is proposed for mitigating the particle impover-
in which δ(·) denotes the Dirac delta function, N is the particle
ishment. It is inspired from the idea of GA. The reduction of
number, and wki is the normalized importance weight (or parti-
particle diversity introduced by resampling can be attributed to
cle weight) for the ith particle. With state xi0:k and weight wki ,
two reasons. The first one is the existence of too many repli-
the ith particle can be formulated as {xi0:k , wki }.
cates of large-weight particles. Second, if no particles located
The definition of the importance weight wk is
in the high probability regions of the posterior distribution, no
resampled particles will be generated in these regions. Based p(x0:k |y 1:k )
wk ∝ (3)
on these two reasons, a GA-inspired strategy is designed in q(x0:k |y 1:k )
SEF. With this strategy, the particle diversity can be maintained. where ∝ means to be proportional to. q(x0:k |y 1:k ) is the impor-
The small-weight particles can be modified into the particles tance distribution. In practice, it is usually difficult to directly
assigned with larger weights. And at the same time, these mod- sample from p(x0:k |y 1:k ) for its attributes such as multivari-
ified small-weight particles will not overlap the large-weight ate and nonstandard. The importance distribution q(x0:k |y 1:k )
ones. Different from most of the existing GA-based PFs, the is introduced to overcome this problem, and compared with
resampling procedure is not necessary in SEF and is replaced by p(x0:k |y 1:k ), it can be easily sampled.
the proposed GA-inspired strategy. The effectiveness of SEF is To derive the iterative formulation of the importance weight
demonstrated in the numerical example and three-tank system. wk , q(x0:k |y 1:k ) is chosen to satisfy
This paper is organized as follows. In Section II, the
brief introductions of the generic PF and GA are presented. q(x0:k |y 1:k ) = q(xk |x0:k−1 , y 1:k )q(x0:k−1 |y 1:k−1 ) (4)
Section III presents the detailed descriptions of the proposed
in which q(xk |x0:k−1 , x1:k ) is regarded as the marginal distri-
SEF. The results of the experiments on the numerical exam-
bution of q(x0:k |x1:k ).
ple and three-tank system are reported in Section IV. Finally,
Under the Bayesian framework, the joint posterior distribu-
Section V concludes this paper.
tion p(x0:k |x1:k ) can be written as
p(xk |x1:k−1 , x0:k )p(xk |x0:k−1 )
II. P RELIMINARIES O N G ENERIC PF (GPF) AND GA p(x0:k |x1:k ) =
p(xk |x1:k−1 )
In this paper, the considered dynamic state-space model is × p(x0:k−1 |y 1:k−1 ). (5)
xk = f(xk−1 , uk , wk ) The denominator part of (5) can be reformulated to
(1)
y k = h(xk , uk , νk ), k = 1, 2, . . .
p(y k |y 1:k−1 ) = p(y k |xk )p(xk |y 1:k−1 )dxk (6)
where
xk ∈ Rnx and yk ∈ Rny denote the state and measure- in which
ment vectors;
uk ∈ Rnu is the system input; p(xk |y 1:k−1 ) = p(xk |xk−1 )p(xk−1 |y 1:k−1 )dxk−1 . (7)
wk ∈ Rnw and νk ∈ Rnν denote the white process and
measurement noises. Since p(y k |xk ) and p(xk |xk−1 ) are known from the system
The functions f : Rnx × Rnu × Rnw → Rnx and h : model (1), and p(xk−1 |y 1:k−1 ) is the marginal posterior distri-
R × Rnu × Rnν → Rny are the transition and measure-
nx bution at (k − 1)th time step, p(y k |y 1:k−1 ) can be regarded as a
ment functions. The system model (1) can also be formulated constant factor. Considering the assumptions made on the states
in the statistical form, where the functions f and h, respectively, xk and measurements y k (k = 1, 2, . . .), the joint posterior
correspond to the transition and likelihood distributions, i.e., distribution p(x0:k |y 1:k−1 ) can be further simplified to
p(xk |xk−1 ) and p(y k |xk ). The initial distribution of the state p(x0:k |y 1:k ) ∝ p(y k |xk )p(xk |xk−1 )p(x0:k−1 |y 1:k−1 ). (8)
x0 is known as p(x0 ). In general, the assumptions are made
that xk (k = 1, 2, . . .) subject to the first-order Markov process, By substituting (4) and (8) into (3), a recursive formulation of
and y k (k = 1, 2, . . .) are conditionally independent given the the importance weight wk can be derived as
states. p(y k |xk )p(xk |xk−1 )p(x0:k−1 |y 1:k−1 )
wk ∝ (9)
q(xk |x0:k−1 , y 1:k )q(x0:k−1 |y 1:k−1 )
A. GPF
p(y k |xk )p(xk |xk−1 )
In order to give a brief introduction of GPF, the sequences = wk−1 . (10)
of the states and measurements are denoted as x0:k = q(xk |x0:k−1 , y 1:k )
Authorized licensed use limited to: UNIVERSITY OF SOUTHAMPTON. Downloaded on December 06,2023 at 15:47:57 UTC from IEEE Xplore. Restrictions apply.
3788 IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, VOL. 63, NO. 6, JUNE 2016
TABLE I
M ULTINOMIAL R ESAMPLING
inspired from the idea of GA. An effective strategy, i.e., GA- χ̃L = {x̃1kL , w̃kL
1
}, . . . , {x̃m m
kL , w̃kL }
inspired strategy, is implemented in SEF. In the following, this . (21)
strategy will be first introduced, and then the full procedure of χ̃S = {x̃1kS , w̃kS
1 N −m
}, . . . , {x̃kS N −m
, w̃kS }
SEF will be presented.
The state of the moved particle is represented by xjkO . Then,
the crossover operator in GA-inspired strategy is
A. GA-Inspired Strategy
x̃jkO = αj x̃jkS + (1 − αj )x̃lkL . (22)
The GA-inspired strategy is designed to maintain the parti-
cle diversity, and finally, mitigate the particle impoverishment Similar to the arithmetic crossover, αj is also a random number,
problem encountered in GPF. Similar to GA, three operators, but it is drawn from the uniform distribution over [0, ᾱ], i.e.,
i.e., separation, crossover, and mutation, constitute the main αj ∼ U (0, ᾱ). ᾱ is the upper bound of αj , and in every step, it
part of the GA-inspired strategy. But, these three operators are can be calculated as
different from the genetic operators in GA. They are developed
according to the attributes of PF. ᾱ = 1 − Neff /N. (23)
Assume that the particles have been obtained and they are
formulated as {xik , wki } (i = 1, 2, . . . , N ). The separation The value of αj determines how much information will be
operator aims to separate the particle population into two cate- transferred from x̃lkL to x̃jkO . The smaller the value of αj is the
gories. One is composed of the small-weight particles and the more information of x̃lkL will be transferred. For each x̃jkS , x̃lkL
other is composed of the large-weight particles. To achieve this is randomly selected from χ̃L . Different from the arithmetic
separation, all the particles are sorted first according to their crossover, the modifications are only made on the small-weight
weights particles, and only one offspring particle, i.e., x̃jkO , is created
by the crossover in the GA-inspired strategy. For the large-
χ̃ = Sortdesc. {x1k , wk1 }, . . . , {xik , wkN } (15) weight particles, no modifications are made on them, since
1 1 i
they are important for approximating the posterior distribution
= {x̃k , w̃k }, . . . , {x̃k , w̃kN } (16)
p(xk |y 1:k ).
where Sortdesc. { · } is the operation used to sort the particles in In order to further improve the diversity of particle popu-
descending order according to their weights wki . {x̃ik , w̃ki } rep- lation, the mutation operator is designed to extend the state-
resents the sorted particle. The separation is performed on these search space of particles. The definition is
sorted particles l
2x̃kL − x̃jkO , if rj ≤ pM
j
x̃kM = (24)
χ̃L = {x̃1k , w̃k1 }, . . . , {x̃m m
k , w̃k } x̃jkO , if rj > pM
(17)
χ̃S = {x̃m+1 k , w̃km+1 }, . . . , {x̃N N
k , w̃k }
where x̃lkL and x̃jkO are the particle states in (23). rj ∼
in which χ̃L is the category including large-weight particles,
U (0, 1), and pM ∈ [0, 1] is the mutation probability. When
and χ̃S is the category including the small-weight particles. m
rj ≤ pM , the mutation will be performed on x̃jkO .
is a integer, and
After performing the GA-inspired strategy, the small-weight
m ≤ γNeff < m + 1. (18) particles will be modified into the ones with the large weights.
In the next time step, all particles will be treated equally. Based
where Neff is the effective sample size calculated by (14). γ ∈ on this idea, the particle weights will be reset to 1/N after the
(0, 1] is a parameter that is set in advance. GA-inspired strategy as
In the real-coded GAs, a commonly used crossover is the
χ = { x̃1kL , 1/N }, . . . , {x̃m
kL , 1/N }
arithmetic crossover [32]. Suppose that two of the selected par-
(N −m)
ent potential solutions are represented as x1par and x2par , and {x̃1kM , 1/N }, . . . , {x̃kM , 1/N }} (25)
their offsprings are represented as x1off and x2off . The arithmetic
= { x1k , 1/N }, . . . , {xN
k , 1/N } . (26)
crossover is represented as
To intuitively show the GA-inspired strategy, a diagram under
x1off = ax1par + (1 − a)x2par (19) the one dimension situation is presented in Fig. 2. The sizes
x2off = ax2par + (1 − a)x1par (20) of the circles represent the particle weights. Different colored
circles denote the particles obtained from different stages. The
where a is a random number drawn from the interval [0, 1]. circles filled with navy blue correspond to the parent particles in
The crossover operator used in the GA-inspired strategy is Fig. 1. The circles filled with red and orange denote the small-
modified from the arithmetic crossover. It is designed to mod- and large-weight particles, respectively. The circles filled with
ify the small-weight particles into the particles with the large light blue are the particle created by mutation. And the circle
weights. Denote the state of the large-weight particle as x̃lkL , filled with no color is the final particle, i.e., {xik , 1/N }, created
and the state of the small-weight particle as x̃jkS , in which by GA-inspired strategy.
l = 1, . . . , m and j = 1, . . . , N − m. Hence, the χ̃L and χ̃S Different from the resampling used in GPF, which only repli-
can be reformulated as cates the large-weight particles to replace the small-weight
Authorized licensed use limited to: UNIVERSITY OF SOUTHAMPTON. Downloaded on December 06,2023 at 15:47:57 UTC from IEEE Xplore. Restrictions apply.
3790 IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, VOL. 63, NO. 6, JUNE 2016
TABLE II
I MPLEMENTATION P ROCEDURE OF SEF
The choice of MMSE or MAP depends on different estimation where ωk ∼ N (0, σω2 ) and νk ∼ N (0, σν2 ) represent the pro-
problems. In the experimental part of this work, the MMSE is cess and measurement noises, respectively.
√ In the experiment,
used to estimate the system states. the variances are set as σω = 1, σν = 2. xk is the system state
In the GA-inspired strategy, a simpler formulation of the that needs to be estimated. Its initial value is set to x0 = 0.1.
crossover and mutation operators can be derived. Assume that The initial particles {xi0 , 1/N } (i = 1, . . . , N ) are drawn from
we have obtained the separated particles, i.e., {x̃lkL , wkL
l
}∈ a Gaussian distribution, i.e., xi0 ∼ N (0.1, 2).
j j In this experiment, the particle number is set to N = 500.
χ̃L and {x̃kS , wkS } ∈ χ̃S . Then, the simpler formulation can
Both the thresholds of Neff for GPF and RPF are set to
be presented as
NT = 0.5N . The constant tuning parameter K in RGPF is
(αj + 1)x̃lkL − αj x̃jkS , if rj ≤ pM set to K = 0.2 [19]. The parameters in SEF are set as pM =
j
x̃kM = (29) 0.5 and γ = 1. A metric, i.e., the absolute error of state, is
αj x̃jkS + (1 − αj )x̃lkL , if rj > pM defined as
n
which combines the crossover and mutation operators. 1
Compared with the GPF, SEF replaces the resampling with e= |xk − x̂k | (32)
n
k=1
the designed GA-inspired strategy, with which the particle
diversity is maintained and the particle impoverishment in where n is the number of the time steps. The experiment is
GPF is mitigated. A brief implementation procedure of SEF is repeatedly performed for 30 times. To evaluate the performance
presented in Table II. of these four PFs, the averaged absolute error ūe and variance
of the absolute error Var are calculated. To estimate the com-
putational burdens of four PFs, the averaged running time ūt is
IV. E XPERIMENTS recorded.
In this section, two experimental systems are used. The first The experiment results are presented in Table III. As shown
one includes a commonly used numerical example [4], [20]. in the table, four PFs give the comparative estimation accura-
The highly nonlinear and bimodal characteristics of this exam- cies which can be reflected by the average absolute error ūe .
ple bring challenges for state estimation issues. The other one The variances Var of four PFs are all very small. This means
is the three-tank system that always acts as the benchmark in that these four PFs can offer stable state estimation results.
Authorized licensed use limited to: UNIVERSITY OF SOUTHAMPTON. Downloaded on December 06,2023 at 15:47:57 UTC from IEEE Xplore. Restrictions apply.
YIN et al.: STATE ESTIMATION IN NONLINEAR SYSTEM USING SEF 3791
Fig. 4. State estimation results of SEF. Fig. 7. State estimation results of RGAPF.
TABLE V
AVERAGED A BSOLUTE E RRORS OF SEF AND GPF
in GPF is mitigated. GPF and RPF offer the similar state esti-
mation results. The worst state estimation results belong to the
RGAPF. The simulations are performed for 10 times, and the
averaged absolute errors of four filters are presented in Table V.
The table also shows that SEF offers the more accurate state
Fig. 5. State estimation results of GPF.
estimation results than GPF, RPF, and RGAPF. In the exper-
iment, we find that the variances of the absolute errors of all
four filters for 10 times experiments approximate to 0. This
means that all these four filters can offer stable state estima-
tion results. The averaged running time ūt for SEF, GPF, RPF,
and RGAPF are 14.573, 11.362, 14.591, and 15.053 s, respec-
tively. As the averaged running time recorded in the numerical
example, the ūt of RPF and RGAPF are larger than GPF’s.
But, for this MIMO system, the ūt of SEF is also greater than
GPF’s.
V. C ONCLUSION
In this paper, an SEF technique, which is based on the PF
and GA, has been introduced. It replaces the resampling in GPF
with a GA-inspired strategy, by which the particle impoverish-
ment encountered in GPF is mitigated. The main idea of the
Fig. 6. State estimation results of RPF.
GA-inspired strategy is to modify the small-weight particles
into the particles with large weights, and at the same time, main-
estimated and real states. From these figures, it can be observed tain the diversity of the particle population. The performances
that SEF provides the best state estimation. This mainly owes to of SEF are evaluated by two experiments. The experimental
the GA-inspired strategy. With this strategy, the particle diver- results show that compared with three other filters, SEF offers
sity is maintained, and the particle impoverishment encountered the better state estimation results.
Authorized licensed use limited to: UNIVERSITY OF SOUTHAMPTON. Downloaded on December 06,2023 at 15:47:57 UTC from IEEE Xplore. Restrictions apply.
YIN et al.: STATE ESTIMATION IN NONLINEAR SYSTEM USING SEF 3793
Authorized licensed use limited to: UNIVERSITY OF SOUTHAMPTON. Downloaded on December 06,2023 at 15:47:57 UTC from IEEE Xplore. Restrictions apply.
3794 IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, VOL. 63, NO. 6, JUNE 2016
Authorized licensed use limited to: UNIVERSITY OF SOUTHAMPTON. Downloaded on December 06,2023 at 15:47:57 UTC from IEEE Xplore. Restrictions apply.