A gradient boosting decision tree based GPS signal reception classification algorithm
A gradient boosting decision tree based GPS signal reception classification algorithm
article info a b s t r a c t
Article history: In urban areas, GPS signals are often reflected or blocked by buildings, which causes multipath
Received 15 May 2019 effects and non-line-of-sight (NLOS) reception respectively consequently degrading GPS positioning
Received in revised form 14 October 2019 performance. While improved receiver design can reduce the effect of multipath to some extent, it
Accepted 15 November 2019
cannot deal with NLOS. Modelling methods based on measurements have shown promise to reduce
Available online xxxx
the effect of NLOS signal reception. However, this depends on their ability to accurately and reliably
Keywords: classify line-of-sight (LOS), multipath and NLOS signals. The traditional method is based on one feature
GPS using signal strength as measured by the carrier to noise ratio, C/N0 . However, this feature is ineffective
GBDT in capturing the characteristics of multipath and NLOS in all environments. In this paper, to improve
Urban canyon the accuracy of signal reception classification, we are using the three features of C/N0 , pseudorange
Multipath
residuals and satellite elevation angle with a gradient boosting decision tree (GBDT) based classification
NLOS
algorithm. Experiments are carried out to compare the proposed algorithm with classifiers based on
decision tree, distance weighted k-nearest neighbour (KNN) and the adaptive network-based fuzzy
inference system (ANFIS). Test results from static receivers in urban environments, show that the GBDT
based algorithm achieves a classification accuracy of 100%, 82% and 86% for LOS, multipath and NLOS
signals, respectively. This is superior to the other three algorithms with the corresponding results of
100%, 82% and 84% for the Distance-Weighted KNN, 99%, 70% and 65% for the ANFIS and 98%, 35% and
95% for the traditional decision tree. With the NLOS detection and exclusion, the proposed GBDT with
multi-feature based method can provide a positioning accuracy improvement of 34.1% compared to
the traditional C/N0 based method.
© 2019 Elsevier B.V. All rights reserved.
https://doi.org/10.1016/j.asoc.2019.105942
1568-4946/© 2019 Elsevier B.V. All rights reserved.
Please cite this article as: R. Sun, G. Wang, W. Zhang et al., A gradient boosting decision tree based GPS signal reception classification algorithm, Applied Soft Computing
Journal (2019) 105942, https://doi.org/10.1016/j.asoc.2019.105942.
2 R. Sun, G. Wang, W. Zhang et al. / Applied Soft Computing Journal xxx (xxxx) xxx
effects [7]. Subsequently, multipath estimating delay lock loops used in recent years to improve the accuracy of GNSS positioning
(MEDLL) [8], vision correlators and strobe correlators have been and signal reception classification. Phan et al. used elevation
developed [9–13]. These methods, referred to as signal processing and azimuth angles as the key features of support vector ma-
based multipath mitigation, assume that both direct and reflected chines (SVM) to mitigate the multipath effect [37]. Yozevitch
signals are able to reach the receiver, thus a signal processing et al. used C/N0 , elevation and other observations as features
technique can be applied to filter the reflected signal. Unfortu- of decision trees to classify LOS and NLOS [32]. Monsak et al.
nately, these methods do not mitigate the effect of NLOS signal proposed a method using machine learning to detect NLOS signals
reception as it comprises of the reflected signal only. in a collaborative vehicle environment. The results are com-
Measurement-based modelling refers to using satellite ob- pared with several different machine learning algorithms [38].
servations and related information to mitigate NLOS reception Hsu et al. applied SVM to distinguish received GPS signal types,
using four features, including C/N0 , temporal difference of C/N0 ,
and multipath effects. GPS measurements can be augmented by
pseudorange residual and pseudorange rate, extracted from raw
complementary additional sensors (such as inertial measurement
measurements [36]. Quan et al. proposed a convolutional neu-
units) to improve performance in urban canyons. However, this
ral network (CNN) based multipath detection method with the
is highly dependent on the performance of the inertial measure-
sparse auto-encoder (SAE) for feature extraction [39]. Guermaha
ments and their mechanizations [14–21]. Integrating GNSS with
et al. proposed a GNSS signal classifier system based on the
vision sensors can be effective in reducing multipath error and satellite elevation and the difference of C/N0 value, provided by
resulting in the improvement of positioning accuracy. However, right-hand circular polarized (RHCP) and left-hand circular polar-
the performance is affected by weather and environment fea- ized (LHCP) antennas, with the decision tree [40]. Sun et al. used
tures [22]. In addition, the expense associated with such sensors nine variables, including the C/N0 , temporal difference of C/N0 ,
precludes their use in location-based services. HDOP, VDOP, satellite elevation angle and azimuth angle, pseu-
Another relatively low-cost approach to mitigate multipath dorange residual, consistency between delta pseudorange and
errors is to use a 3D city model as an information source. The pseudorange rate and number of visible satellites, derived from
model can be used to classify the type of signal reception and thus the raw GPS measurements together with an algorithm based on
exclude NLOS signals during positioning [23,24]. However, this is an ANFIS to classify LOS, multipath and NLOS measurements from
very likely to decrease the number of available satellites, leading GPS [41]. These studies have shown that machine learning is a po-
both to a weaker geometry and lower redundancy required for tentially effective method for GPS signal reception classification.
the reduction in random errors and integrity monitoring. A more To date, this potential is still to be exploited, because it is difficult
reasonable approach is to use the NLOS signal for positioning to obtain a trade-off between high classification accuracy and low
with the shadow matching method. Shadow matching utilizes 3D computational cost with an increasing number of input features.
city models to predict satellite visibility and then compares this Therefore, in this paper we are aiming to use a number of in-
prediction with the measured satellite visibility to determine the put features with various machine learning algorithms to improve
position [25–27]. In addition, a research team at the University classification accuracy with a high computational efficiency. The
of Tokyo used a 3D city model to simulate signal propagation main known features from GPS raw measurements include C/N0 ,
path and computed simulated pseudorange measurements, be- HDOP, VDOP, satellite elevation angle, azimuth angle, pseudor-
ange residual, pseudorange rate and number of visible satellites.
fore weighting each candidate based on the similarity between
This paper proposes a signal reception classifier based on the gra-
the simulated and actual pseudorange measurements. The esti-
dient boosting decision tree (GBDT) algorithm using 3 variables
mated position was then obtained by the weighted average of the
as features: C/N0 , pseudorange residuals, and satellite elevation
candidate positions [28–30]. The performance of this method de-
angle. The reason for choosing these 3 features among the known
pends on the correct and reliable classification of LOS, multipath
features are that C/N0 , pseudorange residuals, and satellite eleva-
and NLOS signals. tion angle are highly related to the types of signal reception, and
The traditional method for signal classification is to define a their ability to distinguish the signal reception type have been
threshold value of C/N0 with higher C/N0 classified as LOS, while proved by previous studies [31–40]. For the other features, such
those with lower C/N0 classified as NLOS. Although Yozevitch as HDOP, VDOP and GDOP, represent the geometric distribution
et al. have shown that under conditions of no interference, C/N0 of visible satellites in a single epoch without a direct relationship
can serve as an excellent classifier [31,32], in practice LOS signals to signal reception type. The pseudorange rate only represents
can be detected with low C/N0 for various reasons (e.g., the an instantaneous change of the signal and therefore, has a weak
antenna location, momentary blocks, etc.) and NLOS signals can relationship to signal reception type. The contributions of this
be detected with relatively high C/N0 . Hence, additional signal paper are summarized as follows.
features are required for reception type classification. Therefore,
in addition to C/N0 , satellite elevation can be used as a feature for • A robust GBDT based GPS signal reception classification
the classification. In general, the higher the elevation, the more algorithm is proposed by using C/N0 , pseudorange residuals
likely it is for a signal to be LOS, but as the elevation decreases, the and satellite elevation angle as the input features to improve
satellite is more likely to be blocked by buildings and other man- the performance of GPS signal reception classification.
made obstacles, with the corresponding signal being NLOS. Deng • A sensitivity analysis process is designed and used to select
proposed a satellite selection algorithm based on satellite eleva- the most appropriate iteration number of the algorithm
to ensure the classification accuracy, while avoiding over
tion angle and geometric dilution precision (GDOP). The influence
fitting of the GBDT algorithm
of elevation angle on positioning accuracy is analysed [33]. Wang
et al. also use C/N0 and satellite elevation angle to evaluate the
• The raw GPS measurements are collected in various urban
canyon environments to evaluate the performance of the
possibility of LOS [34]. In addition, larger pseudorange residuals
designed algorithm
could point to a higher probability of NLOS or multipath, and
therefore, this can also be used to classify signal reception [35]. 2. Algorithm design
The other features that could be used for classification are pseu-
dorange change rate, horizontal dilution of precision (HDOP) and The framework of the algorithm (Section 2.1) consists of the
vertical dilution of precision (VDOP) [32,36]. three functions of data labelling (Section 2.2), feature selection
Machine learning has the advantages of high speed and ac- (Section 2.3) and the GBDT based signal classification algorithm
curacy in dealing with various types of features and has been (Section 2.4).
Please cite this article as: R. Sun, G. Wang, W. Zhang et al., A gradient boosting decision tree based GPS signal reception classification algorithm, Applied Soft Computing
Journal (2019) 105942, https://doi.org/10.1016/j.asoc.2019.105942.
R. Sun, G. Wang, W. Zhang et al. / Applied Soft Computing Journal xxx (xxxx) xxx 3
Please cite this article as: R. Sun, G. Wang, W. Zhang et al., A gradient boosting decision tree based GPS signal reception classification algorithm, Applied Soft Computing
Journal (2019) 105942, https://doi.org/10.1016/j.asoc.2019.105942.
4 R. Sun, G. Wang, W. Zhang et al. / Applied Soft Computing Journal xxx (xxxx) xxx
segments to connect the point of reflection and the satellite, and decreases, however, there is a more likelihood of signal blockage
the point of reflection and the receiver. If both line segments are by buildings and other obstacles. Elevation angle can, therefore,
not blocked by some other structure, they are considered as a be used as a feature in signal reception classification. The satellite
reflected path. If the intersection is outside the surface, a reflected elevation angle θ can be calculated as (11):
path does not exist. If the receiver only receives a reflected signal
from a satellite, the measurements are labelled as NLOS. If the
θ (i) = − arcsin(u(Di) ) (11)
receiver receives both direct and reflected signals from a satellite,
2.4. Signal classification algorithm based on GBDT
the measurements are labelled as multipath.
GBDT is a supervised learning algorithm [45], also known
2.3. Feature selection
as gradient boost regression tree (GBRT) and multiple additive
regression tree (MART). It combines regression trees using a gra-
Currently, the signal features can be easily obtained from
dient boosting technique and has been widely applied in various
modern GNSS receivers. The features selected in this paper are:
disciplines, such as credit risk assessment [46], transport crash
C/N0 , pseudorange residuals and satellite elevation angle.
prediction [47] and fault prognosis in electronic circuits [48]. It
(1) C/N0 : the signal strength is measured in terms of the C/N0 ,
replaces the difficult function minimization problem by using
which is the ratio of carrier power to noise power per unit of
least-squares function minimization, followed by only a single
bandwidth in decibel-hertz (dB-Hz). Usually, the C/N0 of an NLOS
parameter optimization based on the original criterion. Therefore,
signal is smaller than that of a LOS signal. Therefore, it is the most
this advantage could potentially facilitate the achievement of high
commonly used feature. However, since both low and high C/N0
accuracy GPS signal reception classification [45].
values are possible for NLOS in urban canyons due to the different
In the designed GBDT based algorithm, each sample in the
reflection surface materials, classification simply based on C/N0 training set is represented as xi = (C /N0i , ηi , θi ), where i =
can be unreliable and therefore, necessitates the consideration of 1, 2, 3, . . . , N indicating the sequence number of the sample, and
additional features. N is the number of samples. The labelled training dataset can
(2) Pseudorange residuals, η: the pseudorange ρ is computed be expressed as T = {(x1 , y1 ) , (x2 , y2 ) , (x3 , y3 ) , . . . , (xN , yN )},
as the time ∆T from the signal being transmitted from the satel- where yi ∈ {−1, 0, 1}, is the label of each sample, −1, 0, 1 rep-
lite to the detection of the signal in the receiver multiplied by the resent the NLOS, Multipath and LOS signals, respectively. GBDT
speed of light c plus the clock synchronization error t multiplied minimizes the expected value of loss function L (yi , f (xi )) by
by the speed of light c, which can be expressed in (7): iteratively creating a weak learner ht (xi ; a) that points in the
ρ = ∆T × c + t × c (7) steepest-descent direction, i.e., the negative gradient direction.
The weak learner ht (xi ; a) is a classification tree, the parameters
The satellite position can be resolved from the broadcast a are the splitting variables, split locations and the terminal node
ephemeris, so the positioning solution can be calculated by solv- means of the individual trees. The square loss function (12) is
ing the pseudorange equations using least square estimation in used in this paper:
(8):
1
)−1 L (yi , f (xi )) = (yi − f (xi ))2 (12)
r = GT G GT ρ
(
(8) 2
The input to GBDT is the labelled training dataset T , with M
where r is the receiver state, including the three-dimensional
as the number of iterations. The GBDT based GPS signal reception
position and the receiver clock offset. G denotes the design ma-
(i) (i) (i) classification algorithm flow is as follows:
trix consisting of the unit LOS vector (uN , uE , uD ) between the
1. Initialize a weak learner f0 (x) for the training data:
satellite and receiver in (9):
N
(1) (1) (1)
⎡ ⎤ ∑
uN uE uD −1 f0 (x) = argmin L(yi , γ ) (13)
⎢ (2) (2) (2) ⎥ γ
⎢ uN uE uD −1⎥ i=1
G=⎢
⎢ .. .. .. .. ⎥
⎥ (9)
⎣ . f0 (x) is a regression tree consisting of only one root node. Since
. . . ⎦
(i) (i) (i) L is selected to be the square loss function, f0 (x) becomes:
uN uE uD −1
f0 (x) = y (14)
Once the positioning solution is calculated, the distance be-
tween the receiver and satellite can be obtained. The difference 2. For m = 1 to M:
between this distance and the pseudorange is called the pseudo- 2.1 Compute the negative gradient
range residual, expressed as η, which can be calculated in (10): [
∂ L (yi , f (xi ))
]
ỹi = − (15)
∂ f (xi ) f (x)=fm−1 (x)
η =ρ−G·r (10)
2.2 Replace the label{(yi of the
) training ) dataset )with ỹi(to obtain
Pseudorange residuals are important for LOS/Multipath/NLOS x1 , ỹ1 , x2 , ỹ2 , x3 , ỹ3 , . . . , xN , ỹN ,
( ( )}
a new dataset Tm =
signal reception classification [43]. In theory, the absolute value of and create a new regression tree hm (xi ; am ) by training the new
pseudorange residuals and the probability of NLOS are positively dataset Tm :
related. This phenomenon becomes more obvious when only a N
small portion of the signals are NLOS [44]. Hsu et al. showed ∑
at = argmin (ỹi − hm (xi ; a))2 (16)
that the pseudorange residuals could be used as an indicator to a
i=1
classify signal reception type if the number of measurements is
sufficient [35]. 2.3 Update the strong learner:
(3) Satellite elevation, θ : there is a significant positive cor-
fm (x) = fm−1 (x) + ρ hm (x; am ) (17)
relation between satellite elevation and probability of LOS. In
general, signals from satellites with higher elevation angles are where ρ is the learning rate, usually chosen to be a value between
less likely to be blocked or reflected by buildings. As the elevation 0∼1 to prevent the overfitting.
Please cite this article as: R. Sun, G. Wang, W. Zhang et al., A gradient boosting decision tree based GPS signal reception classification algorithm, Applied Soft Computing
Journal (2019) 105942, https://doi.org/10.1016/j.asoc.2019.105942.
R. Sun, G. Wang, W. Zhang et al. / Applied Soft Computing Journal xxx (xxxx) xxx 5
Please cite this article as: R. Sun, G. Wang, W. Zhang et al., A gradient boosting decision tree based GPS signal reception classification algorithm, Applied Soft Computing
Journal (2019) 105942, https://doi.org/10.1016/j.asoc.2019.105942.
6 R. Sun, G. Wang, W. Zhang et al. / Applied Soft Computing Journal xxx (xxxx) xxx
Table 1
Summary of the datasets.
Dataset D0 D1 D2 D3 D4
Total samples 96 992 24 000 24 000 11 615 25 039
LOS (labelled as 1) 25 987 8 000 8 000 0 0
Multipath (labelled as 0) 18 164 8 000 8 000 3 114 8 831
NLOS (labelled as −1) 52 841 8 000 8 000 8 501 16 208
Please cite this article as: R. Sun, G. Wang, W. Zhang et al., A gradient boosting decision tree based GPS signal reception classification algorithm, Applied Soft Computing
Journal (2019) 105942, https://doi.org/10.1016/j.asoc.2019.105942.
R. Sun, G. Wang, W. Zhang et al. / Applied Soft Computing Journal xxx (xxxx) xxx 7
Table 2
Confusion matrix of LOS (Noted as 1), Multipath (0) and NLOS (−1) classification results using
different algorithms based on a single feature (C/N0 ) for Testing Dataset D2.
Algorithms GBDT Decision tree
Label ID −1 0 1 −1 0 1
−1 6365 1635 0 3729 4018 253
No of samples 0 4197 3803 0 1836 4611 1553
1 0 391 7609 0 182 7818
Accuracy (%) 74.1 67.3
Category accuracy (%) 79.7 47.5 95.1 46.6 57.6 97.7
Algorithms Distance-weighted KNN ANFIS
Label ID −1 0 1 −1 0 1
−1 15 7735 250 6054 1691 255
No. of samples 0 0 6170 1830 3934 2329 1737
1 0 22 7978 0 365 7635
Accuracy (%) 59 66.7
Category accuracy (%) 69.4 29 99.9 75.7 29.1 95.4
Please cite this article as: R. Sun, G. Wang, W. Zhang et al., A gradient boosting decision tree based GPS signal reception classification algorithm, Applied Soft Computing
Journal (2019) 105942, https://doi.org/10.1016/j.asoc.2019.105942.
8 R. Sun, G. Wang, W. Zhang et al. / Applied Soft Computing Journal xxx (xxxx) xxx
Please cite this article as: R. Sun, G. Wang, W. Zhang et al., A gradient boosting decision tree based GPS signal reception classification algorithm, Applied Soft Computing
Journal (2019) 105942, https://doi.org/10.1016/j.asoc.2019.105942.
R. Sun, G. Wang, W. Zhang et al. / Applied Soft Computing Journal xxx (xxxx) xxx 9
Table 3
Confusion matrix of LOS (1), Multipath (0) and NLOS (−1) classification results using different
algorithms based on multiple features (C/N0 , Pseudorange Residuals and Satellite Elevation Angle)
for Testing Dataset D2.
Algorithms GBDT Decision tree
Label ID −1 0 1 −1 0 1
−1 6858 1134 8 7583 395 22
No of samples 0 1322 6522 156 5006 2795 199
1 0 14 7986 133 5 7862
Accuracy (%) 89 76
Training Time (s) 47.6 3.1
Category accuracy (%) 85.7 81.5 99.8 94.8 34.9 98.3
Algorithms Distance-weighted KNN ANFIS
Label ID −1 0 1 −1 0 1
−1 6521 1472 7 6256 1716 28
No. of samples 0 1266 6724 10 2237 5634 129
1 0 1 7999 0 47 7953
Accuracy (%) 88.5 82.7
Training time (s) 1.2 105
Category accuracy (%) 81.5 84.1 100 64.9 70.4 99.4
Table 4
Confusion matrix of LOS (1), Multipath (0) and NLOS (−1) classification results using different
algorithms based on multiple features (C/N0 , Pseudorange Residuals and Satellite Elevation Angle)
for Testing Dataset D3.
Algorithms GBDT Decision tree
Label ID −1 0 1 −1 0 1
−1 6239 2262 0 8320 168 13
No of samples 0 299 2726 89 1321 1681 112
1 0 0 0 0 0 0
Accuracy (%) 77.2 86.1
Category accuracy (%) 73.4 87.5 97.9 54
Algorithms Distance-weighted KNN ANFIS
Label ID −1 0 1 −1 0 1
−1 6217 2284 0 7047 1436 18
No. of samples 0 1024 1687 403 868 1252 994
1 0 0 0 0 0 0
Accuracy (%) 68 71.5
Category accuracy (%) 73.1 54.2 82.9 40.2
Table 5
Confusion matrix of LOS (1), Multipath (0) and NLOS (−1) classification results using different
algorithms based on multiple features (C/N0 , Pseudorange Residuals and Satellite Elevation Angle)
for Testing Dataset D4.
Algorithms GBDT Decision tree
Label ID −1 0 1 −1 0 1
−1 8830 7378 0 14 485 1723 22
No of samples 0 3694 5025 112 7 782 851 198
1 0 0 0 0 0 0
Accuracy (%) 55.3 61.3
Category accuracy (%) 54.6 56.6 89.4 9.6
Algorithms Distance-weighted KNN ANFIS
Label ID −1 0 1 −1 0 1
−1 9217 6991 0 9385 6663 160
No. of samples 0 2970 5775 86 3141 5374 316
1 0 0 0 0 0 0
Accuracy (%) 60 59
Category accuracy (%) 56.9 65.4 57.9 60.9
In summary, the classification rules extracted from the GBDT 4. Conclusions and future work
algorithm are applicable to environments with largely similar
spatial and material characteristics (i.e. testing dataset D2 and D3) This paper has presented a GBDT based algorithm, using C/N0 ,
pseudorange residual and elevation angle as the features, to clas-
but with low adaptively to datasets with different characteristics
sify GPS signal reception as LOS, multipath and NLOS, mean-
(i.e. testing dataset D4). Further work is exploring further the while, the static positioning solutions are also calculated with the
issue of adaptively and the development of real-time on-line detected NLOS eliminations. The signal reception classification
training algorithms. results of the testing dataset D2 (from Location A), from the
Please cite this article as: R. Sun, G. Wang, W. Zhang et al., A gradient boosting decision tree based GPS signal reception classification algorithm, Applied Soft Computing
Journal (2019) 105942, https://doi.org/10.1016/j.asoc.2019.105942.
10 R. Sun, G. Wang, W. Zhang et al. / Applied Soft Computing Journal xxx (xxxx) xxx
Table 6
Comparisons of RMSE for Location A, B and C.
RMSE (m) E N U 3D 2D
C/N0 Based NLOS 40.92 17.9 79.01 90.76 44.67
Location A Elimination
GBDT with 26.19 17.02 51.02 59.82 31.23
Multi-Feature-Based NLOS
Elimination
Improvement (%) 36.0 4.9 35.4 34.1 30.1
C/N0 Based NLOS 20.13 45.41 63.72 80.80 49.67
Location B Elimination
GBDT with 18.35 35.61 50.89 64.77 40.06
Multi-Feature-Based NLOS
Elimination
Improvement (%) 8.8 21.6 20.1 19.8 19.4
C/N0 Based NLOS 25.4 29.5 127.67 133.37 38.59
Location C Elimination
GBDT with 25.07 32.27 123.83 130.39 40.86
Multi-Feature-Based NLOS
Elimination
Improvement (%) 1.3 −9.4 3.0 2.2 −5.9
Fig. 11. Positioning results based on the elimination of the NLOS signals detected from the proposed GBDT with multi-feature-based method and single C/N0 based
method in Location A.
Fig. 12. Positioning results based on the elimination of the NLOS signals detected from the proposed GBDT with multi-feature-based method and single C/N0 based
method in Location B.
Please cite this article as: R. Sun, G. Wang, W. Zhang et al., A gradient boosting decision tree based GPS signal reception classification algorithm, Applied Soft Computing
Journal (2019) 105942, https://doi.org/10.1016/j.asoc.2019.105942.
R. Sun, G. Wang, W. Zhang et al. / Applied Soft Computing Journal xxx (xxxx) xxx 11
Fig. 13. Positioning results based on the elimination of the NLOS signals detected from the proposed GBDT with multi-feature-based method and single C/N0 based
method in Location C.
same environment as the urban training dataset, have shown positioning accuracy to some extent. However, simple elimina-
that the overall accuracy of the multi-feature-based classification tion cannot meet the requirements of high-precision positioning.
algorithm (i.e. 89% for the static data) is much higher than that of The proposed method is a complement to the existing positioning
the single-feature-based C/N0 classification algorithm (i.e.74.1%). methods based on 3D city model such as shadow matching. In fu-
Furthermore, for the correct classification of signals in the cat- ture studies, we will combine the proposed method with 3D map
egories of NLOS and multipath, the local accuracies are 85.7% to achieve better static positioning accuracy and therefore, could
and 81.5% respectively, better than the decision tree, distance- be used for the civil engineering applications such as building
weighted KNN and ANFIS algorithms. For the testing dataset D3 maintenance, and structural integrity/deformation monitoring in
(from Location B), from the similar environment as the training the urban canyons. For dynamic positioning, research is ongoing
dataset, the algorithm achieves an overall classification accuracy based on a framework of grid of reference points from which data
of 77.2%. In particular, the detection accuracies are 73.4% and will be captured for training. Users will then automatically obtain
87.5% for the NLOS and multipath respectively, better than the the classification rules of nearby reference points for accurate
decision tree, distance-weighted KNN and ANFIS algorithms. The satellite signal reception type classification, to improve position-
computation time for the GBDT is higher than the decision tree ing. In addition, we will develop online data training for use with
and distance-weighted KNN, however, this could be resolved by the GBDT algorithm for real-time applications such as ground
higher computing processing power. Overall, considering compu- vehicles, pedestrians and unmanned aerial vehicles (UAVs).
tation time and classification accuracy, it is shown that GBDT is
the best of the algorithms investigated, for GPS signal reception Declaration of competing interest
classification. It should be noted that, for some datasets with
different characteristics from the training dataset, such as D4 No author associated with this paper has disclosed any po-
(from Location B), the classification performance is degraded due tential or pertinent conflicts which may be perceived to have
to the inapplicability of the rules extracted from the training impending conflict with this work. For full disclosure statements
environment. refer to https://doi.org/10.1016/j.asoc.2019.105942.
Based on the GBDT with multi-feature-based classification re-
sults, the application for the static positioning results are further Acknowledgements
analysed with the detected NLOS elimination. For the Location
A, a positioning accuracy improvement of 34.1% (3D RMSE) has This work is supported by the National Natural Science Foun-
been achieved compared to the single C/N0 based method. For dation of China (Grant No. 41704022, No. 41974033), Natu-
the Location B, the proposed method could also provide an im- ral Science Foundation of Jiangsu Province, China (Grant No.
provement of the positioning accuracy with 19.8% (3D RMSE), BK20170780), China Postdoctoral Science Foundation funded
lower than that for the Location A. While for the Location C, the Project (Grant No. 2017M623360) and Foundation of Graduate
proposed method is unable to improve the positioning accuracy Innovation Center in NUAA, China (Grant No. KFJJ20180719) and
due to the different spatial and material characteristics. Therefore, Specialized Research Fund for Shandong Provincial Key Labora-
environmental sensitivity is a key issue in the application of tory, China (Grant No. KLWH201813).
classification algorithms. This could be addressed by developing
spatio-temporally dynamic algorithms, consideration of more sig- References
nal related features and training data from a number of different
locations in future work. [1] P. Misra, P. Enge, GPS Measurements and error sources, in: Global Po-
For static positioning, the proposed method can be used to sitioning System: Signals, Measurements and Performance, second ed.,
Ganga-Jamuna Press, Lincoln, MA, USA, 2011, pp. 174–177.
detect the NLOS and multipath signals, which could then be [2] P.W. Ward, J.W. Betz, C.J. Hegarty, Interference, multipath, and scintillation,
used in data pre-processing. The experimental results show that in: Understanding GPS: Principles and Applications, second ed., Artech
removing NLOS based on the proposed method can improve the House, Norwood, MA, USA, 2006, pp. 279–292.
Please cite this article as: R. Sun, G. Wang, W. Zhang et al., A gradient boosting decision tree based GPS signal reception classification algorithm, Applied Soft Computing
Journal (2019) 105942, https://doi.org/10.1016/j.asoc.2019.105942.
12 R. Sun, G. Wang, W. Zhang et al. / Applied Soft Computing Journal xxx (xxxx) xxx
[3] G. MacGougan, G. Lachapelle, R. Klukas, K. Siu, L. Garin, J. Shewfelt, G. Cox, [27] L. Wang, P.D. Groves, M.K. Ziebart, GNSS Shadow matching: improving
Performance analysis of a stand-alone high-sensitivity receiver, GPS Solut. urban positioning accuracy using a 3D city model with optimized visibility
6 (3) (2002) 179–195, http://dx.doi.org/10.1007/s10291-002-0029-z. scoring scheme, Navigation 60 (3) (2013) 195–207, http://dx.doi.org/10.
[4] J.M. Tranquilla, J.P. Carr, H.M. Al-Rizzo, Analysis of a choke ring ground- 1002/navi.38.
plane for multipath control in global positioning system (GPS) applications, [28] S. Miura, L.T. Hsu, F. Chen, S. Kamijo, GPS Error correction with pseu-
IEEE Trans. Antennas Propag. 42 (7) (1994) 905–911, http://dx.doi.org/10. dorange evaluation using three-dimensional maps, IEEE Trans. Intell.
1109/8.299591. Transp. Syst. 16 (6) (2015) 3104–3115, http://dx.doi.org/10.1109/TITS.2015.
[5] Z. Jiang, P.D. Groves, NLOS GPS Signal detection using a dual-polarisation 2432122.
antenna, GPS Solut. 18 (1) (2014) 15–26, http://dx.doi.org/10.1007/s10291- [29] L.T. Hsu, Y. Gu, S. Kamijo, 3D Building model-based pedestrian positioning
012-0305-5. method using GPS/GLONASS/QZSS and its reliability calculation, GPS Solut.
[6] H. Liang, T. Walter, P. Enge, and, G.X. Gao, GNSS Multipath and jamming 20 (3) (2016) 413–428, http://dx.doi.org/10.1007/s10291-015-0451-7.
mitigation using high-mask-angle antennas and multiple constellations, [30] Y. Gu, L.T. Hsu, S. Kamijo, GNSS/Onboard inertial sensor integration with
IEEE Trans. Intell. Transp. Syst. 16 (2) (2015) 741–750, http://dx.doi.org/ the aid of 3-d building map for lane-level vehicle self-localization in
10.1109/tits.2014.2342200.
urban canyon, IEEE Trans. Veh. Technol. 65 (6) (2016) 4274–4287, http:
[7] V.A. Dierendonck, P. Fenton, T. Ford, Theory and performance of narrow
//dx.doi.org/10.1109/TVT.2015.2497001.
correlator spacing in a GPS receiver, Navigation 39 (3) (1992) 265–283,
[31] R. Yozevitch, B.B. Moshe, H. Levy, Breaking the 1meter accuracy bound
http://dx.doi.org/10.1002/j.2161-4296.1992.tb02276.x.
in commercial GNSS devices, in: 27th Convention of the Electrical and
[8] R.D.J.V. Nee, J. Siereveld, P.C. Fenton, B.R. Townsend, The multipath
Electronics Engineers in Israel (IEEEI), Eilat, Israel, 2012, pp. 1-5.
estimating delay lock loop: approaching theoretical accuracy limits, in:
[32] R. Yozevitch, B.B. Moshe, A. Weissman, A robust GNSS LOS/NLOS signal
IEEE/ION Position, Location and Navigation Symposium, Las Vegas, NV,
classifier, Navigation 63 (4pp) (2016) 429–442.
USA, 1994, pp. 246-251, http://dx.doi.org/10.1109/ISSSTA.1992.665623.
[33] G. Deng, GPS Satellite selection algorithm based on satellite elevation angle
[9] L. Garin, J.M. Rousseau, Enhanced strobe correlator multipath rejection for
code & carrier, in: ION GPS, Kansas, MO, USA, 1997, pp. 559-568. and GDOP, Digit. Commun. 37 (2) (2010) 47–50, (in Chinese).
[10] B.R. Townsend, P.C. Fenton, A practical approach to the reduction of [34] L. Wang, P.D. Groves, M. Ziebart, Smartphone shadow matching for better
pseudorange multipath errors in a Ll GPS receiver, in: International cross-street GNSS positioning in urban environments, J. Navig. 68 (3)
Technical Meeting of the Satellite Division of the Institute of Navigation, (2015) 411–433, http://dx.doi.org/10.1017/S0373463314000836.
Salt Lake City, UT, USA, 1994, pp. 143-148. [35] L.T. Hsu, H. Tokura, N. Kubo, Y. Gu, S. Kamijo, Multiple faulty GNSS mea-
[11] M.S. Braasch, Performance comparison of multipath mitigating receiver surement exclusion based on consistency check in urban canyons, Sensors
architectures, in: IEEE Aerospace Conference, Big Sky, MT, USA, 2001, pp. 17 (6) (2017) 1909–1917, http://dx.doi.org/10.1109/JSEN.2017.2654359.
3/1309-3/1315. [36] L.T. Hsu, GNSS multipath detection using a machine learning approach, in:
[12] B.R. Townsend, P.C. Fenton, K.J.V. Dierendonck, R.D.J.V. Nee, Performance International conference on intelligent transportation systems, Yokohama,
evaluation of the multipath estimating delay lock loop, Navigation 42 (3) Japan, 2017, pp. 1-6. http://dx.doi.org/10.1109/ITSC.2017.8317700.
(1995) 502–514, http://dx.doi.org/10.1002/j.2161-4296.1995.tb01903.x. [37] Q. Phan, S. Tan, I.V. Mcloughlin, L.D. Vu, A unified framework for GPS
[13] P.C. Fenton, J. Jones, The theory and performance of NovAtel Inc.’s vision code and carrier-phase multipath mitigation using support vector regres-
correlator, in: ION GNSS, Long Beach, CA, USA, 2005, pp. 2178-2186. sion, Adv. Artif. Neural Syst. (2013) 1–14, http://dx.doi.org/10.1155/2013/
[14] F. Wang, J. Chen, S. Gao, K. Tang, X. Meng, Development and sea trial of 240564.
real-time offshore pipeline installation monitoring system, Ocean Eng. 146 [38] M. Socharoentum, H.A. Karimi, Y. Deng, A machine learning approach to
(2017) 468–476, http://dx.doi.org/10.1016/j.oceaneng.2017.09.016. detect non-line-of-sight GNSS signals in Nav2Nav, in: presented at the 23rd
[15] M.G. Petovello, C. O’Driscoll, G. Lachapelle, Weak signal carrier tracking of ITS World Congress, Melbourne, Australia, 10–14 October, 2016.
weak using coherent integration with an ultra-tight GNSS/IMU receiver, [39] Y. Quan, L. Lau, G.W. Roberts, X. Meng, C. Zhang, Convolutional neural
in: presented at the European Navigation Conference, Toulouse, Lauragais, network based multipath detection method for static and kinematic GPS
France, Apr. 23-25, 2008. high precision positioning, Remote Sens. 10 (12) (2018) 2052, http://dx.
[16] M.G. Petovello, G. Lachapelle, Comparison of vector-based software re- doi.org/10.3390/rs10122052.
ceiver implementations with application to ultra-tight GPS/INS integration, [40] B. Guermah, H.E. Ghazi, T. Sadiki, H. Guermah, A robust GNSS
in: ION GNSS, Fort Worth, Texas, 2006, pp. 2977-2989. LOS/multipath signal classifier based on the fusion of information and
[17] A. Soloviev, C. Toth, D. Grejner-Brzezinska, Performance of deeply inte- machine learning for intelligent transportation systems, in: 2018 IEEE
grated GPS/ins in dense forestry areas, J. Appl. Geodesy 6 (1) (2012) 3–13, International Conference on Technology Management, Operations and
http://dx.doi.org/10.1515/jag-2011-0005. Decisions (ICTMOD), Marrakech, Morocco, 2018, pp. 94-100. http://dx.doi.
[18] A. Soloviev, F.V. Graas, Use of deeply integrated GPS/INS architecture org/10.1109/ITMC.2018.8691272.
and laser scanners for the identification of multipath reflections in urban
[41] R. Sun, L.T. Hsu, D. Xue, G. Zhang, W.Y. Ochieng, GPS Signal reception
environments, IEEE J. Sel. Top. Sign. Proces. 3 (5) (2009) 786–797, http:
classification using adaptive neuro-Fuzzy inference system, J. Navig. (2018)
//dx.doi.org/10.1109/JSTSP.2009.2027796.
1–17, http://dx.doi.org/10.1017/S0373463318000899.
[19] V. Milanes, J.E. Naranjo, C. Gonzalez, J.A. Ruiz, T. Pedro, Autonomous vehicle
[42] L. Lau, P.A. Cross, Development and testing of a new ray-tracing approach
based in cooperative GPS and inertial systems, Robotica 26 (5) (2008)
to GNSS carrier-phase multipath modelling, J. Geod. 81 (11) (2007)
627–633, http://dx.doi.org/10.1017/S0263574708004232.
713–732, http://dx.doi.org/10.1007/s00190-007-0139-z.
[20] S. Zhao, Y. Chen, J.A. Farrell, High-precision vehicle navigation in urban
[43] P.D. Groves, Z. Jiang, L. Wang, M.K. Ziebart, Intelligent urban positioning
environments using an mem’s IMU and single-frequency GPS receiver, IEEE
using multi-constellation GNSS with 3D mapping and NLOS signal detec-
Trans. Intell. Transp. Syst. 17 (10) (2016) 2854–2867, http://dx.doi.org/10.
1109/TITS.2016.2529000. tion, in: International Technical Meeting of the Satellite Division of the
[21] K.W. Chiang, Y.W. Huang, An intelligent navigator for seamless INS/GPS Institute of Navigation, Nashville, TN, USA, Sep. 2012, pp. 458-472.
integrated land vehicle navigation applications, Appl. Soft Comput. 8 (1) [44] P.D. Groves, Z. Jiang, Height aiding, C/N0 weighting and consistency
(2008) 722–733, http://dx.doi.org/10.1016/j.asoc.2007.05.010. checking for GNSS NLOS and multipath mitigation in urban areas, J. Navig.
[22] J.I. Meguro, T. Murata, J.I. Takiguchi, Y. Amano, T. Hashizume, GPS Multi- 66 (5) (2013) 653–669, http://dx.doi.org/10.1017/s0373463313000350.
path mitigation for urban area using omnidirectional infrared camera, IEEE [45] J.H. Friedman, Greedy function approximation: a gradient boosting ma-
Trans. Intell. Transp. Syst. 10 (1) (2009) http://dx.doi.org/10.1109/tits.2008. chine, Ann. Statist. 29 (5) (2011) 1189–1232, http://dx.doi.org/10.2307/
2011688. 2699986.
[23] S. Peyraud, D. Bétaille, S. Renault, M. Ortiz, F. Mougel, D. Meizel, F. [46] L. Zhou, GBDT-SVM Credit risk assessment model and empirical analysis of
Peyret, About non-line-of-sight satellite detection and exclusion in a 3D peer-to-peer borrowers under consideration of audit information, Open J.
map-aided localization algorithm, Sensors 13 (1) (2013) 829–847, http: Bus. Manag. 6 (2018) 362–372, http://dx.doi.org/10.4236/ojbm.2018.62026.
//dx.doi.org/10.3390/s130100829. [47] H. Park, A. Haghani, S. Samuel, A.K. Michael, Real-time prediction and
[24] F. Peyret, D. Betaille, M. Ortiz, S. Miquel, L. Fontenay, How to improve avoidance of secondary crashes under unexpected traffic congestion, Accid.
GNSS positioning quality of service for demanding ITS in city environments Anal. Prev. 112 (2018) 39–49, http://dx.doi.org/10.1016/j.aap.2017.11.025.
by using 3D digital maps, in: presented at the 19th ITS World Congress, [48] L. Wang, D. Zhou, H. Zhang, W. Zhang, J. Chen, Application of relative
Vienna, Austria, Oct. 22-26, 2012. entropy and gradient boosting decision tree to fault prognosis in elec-
[25] P.D. Groves, Shadow matching: a new GNSS positioning technique for tronic circuits, Symmetry 10 (10) (2018) 495, http://dx.doi.org/10.3390/
urban canyons, J. Navig. 63 (3) (2011) 417–430, http://dx.doi.org/10.1017/ sym1010049.
S0373463311000087. [49] J.R. Jang, ANFIS: adaptive-network-based fuzzy inference system, Syst. Man
[26] L. Wang, P. Groves, M. Ziebart, Urban positioning on a smartphone: real- Cybern. 23 (3) (1993) 665–685, http://dx.doi.org/10.1109/21.256541.
time shadow matching using GNSS and 3D city models, in: ION GNSS,
Nashville, Tennessee, USA, 2013, pp. 1606-1619.
Please cite this article as: R. Sun, G. Wang, W. Zhang et al., A gradient boosting decision tree based GPS signal reception classification algorithm, Applied Soft Computing
Journal (2019) 105942, https://doi.org/10.1016/j.asoc.2019.105942.