Non-Line-of-Sight Multipath Classification Method For BDS Using Convolutional Sparse Autoencoder With LSTM
Non-Line-of-Sight Multipath Classification Method For BDS Using Convolutional Sparse Autoencoder With LSTM
Yahang Qin, Zhenni Li*, Shengli Xie, Bo Li, Ming Liu, and Victor Kuzin
Abstract: Multipath signal recognition is crucial to the ability to provide high-precision absolute-position services
by the BeiDou Navigation Satellite System (BDS). However, most existing approaches to this issue involve
supervised machine learning (ML) methods, and it is difficult to move to unsupervised multipath signal
recognition because of the limitations in signal labeling. Inspired by an autoencoder with powerful unsupervised
feature extraction, we propose a new deep learning (DL) model for BDS signal recognition that places a long
short-term memory (LSTM) module in series with a convolutional sparse autoencoder to create a new
autoencoder structure. First, we propose to capture the temporal correlations in long-duration BeiDou satellite
time-series signals by using the LSTM module to mine the temporal change patterns in the time series.
Second, we develop a convolutional sparse autoencoder method that learns a compressed representation of
the input data, which then enables downscaled and unsupervised feature extraction from long-duration BeiDou
satellite series signals. Finally, we add an l1/2 regularizer to the objective function of our DL model to remove
redundant neurons from the neural network while ensuring recognition accuracy. We tested our proposed
approach on a real urban canyon dataset, and the results demonstrated that our algorithm could achieve better
classification performance than two ML-based methods (e.g., 11% better than a support vector machine) and
two existing DL-based methods (e.g., 7.26% better than convolutional neural networks).
Key words: convolutional sparse autoencoder; BeiDou Navigation Satellite System (BDS); long short-term
memory (LSTM); multipath classification
© The author(s) 2025. The articles published in this open access journal are distributed under the terms of the
Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).
Yahang Qin et al.: Non-Line-of-Sight Multipath Classification Method for BDS Using Convolutional Sparse... 69
for autonomous driving, intelligent transportation, which not only increases the size of the receiving
robot navigation, and other related fields[1−3]. Usually, device, but also increases its cost. Meanwhile, sidereal
BDS can reach centimeter-level positioning accuracy filtering (SF)[14], multipath hemispherical maps
in open areas[2]. However, multipath interference (MHM)[15], 3D city models[16], and other software
dramatically reduces the positioning accuracy of BDS methods have been proposed to mitigate multipath
in the so-called “urban canyon”[4]. The multipath interference. However, the SF method requires
interference depends on the satellite elevation angle precalculation of the satellite operational period.
(ELE), the environment around the receiving antenna, Because the operational period of each satellite varies
the distance from the reflecting surface to the receiver, slowly over time, there is a consequent decrease in the
the reflection coefficient of the reflecting source, and multipath mitigation effect. The MHM method requires
the antenna characteristics. This leads to multipath a very large matrix of multipath models, which
interference becoming the main error source in the increases the computational load and does not establish
field of BDS and GNSS high-accuracy positioning[5−8]. a fully accurate multipath model. In addition, real-time
In the urban canyon environment, satellite signals updating of 3D city models is difficult.
develop multiple propagation paths by specular Artificial intelligence (AI) is a rapidly developing
reflection and bypassing. These extra signals interfere technology that can learn multipath signals from
with the direct signals from the satellite at the receiver sufficient data without the need to build complex
antenna, generating the multipath effect[9]. For the mathematical models and shows great potential for
BDS, there are two types of signals causing multipath identifying multipath in urban canyon environments.
errors: line-of-sight (LOS) multipath signals and non- These multipath signal-recognition methods can be
line-of-sight (NLOS) signals. NLOS-based errors tend divided into supervised and unsupervised cases.
to be worse than LOS-based errors. Examples of LOS Supervised algorithms for recognizing multipath
and NLOS signals are shown in Fig. 1. The LOS signals include decision tree (DT)[17], support vector
contains only the one direct signal emitted by the machine (SVM)[18], k-nearest neighbor (KNN)[19], and
satellite, and the NLOS contains only the one signal neural network (NN)[20]. They have own advantages,
and which algorithm to choose for a specific task
reflected by the environment near the receiver[10]. LOS
should be considered according to the data
multipath can easily reduce the GNSS accuracy to tens
characteristics. For example, SVM has strong
of meters. NLOS positioning errors can be several
generalization ability for small samples, but it is
hundred meters above[11]. Therefore, some current
relatively slow in processing large data sets. Overall,
studies[11, 12] are focusing on the issue of multipath
supervised learning has two main limitations: the
interference with the aim of mitigating its influence on
construction of databases containing a large number of
satellite positioning accuracy.
labels is difficult, and the recognition accuracy of the
Exploring how to cope with the satellite positioning
models needs to be improved. However, some recent
offset problem caused by multipath interference has
works have attempted to explore the use of
received wide attention. Proposals have mainly been in
unsupervised multipath-signal-recognition methods.
the areas of hardware upgrading and software
For example, the generative adversarial network
processing. Hardware devices such as anti-multipath
considers unsupervised domain adaptation (UDA)-
antennas[13] are used to filter multipath interference,
based models[21] to reduce the discrepancy between
real and simulated data. In Ref. [22], a k-means
LOS
clustering approach was used to achieve unlabeled
NLOS
satellite satellite multipath signal recognition. Unfortunately, this
process includes the problems of high-dimensional data
redundancy and low classification accuracy.
In this paper, we propose an unsupervised
convolutional sparse autoencoder containing a long
short-term memory (LSTM) module to improve the
Direct signal Reflection signal Blocked signal BDS antenna
accuracy of NLOS multipath recognition. First, to
Fig. 1 Two types of BDS signal: LOS and NLOS multipath. tackle the twin issues of the high dimensionality of the
70 Tsinghua Science and Technology, February 2025, 30(1): 68−86
training samples and the need for a large number of provided and analyzed in section 6. Furthermore, we
labels, we use a convolutional autoencoder to realize present a discussion in this section. Finally,
descending and unsupervised signal feature extraction. conclusions are drawn in Section 7.
Then, to learn more fully the information related to the
time series of BDS signals, we employ multiple LSTM 2 Related Work
modules to capture the time series variation patterns of Numerous works have been studied in the field of
BDS signal features such as carrier-to-noise ratio multipath identification. The research includes three
(C/ N0 ), ELE, and the pseudorange residual (PR). This main kinds of methods: 3D, machine learning (ML),
enables the model to learn the underlying features of and DL-based methods. In the following, we review
the signal. Furthermore, to deal with the high- the closely related works and highlight the novelty of
dimensional data redundancy in convolutional our contributions.
autoencoder models, we propose to add an l1/2
regularizer to the objective function of the 2.1 3D-based methods
unsupervised deep learning (DL) model of a The 3D city models have been widely used in the field
convolutional autoencoder with an LSTM module, of multipath environmental positioning. Typically, a
which can remove redundant neurons from the NN combination of shadow matching and 3D mapping
while obtaining high classification accuracy. Finally, aided (3DMA) was presented in Ref. [23], and the
the performance of our model has been evaluated using method can identify satellite signal types by building
a real urban canyon dataset, demonstrating superior boundaries. The sky and building contours are
performance to that of existing models. estimated by fitting data curves to a smooth spline
The main contributions of this paper can be model, and building boundaries are calculated using an
summarized as follows: adaptive weighting scheme. Reference [24] introduced
(1) We develop a convolutional autoencoder method a shadow-matching classifier and a confidence check to
that learns a compressed representation of the input improve the NLOS classifier accuracy. Moreover,
data for downscaling and unsupervised feature exploring how to build applications over the NLOS
extraction of long-duration time-series signals also achieved wide attention. For example, applying a
containing multipath. 3D model to correct positions so that the NLOS
(2) We employ an LSTM module to capture the contributes to the absolute positioning[25]. Some works
temporal correlation of the long-duration time-series use 3D maps and ray tracing to correct the
signals from BeiDou satellites and thereby mine a localization[26]. However, 3DMA-based methods rely
temporal change law for the time series. on the timely updating of 3D models. This poses a
(3) We add an l1/2 regularizer to the objective great challenge in data-intensive computations. To
function of an unsupervised DL model of a avoid the limitations of this 3D city model, ML was
convolutional autoencoder with an LSTM module to adopted.
achieve the removal of redundant neurons from the NN
while obtaining higher sparsity and recognition 2.2 Machine learning-based methods
accuracy. Due to the complex and variable architectural structure
(4) We develop a novel convolutional sparse of the urban environment in which the receiver is
autoencoder and LSTM network model for NLOS located. Making the establishment of accurate physical
multipath classification. models is difficult. Nevertheless, ML can mine the
The rest of this paper is organized as follows. Section underlying features of large amounts of data. The
2 reviews related work. Section 3 illustrates the BDS gradient boosted the decision tree (GBDT)[17] was used
features with different signal types. Section 4 describes to classify LOS, multipath, and NLOS with three
the proposed DL networks and is explained in detail. In features: C/ N0 , PR, and ELE. Reference [19] compared
section 5, we present the environment where the data various ML algorithms such as KNN, NN, SVM, and
set is collected, followed by signal type labeling DT. LOS/NLOS classifiers based on the signal-to-noise
method using the combination of sky mask and code ratio (SNR), number of received satellites (NRS), ELE,
pseudorange double difference observable. Then, the PR, pseudorange residual percentage (PRP), and
experimental setup and classification results are normalized pseudorange residual (NRP) are
Yahang Qin et al.: Non-Line-of-Sight Multipath Classification Method for BDS Using Convolutional Sparse... 71
established. In particular, the SVM[18, 20] has played a accuracy, we used the LSTM method to extract the
vital role in multipath detection, where they are used to relevant information from the time series. To solve the
learn the features of national marine electronics problem of high dimensional data redundancy, we use
association (NMEA), receiver independent exchange l1/2 regularizer to maintain the classification
(RINEX), and correlator levels. However, traditional performance while removing redundant neurons and
ML methods can only process information within one increasing the model sparsity. Finally, the NLOS signal
epoch, and it is easy to lose the data features in the time was classified by the constructed convolutional sparse
domain. Thus, some researchers have considered autoencoder with the LSTM model.
applying 3D techniques to identify multipath.
3 Features Selection and Analysis of
2.3 Deep learning-based methods Collected Data
Recently, with the continuous development of AI In this section, we analyze the collected data. By
techniques, the popular DL techniques have been comparing the features of BDS signals, C/ N0 , ELE,
widely adopted in various fields and have achieve and PR[14] are selected as the input data. At the same
outstanding performances. Meanwhile, many works are
time, we provide a data normalization method.
using the DL techniques to identify NLOS. For
example, the fully connected neural networks 3.1 Carrier to noise ratio (C/N0)
(FCNNs)[27] and LSTM[28] were applied to the The C/ N0 is the ratio of the received carrier signal
classification of LOS and NLOS. In Ref. [29], a deep power to the noise signal and is an important indicator
convolutional neural network (CNN) was used to of satellite signal quality. Usually, the receiver in an
detect multipath and further improve the final unobstructed environment receives mostly LOS signals
localisation accuracy. Considering the successful with a high C/ N0 . In the complex urban environment,
application of CNN in the image field. The mapping of the signal is blocked by high-rise buildings as well as
the correlator output signal to a two-dimensional input reflections, making the signal change propagation
image is proposed for classification. A deep CNN was direction and go through multiple paths to reach the
used to detect the correlator output multipath[30], and receiver, so that the C/ N0 of the NLOS is usually
this deep CNN method outperformed the SVM method. lower.
DL can be applied not only for signal classification but
also for positioning correction. Deep Neural Networks 3.2 Elevation angle (ELE)
(DNN) is used to learn a set of PR and the satellite The ELE is the magnitude of the angle between the
LOS vector as the position correction, which receiver’s horizontal plane and the satellite in the sky
outperforms the weighted least square (WLS) baseline and is an important indicator of satellite visibility.
on real data[31]. In contrast to traditional model-based Generally speaking, in the same environment, the
methods, DL has shown great potential in solving satellite signal interference is usually greater at low
complex multipath interference problems. However, angles, so the low angle is more likely to be NLOS.
the training database for multipath studies based on DL Thus, satellites with high ELE are not easily blocked
is difficult to construct and requires a large amount of by high-rise buildings and are usually LOS. The ELE θ
labeled data. In addition, the problems of high can be calculated as
dimensional data redundancy and low classification ( )
accuracy exist in the process of multipath signal θ(i) = − arcsin uiD /r̂ (1)
recognition. where uiD is the component of the distance between the
To tackle the above challenge, in this paper, we satellite and the receiver in the local Cartesian
proposed an unsupervised convolutional sparse coordinates coordinate system in the “Up” direction. r̂
autoencoder with LSTM structures to recognize the is the estimated value of the distance between the
NLOS multipath. To address the difficulty of receiver and the satellite.
constructing a large number of labeled multipath
databases, we proposed an unsupervised deep feature 3.3 Pseudorange residual (PR)
extraction method with a convolutional autoencoder. The PR is the part of the observation after subtracting
Also considering the problem of low classification the useful information, and it is a relatively objective
72 Tsinghua Science and Technology, February 2025, 30(1): 68−86
standard for evaluating the quality of the signal. widely. Normalization can eliminate the influence of
Normally, the PR of LOS is around zero with less the different ranges of data to ensure the stable
jitter. This is due to the fact that LOS is the signal from convergence of the weights and deviations. In addition,
the satellite that arrives at the receiver via a direct path, data without normalization will slow down the training
without reflections from redundant paths. Conversely, speed of the network. Therefore, the input data features
the NLOS signal quality is poor and the PR is generally C/ N0 , ELE and PR need to be normalized so that they
larger. The position of the receiver is related to the PR remain between [0−1]. We adopt the min-max
measurements and the satellite position. normalization method, which can be expressed as:
Satellite positions can be obtained directly from the { }
xi − min x j
broadcast ephemeris. The state of the receiver can be 0⩽ j⩽n
xi∗ = { } (5)
calculated using a PR equation estimated by the least max {xk } − min x j
0⩽k⩽n 0⩽ j⩽n
squares method, which can be expressed as
−1 where xi is any value in the input sample, max {xk } and
γ = (GTG) GT ρ (2) { } 0⩽k⩽n
min x j denote the maximum and minimum values of
where γ is the receiver state. ρ denotes pseudorange. G 0⩽ j⩽n
denotes the matrix consisting of the unit LOS vector the sample, respectively.
between the satellite and receiver, which can be 4 Non-Line-of-Sight Multipath
expressed as Classification Method for BDS Using
(1)
uN u(1) u(1) −1 Convolutional Sparse Autoencoder with
E D
(2) LSTM
uN u(2) u(2) −1
E D
G = . .. .. .. (3) In this section, we first introduce the convolutional
.
. . . .
sparse autoencoder network model with an LSTM
u(i)
N u(i)
E u(i)
D −1 module for identifying BDS signals. Next, we describe
Once the pseudorange and satellite states are the objective function of the convolutional sparse
obtained, the PR can be expressed as autoencoder. Furthermore, We also present the details
of this training process of the proposed algorithm.
η = ρ−G ·γ (4)
4.1 Convolutional sparse autoencoder with LSTM
In addition to the above features, the raw data
neural network architecture
consists of the horizontal dilution of precision (HDOP),
vertical dilution of precision (VDOP), azimuth angle In this part, we first present the overall process of the
(AZ), consistency between delta pseudorange and proposed method. Meanwhile, each process of our
pseudorange rate ( ζ ) and Number of visible satellites method from the original signal to the BDS signal
(NS), geometric dilution of precision and the doppler classification is described in detail. Then, the basic
shift frequency in the RINEX format. We explore principles of convolutional sparse autoencoder with
which features contribute more to deep learning based LSTM module are presented.
classifier. When more features are used, the accuracy 4.1.1 Proposed method
decreases greatly for most models. This may be due to The identification of single-moment BDS signal type
the fact that these features are not strongly associated makes it easy to lose time correlation information, and
with the type of signal, e.g., sky occlusion varies in the extraction of BDS signal time series features is of
different environments, and it is not possible to great significance for signal identification. DL usually
determine the type of signal from a high or low requires a large amount of labeled data, which is a
azimuth angle. difficult task. In contrast, unsupervised learning
networks require less labeled data. Autoencoder
3.4 Normalization of features networks, as an important unsupervised learning
Due to we adopted a Convolutional AutoEncoder network can achieve downscaling and feature
(CAE) to extract features, the inputs and outputs of this extraction of signals using only unlabeled data.
structure should have the same range of values. However, the training process of autoencoder networks
Meanwhile, the range of the selected features varies is prone to overfitting and there is a possibility that the
Yahang Qin et al.: Non-Line-of-Sight Multipath Classification Method for BDS Using Convolutional Sparse... 73
output layer simply copies the input layer. comprises 16 filters, the subsequent convolutional layer
To address the above problems, a convolutional comprises 32 filters, and the final convolutional layer
sparse autoencoder with an LSTM module is proposed comprises 64 filters. The kernel size of all
for the recognition of BDS signals. The proposed convolutional layers is set to 1 × 2.
method structure is shown in Fig. 2. First, a The decoder part of the convolutional sparse
convolutional sparse autoencoder is used to learn the autoencoder replaces the original pooling layers with
compressed representation of the input data. Second, inverse pooling layers and consists of three one-
The output feature data of the encoder is fed to the dimensional convolutional layers and three upsampling
LSTM layer to learn the sequential dependencies layers. The number of filters is the opposite of the
between the feature data. Then, an l1/2 regularizer encoder part, 64, 32, and 16, respectively. The kernels
sparsity constraint is added to the objective function of of all convolutional layers are also set to 1 × 2. The 1D
the method to further reduce the redundancy of the convolutional layers are mainly responsible for
neural network. Unsupervised high-level feature extracting data features from the time series. Maximum
extraction of BDS signals can be achieved with this pooling is a down-sampling operation that retains the
network. Finally, to further improve the classification maximum value in each channel of the output of the
accuracy of the proposed method, we replace the previous layer, which can effectively avoid over-
softmax layer of the last layer of the autoencoder fitting. The maximum pooling means that only the
module with a decision tree to form a hierarchical strongest of these features are retained while discarding
network, which in turn outputs the NLOS/LOS class of other weak features. The network parameters can be
BDS signals. In Fig. 2, the encoder part of the one- effectively reduced to prevent over-fitting of the
dimensional convolutional sparse autoencoder network model. This pyramidal architecture can eliminate
used in this paper consists of three convolutional layers redundant features while learning a compressed
and three maximum pooling layers. Each convolutional representation of the BeiDou satellite series signal.
layer uses Rectified Linear Unit (ReLU) as the 4.1.2 Convolutional autoencoder
activation function. The first convolutional layer The autoencoder is a typical representation learning
BDS
u-blox F9 GNSS
receiver
Data
preprocessing Root node Data
RINEX classification
Decision Decision
C/N0 , ELE , and PR node node
Convolutional decoder
LOS satellite NLOS satellite Direct signal Reflection signal Blocked signal BDS antenna
Sparse weights with l 1/2
Regularizer
Fig. 2 Convolutional sparse autoencoder with LSTM module based BDS series NLOS recognition.
74 Tsinghua Science and Technology, February 2025, 30(1): 68−86
algorithm that can achieve unsupervised feature can approximately replicate the input data from the
extraction from data. An autoencoder consists of an encoder.
input layer, a hidden layer, and an output layer forming 4.1.3 LSTM module
an encoder and a decoder. The network is trained by a LSTM can effectively solve the problems of gradient
backpropagation algorithm to make the output equal to explosion, gradient disappearance, and the inability to
the input[32]. In the encoder part of a recognition preserve historical information for a long time during
network, the hidden layer is responsible for learning the training process[33]. Therefore, we adopt LSTM to
the compressed representation of the input layer data. solve the problem of long distance dependence of time
In the decoder part of a generative network, the output series data. As shown in Fig. 4, each module contains
layer is responsible for reconstructing the input data the previous module’s hidden state h and cell state C .
from the hidden layer. The structure of the autoencoder Among them, the hidden state h can achieve short-term
is shown in Fig. 3. memory and the cell state C can achieve long-term
The process of reconstructing the input data by the memory. Satellite features have a high degree of
autoencoder can be briefly described as follows. First, temporal correlation, and LSTM can effectively extract
for the input data x ∈ Rn , the hidden layer feature the temporal information of the features, thus
h ∈ Rk (k < n) can be obtained by the encoder. Then the improving the recognition capability. The internal
output layer in the decoder reconstructs the output structure of the LSTM neural network is shown in Fig.
x̂ ∈ Rn from the hidden layer. However, autoencoder 4. The retention and forgetting of historical information
often leads to the degradation of feature learning is achieved by three gating units. The forget gate
performance when dealing with high-dimensional data determines which information needs to be deleted from
due to the excessive amount of parameters. CNN has the cell state. The input gate determines which new
the features of local connectivity and weight sharing, information is added to the cell state. The output gate
which can reduce the network parameters and speed up determines which state features of the cell are output.
the computation of the network. And the unsupervised The computational process of LSTM can be expressed
learning feature of the autoencoder can prevent the as follows:
network from overfitting. Moreover, there are local
ft = σ(W f · [ht−1 , xt ] + b f ) (6)
correlations in continuous data. CNN can perceive
richer local features of the signal using convolutional it = σ(Wi · [ht−1 , xt ] + bi ) (7)
kernels. Convolutional sparse autoencoder combined
with operations such as convolution and pooling in C̃t = tanh(WC · [ht−1 , xt ] + bC ) (8)
CNN can achieve unsupervised training. One-
Ct = ft ∗ Ct−1 + it ∗ C̃t (9)
dimensional convolution is processed on BeiDou
satellite time series data to achieve feature extraction to ot = σ(Wo · [ht−1 , xt ] + bo ) (10)
generate new feature maps. The convolutional sparse
autoencoder network is trained many times so that the ht = ot ∗ tanh(Ct ) (11)
Beidou satellite signal data output from the decoder In the method proposed in this paper, the LSTM
memory module in the forgetting gate ft obtains the
Encoder Decoder
BDS signals at time t . The information that can reflect
x1 w Bottleneck w’ x̂1
ht-1 ht ht+1
x2 h1 x̂2
Input
x3 h2 x̂ 3 Ct-1 Forget gate Input gate Output gate Ct
Error
× +
......
x4 x̂4 × tanh
ft it ~
Ct ot
h LSTM σ σ σ LSTM
......
......
tanh ×
xn
m
+1 x̂n
block block
+ + + +
+1 h(x)
ht-1 LSTM block ht
x x ≈ xˆ x̂
xt-1 xt xt+1
Fig. 3 Network structure of autoencoder. Fig. 4 Structure of the LSTM neural network.
Yahang Qin et al.: Non-Line-of-Sight Multipath Classification Method for BDS Using Convolutional Sparse... 75
training process are summarized in Algorithm 1. First, 11 Let the loss J be defined as (using Eq. (13))
a convolutional sparse autocodeer with LSTM is 1 ∑N 1
12 Ji ←− minω i=1 (y − x)2 + λ ∥ω∥ 21
trained with the original input data x . This allows the N 2
encoder fθ to output hlj and obtain the parameters θi . 13 Compute stochastic gradient of the loss w.r.t each ω
Next, the LSTM learns the temporal correlation of the 14 using l1/2 regularizer trick (Eq. (20)).
( )
hlj generated by the encoder and outputs the 1
15 ∇J̄ = ∇θ,ϕ ∑
dimensionality-reduced data ht . Furthermore, ht |M Ji |
obtains the reconstructed data y after going through the 16 Update network parameters by backpropagation
decoder fϕ function and updating the parameters ϕi . 17 (θ, ϕ) ←− Stochastic gradient descent optimizer
Then, we use the backpropagation algorithm to 18 for h j = [h1 , h2 , . . . , h M ] ∈ M do
l l
compute the gradient of the loss function Ji of the
l
weights and deviation vectors and optimize the 19 To classification on the data h j using DT
autoencoder with a stochastic gradient descent 20 return The type of the BDS signal , θ , ϕ
algorithm. Thereby each sample x can be mapped to a
new feature space ht . Finally, the DT model is used to antennas, two u-blox F9 GNSS modules, a laptop, two
classify and output the BDS signal type. rechargeable batteries, and a small portable foldable
table. Above the base, the fisheye camera can be
5 Experiment installed on the tripod, and other devices can be placed
In this section, we first describe the BDS signal on the table. Due to the double differential approach
collection scheme. Second, we illustrate the urban that can get more accurate positioning results, we use
complex environment of BDS signal acquisition. Then, two GNSS antennas for capturing the BDS signals, one
we depicted the BDS signal labeling method. Finally, of which is used as a base station and the other as a
the data preprocessing is given in Section 5.3. mobile station. The u-blox F9 GNSS module is used
for high-precision positioning, and it requires a
5.1 Data collection
rechargeable battery for continuous power supply. In
Figure 5 shows the BDS signal collected device, which addition, the laptop is used to store the raw BDS data.
consists of a fisheye camera, a tripod, a pair of GNSS The BDS signals were collected at four different
Yahang Qin et al.: Non-Line-of-Sight Multipath Classification Method for BDS Using Convolutional Sparse... 77
Fisheye camera Antennas Laptop sky occlusion cases in urban canyons, and effectively
avoids the problem of poor model generalisation ability
brought by a single data distribution. Location A
corresponds to a situation where one side is blocked by
a building and the other three sides are open. Similarly,
Location B corresponds to a situation where two sides
u-blox F9
are blocked. Location C corresponds to a situation
where three sides are blocked by the building and the
other sides are open. Location D represents a deep
urban environment blocked by four sides. This location
contains more NLOS due to building blockage.
5.2 Data labeling
Fig. 5 Data collection equipment.
Labeling the true signal type on BDS data collected for
locations in Tianhe, Guangzhou. This environment is DL is necessary. In this study, a fisheye camera is used
densely populated with high-rise buildings, and many to assist in data labeling, and the labeling is achieved
of them are higher than 80 meters. Therefore, we can by the method of oprojections. The flow of data
collect a large number of NLOS. From October 10 to labeling is shown in Fig. 7. First, a sky image with
13 in 2022, we collected data from four Locations, A, building boundaries and sky regions is captured by a
B, C, and D, using the equipment in Fig. 5, and each fisheye camera. Moreover, the sky image is segmented
location collected 4 hours of static data of BDS L1 into sky area and non-sky area using an image
signals in RINEX format with the sampling frequency segmentation algorithm, where the non-sky area
set to 1 s. Thus, four BDS signal datasets (A, B, C, and corresponds to the part obscured by buildings. Finally,
D) were constructed. the ELE and AZ of the satellites acquired by the
Figure 6b shows a sky image of the data collection receiver are projected onto the fisheye camera sky
location, where grey is the area shaded by buildings image using an isotropic projection method, and the
and white is the sky area in the figure. Our data include satellites falling in the non-sky regions are NLOS
four different sky obscuration scenarios. Generally satellites and those falling in the sky region are LOS
speaking, the more sky occlusion area, the easier the satellites.
satellites are occluded by buildings, and there are fewer An example of the method shown above is given in
LOS signals in the dataset. One side, two sides, three Figs. 6b and 6c, where the original sky image taken by
sides and four sides of the sky occlusion cases are the fisheye camera is shown in Fig. 6b, and the sky
selected in the experiment, which includes the typical image after satellite projection is shown on Fig. 6c. The
Location A Location B
Location C Location D
Fig. 6 BDS data collection in an urban environments. (a) Real environment; (b) Fish-eye image; (c) Sky plot.
78 Tsinghua Science and Technology, February 2025, 30(1): 68−86
Fig. 14 Average classification performance of different Fig. 15 Sparsity performance of l1/2 regularizer.
methods.
classification, sparsity is ensured. This indicates that
addition of the LSTM module can further enhance the the algorithm is capable of acquiring more effective
method's performance. It is worth noting that SVM features of BDS signals.
approaches achieve such poor performance compared
6.6 Discussions
to other methods. The main reason may be that SVM
can map low-dimensional data into high-dimensional In our experiments, we investigate the performance of
space by kernel function, and then improve the our proposed convolutional sparse autoencoder with
classification accuracy. Therefore, SVM is more the LSTM method in real environments with a variety
suitable for classification problems with low- of sky occlusion scenarios. In contrast to ML methods
dimensional small-sample data. such as SVM and DT, we propose a convolution-based
DL method. In addition, unlike CNN and convolutional
6.5 Sparsity analysis of the weights with the l1/2
sparse autocoder methods, our method incorporates
regularizer layer
LSTM into the method to extract important timing
To analyze the sparsity of the l1/2 regularizer, we use information. We also propose adding an l1/2 regularizer
the sparsity evaluation metric proposed by Hoyer’s to the objective function to induce strong sparsity
sparsity[39] to calculate the sparsity of the feature among the parameters. Specific performance
representation learned by the proposed method. And comparisons of our method against existing methods
this method uses the difference between the l1 and l2 are listed as follows.
parameters of the vector to evaluate the sparsity of the (1) From the detailed comparisons, as shown in
vector. The sparsity can be expressed as Tables 2–5, it is clear that the proposed method is
√ (∑n √∑ ) superior to ML methods such as SVM and DT for all
n − i=1 |xi | / n
x
i=1 i
2
Furthermore, by checking the average values for through seeking deep sparse representations during
accuracy, F1-score, precision, and recall, we note that propagation, the proposed method can improve
all metrics for the method proposed in this paper classification performance by 1 % –2 % . Furthermore,
exceed 80 % , which is not achieved by any of the other the proposed l1/2 regularizer can largely remove
methods, as shown in Fig. 14. unnecessary weights while retaining the classification
(3) In Fig. 11, the performances of various performance. In particular, it can reduce over 20 % of
regularizers when using the corresponding optimal the parameters while maintaining a significantly high
parameters are summarized. Here, the l1/2 regularizer classification accuracy for all urban canyon
achieves the best classification performance across all environments. The main reason may be that the method
datasets, which indicates that the l1/2 regularizer is we proposed method that learns a compressed
robust and can improve sparsity. A comparison of representation of the input data for downscaling and
Hoyer’s sparsity is shown in Fig. 15, where the l1/2 unsupervised feature extraction of long-duration time-
regularizer pruned unnecessary weights from the NN series signals containing multipath. The cited methods
(e.g., around 72 % of the total across the four datasets). also uses an unsupervised approach, but the cited
Furthermore, the l1/2 regularizer could converge faster methods uses a machine learning approach, and the
with fewer training epochs. recognition accuracy is significantly lower than the
deep learning unsupervised feature extraction method
7 Conclusion
used in this paper.
In this paper, we have presented an efficient However, there are two main limitations of our
convolutional sparse autoencoder with an LSTM algorithm: (1) The proposed algorithm only considers
training framework to address the NLOS signal urban canyon; (2) The classification generalisation
classification problem. Specifically, to prevent high- performance of the proposed algorithm needs to be
dimensional data redundancy in DL methods, we improved. In order to address the above problem, in the
implemented dimension reduction and feature future work, we will continue our studies of multipath
extraction through a convolutional sparse autoencoder signal-recognition-based unsupervised DL methods in
module structure. To capture the temporal correlations more diverse and complex urban environments such as
in long-duration time-series signals, we proposed an urban forests, overpasses, and viaducts. We note that
LSTM module that uses an input gate, a forget gate, our method loses information about the satellite
and an output gate to obtain information about the environment, so we are considering learning the
evolution of the BDS signal over time. Then, to reduce representation of indirect environmental information
unnecessary weights, we applied a regularizer to the using a transformer. We are also working on inducing
weights of the hidden layers, which offered results sparsity in DL algorithms that involve a CNN
beyond those for the LOG and MCP regularizers. architecture, which should be more efficient for more
Finally, the trained and simplified DL could provide complex DL environments.
accurate classification of BDS NLOS signals in real
urban canyon environments. Acknowledgment
Experiments with classification and pruning were This research was supported in part by the National
conducted on four datasets with different sky occlusion Natural Science Foundations of China (Nos. 62273106,
scenarios. Compared with ML methods such as DT and 62203122, 62320106008, 62373114, 62203123, and
SVM, our DL method, with its convolutional sparse 62073086), in part by Guangdong Basic and Applied
autoencoder and LSTM module, demonstrated superior Basic Research Foundation (Nos. 2023A1515011480
performance. Moreover, compared with existing DL- and 2023A1515011159), and in part by China
based methods such as CNNs and SCAEs, the Postdoctoral Science Foundation funded project (No.
proposed method achieved 7.26 % and 2.27 % 2022M720840).
improvement in accuracy, respectively. Compared with
the LOG and MCP regularizers, the proposed l1/2 References
regularizer used for NN pruning enabled our proposed [1] C. Ma, S. Pan, W. Gao, F. Ye, L. Liu, and H. Wang,
method to induce strong sparsity in the parameters. The Improving GNSS/INS tightly coupled positioning by using
results demonstrate that, across all environments, BDS-3 four-frequency observations in urban
84 Tsinghua Science and Technology, February 2025, 30(1): 68−86
environments, Remote Sens., vol. 14, no. 3, p. 615, 2022. and C. Yu, Multipath mitigation in GNSS precise point
[2] F. Li, R. Tu, J. Hong, S. Zhang, M. Liu, and X. Lu, positioning based on trend-surface analysis and multipath
Performance analysis of BDS–5G combined precise point hemispherical map, GPS Solut., vol. 25, no. 3, p. 119,
positioning, Remote Sens., vol. 14, no. 13, p. 3006, 2022. 2021.
[3] Y. Xu, Y. Yang, and J. Li, Performance evaluation of [16] A. El-Mowafy, B. Xu, and L. T. Hsu, Integrity monitoring
BDS-3 PPP-B2b precise point positioning service, GPS using multi-GNSS pseudorange observations in the urban
Solut., vol. 25, no. 4, p. 142, 2021. environment combining ARAIM and 3D city models, J.
[4] H. Zhao, Z. Li, C. Chen, L. Wang, K. Xie, and S. Xie, Spatial Sci., vol. 67, no. 1, pp. 91–110, 2022.
Fusing vehicle trajectories and GNSS measurements to [17] R. Sun, G. Wang, W. Zhang, L. T. Hsu, and W. Y.
improve GNSS positioning correction based on actor-critic Ochieng, A gradient boosting decision tree based GPS
learning, in Proc. 2023 Int. Technical Meeting of The signal reception classification algorithm, Appl. Soft
Institute of Navigation, Long Beach, CA, USA, 2023, pp. Comput., vol. 86, p. 105942, 2020.
82–94. [18] Y. Qin, Z. Li, S. Xie, R. Yuan, and J. Xie, BDS multipath
[5] W. Qiu, Q. Zeng, R. Xu, J. Liu, J. Shi, and Q. Meng, A signal classification using support vector machine, in
multipath mitigation algorithm for GNSS signals based on Proc. 4th Int. Conf. Industrial Artificial Intelligence (IAI),
the steepest descent approach, Satell. Navig., vol. 3, no. 1, Shenyang, China, 2022, pp. 1–6.
p. 14, 2022. [19] G. Zhang, B. Xu, and L. T. Hsu, GNSS shadow matching
[6] P. R. R. Strode and P. D. Groves, GNSS multipath based on intelligent LOS/NLOS classifier, in Proc. 16th
detection using three-frequency signal-to-noise IAIN World Congr., Chiba, Japan, pp. 1–7, 2018.
measurements, GPS Solut., vol. 20, no. 3, pp. 399–412, [20] T. Suzuki and Y. Amano, NLOS multipath classification
2016. of GNSS signal correlation output using machine learning,
[7] Groves, P. D, Jiang, Z, Rudi, M, and Strode, P, A portfolio Sensors, vol. 21, no. 7, p. 2503, 2021.
approach to NLOS and multipath mitigation in dense [21] R. Zawislak, M. Greiff, K. J. Kim, K. Berntorp, S. Di
urban areas, in Proc. the 26th Int. Technical Meeting of the Cairano, M. Konishi, K. Parsons, P. V. Orlik, and Y. Sato,
Satellite Division of The Institute of Navigation, Nashville, GNSS multipath detection aided by unsupervised domain
TN, USA, 2013, pp. 3231–3247. adaptation, in Proc. the 35th Int. Technical Meeting of the
[8] L. T. Hsu, GNSS multipath detection using a machine Satellite Division of The Institute of Navigation, Denver,
learning approach, in Proc. IEEE 20th Int. Conf. CO, USA, pp. 2127–2137, 2022.
Intelligent Transportation Systems (ITSC), Yokohama, [22] H. Wang, S. Pan, W. Gao, Y. Xia, and C. Ma,
Japan, 2017, pp. 1–6. Multipath/NLOS detection based on K-means clustering
[9] Z. Li, K. Zeng, L. Wang, X. Kan, R. Yuan, and S. Xie, for GNSS/INS tightly coupled system in urban areas,
NLOS/LOS classification by constructing indirect Micromachines, vol. 13, no. 7, p. 1128, 2022.
environment interaction from GNSS receiver [23] H. Xu, L. T. Hsu, D. Lu, and B. Cai, Sky visibility
measurements using a transformer-based deep learning estimation based on GNSS satellite visibility: An approach
model, in Proc. 2023 Int. Technical Meeting of The of GNSS-based context awareness, GPS Solut., vol. 24,
Institute of Navigation, Long Beach, CA, USA, 2023, pp. no. 2, p. 59, 2020.
859–870. [24] H. F. Ng, G. Zhang, and L. T. Hsu, Robust GNSS shadow
[10] I. Smolyakov, M. Rezaee, and R. B. Langley, Resilient matching for smartphones in urban canyons, IEEE Sens.
multipath prediction and detection architecture for low- J., vol. 21, no. 16, pp. 18307–18317, 2021.
cost navigation in challenging urban areas, NAVIGATION, [25] S. Zhang, S. Lo, Y. H. Chen, T. Walter, and P. Enge,
vol. 67, no. 2, pp. 397–409, 2020. GNSS multipath detection in urban environment using 3D
[11] Z. Lyu and Y. Gao, An SVM based weight scheme for building model, in Proc. IEEE/ION Position, Location and
improving kinematic GNSS positioning accuracy with Navigation Symp. (PLANS), Monterey, CA, USA, 2018,
low-cost GNSS receiver in urban environments, Sensors, pp. 1053–1058.
vol. 20, no. 24, p. 7265, 2020. [26] S. Miura, L. T. Hsu, F. Chen, and S. Kamijo, GPS error
[12] J. Zidan, E. I. Adegoke, E. Kampert, S. A. Birrell, C. R. correction with pseudorange evaluation using three-
Ford, and M. D. Higgins, GNSS vulnerabilities and dimensional maps, IEEE Trans. Intell. Transport. Syst.,
existing solutions: A review of the literature, IEEE Access, vol. 16, no. 6, pp. 3104–3115, 2015.
vol. 9, pp. 153960–153976, 2021. [27] G. Zhang, P. Xu, H. Xu, and L. T. Hsu, Prediction on the
[13] J. Chen, H. Shi, Z. Fang, C. Yuan and Y. Xu, Performance urban GNSS measurement uncertainty based on deep
analysis of the GNSS instantaneous ambiguity resolution learning networks with long short-term memory, IEEE
method using three collinear antennas, IEEE Sens. J., pp. Sens. J., vol. 21, no. 18, pp. 20563–20577, 2021.
11936–11945, 2023. [28] H. Su, B. Wu, and X. Mao, Non-line-of-sight multipath
[14] G. Chang, C. Chen, Y. Yang, and T. Xu, Tikhonov detection method for BDS/GPS fusion system based on
regularization based modeling and sidereal filtering deep learning, J. Shanghai Jiaotong Univ. Sci., vol. 27, no.
mitigation of GNSS multipath errors, Remote. Sens., vol. 6, pp. 844–854, 2022.
10, no. 11, p. 1801, 2018. [29] Y. Quan, L. Lau, G. W. Roberts, X. Meng, and C. Zhang,
[15] R. Lu, W. Chen, D. Dong, Z. Wang, C. Zhang, Y. Peng, Convolutional neural network based multipath detection
Yahang Qin et al.: Non-Line-of-Sight Multipath Classification Method for BDS Using Convolutional Sparse... 85
method for static and kinematic GPS high precision pp. 1013–1027, 2012.
positioning, Remote. Sens., vol. 10, no. 12, p. 2052, 2018. [35] Z. Li, S. Ding, W. Chen, Z. Yang, and S. Xie, Proximal
[30] A. Blais, N. Couellan, and E. Munin, A novel image alternating minimization for analysis dictionary learning
representation of GNSS correlation for deep learning and convergence analysis, IEEE Trans. Emerg. Top.
multipath detection, Array, vol. 14, p. 100167, 2022. Comput. Intell., vol. 2, no. 6, pp. 439–449, 2018.
[31] A. V. Kanhere, S. Gupta, A. Shetty, and G. Gao, [36] H. Zhao, J. Wu, Z. Li, W. Chen, and Z. Zheng, Double
Improving gnss positioning using neural-network-based sparse deep reinforcement learning via multilayer sparse
corrections, NAVIGATION-US, vol. 69, no. 4, p. 548, coding and nonconvex regularized pruning, IEEE Trans.
2022. Cybern., vol. 53, no. 2, pp. 765–778, 2023.
[32] Z. He, H. Shao, Z. Ding, H. Jiang, and J. Cheng, Modified [37] Z. Li, H. Zhao, Y. Guo, Z. Yang, and S. Xie, Accelerated
deep autoencoder driven by multisource parameters for Log-regularized convolutional transform learning and its
fault transfer prognosis of aeroengine, IEEE Trans. Ind. convergence guarantee, IEEE Trans. Cybern., vol. 52, no.
Electron., vol. 69, no. 1, pp. 845–855, 2022. 10, pp. 10785–10799, 2022.
[33] K. Greff, R. K. Srivastava, J. Koutnik, B. R. Steunebrink, [38] Z. Li, Z. Yang, H. Zhao, and S. Xie, Direct-optimization-
and J. Schmidhuber, LSTM: A search space odyssey, based DC dictionary learning with the MCP regularizer,
IEEE Trans. Neural Netw. Learn. Syst., vol. 28, no. 10, pp. IEEE Trans. Neural Netw. Learning Syst., vol. 34, no. 7,
2222–2232, 2017. pp. 3568–3579, 2023.
[34] Z. Xu, X. Chang, F. Xu, and H. Zhang, L1/2 regularization: [39] P. O. Hoyer, Nonnegative matrix factorization with
A thresholding representation theory and a fast solver, sparseness constraints, J. Mach. Learn. Res., vol. 5, pp.
IEEE Trans. Neural Netw. Learning Syst., vol. 23, no. 7, 1457–1469, 2004.
Yahang Qin received the MS degree from Zhenni Li received the BSc degree from
Guangxi University of Science and School of Physical Science and
Technology, Liuzhou, China, in 2017. She Electronics, Shanxi Datong University, in
is currently pursuing the PhD degree in 2009, the MSc degree from School of
control science and engineering with Physics and Optoelectronic, Dalian
School of Automation, Guangdong University of Technology, in 2012, and
University of Technology, and also with received the PhD degree from School of
Guangdong-Hong Kong-Macao Joint Computer Science and Engineering,
Laboratory for Smart Discrete Manufacturing, Guangdong University of Aizu, Japan. She is currently an associate
University of Technology. Her research interests include deep professor with School of Automation, Guangdong University of
learning and BDS positioning in challenging environments. Technology, and also with Guangdong-Hong Kong-Macao Joint
Laboratory for Smart Discrete Manufacturing, Guangdong
Shengli Xie received the BS degree in University of Technology. Her research interests include deep
mathematics from Jilin University, China learning, reinforcement learning, and high-precious position.
in 1983, the MS degree in mathematics
from Central China Normal University, Bo Li received the MS degree in
China in 1995, and the PhD degree in mathematics from Central China Normal
control theory and applications from South University, Wuhan, China, in 1992, and
China University of Technology, China in the PhD degree in control theory and
1997. He is currently a full professor in applications from South China University
School of Automation, Guangdong University of Technology, of Technology, Guangzhou, China, in
the head of the 111 Center for Intelligent Batch Manufacturing 1997. He is currently the director of
Based on IoT Technology (GDUT), and also with the Key Laboratory for Intelligent Information
Laboratory of Intelligent Information Processing and System Processing and a full professor with the School of Automation,
Integration of IoT (GDUT), Ministry of Education. He has Guangdong University of Technology, and also with Guangdong
coauthored two books and over 150 research papers in refered Key Laboratory of IoT Information Technology (GDUT). He
journals and conference proceedings, and was awarded Highly has authored or coauthored two monographs, a dozen patents,
Cited Researcher in 2020. His research interests include blind and over 80 papers in journals and conference proceedings. His
signal processing, machine learning, and Internet of Things. He current research interests include automatic controls, signal
was awarded the Second Prize of National Natural Science processing, blind signal processing, and image processing.
Award of China in 2009. He is an associate editor of IEEE
TSMC.
86 Tsinghua Science and Technology, February 2025, 30(1): 68−86
Victor Kuzin is a leading expert in the Ming Liu received the PhD degree in
field of physical and technical control of robotics from ETH Zurich. He is currently
mining products, physical processes, and the director of Intelligent Autonomous
quality management. He has published Driving Center of HKUST. He published
over 300 articles. He is the author of 18 over 300 articles. His research interests
monographs and booklets, 3 of which were include dynamic environment modeling, 3-
published abroad. He is a full member of D mapping, machine learning, and visual
Academician of Russian Engineering control. He received various IEEE awards,
Academy, Academician of Russian Academy of Natural including the IEEE IROS Young Professional Award in 2018
Sciences, International Academy of Mineral Resources and 15 best paper awards or finalist awards. He is particularly
Sciences, Academician of Academy of Technological Sciences interested in the investigation of novel, real-time online
of the Russian Federation, and Academician of International approaches in solving autonomous driving, mobile robot
Academy of Mineral Resources. He is an honorary doctor and mapping, and navigation. His team developed the first
professor at several universities in Europe and China. In 1979, autonomous vehicle in Hong Kong in 2017. During COVID-19,
he became a recipient of the Lenin Komsomol Prize. He was his autonomous vehicles served several cities and were reported
awarded the Russian Government Prize for Science and by more than 100 media, including IEEE Spectrum and Xinhua.
Technology in 2011 and the Russian Prize for Science and
Technology in 2013.