Zhang 2020
Zhang 2020
Fuzzy Forests
Le Zhang† , Zhenghua Chen† , Wei Cui∗ , Bing Li, Cen Chen, Zhiguang Cao, Kaizhou Gao
Abstract—Addressing the positioning problem of a mobile robot visual SLAM may suffer from various aspects, ranging from
remains challenging to date despite many years of research. the algorithmic point of view, such as the issues of camera cal-
Indoor robot positioning strategies developed in the literature ibration, data association, system initialization, and algorithms
either rely on sophisticated computer vision techniques to handle to compensate for camera distortions to hardware problems
visual inputs or require strong domain knowledge for non- including occlusions in the visual input. Moreover, the popular
visual sensors. Although some systems have been deployed, the
former may be lacking due to the intrinsic limitation of cameras
feature-based visual SLAM methods like [5] may fail when
(such as calibration, data association, system initialization, etc.) the scenes have blank background, e.g. indoor white walls.
and the latter usually only works under certain environment As an alternative, determining the robot’s location using non-
layouts and additional equipment. To cope with those issues, visual sensors such as radio frequency (RF) signals from radio
we design a lightweight indoor robot positioning system which sources is becoming popular. Examples include but are not
operates on cost-effective WiFi based Received Signal Strength limited to WiFi [6], [7], radio frequency identification device
(RSS) and could be readily pluggable into any existing WiFi (RFID) [8], [9], and ultra-wideband (UWB) [10]. Among those
network infrastructures. Moreover, a novel deep fuzzy forest approaches, a common practice is to use Received Signal
is proposed to inherit the merits of decision trees and deep Strength (RSS) measurements from radio sources in localizing
neural networks within an end-to-end trainable architecture. a mobile robot in the indoor/outdoor environment.
Real-world indoor localization experiments are conducted and
results demonstrate the superiority of the proposed method over In this paper, we study robots positioning using WiFi based
the existing approaches. signal strength. We choose WiFi due to its popularization in
residential buildings, academic areas, hospitals, and industry
Keywords—indoor robot positioning; WiFi; Deep fuzzy forests regions. More importantly, in comparison with other wireless
technologies, WiFi-based indoor localization is cost-effective
I. I NTRODUCTION because the system could be readily pluggable into any existing
Rapid developments in artificial intelligence have spurred WiFi network infrastructures including access points (APs),
the rise in robots service, in which robots positioning [1] cell phones, laptops, etc. We also focus on indoor scenarios
plays an inevitable role. For instance, using robots to perform because they are more complex. The line-of-sight (LOS)
programming tasks in harsh damaged environments, work as transmission channels between satellites and receivers is often
a pioneer in non-human living conditions such as the moon, invalid compared with their outdoor positioning counterparts
Mars, and other planets, take care of people in a nursing home in which mature localization systems such as global navigation
and deliver products in an indoor warehouse. However, robot satellite system (GNSS) has been developed successfully [11].
localization remains a challenging problem in practice despite Existing algorithms for RSS-based positioning can be typi-
many years of research. cally classified into two categories: 1) geometric related tech-
There are a plethora of research in the literature on using niques and 2) fingerprinting-based techniques [12]. Geometric
visual sensors to solve the simultaneous localization (“Posi- related techniques use theoretical and/or empirical path loss
tioning” is also named as “Localization” in the literature.) and models to characterize the RSS. Then, position estimation
mapping (SLAM) [2], [3], [4] problem of robots. However, of a node is performed using a trilateration or modeling
algorithm. One apparent drawback of these approaches, from
Copyright (c) 2020 IEEE. Personal use of this material is permitted. a practical point of view, is that in cluttered environments the
However, permission to use this material for any other purposes must be
obtained from the IEEE by sending a request to [email protected].
distance estimation is inaccurate because the attenuation of the
This work was supported by the National Natural Science Foundation of signal is poorly correlated with distance, especially under non-
China (61903231,61803104), Shandong Province Natural Science Foundation line-of-sight (NLOS) propagation [13]. The fingerprint-based
(ZR2018PF011), Singapore National Research Foundation (NRF-RSS2016- methods are becoming the dominating approach for indoor
004) and partially supported by the A*STAR Industrial Internet of Things positioning because wireless signal variances can be captured
Research Program under the RIE2020 IAF-PP Grant A1788a0023.
Le Zhang, Zhenghua Chen, Wei Cui and Cen Chen are with the Institute more accurately thanks to its transmissivity in complex indoor
for Infocomm Research, Agency for Science, Technology and Research spaces [14].
(A*STAR) Singapore. As illustrated in Fig. 1, a typical fingerprinting-based system
Bing Li is with the University of New South Wales, Australia. can be broken down into the offline phase and the online
Zhiguang Cao is with the National University of Singapore.
Kaizhou Gao is with the Macau University of Science and Technolog, P.R.C. phase. In the offline phase, an area of interest is calibrated
† indicates equal contribution and Wei Cui ([email protected]) is the by surveying the site with a mobile device (MD). The fin-
corresponding author. gerprints (i.e., the RSS values) from different predetermined
reference points (RPs) are collected as a priori knowledge Offline training phase
and recorded in a map database. During the online phase, RP1(x,y) RSS1,RSS2,…,RSSn
when an MD at unknown position sends its instant vector RP2(x,y) RSS1,RSS2,…,RSSn
of WiFi RSS values from surrounding APs, its current po- RP3(x,y) RSS1,RSS2,…,RSSn
sition will be estimated utilizing the information from the … … Training Database
map database. Compared with geometric related techniques, RPm(x,y) RSS1,RSS2,…,RSSn
fingerprinting-based methods have been widely regarded to be
more accurate. Hence, they have been heavily applied in real
life[15]. However, there are two major challenges of WiFi
fingerprinting-based technology for large-scale implementa-
RSS1,RSS2,…,RSSn Algorithm (x,y)
tion. One is to shrink the workload of the offline site survey
as the process is extremely laborious and time consuming.
The crowdsourcing based schemes have been proposed to Online localization phase
solve this problem. Instead of labeling the training database
manually, crowdsourcing could build the radio map with less Fig. 1: The offline phase and the online phase of a typical
maintenance effort by leveraging the unconscious participatory fingerprinting-based positioning.
sensing of mobile users [16], [17]. However, how to implement
crowdsourcing for the radio map is still an open problem
because the participant’s smartphones are usually different.
Another big issue is the vulnerability of RSS to environmental during the online phase are represented as a probability distri-
dynamics including variation in humidity, light, temperature, bution, and the algorithm estimates the position of the target
weather conditions[18], occupancy distribution, and human by calculating the highest probability of the mobile node’s
movement [19]. Hence the final performance of the system can location [27]. For instance, The authors in [28] proposed a
be dramatically degraded under a dynamic environment. This new algorithm based on Bayesian regression to estimate the
naturally opens a path towards designing more robust machine RSS posterior. Deterministic methods adopt machine learn-
learning techniques which can generalize well in practice. ing techniques to match the observed fingerprints with the
In this paper, we aim to solve the second aforementioned radio map database. More specifically, in [29], the authors
issue and develop a new light-weight indoor robot positioning proposed an indoor positioning system based on the smallest
system that uses low-cost WiFi devices which can be applied M-vertex polygon (SMP). Support Vector Machine/Regression
for various purposes, such as localization and navigation in (SVM/SVR) also has been widely applied for fingerprinting-
areas hard to reach by humans, navigational assistance for based position system. For example, Sun et al. [30] proposed
the blind, and tour guide robots. Motivated by the recent an indoor localization algorithm based on least-squares SVM
successes of random forest [20], [21], [22] and ensemble deep (LS-SVM). The work of [31] applied Support vector classi-
learning [23], [24] in different domains, a deep fuzzy forest is fication (SVC) for the indoor positioning problem. Wu et al.
designed which unifies decision trees with the representation [32] proposed to use the SVR method for position estima-
learning functionality known from deep neural networks in tion. Nerguizian [33] introduced the neural network model
an end-to-end trainable fashion. For simplicity, we adopt the and applied it for fingerprint based indoor positioning. A
commonly used deep fully-connected feed-forward network randomized network was introduced in [34]. As for deep
whose details are elaborated in Section V. The trees in our learning, a four-layer deep neural network was employed in
system work in a fuzzy manner to avoid wrong partitionings [35] for localization. However, it requires a relatively large
in internal nodes, which usually happens in conventional forest number of APs, which is less practical for indoor scenarios.
methods, and generalizes well in real-life scenarios. A deep semi-supervised reinforcement learning approach was
The rest of this paper is organized as follows. We discuss introduced [36] for indoor localization based on Bluetooth
the background and related works in Section II. Section III Low Energy (BLE) signal strength. Deep learning based indoor
describes the working process and localization scheme of fingerprinting systems using Channel State Information (CSI)
the proposed positioning system. Section IV explains the were proposed to achieve high localization precision [37], [38].
proposed deep fuzzy forest. The experimental evaluations are However, the transmitters in these system need to be equipped
demonstrated in Section V. Conclusion remarks are given in with CSI tools.
Section VI. The advantage of deterministic approaches is allowing for
automatically learning the potential relationship between the
RSS values and the locations, and most importantly, does not
II. R ELATED WORK
depend on knowing the entire environment layout upfront.
Nowdays, indoor localization based on smartphones has Due to similar reasons, machine learning based solutions saw
become a research focus due to its minimum hardware re- heavy usage across a wide range of fields and tasks, examples
quirement [25], [26]. Generally speaking, fingerprinting based including: computer vision [24], speech recognition [39], re-
on RSS techniques can be generally categorized into two inforcement learning [40] and affective computing [41]. The
groups: probabilistic approaches and deterministic approaches. unifying idea behind all of the above is deep learning, the
For probabilistic approaches, observed signal strength vectors utilisation of neural networks with many hidden layers, for
the purposes of learning complex feature representations from
raw data, rather than relying on handcrafted feature extraction.
In addition, recent works on ensemble learning [23], [22] also
show that using multiple models could effectively improve
the generalization ability of the learning system. Motivated
by those work, a deep fuzzy forest is proposed in this paper
to inherit the merits of ensemble methods and deep neural
networks within an end-to-end trainable architecture. Extensive
real-world indoor localization experiments are conducted and
results demonstrate the superiority of the proposed method
over the existing approaches. Fig. 3: Block diagram of hardware mounted on the robot.
It consists of a development board with an ARM processor,
III. S YSTEM D ESIGN microcontrollers, a wireless network card and a motor driver.
A. System Architecture
The whole system architecture of the proposed positioning
system for the robot is illustrated in Fig. 2. The mobile robot B. Hardware Design of the robot
is equipped with a commercial radio transceiver to extract RSS
The hardware mounted on the robot with its detailed com-
data from all APs. The RSS signals from each AP are encoded
ponents is presented in Fig. 3. It consists of a development
as a data packet and further forwarded to the back-end server
board with a 1GHz Samsung S5P V 210 ARM Cortex − A8
for both radio map construction and learning. In this paper,
processor, an ST M 32F 103 chip (a version of ST M 32 micro-
we only focus on the localization part and its key components
controllers with an ARM Cortex − M 3 core and a maximum
are elaborated in the following section.
frequency of 72M Hz), an RT 3070 wireless network card and
the L298N motor driver. The RT 3070 card is used to com-
municate with the server for sending/receiving collected RSS
packets. The ST M 32F 103 chip is employed to interact with
users for motion control through the universal asynchronous
receiver/transmitter (UART) protocol. The weakest detectable
RSS is set to be −95dBm.
Tree
DNN
……
Tree
S1(s1,s2 ,...,sn)
P1(x1,y1)
S2(s1,s2 ,...,sn), P2(x2,y2)
.
.
.
.
.
.
Sm(s1,s2 ,...,sn) Pm(x1,ym)
……
Tree
Fig. 4: System overview of the proposed deep fuzzy forest. Each tree received highly abstract features from a deep network
and divided the data samples recursively until certain criteria are met. More specifically, considering a single tree, each decision
node is implemented with a fully-connected layer with a Sigmoid activation function and its output denotes the corresponding
probability of assigning the current datum to the left and right child node, respectively. Please refer to Section V for more details
on the network structure. The final results are obtained by weighted averaging.
2
RM SE = sqrt(avg((P̃ − P ) )
4m
where avg(·) is the average over all the test samples.
We coin our method as DeepFuzzy and compare it with 10m
other widely used regression methods, including (1) k nearest
neighbours [46], (2) Random Forest [47], (3) SVR [48], 8m
4m
10m
8m
2m
11m 3m 10m 2m 6m
Learning Curves
50 10m
Training Loss
Testing Loss 8m
40
30 2m
MSE
11m 3m 10m 2m 6m
20
TP-LINK WDR6500 Router Testing Positions Predicted Testing Positions
10
Fig. 9: The predicted trajectories of training points and testing
0 points with trained DeepFuzzy.
0 20 40 60 80 100
Epoch