Reducing Computation Requirements For Autonomous Mobile Robots On Low Powered Embedded Systems
Reducing Computation Requirements For Autonomous Mobile Robots On Low Powered Embedded Systems
Embedded Systems
Abhinav Pathak Aditya Singhal Adit Tewari
Department of Computer Science, Department of Computer Science, Birla Department of Electrical and
Birla Institute of Technology and Institute of Technology and Science Electronics Engineering, Birla Institute
Science Pilani, Dubai Campus, Pilani, Dubai Campus, of Technology and Science
Dubai, UAE Dubai, UAE Pilani, Dubai Campus,
[email protected] [email protected] Dubai, UAE
[email protected]
Aadish Jain V. Kalaichelvi
Department of Computer Science, Birla Department of Electrical and
Institute of Technology and Science Electronics Engineering, Birla Institute
Pilani, Dubai Campus, of Technology and Science Pilani,
Dubai, UAE Dubai Campus,
[email protected] [email protected].
Abstract— Fully autonomous mobile robots have the potential balance has motivated researchers and engineers to seek
to revolutionize various industries, from warehouse management innovative solutions that can enhance the efficiency and
to hospital logistics and last-mile deliveries. However, a significant independence of these robots.
obstacle to achieving reliable autonomy lies in the high
computational and energy requirements. In response to this One of the key challenges in autonomous driving is the
challenge, our paper introduces two innovative algorithms: the development of algorithms that can accurately detect and track
Pure Image Segmentation Approach (PISA) and the UNet Based objects in the environment, including other vehicles,
Approach to Semantic Segmentation (UBASS). PISA leverages pedestrians, and obstacles. This requires a combination of
classical computer vision techniques, offering a fresh perspective computer vision techniques, such as object detection and
on solving crucial tasks such as object detection, object avoidance, tracking, as well as machine learning algorithms that can learn
and lane detection. In contrast, UBASS harnesses the power of to recognize and classify different objects in the environment.
deep learning algorithms for semantic segmentation, unlocking
new capabilities in robot perception. Our experiments showcase For lane detection specifically, there are several
the effectiveness of these algorithms, demonstrating their algorithms that are commonly used in autonomous driving
accuracy and computational efficiency. Notably, PISA and systems. One of the most popular methods for
UBASS outperform or match traditional techniques, including computationally inexpensive lane detection is [3] canny edge
End-to-End Deep Learning and Canny Edge Detection, in terms detection, which works by identifying the edges in an image
of both task performance and resource utilization. This research and using those edges to define the boundaries of the lane.
contributes to the advancement of autonomous mobile robotics by However, canny edge detection has several limitations, such
offering practical and efficient solutions for navigation and as struggling with noisy images or low contrast environments,
perception challenges. By combining classic and contemporary and it may produce false positives if there are other edges in
approaches, we aim to inspire further research in the field, the image that are not part of the lane. Additionally, canny
ultimately paving the way for more accessible and dependable edge detection is only able to identify the lane, and a separate
autonomous mobile robots. algorithm is required to manage object detection, which can
Keywords--Autonomous Mobile Vehicle, Computer Vision, add complexity to the system.
UBASS. PISSA. Low Powered Embedded Systems, UNET, To improve the accuracy of canny edge detection, some
Convolution Neural Networks researchers have combined it with other techniques like the
Hough transform and deep learning. These methods can be
I. INTRODUCTION
more computationally expensive, but they can help to reduce
In today's fast-paced technological landscape, fully false positives and improve lane detection in challenging
autonomous mobile robots have emerged as catalysts for environments. However, these methods still require a separate
profound changes across various industries. They offer algorithm for object detection, and they may struggle with
potential improvements in warehouse management, hospital real-time performance and scalability.
logistics, and last-mile deliveries by automating tasks.
However, a significant obstacle hampers their widespread Another approach that has gained popularity in recent
adoption: the substantial demands they place on years is end-to-end deep learning. This method uses a neural
computational power and energy. network to directly map the input image to the steering
commands needed to control the car. This approach doesn't
Achieving dependable autonomy in mobile robotics has require a separate algorithm for object detection, which can
long been a complex balancing act between harnessing simplify the system, and it has been shown to be effective in
advanced technology and conserving resources. This delicate somecases. However, end-to-end deep learning requires a
lot of
281
979-8-3503-9341-5/23/$31.00
Authorized ©2023
licensed use limited to: Birla IEEE
Inst of Technology and Science Pilani Dubai. Downloaded on March 20,2024 at 10:06:53 UTC from IEEE Xplore. Restrictions apply.
training data, and it can be very computationally expensive. of important road shape data, while a dynamic ROI would
Additionally, this method is hard to debug, and it is not require significant computing resources. Therefore, a simpler
predictive in nature, meaning that it may struggle to handle approach would be to split the image into two halves, since
unexpected situations that were not seen during training. most of the irrelevant noise tends to occur in the upper half of
the image.
In response to these critical challenges, our research paper
introduces two novel algorithms: the Pure Image
Segmentation Approach (PISA) and the UNet Based
Approach to Semantic Segmentation (UBASS). These
algorithms are meticulously designed to address specific
aspects of a mobile robot's ability to understand its Fig. 2. Reducing noise via simple cropping
surroundings and navigate effectively. PISA draws inspiration
from traditional computer vision techniques and reimagines The subsequent step in the pipeline involves the
how robots detect objects, avoid obstacles, and follow lanes. computation of the steering angle. Given the high frame rate
In contrast, UBASS takes advantage of deep learning, of input images, it is not pragmatic to calculate the exact
leveraging its transformative capabilities to enhance a robot's steering angle for each frame. Instead, it can be viewed as a
perception. rate of angle at which the vehicle must steer in order to remain
Our experiments with PISA, UBASS and End-to-End on the road while [2,3] evading obstacles. To achieve this, one
Deep Learning on a mobile robot equipped with Nvidia Jetson can capitalize on the fact that in the binary map, all road pixels
Nano, 4gb RAM, MPCore dual core CPU and a Logitech are represented in black. Thus, the objective is to compute a
Webcam yielded valuable insights, demonstrating not only steering angle that maximizes the presence of black pixels in
their precision but also their efficient use of computational the image, which corresponds to staying on the road. A
resources. Perhaps most notably, these algorithms often straightforward method of accomplishing this is to divide the
outperform established methods like End-to-End Deep image into left and right halves and compute the total number
Learning and the conventional Canny Edge Detection, both in of black pixels in each half. By comparing these [3] pixel
task performance and responsible resource usage. counts, the direction of steering can be ascertained. Finally,
the magnitude of the steering angle (S) can be calculated using
This paper aims to advance the field of autonomous the following equation:
mobile robotics. By providing practical and efficient solutions
to the intricate challenges of navigation and perception, we 𝑆 = (𝜌𝐿 − 𝜌𝑅) ∗ 𝜃𝑚𝑎𝑥 (1)
aspire to empower today's robots and inspire future research Here 𝜌L represents the black pixel density in the left half
endeavors. With a blend of timeless principles and cutting- of the input image, while 𝜌R represents the black pixel density
edge techniques, our journey beckons further exploration and in the right half of the input image. 𝜃 represents the maximum
discovery, ultimately paving the way for more accessible and steering angle that can be mobile robot is expected to steer in
dependable autonomous mobile robots. each instant, usually this value is set at 90 degrees. However,
depending on the use case, this value can be changed to some
II. METHODOLOGY
other value.
This paper proposes two algorithms, each tailored to suit
different use cases depending on the anticipated driving This algorithm has the added advantage of obstacle
conditions and the available computing resources. avoidance, as any obstacles in the environment would be
represented as white pixels. The steering angle would then be
A. Pure Image Segmentation Approach (PISA) calculated based on the location of these pixels, allowing the
This algorithm is specifically designed for deployment in robot to avoid obstacles while maintaining its position within
extremely low-powered embedded systems, thereby the lane. This can be visualized using the accompanying
necessitating its inherent simplicity.[5] The algorithm utilizes figure.
a two-stage image processing technique involving the
application of Gaussian blur to the image, succeeded by
adaptive thresholding to binarize the blurred image.
This strategy effectively mitigates noise and enhances
image smoothness, thereby producing a binary map output.
Fig. 3. Image processing pipeline, showing howobstacle is
ignored in the map.
282
Authorized licensed use limited to: Birla Inst of Technology and Science Pilani Dubai. Downloaded on March 20,2024 at 10:06:53 UTC from IEEE Xplore. Restrictions apply.
degrees. Despite these limitations, the proposed algorithm
succeeds in achieving the fundamental objective of realizing
autonomous navigation even in resource-constrained
environments.
283
Authorized licensed use limited to: Birla Inst of Technology and Science Pilani Dubai. Downloaded on March 20,2024 at 10:06:53 UTC from IEEE Xplore. Restrictions apply.
The current study proposes two novel algorithms for output frame rate, CPU usage, and RAM usage. These
autonomous mobile navigation, which aim to optimize the measurements are summarized in the table below.
navigation process through the incorporation of distinct
decision-making strategies. To validate the effectiveness of
these algorithms, a comparative analysis is conducted,
whereby an end-to-end deep learning approach is
implemented and evaluated alongside the proposed
algorithms. In this regard, the experimental setup for this study
involves the implementation of a rubber mat road track that
simulates real-world environments by integrating diverse Fig. 7. Object Avoidance Using PISA
obstacles, including cones and green boxes. This setup enables
the evaluation of the proposed algorithms and the end-to-end
deep learning approach against challenging navigation
conditions and varying levels of complexity. The end-to-end
deep learning approach, a promising technique in the field of
mobile robot navigation, involves training a navigation system
solely on raw sensory data, this methodology bypasses the
traditional pipeline of intermediate processing steps, leading
to a more streamlined and efficient navigation system. Thus, Fig. 8. Object Avoidance Using UBASS
comparing the proposed algorithms with the end-to-end deep
learning approach presents an opportunity to evaluate the The figure presented provides an illustration of the
efficacy of the different techniques in addressing the successful implementation of lane detection and object
challenges associated with mobile robot navigation. Overall, avoidance systems in our custom-built mobile robot using the
the results obtained from this study will provide insights into PISA and UBASS algorithms. Notably, the UBASS algorithm
the strengths and weaknesses of the proposed algorithms and surpasses the PISA algorithm in its ability to detect and steer
the end-to-end deep learning approach, enabling researchers away from obstacles. This superiority can be attributed to the
to make informed decisions regarding the optimal navigation use of the UNet architecture in the UBASS algorithm, which
strategy for autonomous mobile robots. facilitates enhanced road extraction. Additionally, the UBASS
The proposed experiment aims to capture and analyses algorithm's superior white pixel avoidance system enables it
multiple performance metrics to evaluate the efficiency and to detect and avoid obstacles at an earlier stage than the PISA
effectiveness of the experimental approach. Specifically, the algorithm.
experiment seeks to record and compare various matrix
TABLE I. EXPERIMENTAL DATA FOR COMPUTATIONAL EXPENSE
parameters, including the central processing unit (CPU) IN TABULAR FORM
consumption, random access memory (RAM) consumption,
and frame rate of the processed video stream. Measuring the S. No Algorithm CPU% RAM% FPS
CPU and RAM consumption will provide a comprehensive 1. Pure Image Segmentation
65.24% 43.55%
30.00
understanding of the system's resource utilization, Approach (PISA)
highlighting any bottlenecks or inefficiencies that may be 2. UNet Based Approach to 13.87
Semantic Segmentation 72.52% 66.34%
present. Additionally, the analysis of the frame rate of the (UBASS)
processed video stream will provide insights into the system's 3. End to End Deep Learning 7.35
responsiveness and real-time processing capabilities. Along 97.9% 70.32%
Approach
with this, we have implemented a system for monitoring
instances where the robot deviates from its intended path, 100.00%
makes minor contact with obstacles, or experiences a full
collision with an obstacle. This system allows us to maintain
80.00%
a comprehensive record of the robot's performance and its
ability to navigate its environment with precision and
accuracy. To ensure complete testing of the proposed 60.00%
algorithms, we have run the test 60 times while varying the
road complexity and obstacle for the targeted problem domain. 40.00%
284
Authorized licensed use limited to: Birla Inst of Technology and Science Pilani Dubai. Downloaded on March 20,2024 at 10:06:53 UTC from IEEE Xplore. Restrictions apply.
TABLE II. EXPERIMENTAL DATA FOR REAL WORLD PERFORMACE
No. of runs with No. of runs with No. of runs with slight No. of perfect
S. No Algorithm Lighting Condition
derailment complete collision obstacle grazing runs
Low 23 16 37 8
Pure Image Segmentation
1. Medium 19 13 19 15
Approach (PISA)
Ideal 17 9 10 19
Low 15 7 5 29
UNet Based Approach to
2. Semantic Segmentation Medium 9 5 1 37
(UBASS)
Ideal 8 2 1 50
Low 19 14 3 31
End to End Deep
3. Medium 10 5 2 35
Learning Approach
Ideal 5 1 1 51
Authorized licensed use limited to: Birla Inst of Technology and Science Pilani Dubai.285
Downloaded on March 20,2024 at 10:06:53 UTC from IEEE Xplore. Restrictions apply.