Real Time Human Body Posture Analysis Using Deep Learning
Real Time Human Body Posture Analysis Using Deep Learning
https://doi.org/10.22214/ijraset.2023.52099
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com
Abstract: We present a novel approach for accurately estimating the pose of objects in a low-cost and resource-efficient manner,
making it suitable for deployment on embedded systems. Our algorithm comprises of two primary stages: object detection and spatial
reconstruction. In the first stage, we employ a Convolutional Neural Network (CNN) called PoseNet for object detection. This
approach has proven to be effective in detecting and localizing objects in an image. Next, utilizing stereo correspondences, we 3D
reconstruct the spatial coordinates of multiple ORB features within the object's bounding box. This enables us to accurately estimate
the position of the object in space.
To calculate the final position of the object, we compute a weighted average of the stereo-corresponded key points' spatial
coordinates. The weights are proportional to the level of ORB stereo matching, which enables us to obtain a more accurate estimate
of the object's position in space. Our algorithm was tested in a calibrated environment, and we compared the results with a deep
learning-based method using various datasets. The results show that our approach outperforms existing methods in terms of
accuracy, while maintaining a low cost and efficient resource utilization. Our proposed method has several applications, including
the quantitative and qualitative analysis of human posture. By analyzing all aspects of a person's posture, we can determine if there
are any postural deviations, imbalances, or muscle weaknesses that may be causing pain or discomfort. This information can then be
used to develop personalized rehabilitation programs, reducing the risk of injury and enhancing athletic performance.
Furthermore, our approach can be used in various assistive technology applications, such as the control of robotic arms for pick-and-
place tasks. The low-cost and resource-efficient nature of our algorithm make it ideal for deployment in embedded systems, enabling
us to develop affordable and accessible assistive technology solutions.
In conclusion, our proposed algorithm provides an accurate, low-cost, and resource-efficient solution for pose estimation, with a wide
range of potential applications in human posture analysis, assistive technology, and beyond.
I. INTRODUCTION
Human Pose Estimation (HPE) involves identifying and classifying the joints in the human body to capture a set of coordinates for each
joint, also known as a key point, that can describe a pose of a person. There are several methods for pose estimation, including OpenPose,
Posenet, and DeepPose. This paper discusses the evolution of human pose estimation over the years and concludes that Posenet is the
most suitable technique for a real-world Android application. Posenet provides real-time pose estimation for the human body, allowing
for real-time evaluation on the client side. This is achieved with TensorFlow, which enables faster and more privacy-respecting model
inference on Android. Posenet is an open-sourced technology that can extract the essential 16-17 points natively and draw a skeleton of
the human pose with these points. This skeleton can be used to derive angles between points, enabling effective correction of the user's
pose.
Human pose estimation and tracking is a computer vision task encompassing the detection, association, and tracking of semantic key
points, such as "right shoulders," "left knees," or the "left brake lights of vehicles." Real-time performance of semantic key point tracking
in video footage demands significant computational resources, which limits the accuracy of pose estimation. Pose estimation has diverse
applications, including interactive installations that respond to human motion, augmented reality, animation, fitness tracking, and more.
We aim to foster experimentation and application of pose detection in unique projects through the accessibility of our model. While
various alternate pose detection systems have been open-sourced, all necessitate specialized hardware and/or cameras, as well as
extensive system setup.
A. Problem Statement
The optimization of human body productivity and enhancement of athletic performance through various techniques can facilitate the
development of numerous assistive technologies in the field of robotics.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 2951
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com
B. Objective
The primary aim of this project is to promote healthy living and enhance posture through the use of advanced technologies such as
augmented reality, virtual reality, and training robots. The proposed project seeks to investigate new assistive technologies that can
positively impact our daily lives.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 2952
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com
B. Raspberry Pi 4GB
The project uses Raspberry Pi 4GB, Raspberry Pi Camera Module and a small breadboard with an LED, resistor, and push button. This is
the hardware configuration to run the pipeline: A Raspberry Pi can be used for a wide range of applications. Making your Raspberry Pi
into a retro arcade machine, using it as a web server, or using it as the brain for a robot, security system, IoT device, or dedicated Android
device are all popular uses.
C. Software Implementation
This implementation uses the Pose Net model integrated in TensorFlow Lite, everything is written in Python to be run on the Raspberry
Pi 4. The code written for this project develops a pipeline to feed images to the model, process them using TensorFlow pretrained model,
decode the model output, and draw key points and limbs on the processed images. Post-processing can be used to convert these images
into a video!
D. LCD Display
LCD (Liquid Crystal Display) is a type of flat panel display which uses liquid crystals in its primary form of operation. LEDs have a
large and varying set of use cases for consumers and businesses, as they can be commonly found in smartphones, televisions, computer
monitors and instrument panels. LCDs were a big leap in terms of the technology they replaced, which include light-emitting diode
(LED) and gas-plasma displays. LCDs allowed displays to be much thinner than cathode ray tube (CRT) technology. LCDs consume
much less power than LED and gas display displays because they work on the principle of blocking light rather than emitting it. Where
an LED emits light, the liquid crystals in an LCD produces an image using a backlight.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 2953
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com
IV. RESULT
Our research is the first to apply deep convolutional neural networks to end-to-end 6-degree-of-freedom (6-DOF) camera pose
localization. We have demonstrated that transfer learning from classifiers trained on relatively small datasets can be used to overcome the
need for millions of training images. Our findings show that these networks retain sufficient pose information in their feature vectors,
even though they are trained to produce pose-invariant outputs. Our future research aims to investigate the use of Multiview geometry as
a source of training data for deep pose regressors and explore probabilistic extensions to this algorithm. Additionally, we acknowledge
that there is an upper limit to the physical area that can be localized by a finite neural network, which we leave for future exploration.
V. CONCLUSION
The primary advantage of the approach described is its ability to rapidly and autonomously provide high-probability detections based on
conservative estimations of key point parameters in image transformations. A concise list of the most probable pose positions, 3D poses,
and sizes is derived, which can be refined in a subsequent step to determine the correct object parameters. Additionally, as the algorithm
always obtains the most probable pose for each image position, the parameter space is significantly reduced, allowing more sophisticated
techniques to be employed to identify objects in situations where the highest convolution results do not match the object. Therefore, the
algorithm, as currently presented, is intended to provide a swift parameter space reduction, which can directly estimate poses or initiate a
refined search using alternative methods. Further research is needed to explore the full potential of this approach and to identify its
limitations.
REFERENCES
[1] Kepski, M., & Kwolek, B. (2012, September) Human fall detection by mean shift combined with depth connected components. In International Conference on
Computer Vision and Graphics (pp. 457-464). Springer, Berlin, Heidelberg.
[2] Shi, G., Zou, Y., Jin, Y., Cui, X., & Li, W. J. (2009, February). Towards HMM based human motion recognition using MEMS inertial sensors. In 2008 IEEE
International Conference on Robotics and Biomimetics (pp. 1762-1766). IEEE
[3] Baek, W. S., Kim, D. M., Bashir, F., & Pyun, J. Y. (2013, January). Real life applicable fall detection system based on wireless body area network. In 2013 IEEE 10th
Consumer Communications and Networking Conference (CCNC) (pp. 62-67). IEEE
[4] https://github.com/Pose-Group/DCPose
[5] https://paperswithcode.com/task/pose-estimation
[6] https://blog.tensorflow.org/2018/05/real-time-human-pose-estimation-in.html
[7] https://medium.com/analytics-vidhya/pose-estimation-on-the-raspberry-pi-4-83a02164eb8e
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 2954
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue V May 2023- Available at www.ijraset.com
BIOGRAPHIES
Ram krishna
Student,
Dept. of Electronics and
Telecommunication Engineering,
Sinhgad College of Engineering,
Maharashtra, India.
Vaibhav Shekhar
Student,
Dept. of Electronics and
Telecommunication Engineering,
Sinhgad College of Engineering,
Maharashtra, India.
Aman Raj
Student,
Dept. of Electronics and
Telecommunication Engineering,
Sinhgad College of Engineering,
Maharashtra, India.
Prof. Ganesh V. Madhikar
Assistant Professor,
Dept. of Electronics and
Telecommunication Engineering,
Sinhgad College of Engineering,
Maharashtra, India
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 2955