Oalib2024 11null
Oalib2024 11null
Subject Areas
Artificial Intelligence, Image Processing
Keywords
Remote Photoplethysmography (rPPG), Heart Rate Measurement, Remote
Health Monitoring, Computer Vision, Deep Learning
1. Introduction
Heart rate (HR) is a significant physiological indicator, specifically for the rapid
notification of heart diseases and arrhythmias in the heart. The techniques of HR
monitoring can be categorized as direct and indirect contact with the human
Figure 2. (a) A wrist-mounted remote sensor pulse oximeter with a probe. (b) A pulse oximeter is a tool that non-
invasively evaluates oxygen saturation by transmitting light via the skin of a finger [2].
more research studies toward measuring heartbeats without contact with the
body. As wearable devices gained popularity, the incorporation of rPPG into
smartwatches, cellphones, and fitness trackers offers users immediate health data
without the necessity for invasive sensors. Remote tracking of health has been
gaining popular interest due to its offer of a more comfortable and unobtrusive
method for heart rate evaluation, especially for people who have sensitive or burnt
skin or require continuous vital signs measurement, as in an intensive care unit
(ICU). Therefore, methods directed to replace traditional measurement techniques
are based on the process known as rPPG [5] [6]. As mentioned above, rPPG is
helpful in cases of epidemics and infectious diseases, such as COVID-19, where it
is preferable to avoid direct contact with the patient [7]. Remote photoplethys-
mography (rPPG) uses a camera, including thermal, infrared spectrum, or RGB
cameras, to capture video of the individual and detect small color variations in
facial skin to produce the remote signal for PPG. The rPPG method operates on
an idea similar to the standard PPG method, where the rapid flow of blood within
the circulatory system alters the blood density within the blood vessels area below
the skin surface with every heartbeat, producing regular waves. The primary dis-
tinction between the two methodologies lies in the approach employed for col-
lecting the PPG signal; rPPG techniques obtain the signal by capturing the face of
the subject without contact. Conversely, traditional PPG techniques require a
physical sensor that stays in touch with the skin [8]. The principle of rPPG method
is ambient light falls on the skin; part of this light will suffer from a specular re-
flection from the skin surface, which is devoid of physiological information, and
the other parts of the light will suffer from a diffuse reflection from the blood vessels
inside the skin, changes in blood volume inside the vessels corresponding to heart-
beats result in minor differences in reflected light from the skin surface, from which
the PPG signal is derived following processing. This diffuse signal contains physi-
ological information that is captured using the camera, and the heart rate is calcu-
lated or estimated from this signal after filtering and processing it, as shown in
Figure 3 [9]. The rPPG has become one of the most widely used methods due to
Figure 3. Mechanism of rPPG using the dichromatic reflection model (DRM). The digital camera records both specular
and diffuse reflections from ambient light. Specular reflection includes surface information unrelated to signals from
the body, while diffuse reflection is affected by circulation. The signal of rPPG can be generated via additional pro-
cessing of signals [11].
the cheap cost of the standard camera, comfort for patients, and high accuracy of
the results. This method can be used in various contexts, including Telehealth,
which allows healthcare providers to monitor patients requiring continuous ob-
servation of health indicators remotely. Additionally, outside the medical domain,
this method can assess individuals’ stress levels during indirect investigations con-
ducted by authorities and the possibility of using it to monitor the vital indicators
of car drivers in order to alert the driver in the event of recording indicators for
an upcoming health crisis. This technique enables us to take health indicators
from a few meters away with standard cameras. Despite its many advantages,
rPPG faces some challenges; factors like motion artifacts and varying lighting con-
ditions can affect accuracy; performance can vary across different skin tones, re-
quiring additional research for equitable application; the diversity of the database
in the methods that use deep learning to extract the PPG signal affects the accu-
racy [10].
This research provides an in-depth review of rPPG monitoring utilizing com-
puter vision and deep learning approaches to determine the heart rate. According
to the work method, the review divides the heartbeat measurement into two types.
Section two lists the details related to the two types. Section three lists the Potential
applications. Section four lists the advantages and Limitations. Conclusions are in
section five.
2. Literature Review
The main signal being worked on to obtain the heartbeats is obtained in two ways.
The first method is to measure the amount of diffuse reflections light from the
skin to obtain the main signal, and then different algorithms are applied to process
the main signal to achieve the heart rate. The second method extracts the main
signal from subtle head movements generated by heartbeats and then applies
different algorithms and filters to signal processing to achieve the heart rate. Be-
low is the previous study according to how the main signal is obtained.
Figure 4. A description of no physical heart rate estimation using a thermal camera [4].
In 2008, Verkruysse et al. first studied rPPG. They used consumer-level digital
cameras for HR estimation from a distance of about 1 meter, depending on the av-
erage light from the around as a source. This method focuses on taking the initial
signal from the reflection of the light signal on the green channel for ROI, then
applying a bandpass filter with a range of frequency (0.8, 3.5) Hz to get a heart rate
range between (48, 210), then performed an FFT on the obtained signal to derive
heart rate. Two primary factors constrain the spatial clarity of the PPG data (power
and phase) and the level of accuracy of our existing methodology: firstly, movement
objects, and secondly, decreasing signal-to-noise ratio (SNR). The first issue may
be fixed by increasing the location of the volunteers. The second issue can be resolved
through the use of advanced movie cameras with enhanced frame quality and bet-
ter techniques for analyzing signals and processing [13].
In 2010, the author focused on blind source separation (BSS) utilizing independ-
ent component analysis (ICA). ICA is an approach for detecting independent source
data generated by a collection of signals that consist of linear combinations of the
fundamental sources, as seen in Figure 5. This new approach in healthcare signal
analysis is fast growing. ICA has been utilized to eliminate noise from electrocar-
diogram (ECG) and electroencephalogram (EEG) recordings, as well as to decrease
movement effect errors on the PPG signal [14]. The ICA method introduces an
innovative approach for indirect heart rate evaluation from video frames via blind
source separation. In this study, a laptop webcam was used for 12 participants with
varying skin colors. The trials were performed indoors, utilizing different amounts
of sunshine as the sole source of illumination. When recording a video of the skin
on the face with a webcam, the sensor of the camera detects a mix of the reflected
plethysmographic signal and various light fluctuations caused by artifacts, such as
motion variations in environmental illumination, resulting in a summation of in-
dependent random parameters that exhibit greater Gaussian characteristics than
the original parameters [15].
Figure 5. Methodology for heart pulse monitoring. (a) The ROI is dynamically identified with a face detector.
(b) From the ROI region, extracted RGB channels and counted the averaged change to obtain (c) the unfiltered
RGB signals. ICA is utilized on the normalized RGB signals for recovery (d) of three independent source signals
[15].
After recording the video, face detection was performed to identify the ROI and
count the average colors: red, green, and blue. The normalized raw signals are fur-
ther divided into three independent source signals utilizing ICA. This method sep-
arates the extracted signal and divides it into three sources. There will be one or
more sources close to the heart rate indicators, and the other source that contains
noise resulting from abnormal movement and changes in lighting is ignored, then
passed through a bandpass filter with a range of frequency (0.75, 4) Hz. According
to this frequency range, the heart rate range must be between (45 and 240) beats
per minute (bpm). The recorded videos included two statuses: the first involved
no movement, and the second involved typical movement. The results were com-
pared with a blood volume pulse (BVP). This method allows the heart rate of three
people to be read simultaneously. Table 1 shows the results before and after using
ICA to demonstrate this method’s efficiency. There was a noticeable improvement
in the accuracy of this method, but it still suffered from limitations, including be-
ing affected by changes in lighting and movement [15].
from RGB channels obtained from participants’ faces using an ordinary webcam
to determine the HR. The LE method is distinguished from previous techniques,
such as ICA and PCA, in that it processes the signal non-linearly to achieve a more
accurate HR signal. The basic principle of LE is that it operates by embedding
high-dimensional data into a lower-dimensional space while preserving the local
geometric structure. It does this by constructing a graph representation of the
data, where each data point is a node, and edges represent relationships between
points based on proximity. The method seeks to minimize the distance between
connected points in the new space while maximizing distances between discon-
nected points. The method was conducted on 20 people using a standard webcam
to record a 30-second video at an average rate of 30 frames per second. The face
was detected using the method described in [17]. The RGB signal was extracted,
the LE algorithm was applied, and then a bandpass filter was used to delete unwanted
frequencies and calculate the peaks of the final signal to obtain the heartbeats, as
shown in Figure 6. Although the results presented showed better performance
than the previous study, which used a linear process for heart rate estimation, it
was more sensitive to noise and computational complexity [18].
model across new people may produce various results or display poor perfor-
mance when additional people not included in the training dataset are introduced
to the system, potentially resulting in failures in practical applications. Addition-
ally, the research was performed in a regulated setting with defined illumination
parameters [19].
Eulerian video magnification (EVM) was originally employed to determine var-
iations in the facial region of subjects. The core idea of EVM is to amplify small,
subtle changes in a video over time while keeping the overall structure of the video
intact. It focuses on temporal changes in pixel values, which can indicate physio-
logical signals like blood flow. After recording the video, it is converted into se-
quential frames. The RGB signals are extracted for each frame. Then, the filter is
applied to remove unwanted frequencies. After that, the EVM is used to magnify
the slight change in the RGB signal for all frames. Then, the heartbeats are calcu-
lated from the final signal after applying FFT. This strategy was used for data in
which subjects were stationary. This method is impractical for heart rate moni-
toring in real-world scenarios when people may be in motion. Furthermore, its
performance may be degraded in the presence of significant noise within the sig-
nal [20].
In 2020, Laurie et al. proposed an exposure management strategy that improves
the SNR by modifying the camera shot period to maximize it with the absence of
distortion. Nonetheless, the method is weak in terms of subject flexibility, and
shows a prolonged time of execution compared to normal controls. Subjects must
remain somewhat motionless to ensure accuracy, as movement may result in color
channel overloading. The extended execution time may restrict its applicability in
real-time contexts [21].
In 2013, the authors designed a new rPPG signal filtering technique, which de-
pends on chrominance and is termed CHROM. They researched the movement
problem, leading to the development of far more advanced chrominance-based
approaches. The primary issue with rPPG is its resilience to subject movement.
The authors examined how motion affects the pulse signal and derived more ro-
bust rPPG algorithms from this analysis, achieving superiority surpassing all pre-
vious techniques in SNR and movement resilience. The main idea behind chro-
minance algorithms is to isolate and analyze the color components (like red,
green, and blue) of the video to detect subtle changes caused by blood flow. When
blood rushes to the skin’s surface, it alters the skin’s color, which can be quantified
and used to extract the heartbeat signal. The study was conducted on 117 healthy
volunteers with different skin colors. Face detection, as established by Viola and
Jones [17], was utilized to mark an ROI. Subsequently, RGB color data served as
the basis for analysis. A Bandpass filter is applied to the initial signal to remove
unwanted frequencies. After that, the CHROM method is used to analyze the sig-
nal and get a signal close to the heartbeat signal, and finally, FFT is applied to
obtain the HR result, as shown in Figure 7. The SNR was also affected by the color
of the skin type; it suffered from a decrease in people with dark skin compared to
people with white skin, and this decrease is logical; the elevated melanin level
Soaks up a portion of the reflected, diffuse light that conveys signal information
PPG. CHROM showed more sensitivity to motion-induced distortions compared
to approaches based on PCA and ICA [22].
their size, the diversity of samples (including different skin colors), and the variety
of registered environments. All of these factors must be considered to increase the
accuracy of this approach [25].
Figure 8. Summary of our heartbeat prediction methodology. (a) A specific area within the
human head has been chosen, and feature locations are monitored across all video frames.
(b) The vertical component is collected from every feature element direction. (c) Each track
is later dynamically filtered to remove unnecessary frequencies. (d) PCA splits each trajec-
tory into signals from the source s1, s2, s3, s4, and s5. (e) The component showing the
clearest focused frequency is chosen. (f) Peak monitoring determines the beats of the signal
[26].
2010 Poh et al. [15] (with slight movement) 4.59 4.36 0.95
2014 Monkaresi et al. [19] (CA + KNN + Regression) 13.7 13.69 0.58
3. Potential Applications
Heart Rate Monitoring: rPPG can be employed in fitness applications and devices
to track heart rate during exercise, helping users enhance their daily activities. In
hospitals, the rPPG allows continuous heart rate monitoring of patients without
physical touch, therefore reducing the danger of infection [35].
Stress Detection: This method can measure the level of stress in individuals
5. Conclusion
This paper offers a review of rPPG-related studies that can measure heart rate. The
paper discussed the reason for the interest in this technology and the increasing
amount of research related to it, in addition to discussing the methods used, sam-
ples, and the environment in which the experiments were conducted. The databases
Conflicts of Interest
The authors declare no conflicts of interest.
References
[1] Lee, R.J., Sivakumar, S. and Lim, K.H. (2023) Review on Remote Heart Rate Meas-
urements Using Photoplethysmography. Multimedia Tools and Applications, 83,
44699-44728. https://doi.org/10.1007/s11042-023-16794-9
[2] Ghamari, M. (2018) A Review on Wearable Photoplethysmography Sensors and Their
Potential Future Applications in Health Care. International Journal of Biosensors &
Bioelectronics, 4, 195-202. https://doi.org/10.15406/ijbsbe.2018.04.00125
[3] Johnston, W. and Mendelson, Y. (2005) Extracting Heart Rate Variability from a Wear-
able Reflectance Pulse Oximeter. Proceedings of the IEEE 31st Annual Northeast Bi-
oengineering Conference, Hoboken, 2-3 April 2005, 157-158.
[4] Waqar, M. (2019) Contact-Free Heart Rate Measurement from Human Face Videos and
Its Biometric Recognition Application. Ph.D. Thesis, Aberystwyth University.
[5] Al-Naji, A., Gibson, K., Lee, S. and Chahl, J. (2017) Monitoring of Cardiorespiratory
Signal: Principles of Remote Measurements and Review of Methods. IEEE Access, 5,
15776-15790. https://doi.org/10.1109/access.2017.2735419
[6] Yan, Y., Ma, X., Yao, L. and Ouyang, J. (2015) Noncontact Measurement of Heart Rate
Using Facial Video Illuminated under Natural Light and Signal Weighted Analysis.
Bio-Medical Materials and Engineering, 26, S903-S909.
https://doi.org/10.3233/bme-151383
[7] Rohmetra, H., Raghunath, N., Narang, P., Chamola, V., Guizani, M. and Lakkaniga,
N.R. (2021) AI-Enabled Remote Monitoring of Vital Signs for COVID-19: Methods,
Prospects and Challenges. Computing, 105, 783-809.
https://doi.org/10.1007/s00607-021-00937-7
[8] Chen, X., Cheng, J., Song, R., Liu, Y., Ward, R. and Wang, Z.J. (2019) Video-Based
Heart Rate Measurement: Recent Advances and Future Prospects. IEEE Transactions
on Instrumentation and Measurement, 68, 3600-3615.
https://doi.org/10.1109/tim.2018.2879706
[9] van der Kooij, K.M. and Naber, M. (2019) An Open-Source Remote Heart Rate Imaging
Method with Practical Apparatus and Algorithms. Behavior Research Methods, 51,
2106-2119. https://doi.org/10.3758/s13428-019-01256-8
[10] Rouast, P.V., Adam, M.T.P., Chiong, R., Cornforth, D. and Lux, E. (2018) Remote Heart
Rate Measurement Using Low-Cost RGB Face Video: A Technical Literature Review.
Frontiers of Computer Science, 12, 858-872.
https://doi.org/10.1007/s11704-016-6243-6
[11] Cheng, C., Wong, K., Chin, J., Chan, T. and So, R.H.Y. (2021) Deep Learning Methods
for Remote Heart Rate Measurement: A Review and Future Research Agenda. Sensors,
21, Article 6296. https://doi.org/10.3390/s21186296
[12] Pavlidis, I. (2003) Continuous Physiological Monitoring. Proceedings of the 25th An-
nual International Conference of the IEEE Engineering in Medicine and Biology So-
ciety, Cancun, 17-21 September 2003, 1084-1087.
[13] Verkruysse, W., Svaasand, L.O., Stuart Nelson, J., Wieringa, F.P., Mastik, F. and van der
Steen, A.F.W. (1937) Multilaser Photoplethysmography Technique. Institute of Elec-
trical and Electronics Engineers.
[14] Yuan, Z. (2013) Concurrent Spatiotemporal Analysis of Functional Near-Infrared Spec-
troscopy Data Using Independent Component Analysis. Proceedings of the 2013 6th
International Congress on Image and Signal Processing (CISP), Hangzhou, 16-18 De-
cember 2013, 794-798. https://doi.org/10.1109/cisp.2013.6745273
[15] Poh, M., McDuff, D.J. and Picard, R.W. (2010) Non-Contact, Automated Cardiac Pulse
Measurements Using Video Imaging and Blind Source Separation. Optics Express, 18,
10762-10774. https://doi.org/10.1364/oe.18.010762
[16] Lewandowska, M. and Nowak, J. (2012) Measuring Pulse Rate with a Webcam. Journal
of Medical Imaging and Health Informatics, 2, 87-92.
https://doi.org/10.1166/jmihi.2012.1064
[17] Viola, P. and Jones, M.J. (2004) Robust Real-Time Face Detection. International Jour-
nal of Computer Vision, 57, 137-154.
https://doi.org/10.1023/b:visi.0000013087.49260.fb
[18] Wei, L., Tian, Y., Wang, Y., Ebrahimi, T. and Huang, T. (2013) Automatic Webcam-
Based Human Heart Rate Measurements Using Laplacian Eigenmap. 11th Asian Con-
ference on Computer Vision, Daejeon, 5-9 November 2012, 281-292.
https://doi.org/10.1007/978-3-642-37444-9_22
[19] Monkaresi, H., Calvo, R.A. and Yan, H. (2014) A Machine Learning Approach to Im-
prove Contactless Heart Rate Monitoring Using a Webcam. IEEE Journal of Biomed-
ical and Health Informatics, 18, 1153-1160.
https://doi.org/10.1109/jbhi.2013.2291900
[20] Wu, H., Rubinstein, M., Shih, E., Guttag, J., Durand, F. and Freeman, W. (2012) Eulerian
Video Magnification for Revealing Subtle Changes in the World. ACM Transactions
on Graphics, 31, 1-8. https://doi.org/10.1145/2185520.2185561
[21] Laurie, J., Higgins, N., Peynot, T. and Roberts, J. (2020) Dedicated Exposure Control for
Remote Photoplethysmography. IEEE Access, 8, 116642-116652.
https://doi.org/10.1109/access.2020.3003548
[22] de Haan, G. and Jeanne, V. (2013) Robust Pulse Rate from Chrominance-Based RPPG.
IEEE Transactions on Biomedical Engineering, 60, 2878-2886.
https://doi.org/10.1109/tbme.2013.2266196
[23] Wu, B.-F., Huang, P.-W., He, D.-H., Lin, C.-H. and Chen, K.-H. (2019) Remote
[38] Smith, W., Colbert, B.M., Namouz, T., Caven, D., Ewing, J.A. and Albano, A.W. (2024)
Remote Patient Monitoring Is Associated with Improved Outcomes in Hypertension:
A Large, Retrospective, Cohort Analysis. Healthcare, 12, Article 1583.
https://doi.org/10.3390/healthcare12161583
[39] Gibson, K., Al-Naji, A., Fleet, J., Steen, M., Esterman, A., Chahl, J., et al. (2019) Non-
contact Heart and Respiratory Rate Monitoring of Preterm Infants Based on a Computer
Vision System: A Method Comparison Study. Pediatric Research, 86, 738-741.
https://doi.org/10.1038/s41390-019-0506-5
[40] Nowara, E.M., Marks, T.K., Mansour, H. and Veeraraghavan, A. (2018) SparsePPG:
Towards Driver Monitoring Using Camera-Based Vital Signs Estimation in Near-In-
frared. IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops,
Salt Lake City, 18-22 June 2018, 1385-1394.