1. Introduction
Photoplethysmography (PPG) has proven effective in monitoring cardiovascular-related physiological signs, especially heart rate (HR), oxygen saturation (SpO
2) and blood pressure (BP) [
1]. Due to the advantages of low cost and convenience, PPG sensors are widely applied in wearable healthcare. Though having a great potential for wearable healthcare, the accuracy of PPG sensors during motion such as exercise by the user is still unsatisfactory due to motion artifacts (MA) [
2]. The MA is typically caused by the change of blood flow velocity induced by the motion [
3] or the relative movement between PPG sensors and human skin [
4]. The wide frequency range of MA with time-varying nature makes it difficult to use traditional filtering techniques for the removal of motion artifacts [
5]. Thus, to improve the reliability and accuracy of health monitors based on PPG sensors, the removal of MA continues to be a technical challenge that needs to be tackled.
Early studies attempted to reconstruct PPG signal from MA-corrupted signals with traditional signal processing methods. Independent component analysis (ICA) is a common method [
6], which can separate motion artifacts and clean PPG signal from multi-channel corrupted PPG signals. However, if the statistical independence between motion artifacts and clean PPG signals is not well satisfied, the method could hardly be effective [
7]. Researchers have also adopted wavelet transform for PPG MA removal [
8]. Nevertheless, this method needs users to empirically set the threshold values for wavelet de-noising, which limits its usage. The empirical mode decomposition (EMD) was also proposed for removing motion artifacts [
9], but the performance of EMD may be affected by intermittency signals and noise, known as the mode-mixing problem. Later, more researchers tried to combine different techniques to achieve better results. In [
10], researchers proposed a method that combines Fourier series reconstruction and frequency-domain independent component analysis (FD-ICA) to reduce PPG motion artifacts in a step-by-step manner. Another method proposed in [
11] combines multiple signal processing techniques and adaptive noise cancellation (ANC), which uses fast Fourier transform (FFT), singular value decomposition (SVD) and ICA to extract the noise reference from corrupted PPG signal. However, these methods are still not effective for restoring the PPG signal from a heavily corrupted signal.
Recently, some application-oriented studies tried to skip exact PPG signal recovery and directly estimated the physiological signs from preliminary de-noised PPG signal during intensive exercise, especially in the application of HR monitoring [
12]. In [
13], researchers combine sparse signal reconstruction (SSR) and a special HR tracking scheme to accurately estimate HR from de-noised PPG signal. The method proposed in [
14] applies an empirical and complex tracking scheme that incorporates the ensemble empirical mode decomposition (EEMD) algorithm to estimate HR. Another method proposed in [
15] combines the phase vocoder technique, a HR tracking and a smoothing stage for HR estimation from de-noised PPG signal. There are also many other methods which follow a similar routine as the ones mentioned above. Although such methods may be able to estimate HR from MA-corrupted PPG signal accurately, it could not be applicable to other physiological signs estimation, such as BP monitoring, because these signs need to extract more comprehensive information from PPG signal, which has stricter requirements for signal quality.
However, because the useful part of PPG signal is mainly constructed by spectral components around its fundamental frequency and harmonic frequencies [
16], the accurate estimation of HR can be a key for restoring PPG signal due to the high correlation between heart rate frequency (HRF) and fundamental frequency of PPG. In this paper, a robust method is proposed for removing motion artifacts in PPG signals during intensive exercise. The method combines accurate HRF estimation with notch filtering. It first reduces the motion artifacts through a cascaded ANC stage that filters the corrupted PPG signal using three-axis acceleration signals. Then a simple heart rate frequency tracking (HRFT) scheme and a specially-designed heart rate frequency correction (HRFC) stage are applied for HRF estimation. After that, two tunable notch filters are constructed using the HRF and its second harmonic frequency to restore the PPG signal. Finally, the estimated HR value and the restored PPG signal are obtained by the proposed method.
2. Proposed Method
Shown in
Figure 1 is the overall signal flow of the proposed method. The method requires PPG signal(s) and three-axis acceleration signal(s) as its inputs. The 4th order 0.4 Hz–4.0 Hz Butterworth band-pass filters (BPF) are firstly employed to remove the out-of-band noise of the input PPG and acceleration signals. Then a cascaded ANC stage is adopted to adaptively cancel the motion artifacts in the PPG signals using respectively x, y and z-axis acceleration signals as reference noise. After that, a simple HRFT scheme is applied to give a preliminary estimation for the HRF from the spectrum of the ANC-de-noised PPG signal. Then a novel HRFC stage is used to correct possibly wrong HRF estimation and give the final estimated HRF value. The HRF and its second harmonics frequency are used as notch frequencies to construct two notch filters. The notch filters remove the PPG components from the ANC-de-noised PPG signal and extract the motion artifacts. And the MA-removed PPG signal is finally obtained through subtracting the motion artifacts from the ANC-de-noised PPG signal.
A widely-utilized open source data set provided for the 2015 IEEE Signal Processing Cup [
12] is used for preliminary evaluation of different stages in the proposed method. The data set contains signals collected from 12 subjects (11 males and one female). For each subject, two channels of PPG signals and three-axis acceleration signals are recorded from one wrist, with one channel electrocardiograph (ECG) signal recorded simultaneously for about 5 min with a sampling rate of 125 Hz. As the ECG signal is barely influenced by MA, it is used for calculating the reference HR. During the 5 min recording, the subjects exercise on a treadmill in the following pattern: 1–2 km/h walking for 30 s, 6–8 km/h running for 60 s, 12–15 km/h running for 60 s, 6–8 km/h running for 60 s, 12–15 km/h running for 60 s and 1–2 km/h walking for 30 s. For the first 30 s, subjects are in a relatively static initialization stage. Average Absolute Error (AAE) between reference HR and estimated HR from the proposed method [
12] is used for evaluating performance of the proposed method, which denoted as
HRAAE. The
HRAAE for each subject is calculated using Equation (1).
HREST(i) and
HRTRUE(i) are respectively the
i-th HR estimation results and reference HR value.
N is the number of total estimations for a subject.
2.1. Cascaded Adaptive Noise Cancellation
Adaptive noise cancellation is an effective technique to de-noise a noise-corrupted signal given a reference signal that is highly correlated with the noise [
17].
Figure 2 shows the signal flow of typical ANC. In the proposed method, the least mean squares-Newton (LMS-Newton) algorithm [
18] is applied for adaptive filtering, which features a faster convergence speed than conventional LMS-based algorithms. The LMS-Newton algorithm is given in Equations (2) and (3). The parameter
x(k) is the reference signal of MA, which is the acceleration signal.
W(k) is the weight vector of the finite impulse response (FIR) filter, and
e(k) is the output for de-noised PPG signal. The parameter α and
μ in Equation (3) determine the speed of convergence.
Because MA can be decomposed into motions in different directions, a cascaded topology of ANC is used here to reduce the motion artifacts better, as is illustrated in
Figure 1. The cascaded adaptive noise cancellers use respectively x, y and z-axis acceleration signals as their MA reference inputs. Band-pass filtered PPG and acceleration signals are streamed into this step at a specified time interval
ltv. In order to avoid an unmatched scale between the PPG signal and the acceleration signal, Z-score standardization is adopted to normalize the inputs of each adaptive noise canceller. Especially, if there are multiple PPG signals at the input end of the cascaded ANC, the PPG signals will be averaged into a single PPG signal before the first adaptive noise canceller.
Shown in
Table 1 is the experimental results of
HRAAE before and after applying the cascaded ANC stage of the proposed method. When only BPF is applied to the MA-corrupted signals, the estimation errors for HR values are very large, reaching an average
HRAAE of 11.47 bpm over the 12 subjects, which indicates that the MA frequencies are strongly deviated from the HRFs and the spectral magnitudes of MA are larger. After applying the cascade ANC stage for MA reduction, the HR estimation errors significantly decreased.
2.2. Heart Rate Frequency Tracking
After the cascaded ANC stage, the de-noised PPG signal
SPPG is used for a preliminary estimation of current HRF. FFT is first applied to acquire the spectrum of
SPPG, where only the magnitude information is considered. A relatively large number of points for FFT
LFFT is used to increase frequency resolution. Then a simple HRFT scheme from [
15] is used for preliminary HRF estimation.
Flowchart of the HRFT scheme is shown in
Figure 3. The variable
i corresponds to the
i-th PPG signal sequence being processed by the proposed algorithm since it starts.
fHRlow and
fHRhigh are the low and high boundaries of typical human HRF. The partameter
fPHR denotes the value of preliminary HRF estimation result and
fHR is the final estimated HRF value. For the first signal sequence,
fPHR is directly selected to be the frequency that corresponds to the largest spectral magnitude of
SPPG within the human HRF range. Starting from the second signal sequence,
fPHR is tracked from a range of ±
Δf around
fHR of the previous sequence. When
i is smaller or equivalent to
Ninit,
Δf is set to be an initial constant
Δf0.
Ninit corresponds to the initialization stage of the proposed algorithm that typically takes 30 s to 1 min, where the users are required to stay motionless. If the initialization stage has finished,
Δf is updated according to be the sum of the largest absolute difference of previous consecutive HRF estimation results and a bias
b. Then,
fPHR is selected to be corresponding to the largest spectral magnitude within a range of ±
Δf around previous
fHR and also within human HRF range. The calculated
HRAAE results after applying the HRFT stage have been improved, as shown in
Table 2.
2.3. Heart Rate Frequency Correction
After the preliminary HRF estimation, a heuristic HRF correction scheme is further designed to correct possibly wrong
fPHR results and give the final HRF estimation result
fHR. The flow of this scheme is illustrated in
Figure 4. The value of
fHR is initialized as
fPHR. For a relatively small time interval, the change in human HRF is not expected to be very large. So if the difference between current
fHR and HRF estimation result of the previous PPG signal sequence
fHR,pre becomes larger than a threshold
Th0, it is highly possible that either
fHR or
fHR;pre is a wrong estimation result. Under that circumstance, the HRFC scheme tries to judge whether
fHR is estimated to be wrong and will correct it if so (As the proposed method targets real-time online application,
fHR,pre should not be corrected).
To correct a possibly wrong fHR result, the HRFC stage first tries to find fN, which corresponds to the largest current PPG spectral peak between fHR and fPHR. If fN is found, it is highly possible that fN is the correct HRF estimation result for fHR. However, it need to further check its fidelity to make the final decision. It first needs to check whether fACC is within a small range of ±Δ around fHR, where fACC is the frequency of the largest spectral magnitude of SACC, and SACC is the magnitude spectrum of acceleration signal calculated from the average of the three-axis acceleration sequences. If that is true, fHR is highly possible to be a wrong result which actually corresponds to motion artifacts, and HRFC further checks whether the spectral magnitude of fN is large enough when compared to the one of fHR so that fN can be the correct estimation result of fHR.
If fACC is not found to be around fHR, HRFC cannot decide whether fHR is a wrong result, and it further checks whether fN can be a faked HRF peak which is actually caused by motion artifacts. If fACC is found to be around fN and the spectral magnitude of fN is not large enough, HRFC keeps the value of fHR unchanged. If HRFC is not able to make a decision after all those checks, it simply compares the spectral magnitude of fN and fHR and set fHR to be fN if the magnitude of fN is large enough. For the spectral magnitude comparisons between fN and fHR, the range of thresholds Th1, Th2 and Th3 is (0, 1). After final estimation of HRF, the HR value (60 fHR bpm) for current PPG sequence is also calculated as an output of the proposed method.
As shown in
Table 3, the final average
HRAAE over the 12 subjects is 0.92 bpm for the complete HRF estimation algorithm, with an average standard deviation (SD) of 1.50 bpm (SDs are calculated within the estimation results of every subjects and then averaged over the 12 subjects).
2.4. Notch Filtering
Cascaded notch filters are used to restore the PPG signal based on the estimated HRF. As is discussed above, the PPG signal can be treated as a quasi-periodic signal with spectral components mainly distributed as peaks around its HRF and the harmonic frequencies of HRF. Thus notch filters or comb filters can be used to eliminate unwanted components and keep only the PPG-related ones. Shown in Equation (5) is the typical transfer function of a digital notch filter [
19]. In Equation (4),
fn is the notch frequency and
Fs is signal sampling rate. Shown in
Figure 5 is the frequency domain amplitude response of the notch filter, where the parameter
r controls its bandwidth. It can be observed from
Figure 5 that notch filters actually suppress the signal around notch frequencies while keep the components of other frequencies close to their original amplitudes.
For every processing iteration (every Itv s), two notch filters are constructed using respectively current HRF estimation result fHR and the second harmonic frequency of fHR, because the main features of time domain PPG signal in one period are its main pulse and dicrotic pulse. For the first notch filter, notch frequency is set to be fHR directly, while for the second one, its notch frequency is set to be the one that corresponds to the largest spectral peak around 2 fHR due to the quasi-periodicity of PPG signals. The output of the cascaded ANC stage is filtered using the two notch filters, where the PPG components are removed while the MA is kept as it is. Through finally subtracting the notch filtering output from the ANC-de-noised PPG signal, the restored PPG signal are achieved.
Figure 6 shows the performance of the proposed method to restore the PPG signal from MA-corrupted signal. The PPG signal was seriously distorted during intensive exercise, and the amplitude of the PPG component in the frequency domain was not obvious. After the proposed method was applied, the waveform of the PPG signal was restored and only the fundamental and second harmonic frequency of the PPG signal were found in the frequency domain. Because there was no reference PPG signal in the open source data set, a preliminary judgment was made through ECG that the peak number and interval of the recovered PPG signal are consistent with that of ECG.
4. Discussions and Conclusions
PPG sensors have been widely used in wearable health tracking, from the early HR monitoring to the present BP and SpO2 monitoring. However, the accuracy of PPG sensors during exercise is unsatisfactory due to the motion artifact. At present, many studies have proposed methods to improve the accuracy of HR monitoring, which ignore the morphological recovery of the PPG signal. It limits the application of PPG sensors. In this paper, a novel method is proposed for PPG MA removal during intensive exercise. On the one hand, an LMS-Newton-based cascaded ANC and unique HRF estimation scheme are applied to increase the accuracy of HR calculation. The mean AAE of HR during intensive exercise is improved from 11.47 bpm to 0.92 bpm, which verified preliminary on a widely-used open source data set of 12 subjects. On the other hand, the notch filters are employed here to recover the PPG components with the calculated HRF, which maintaining the morphological characteristics of PPG signals during intensive exercise. In the practical experiment, the proposed method achieves mean AAE result of 0.89 bpm, which shows the stability and strength of the proposed method for accurate HR calculation during intensive exercise. At the same time, the average correlation coefficient between recovered PPG signal and reference PPG signal reaching 0.86. The main features of PPG signal are restored, which is helpful to the future study of PPG sensors, such as wearable BP monitoring based on PPG. In addition, the proposed method has the advantage of real-time performance. It only costs an average of 0.83 ms to process the PPG signal with 25 Hz sampling rate. The proposed method provides a good basis for the improvement of the wearable PPG sensors.