1. Introduction
The development of wearable technology for the assessment of respiratory activity has grown in the last decades due to the broad range of available sensors [
1,
2,
3,
4]. Lightweight sensors and electronics embedded in customized gear (e.g., pants and
t-shirts) or accessories (e.g., belts and bras) are designed to gather relevant information from the users to monitor their respiratory activity and recognize early signs of abnormal physiological conditions, such as cardiac or respiratory arrest [
4], chronic respiratory diseases [
5,
6], or other psycho-physiological stressors [
7].
Several recent review studies provide an overview of the currently available wearable technology to assess the most relevant features of respiratory mechanics. Among those reviews, Dinh and colleagues provided an up-to-date summary of stretchable respiration sensors based on different sensing principles [
1]. The authors noted that the integration of sensors with other electronics and power sources into a single wearable device represents acurrent challenge that needs to be addressed. In addition, they noted that additional effort is required to improve the stretchability and wearability of respiration sensors without causing performance degradation. Soon and colleagues provided an overview of devices that are currently available on the market and designed for the remote monitoring of vital signs, including respiration, particularly in outpatient settings [
3]. These authors, in accordance with other ones [
4], noted that the routine use of these devices, in both outpatient and healthcare settings, is still limited. In addition, they pointed out that the literature lacks solid evidence to verify the effectiveness of some of the reviewed products. Massaroni and colleagues focused their attention on contact-based methods for measuring the respiratory rate [
7]. The authors identified a wide taxonomy of the available technology and provided a detailed analysis of their metrological properties, characteristics, and potential applications. Finally, in one of our recent works, we collected research that supports the use of wearable devices to continuously monitor tidal volume in both daily activities and clinical settings [
8]. We concluded that some of the proposed methodological approaches (i.e., optical, resistive, and inductive stretch sensors) have been widely investigated in the literature, while others are at an early stage. However, in both cases, the accuracy of the proposed wearable solutions might not be sufficient. Overall, current evidence does support the use of wearable technology to monitor respiratory activity, particularly in subjects affected by respiratory diseases outside the clinical setting. However, additional effort is required to overcome the limits documented in earlier reports.
The use of inertial measurement units (IMUs) has become popular in all branches of human biomechanics, such as in sports [
9,
10], clinical applications [
11,
12], biomedical engineering [
13,
14], and virtual reality [
15,
16]. These sensors are typically composed of a multi-axial accelerometer and a multi-axial gyroscope; thus, they are sensitive to the linear acceleration and angular velocity of the body segment they are attached to. During the last two decades, several studies have investigated the accuracy of different algorithms, parsing the output (acceleration and/or angular velocity) of one or more IMUs to assess respiratory activity as well.
Authors have adopted a single dual- or tri-axial accelerometer as an inclinometer to reconstruct the slow angular movement of either the chest or the abdomen and to extract some of the features (i.e., respiratory rate and tidal volume) related to respiratory activity [
17,
18,
19]. Others determined the respiratory-related variation in the trunk orientation by fusing signals from a tri-axial accelerometer and a tri-axial gyroscope by means of a Kalman filter [
20,
21,
22,
23]. These approaches focus on the variation in the trunk orientation determined by the respiratory activity based on the evidence that the respiratory-related magnitude of the inertial acceleration is relatively small compared to a change in the gravitational components. Other authors analyzed the frequency content of signals from a single dual- or tri-axial accelerometer located on either the chest or the abdomen to estimate the respiratory rate [
24,
25,
26]. Fekr and colleagues adopted a similar sensory setup (i.e., one tri-axial accelerometer located on the sternum) to reconstruct respiratory patterns and, accordingly, recognize respiratory disorders [
27,
28]. Two studies dealt with the use of either two tri-axial accelerometers or two tri-axial IMUs located on the anterior and posterior chest walls [
29,
30]. By using differential approaches, these authors estimated the respiratory-related chest expansion by reducing the effects of the translational movements of the user, particularly during dynamic motor tasks. Finally, De la Fuente and colleagues adopted a network of 13tri-axial accelerometers to develop an automatic strategy revealing costal-superior and costal-abdominal respiratory patterns [
31].
Based on the current literature, the main strengths of an IMU-based approach to monitoring respiratory activity are: (i). the low cost of sensors that are widely available on the market; (ii). their low power consumption, allowing for a long lifespan between charges; (iii). the inherent lightweight and small dimensions of hardware modules embedding sensors, a microcontroller, power supply, and memory and/or wireless data transmission technologies, which make these devices unobtrusive and potentially powerful for human wellbeing monitoring during daily activities; and iv. the respiratory activity assessment through IMUs can be considered one of the most convenient and noninvasive strategies to monitor subjects, such as children, who lack cooperation when undergoing mildly invasive approaches, such as spirometry.
Despite the mentioned, though non-exhaustive, array of studies dealing with IMU-based approaches to monitoring respiratory activity, a wide variability can be found among them. This variability mainly refers to the adopted sensors (e.g., accelerometers vs. gyroscopes), the sensory location (e.g., chest vs. abdomen and sternum vs. left costal margin), the algorithms for the assessment of the most relevant features of breathing patterns (e.g., trigonometric vs. Kalman filter-based data fusion), and the application-related approach (e.g., static vs. dynamic motor tasks and single vs. differential sensory strategy). In addition, some of the proposed strategies have been roughly validated, so the outcomes of these studies appear to be quite preliminary [
7,
8].
The aim of this study was to evaluate the feasibility of an IMU-based strategy to estimate respiratory patterns in subjects lying on a bed. In this respect, our study was designed to overcome some of the main methodological limits of the current literature (e.g., limited number of enrolled subjects, subject-specific tuning of algorithms, and different experimental setups) to achieve robust results that unequivocally support or reject the use of IMUs for the estimation of respiratory patterns. In addition, since our results corroborated the former hypothesis (i.e., feasibility of an IMU-based strategy to estimate respiratory patterns), the accuracy that we achieved with the proposed approach represents a benchmark to compare other algorithms. In the long term, we would like to contribute to the development of an alternative, noninvasive methodology to monitor significant variations in respiratory activity (e.g., changes in lung elasticity and progression of chronic respiratory diseases) in bedridden patients.
To achieve this multifaceted goal, our work was structured in two steps. During the first step, we analyzed the kinematics captured by a network of 9IMUs and 45 reflective markers equally distributed on the anterior wall of the trunk of subjects lying on a bed (
Section 2.1,
Section 2.2 and
Section 2.3). Then, we estimated the respiratory patterns by means of optoelectronic plethysmography and used them as reference variables (
Section 2.4). After that, through principal component analysis (PCA), we searched for the minimal set of signals coming from the network of IMUs where there was a match with linear combinations for the respiratory patterns of the whole chest wall, the rib cage compartment, and the abdominal compartment as a function of time (
Section 2.5). This last task was based on the evidence that signals collected by an IMU as a function of time likely follow chest movements during respiratory activity [
17,
18,
19]. The second step consisted in developing a model relating the retained minimal set of signals coming from the network of IMUs to respiratory patterns. This model was finally validated by using the leave-one-out cross-validation (LOOCV) procedure (
Section 2.6). Notably, the adopted two-step strategy was designed to prevent any significant effect of anatomical dimorphisms due to gender and subject-specific features on the accuracy of the estimates.
2. Materials and Methods
2.1. Enrolled Participants
Eighteen healthy young adults were enrolled in this study. Participants had no history of diseases (e.g., respiratory, postural, or neurological) that could interfere with the experimental sessions. Age and anthropometrical features are reported in
Table 1. All participants signed an informed consent form before starting the experimental sessions.
2.2. Experimental Setup and Protocol
The participants were asked to lie supine on a bed at rest while the respiratory movements of the chest and abdomen were contemporaneously recorded by an optoelectronic system (8-camera optical system Smart-DX, BTS Bioengineering, Milano, Italy) and nine IMUs (Xsens wireless Motion Tracker Awinda system [
32]), as depicted in
Figure 1. A set of 45 reflective markers (diameter of 10 mm) were located on the anterior wall of the trunk, from the collarbone to the segment connecting the anterior iliac spines. The location of the marker set likely resembled that reported in earlier studies dealing with optoelectronic plethysmography [
33,
34,
35]. Nine of the markers were attached in the case of the nine IMUs (
Figure 1). The axes of all IMUs were oriented as follows: the x-axis ran longitudinally, towards the head; the y-axis was the latero-lateral axis, towards the right side; and the z-axis was arranged according to the right-hand role (vertical, upward orientated). Data collected by the optoelectronic system and the IMUs were digitalized at 50 Hz and 100 Hz, respectively.
During the experimental sessions, participants were asked to breathe normally fora 2 min time window. At the beginning of each trial, IMU
1 (
Figure 1) was delicately beat about three times to induce quasi-impulsive variations in the kinematics measured by the optoelectronic system and the inertial sensors. These impulsive signals were used for offline synchronization of both data streams, as described elsewhere [
36] and in
Section 2.3. For each participant, data related to 2 repetitions were collected. The time gap between the repetitions was about 2 min. Overall, we collected 36 pairs of data streams (i.e., 18 subjects × 2 repetitions).
The study was conducted in accordance with the Declaration of Helsinki, and the protocol was approved by the Ethics Committee of the Scuola Superiore Sant’Anna, Pisa (IT) (protocol code: 01/2021; date of approval: 11 February 2021).
2.3. Pre-Processing
Both the 3D kinematics of the 45 reflective markers and the output of each of the 9IMUs (3 linear accelerations and 3 angular velocities) were pre-processed offline according to the following pipeline: i. gap filling, if any; and ii. zero-lag, low-pass filtering (Butterworth, 4th order) with cut-off at 10 Hz. Notably, although the literature agrees on the evidence that most of the power spectral density of chest motion kinematics while breathing normally falls within 1.5–2 Hz [
18,
27,
30], we decided to account for a wider frequency bandwidth, i.e., 0–10 Hz, to prevent any lack of information that could negatively affect the outcome of the PCA.
After that, spline interpolation was applied to the 3D kinematics of the reflective markers to virtually double its sample rate and match that of the IMUs (i.e., 100 Hz). This step was required to minimize the error associated with the estimated time lag during the time alignment procedure.
Data from the camera system and the IMUs were time-aligned by assessing the time lag between them. Specifically, we computed the cross-correlation between the second time derivative of the vertical component of the marker on IMU1 and the Z-component of the acceleration detected by the same IMU within a time window including the quasi-impulsive artifacts induced by the beats. The time lag between the data streams coincided with the abscissa of the maximum of their related cross-correlation function.
2.4. Assessing the Tidal Volume as a Function of Time by Optoelectronic Plethysmography
The approach to estimating tidal volume as a function of time based on the 3D kinematics of the trunk, namely, optoelectronic plethysmography, is widely described in earlier literature [
33,
34]. Briefly, this method consists of approximating the chest wall in a series of non-overlapped triangles in which the vertexes coincide with three neighboring markers. Then, the frame-by-frame variation in chest wall volume, namely, V
CW, is computed as the summed contribution of all volumes enclosed by these triangles.
With respect to our study, we assumed that the respiratory activity was mostly reflected in the movement of the frontal view of the chest, since the posterior wall was constrained due to the subject’s supine position on the bed. In addition, we assumed that the bias due to the thickness of the IMUs (~13 mm) on the vertical component of the related markers did not substantially alter the tidal volume estimation. Then, based on our marker set, we defined 64 non-overlapped triangles that were symmetrically arranged with respect to the midline (vertical line M in
Figure 1). After that, V
CW was estimated as the volume enclosed by all these triangles and split with respect to the rib cage (where V
RC is the volume enclosed between lines A and E) and the abdomen (where V
AB is the volume enclosed between lines E and G) compartments.
2.5. Principal Component Analysis (PCA) of IMU Outputs
The PCA was run to reduce the dimensionality of the input dataset consisting of the output from the IMUs. In particular, it was carried out to identify a subset of IMU-related homologous signals (i.e., either accelerations or angular velocities) where a linear combination allowed for an optimal match with VCW, VRC, and VAB.
For each subject and each trial, we initially defined eight data matrices by pooling homologous signals related to all IMUs as follows:
where:
- -
and represent the ith component (i.e., X, Y, or Z components) of acceleration and angular velocity, respectively, measured by the jth IMU (j = 1, 2, …, 9);
- -
and represent the Euclidean norm of the three components of the acceleration and angular velocity, respectively, related to the jth IMU.
Each of these eight data matrices was first z-scored and then parsed by the PCA. Briefly, the PCA projects the initial dataset (e.g.,
,
,
, and
) in a new domain of orthogonal (i.e., uncorrelated) variables, namely, the principal components (PCs), through a squared weight coefficient data matrix reflecting the degree of association between the initial data and PCs [
37]. Next, we retained the PCs that had the best match with V
CW, V
RC, and V
AB according to the Pearson correlation coefficient (ρ). The whole process is described in
Figure 2.
After that, we pooled data among the subjects and trials and identified the subset of IMUs sharing more variance with the retained best-matching PCs, i.e., those more correlated with respiratory patterns (i.e., V
CW, V
RC, and V
AB). Specifically, we assumed that the outcome of an IMU was significantly associated with one of the retained best-matching PCs when the related absolute weight coefficient, as averaged across participants and trials, was ≥0.32. This threshold implies that the observed variable (i.e., output of the IMU) shared more than 10% of its variance with that PC [
37].
Finally, for all subjects and all trials, the signals related to the selected subset of IMUs and the VCW, VRC, and VAB values as a function of time were low-pass filtered (zero-lag, Butterworth, and 4th order) with a cut-off at 2.5 Hz, demeaned, and used to develop and validate a model relating IMU outputs to tidal volumes.
2.6. Developing IMU-Based Models for Tidal Volume Estimation
In this study, we assumed that a linear combination of signals collected by a minimal set of IMUs can accurately estimate respiratory volumes, in accordance with previous findings [
17,
18,
19]. We adopted the LOOCV strategy to test the ability of the designed model to predict new data that were not used to tune the model itself [
38]. According to LOOCV, datasets (i.e., both volumes and retained IMUs outputs) related to 17 out of 18 subjects were selected to train the model. The training process consisted in finding optimal
n-tuples of model coefficients that minimized the residuum between the measured and estimated volumes across those subjects and throughout the entire time window. After that, the optimal
n-tuples of the model coefficients were used to estimate the respiratory volumes related to the 18th subject for both repetitions.
For both the measured and estimated volumes, we identified the relative minima and maxima within the entire time window, which represented the onsets of the inhalation and exhalation phases, respectively. Based on these time events, the respiratory rate (where RR is the number of respiratory cycles per minute), the duration of the inhalation and exhalation phases (DI and DE, respectively), and the inhaled and exhaled volumes (VI and VE, respectively) for the chest wall, ribcage, and abdomen compartments were computed and averaged across cycles. Finally, the relative error between measured and estimated respiratory variables (i.e., ΔRR%, ΔDI%, ΔDE%, ΔVI%, and ΔVE%) was computed and analyzed across conditions (see
Section 2.7).
The next paragraphs provide a detailed description of both the algorithm that we used to develop the model relating the tidal volume to the output of the retained IMUs and the strategy that we used to validate it through the LOOCV. For the sake of simplicity, the following sections report the procedure adopted for the estimation of VCW only. The same procedure was separately implemented for VRC and VAB.
The relationship between the
estimated chest wall volume and signals collected by the minimal set of IMUs (outcome of the PCA) is represented by the following equation:
where:
- -
m refers to the body mass of the subject and was introduced to account for the monotonic relationship between body mass and volume capacity [
39,
40];
- -
t refers to the time;
- -
is the estimated chest wall volume as a function of time;
- -
var1(t), var2(t), and varn(t) represent the n-tuple of signals collected by the retained minimal set of IMUs as the outcome of the PCA, where these variables are subject-dependent;
- -
k1CW, k2CW, and knCW, namely, the model coefficients, are the coefficients of the linear model, where the training procedure consisted of tuning these coefficients.
A system of 34 (i.e., 2 repetitions × 17 subjects) non-linear equations was then formulated as follows:
where:
- -
and are the measured and estimated VCW, respectively, for the ith subject and jth repetition;
- -
RMSDi,j is the root mean square (rms) of the difference between the measured and estimated VCW for the ith subject and jth repetition.
After that, a minimum search algorithm (see fminsearch in Matlab) was run to find the optimal n-tuple for the model coefficients (i.e., k1CW, k2CW, …, and knCW) that minimized the norm of the vector [RMSD1,1, RMSD1,2, …, RMSD17,1, and RMSD17,2].
Once the optimal n-tuple for the model coefficients (i.e., k1CW, k2CW, …, and knCW) was found, it was used to estimate the volume of the 18th subject for both repetitions. This process was complemented by a comparison of respiratory parameters (i.e., the respiratory rate and inhalation and exhalation durations and volumes) between the values obtained through optoelectronic plethysmography and those estimated by the model in terms of relative error percentage (i.e., ΔRR%, ΔDI%, ΔDE%, ΔVI%, and ΔVE%).
2.7. Statistical Analysis
The mean and standard deviation were used as the main descriptive statistics to refer to the central tendency and dispersion of all independent variables (i.e., age and anthropometrical features of the enrolled participants, outcomes of the PCA analysis, respiratory variables, and errors). After that, inferential statistics, especially unpaired t-test and analysis of variance (ANOVA) with repeated measures, were used to investigate the effects of gender, IMU-related outputs, compartments, and models on all independent variables. Notably, when the data distribution of independent variables did not meet the requirements for a normal distribution (Kolmogorov–Smirnov normality test) and homogeneity of variance (Levene’s test), these statistical tests were carried out on the Log10 transformed data.
The significance for all statistical tests was set at p < 0.05. The data analysis was carried out using Matlab (The MathWorks, Inc., Natick, MA, USA).
4. Discussion
This study evaluated novel IMU-based strategies to estimate respiratory parameters in subjects lying on a bed while breathing normally.
Our approach consisted in contemporaneously collecting data from both a network of 9 IMUs and a set of 45 reflective markers uniformly distributed on the frontal chest wall (
Figure 1). The inertial kinematics were initially parsed by PCA to identify them inimum set of signals and IMUs whose linear combination best matched the tidal volume measured by optoelectronic plethysmography. The results (
Table 2,
Table 3 and
Table 4;
Figure 3 and
Figure 4) revealed that linear accelerations as a function of time related to two different subsets of IMUs had the best contemporaneous associations with tidal volumes related to the whole chest wall, rib cage, and abdominal compartments.
Based on these findings, two different models were developed: i. the first model, namely,
Model 1, consisted of the linear combination of the 3D acceleration measured by two IMUs laterally located on the abdominal compartment (see IMU
7 and IMU
9 in
Figure 1) and one IMU centrally located on the segment connecting lower costal margins (see IMU
5 in
Figure 1); ii. the second model, namely,
Model 2, consisted of the linear combination of the Y-component of two accelerometers symmetrically located on the abdominal compartment (see IMU
7 and IMU
9 in
Figure 1). The models were assumed to depend on the body mass of the subjects. They were finally tuned and validated through a LOOCV approach to assess the extent to which subject-independent models can accurately estimate five respiratory parameters (i.e., RR, DI, DE, VI, and VE) related to three trunk compartments (i.e., the whole chest wall, rib cage, and abdominal).
The results revealed that both models accurately estimated the respiratory rate with an average relative error lower than 1.5% across subjects and conditions (
Figure 7). The duration of the inhalation and exhalation phases (i.e., DI and DE) was more accurately assessed by Model 2, where the amplitude of the average relative error across subjects was lower than 5% for all compartments (
Figure 7). Finally, the values of the tidal volume during the inhalation and exhalation phases were more accurately estimated by
Model 1. In this case, the absolute relative error averaged across subjects was lower than 5%, with no difference among the compartments (
Figure 7 and
Table 6).
4.1. Achieved Accuracy
Several studies tested IMU-based strategies to assess respiratory parameters under comparable experimental conditions. Certain authors estimated the respiratory rate from the output of a tri-axial accelerometer by using different approaches and reported either a relative error ranging from 0.53% [
27,
28] to 10% [
26] or an absolute error falling in the range 0.38–0.50 breaths/min [
18,
24]. Other authors working with the output of a gyroscope reported an absolute error in the respiratory rate ranging from 0.3 to 7.9 breaths/min [
21,
22]. As far as the inhalation and exhalation volume estimation is concerned, earlier studies mainly showed a correlation coefficient between the measured tidal volume and that estimated through IMU-based methods (range: 0.77–0.96) as a function of time [
18,
27]. However, no information concerning breath-by-breath accuracy was reported.
Compared to results in the literature, our findings appear to be more accurate and may even be further improved. The accuracy underlying the estimation of RR obtained in our study (mean relative error equal to 1.5%, roughly corresponding to 0.24 ± 0.06 breaths/min) can be considered comparable to or even better than that reported in previous papers [
21,
22,
26,
27,
28]. In addition, the error related to the chest wall tidal volume estimation, which was lower than 5% on average, is equivalent to that achieved by the most accurate wearable-based strategies reviewed in our recent study [
8]. In our opinion, this performance can be further improved by adopting suitable expedients. For instance, it is worth noting that wearable-based strategies for the assessment of respiratory parameters typically rely on a subject-specific calibration procedure. Conversely, we evaluated the feasibility of subject-independent strategies to assess the main respiratory parameters. In this respect, we envisage that a subject-specific tuning of the models can result in improved accuracy for the estimation of respiratory parameters. Moreover, the PCA-based approach that we used to identify a minimal set of signals and IMUs can potentially represent another factor limiting the overall accuracy of the developed models. Other data reduction strategies (e.g., factor analysis) or decomposition procedures (e.g., wavelet decomposition) in conjunction with a non-linear combination of IMU signals may allow for the development of more accurate models that match the tidal volume as a function of time. Overall, the accuracy that we achieved when estimating RR can be considered acceptable, but that related to the tidal volume is not yet in the range of ±3.5%, which has been recommended for calibration checks during spirometry [
41]. However, some of the weak points of our methodological strategy can be strengthened, and this strategy deserves to be further investigated to verify whether the overall accuracy can be improved.
4.2. Comparison of Model 1 and Model 2
One of the unexpected outcomes of our study was the identification of two alternative models to contemporaneously estimate respiratory activity among trunk compartments: the first relies on the 3D acceleration measured by three IMUs (i.e., IMU
5, IMU
7, and IMU
9 in
Figure 1), and the second relies on the Y-component only of the acceleration measured by two IMUs located on the abdominal compartment (i.e., IMU
7 and IMU
9). It is worth remarking that there is a significant overlap between them; that is, both models mostly rely on the respiratory-related movements of the abdominal compartment. This result corroborates findings in the literature, where the authors documented a significant association between respiratory movements of the abdominal compartment and the supine position [
42,
43]. As such, others concluded that abdominal movements can be used to represent diaphragmatic excursion in a healthy population while lying supine [
35].
Model 1 resulted in a more accurate estimation of the tidal volume, whereas
Model 2 provided a more accurate estimation of the duration of the inhalation and exhalation phases (
Figure 7 and
Table 6). This evidence suggests that rib cage movements, in conjunction with a more informative set of signals due to the 3D components of acceleration, can better fit the tidal volume across compartments, as highlighted by
Model 1 (i.e., V
CW, V
RC, and V
AB; see
Figure 7). However, the 3D components of acceleration and the physiological phase shift between rib cage and abdominal movements [
44] would result in a superimposition of components with a different frequency content (compare
Model 1 and
Model 2 in
Figure 5) that involve a fictive shift in either the peaks or valleys of the tidal volume as a function of time. Accordingly, the onset of either the inhalation or exhalation phases in
Model 1 would artificially appear to be premature or delayed. In turn, the duration of both the inhalation and exhalation phases would be reciprocally biased (see
Figure 7,
Model 1, ΔDI% and ΔDE%).
4.3. Limitations of This Study
The main limitation of our study is that the models that we developed to estimate the respiratory parameters during normal breathing in a supine position cannot be generalized to either different motor tasks or breath patterns. The performance of IMU-based strategies to assess the main respiratory parameters decreases during dynamic tasks (e.g., walking, running, and cycling) due to body motion artifacts [
7,
8]. In addition, pathological breath patterns (e.g., bradypnea, tachypnea, Cheyne–Stokes, Kussmaul, and Biot’s) can be characterized by arrhythmias or different movements of the chest wall compared to normal breathing. Accordingly, in both cases, we envisage that the final models that we used to estimate respiratory parameters might not be optimal to accurately assess respiratory parameters in different experimental conditions. Despite this, the approach that we used to identify the minimal set of IMUs and how to relate their output to respiratory activity deserves to be fully investigated and validated across different experimental conditions.