Conditional Regression
Conditional Regression
net/publication/247777746
CITATIONS READS
2 921
4 authors, including:
Roland Örtengren
Chalmers University of Technology
111 PUBLICATIONS 3,370 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Dan Högberg on 02 June 2014.
Abstract
In digital human modelling (DHM) systems consideration of anthropometry is central. Important functionality in
DHM tools is the regression model, i.e. the possibility to predict a complete set of measurements based on a
number of defined independent anthropometric variables. The accuracy of a regression model is measured by
how well the model predicts dependent variables based on independent variables, i.e. known key anthropometric
measurements. In literature, existing regression models often use stature and/or body weight as independent
variables in so-called flat regressions models which can produce estimations with large errors when there are low
correlations between the independent and dependent variables. This paper suggests a conditional regression
model that utilise all known measurements as independent variables when predicting each unknown dependent
variable. The conditional regression model is compared to a flat regression model, using stature and weight as
independent variables, and a hierarchical regression model that uses geometric and statistical relationships
between body measurements to create specific linear regression equations in a hierarchical structure. The
accuracy of the models is assessed by evaluating the coefficient of determination, R2 and the root-mean-square
deviation (RMSD). The results from the study show that using a conditional regression model that makes use of
all known variables to predict the values of unknown measurements is advantageous compared to the flat and
hierarchical regression models. Both the conditional linear regression model and the hierarchical regression
model have the advantage that when more measurements are included the models will give a better prediction of
the unknown measurements compared to the flat regression model based on stature and weight. A conditional
linear regression model has the additional advantage that any measurement can be used as independent variable.
This gives the possibility to only include measurements that have a direct connection to the design dimensions
being sought. Utilising the conditional regression model would create digital manikins with enhanced accuracy
that would produce more realistic and accurate simulations and evaluations when using DHM tools for the
design of products and workplaces.
Keywords: Anthropometry, Regression, Correlation, Multivariate, Prediction, Digital Human Modelling.
The conditional linear regression model analyses This conditional expected value, considered as a
the covariance between the independent and function of Z is called the multivariate regression
dependent variables to calculate the regression of the vector Y on Z. It is composed of j univariate
coefficients. Based on the regression coefficients regressions. The j×k matrix
and the mean values, for both the independent and
−1
dependent variables, linear regression equations can 𝜷 = 𝚺𝒀𝒁 𝚺𝒁𝒁
2
Erik Brolin, Conditional Regression Model for Prediction of Anthropometric Variables
is called the matrix of regression coefficients and Table 1 The 56 anthropometric measurements included
the j×1 vector in the evaluation of regression models
−1
𝜷𝒐 = 𝝁𝒀 − 𝚺𝒀𝒁 𝚺𝒁𝒁 𝝁𝒛 # Anthropometric measurement
1 Stature
2 Weight
is the vector containing the intersection point for 3 Acromial height
each regression equation. 4 Knee height at midpatella
5 Trochanterion height
2.2. Description of comparison procedure 6 Thumb-tip reach
7 Waist circumference at omphalion
The described conditional regression model was 8 Buttock circumference
compared to a flat regression model based on 9 Chest circumference
stature and weight and a hierarchical regression 10 Elbow circumference
11 Forearm-hand length
model based on the method presented by You and 12 Functional leg length
Ryu (2005). In the analyses gender was treated 13 Hand circumference at metacarpale
separately by creating specific regression equations 14 Hand length
for each gender for the flat and hierarchical 15 Head circumference
16 Thigh circumference, proximal
regression models and letting the conditional 17 Wrist circumference, stylion
regression model analyse both female and male 18 Ankle circumference
data. 56 anthropometric measurements (Table 1) 19 Axilla height
were included in the analysis and four comparative 20 Arm circumference at axillar
21 Foot circumference
tests were done where the number of independent 22 Biacromial breadth
variables varied for each test. The first test was 23 Bideltoid breadth
done with stature and weight as independent 24 Buttock depth
variables which are the measurements that are 25 Buttock-knee length
26 Buttock-popliteal length
necessary in the flat and hierarchical regression 27 Calf circumference
models. The second and third test was done using 28 Cervicale height
the first 7 and 17 measurements respectively 29 Chest breadth
according to Table 1. These measurements were 30 Chest depth
31 Crotch height
chosen as they could be found high up in the 32 Eye height (sitting)
hierarchal model described by You and Ryu (2005). 33 Foot breadth
The last and final test was done using the last three 34 Foot length
measurements in Table 1, hip breadth (sitting), 35 Forearm circumference, flexed
36 Gluteal furrow height
popliteal height and radiale-stylion length. These 37 Hand breadth at metacarpale
measurements are found further down in the 38 Head breadth
hierarchy of the hierarchical model, but could still 39 Head length
be interesting to use, for example in the design of 40 Heel breadth
41 Hip breadth
an office chair. The last test was not possible to 42 Interpupillary distance
perform with the flat and hierarchical model, since 43 Knee circumference
these models require stature and weight as 44 Knee height (sitting)
independent variables, but was useful to show the 45 Lateral malleolus height
46 Neck circumference over larynx
capability of the conditional regression model. The 47 Shoulder-elbow length
three regression models were compared by 48 Sitting height
assessing the coefficient of determination, R2, as 49 Thigh clearance
50 Waist breadth at omphalion
51 Waist depth at omphalion
∑𝑛𝑗=1(𝑦�𝑗 − 𝑦�)2
𝑅2 = 52 Waist height at omphalion
∑𝑛𝑗=1(𝑦𝑗 − 𝑦�)2 53 Wrist to centre-of-grip length
54 Hip breadth (sitting)
55 Popliteal height
and the root-mean-square deviation (RMSD) as 56 Radiale-Stylion length
∑𝑛𝑗=1(𝑦𝑗 − 𝑦�𝑗 )2
𝑅𝑀𝑆𝐷 = � 3. Results
𝑛
In the regression models gender was treated
where 𝑦𝑗 is the measured value, 𝑦� the mean value, separately and the coefficient of determination and
the root-mean-square deviation was calculated for
𝑦�𝑗 the predicted value and 𝑛 the number of
each dependent variable for each test. However,
measured individuals (1774 males and 2208 only the combined average results, for both genders
females). and the dependent variables for each test, are
presented in the following text and figures (Table 2,
Figure 2 and Figure 3).
3
Erik Brolin, Conditional Regression Model for Prediction of Anthropometric Variables
Table 2 Average R2 value and the root-mean-square model by 58.2% compared to the flat model and
deviation for the dependent variables for each test 16.4% compared to the hierarchical model.
Rootmean-square deviation
independent variables the resulting R2 value and 20
Conditional model
root-mean-square deviation were approximately the
same for all three regression models. However 15
when the number of independent variables
increases the accuracy of the flat regression model
decreases compared to the hierarchical and 10
conditional regression models. In test 2, when 7
measurements were used as independent variables,
5
the conditional model had an average R2 value that
was 23.3% higher than that of the flat model and
11.6% higher than that of the hierarchical model. 0
Analysis of root-mean-square deviation showed a #1,2 #1-7 #1-17 #54-56
decrease for the conditional model by 34.2% Predictive variables (according to Table 1)
compared to the flat model and 11.1% compared to Figure 3 Graph illustrating the evaluation of the root-
the hierarchical model. mean-square deviation for the dependent variables based
on the results from the four different tests (Lower value
100%
indicates higher accuracy)
90% Flat model
Coefficient of determination, R2
Hierarchical model
80% In total the conditional regression model shows the
Conditional model
70%
highest accuracy when predicting unknown
variables. For the first three tests the conditional
60% model had, on average, an accuracy that was 31.7%
50% higher than that of the flat model and 9.3% higher
40% than that of the hierarchical model (depending on if
the coefficient of determination or the root-mean-
30%
square deviation was assessed). In test 4 the
20% conditional model was the only model that could
10% produce any results.
0%
#1,2 #1-7 #1-17 #54-56 4. Discussion
Predictive variables (according to Table 1)
The results from the study shows that using a
Figure 2 Graph illustrating the evaluation of the R2 value conditional regression model that makes use of all
for the dependent variables based on the results from the known variables to predict the values of unknown
four different tests (Higher value indicates higher measurements is advantageous compared to the flat
accuracy) and hierarchical regression models. Both the
hierarchical regression model and the conditional
In test 3, when 17 measurements were used as linear regression model have the advantage that
independent variables, the conditional model had an when more measurements are included the models
average R2 value that was 78.0% higher than that of will give a better prediction of the unknown
the flat model and 10.2% higher than that of the measurements compared to the flat regression
hierarchical model. Analysis of root-mean-square model based on two variables, stature and weight.
deviation showed a decrease for the conditional A conditional linear regression model has the
additional advantage that any measurement can be
4
Erik Brolin, Conditional Regression Model for Prediction of Anthropometric Variables
5
Erik Brolin, Conditional Regression Model for Prediction of Anthropometric Variables