Types of Biological Variables: Shreemathi S. Mayya, Ashma D Monteiro, Sachit Ganapathy
Types of Biological Variables: Shreemathi S. Mayya, Ashma D Monteiro, Sachit Ganapathy
Types of Biological Variables: Shreemathi S. Mayya, Ashma D Monteiro, Sachit Ganapathy
Abstract: Identification and description of variables used in any study is a necessary component in
biomedical research. Statistical analyses rely on the type of variables that are involved in the study. In this
short article, we introduce the different types of biological variables. A researcher has to be familiar with
the type of variable he/she is dealing with in his/her research to decide about appropriate graphs/diagrams,
summary measures and statistical analysis.
Submitted Apr 19, 2017. Accepted for publication May 09, 2017.
doi: 10.21037/jtd.2017.05.75
View this article at: http://dx.doi.org/10.21037/jtd.2017.05.75
© Journal of Thoracic Disease. All rights reserved. jtd.amegroups.com J Thorac Dis 2017;9(6):1730-1733
Journal of Thoracic Disease, Vol 9, No 6 June 2017 1731
© Journal of Thoracic Disease. All rights reserved. jtd.amegroups.com J Thorac Dis 2017;9(6):1730-1733
1732 Mayya et al. Types of variables
measures for continuous variables with symmetrical value for t-test compared to chi-square test. An important
distributions. Median and inter-quartile range are to be message that we try to convey here is that, statistical tests
computed to summarize quantitative variables with skewed will have more power for a continuous variable than the
distributions. Range is informative if used as a supplement corresponding nominal or ordinal variables (2). In other
to standard deviation or inter quartile range. Discrete words, to achieve the same power as that of a parametric
variables may be summarized and analyzed either as a test, non-parametric tests require larger sample size than a
continuous variable or as an ordinal variable depending on parametric test. Therefore, one may categorize the data for
the number of distinct values. the purpose of presentation (e.g., hypertensive/normal), but
Quantitative data can be represented graphically by means not for statistical analysis (3).
of a histogram. Histogram is useful to decide about the Detailed discussion of various tests is out of the scope
shape of the distribution, symmetrical or skewed. But, with of this article. Campbell & Swinscow (2) have summarized
small samples, histogram may not be useful to identify the the tests suitable for various types of variables in a single
shape. As a rule of thumb, if the mean is smaller than twice table. For computation procedure and more details about
the standard deviation the data are likely to be skewed for various parametric tests, researchers may refer some standard
variable with positive values (5). Quantitative data can also be text books (1,3,8). For a good discussion of a number of
displayed as stem & leaf plots, dot plots, box & whisker plots nonparametric tests readers may refer Siegel and Castellan (9)
and scatterplots, depending on the situation (6). and Conover (10).
Type of the variables decides the type of statistical analyses The type of descriptive and analytical measures to be used
to be performed, parametric or non-parametric. Parametric in data summarization and analysis, all depend on the type
methods, such as t-tests, ANOVA, Pearson’s correlation, of variables. Therefore, to obtain the relevant measures for
and regression, require the assumption that the data follow dataset at hand, we recommend the researchers to study the
a normal distribution and that variances of the distributions characteristics of data (categorical, quantitative) and shape
are equal. Frequently used nonparametric methods are of the frequency distribution (symmetrical bell shaped,
Mann-Whitney or Wilcoxon rank sum test, Wilcoxon skewed) before deciding about the descriptive measures,
signed rank test and rank correlation. Non-parametric graphs and diagrams, and statistical tests suitable for the
methods, make no assumptions about the distribution of presentation and analysis of data.
the data; they use the rank order of observations rather
than actual measurements (7). Chi-square test (or Fisher’s
Acknowledgements
exact test if the numbers are very small) is the most often
used method to compare categorical data. Failure to pay None.
attention to assumptions and their implications can lead to
increase in type I or type II errors.
Footnote
We analyze data from similar studies, completely
differently depending on the type of variable involved. For Conflicts of Interest: The authors have no conflicts of interest
example, let us say that our target population is 50+ age to declare.
group in a certain population and we have measured the
variable systolic blood pressure in a sample of 40 male and
References
40 female subjects, and our null hypothesis is “Male and
female population have the same systolic blood pressure”. 1. Daniel WW. editor. Biostatistics: A foundation for analysis
We would compare the mean blood pressure in males and in the health sciences. 6th ed. New York: John Wiley &
females with a two-sample t-test (parametric test). If the Sons, 1995.
variable is converted to hypertension status (hypertensive/ 2. Campbell MJ, Swinscow TD. editors. Statistics at Square
normal), it is a nominal variable, and we would compare One. 11th ed. Oxford: Wiley-Blackwell, 2009.
the hypertension frequencies in males and females with a 3. Altman DG, Bland JM. The cost of dichotomizing
Chi-square test (non-parametric test). We find smaller P continuous variables. BMJ 2006;332:1080.
© Journal of Thoracic Disease. All rights reserved. jtd.amegroups.com J Thorac Dis 2017;9(6):1730-1733
Journal of Thoracic Disease, Vol 9, No 6 June 2017 1733
4. McDonald JH. editor. Handbook of biological statistics. 8. Bland M. editor. An Introduction to Medical Statistics. 3rd
Baltimore, MD: Sparky House Publishing, 2009. ed. Oxford University Press; 2000.
5. Altman DG, Bland JM. Detecting skewness from summary 9. Siegel S, Castellan NJ. editors. Nonparametric statistics
information. BMJ 1996;313:1200. for the behavioral sciences. 2nd ed. New York: McGraw-
6. Freeman JV, Walters SJ, Campbell MJ. editors. How to Hill, 1988.
display data. Oxford: Blackwell, 2008. 10. Conover WJ. editor. Practical nonparametric statistics. 3rd
7. Altman DG, Bland JM. Parametric v non-parametric ed. New York: John Wiley, 1998.
methods for data analysis. BMJ 2009;338:a3167.
© Journal of Thoracic Disease. All rights reserved. jtd.amegroups.com J Thorac Dis 2017;9(6):1730-1733