0% found this document useful (0 votes)

146 views3 pages

Statistics IV Interpreting The Results of Statistical Tests

This document discusses statistical analysis and interpretation of results. It covers choosing appropriate statistical tests based on data type and distribution, assessing normality, paired vs unpaired data, the null hypothesis and P-values, and the difference between statistical and clinical significance. Precise P-values should always be reported rather than significance cutoffs. Confidence intervals are also important to interpret results and determine if differences are statistically or clinically meaningful. Sample size impacts power and ability to detect differences between groups.

Uploaded by

jeremie carpio

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

146 views3 pages

Statistics IV Interpreting The Results of Statistical Tests

Uploaded by

jeremie carpio

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 3

This is the fourth in a series of articles in this journal on the use of statistics

in medicine. In the previous issue, we described how to choose an appropriate

statistical test. In this article, we consider this further and discuss how to
interpret the results.

More on choosing an appropriate statistical test

Deciding which statistical test to use to analyse a set of data depends on the type
of data (interval or categorical, paired vs unpaired) being analysed and whether or
not the data are normally distributed. Interpretation of the results of statistical
analysis relies on an appreciation and consideration of the null hypothesis, P-
values, the concept of statistical vs clinical significance, study power, types I
and II statistical errors, the pitfalls of multiple comparisons, and one vs two-
tailed tests before conducting the study.

Assessing whether a data set follows a normal distribution

It may be apparent from constructing a histogram or frequency curve that the data
follow a normal distribution. However, with small sample sizes (n < 20), it may not
be obvious from the graph that the data are drawn from a normally distributed
population. The data may be subjected to formal statistical analysis for evidence
of normality using one or more specific tests usually included in computer software
packages, such as the Shapiro�Wilkes test. Such tests are fairly robust with larger
sample sizes (n > 100). However, the choice between parametric and non-parametric
statistical analysis is less important with samples of this size as both analyses
are almost equally powerful and give similar results. With smaller sample sizes (n
< 20), tests of normality may be misleading. Unfortunately, non-parametric analysis
of small samples lacks statistical power and it may be almost impossible to
generate a P-value of <0.05, whatever the differences between the groups of sample
data.

When in doubt as to the type of distribution that the sample data follow,
particularly when the sample size is small, non-parametric analysis should be
undertaken, accepting that the analysis may lack power. The best solution to
avoiding mistakes in choosing the appropriate statistical test for analysis of data
is to design a study with sufficiently large numbers of subjects in each group.

Unpaired vs paired data

When comparing the effects of an intervention on sample groups in a clinical study,
it is essential that the groups are as similar as possible, differing only in
respect of the intervention of interest. One common method of achieving this is to
recruit subjects into study groups by random allocation. All subjects recruited
should have an equal chance of being allocated into any of the study groups.
Provided the sample sizes are large enough, the randomization process should ensure
that group differences in variables that may influence outcome of the intervention
of interest (e.g. weight, age, sex ratio, and smoking habit) cancel each other out.
These variables may themselves be subjected to statistical analysis and the null
hypothesis that there is no difference between the study groups tested. Such a
study contains independent groups and unpaired statistical tests are appropriate.
An example would be a comparison of the efficacy of two different drugs for the
treatment of hypertension.

Another method of conducting this type of investigation is the crossover study

design in which all subjects recruited receive either treatment A or treatment B
(the order decided by random allocation for each patient), followed by the other
treatment after a suitable �washout� period during which the effects of the first
treatment are allowed to wear off. The data obtained in this study would be paired
and subject to paired statistical analysis. The effectiveness of the pairing may be
determined by calculating the correlation coefficient and the corresponding P-value
of the relationship between data pairs.
A third method involves defining all those characteristics that the researcher
believes may influence the effect of the intervention of interest and matching the
subjects recruited for those characteristics. This method is potentially
unreliable, depending as it does on ensuring that key characteristics are not
inadvertently overlooked and therefore not controlled.

The main advantage of the paired over the unpaired study design is that paired
statistical tests are more powerful and fewer subjects need to be recruited in
order to prove a given difference between the study groups. Against this are
pragmatic difficulties and additional time needed for crossover studies, and the
danger that, despite a washout period, there may still be an influence of the first
treatment on the second. The pitfalls of matching patients for all important
characteristics also have to be considered.

The null hypothesis and P-values

Before undertaking statistical analysis of data, a null hypothesis is proposed,
that is, there is no difference between the study groups with respect to the
variable(s) of interest (i.e. the sample means or medians are the same). Once the
null hypothesis has been defined, statistical methods are used to calculate the
probability of observing the data obtained (or data more extreme from the
prediction of the null hypothesis) if the null hypothesis is true.

For example, we may obtain two sample data sets which appear to be from different
populations when we examine the data. Let us consider that the appropriate
statistical test is applied and the P-value obtained is 0.02. Conventionally, the
P-value for statistical significance is defined as P < 0.05. In the above example,
the threshold is breached and the null hypothesis is rejected. What exactly does a
P-value of 0.02 mean? Let us imagine that the study is repeated numerous times. If
the null hypothesis is true and the sample means are not different, a difference
between the sample means at least as large as that observed in the first study
would be observed only 2% of the time.

Many published statistical analyses quote P-values as =0.05 (not significant),

<0.05 (significant), <0.01 (highly significant) etc. However, this practice
resulted from an era before the widespread availability of computers for
statistical analysis when P-values had to be looked up in reference tables. This
approach is no longer satisfactory and precise P-values obtained should always be
quoted. The importance of this approach is illustrated by the following example. In
a study comparing two hypotensive agents, drug A is found to be more effective than
drug B and P < 0.05 is quoted. We are convinced and immediately switch all our
hypertensive patients to drug A. Another group of investigators conduct a similar
study and find no significant difference between the two drugs (P = 0.05). We
immediately switch all our hypertensive patients back onto drug B as it is less
expensive and seems to be equally effective. We may also be somewhat confused by
the apparently contradictory conclusions of the two studies.

In fact, if the actual P-value of the first study was 0.048 and that of the second
study was 0.052, the two studies are entirely consistent with each other. The
conventional value for statistical significance (P < 0.05) should always be viewed
in context and a P-value close to this arbitrary cut-off point should perhaps lead
to the conclusion that further work may be necessary before accepting or rejecting
the null hypothesis.

Another example of the arbitrary nature of the conventional threshold for

statistical significance may be considered. Suppose a new anti-cancer drug has been
developed and a clinical study is undertaken to assess its efficacy compared with
standard treatment. It is observed that mortality after treatment with the new drug
tends to be lower but the reduction is not statistically significant (P = 0.06). As
the new drug is more expensive and appears to be no more effective than standard
treatment, should it be rejected? If the null hypothesis is true (both drugs
equally effective) and we were to repeat the study numerous times, we would obtain
the difference observed (or something greater) between the two study groups only 6%
of the time. At the very least, a further larger study needs to be undertaken
before concluding with confidence that the new drug is not more effective�as we
shall see later, the original study may well have been under-powered.

Statistical vs clinical significance

Statistical significance should not be confused with clinical significance. Suppose
two hypotensive agents are compared and the mean arterial blood pressure after
treatment with drug A is 2 mm Hg lower than after treatment with drug B. If the
study sample sizes are large enough, even such a small difference between the two
groups may be statistically significant with a P-value of <0.05. However, the
clinical advantage of an additional 2 mm Hg reduction in mean arterial blood
pressure is small and not clinically significant.

Confidence intervals
A confidence interval is a range of sample data which includes an unknown
population parameter, for example, mean. The most commonly reported is the 95%
confidence interval (CI 95%), although any other confidence interval may be
calculated. If an investigation is repeated numerous times, the CI 95% generated
will contain the population mean 95% of the time.

Confidence intervals are important when analysing the results of statistical

analysis and help to interpret the P-value obtained. They should always be quoted
with the P-value. Consider an investigation comparing the efficacy of a new
hypotensive agent with standard treatment. The investigator considers that the
minimum clinically significant difference in mean arterial blood pressure after
treatment with the two drugs is 10 mm Hg. If P < 0.05, three possible ranges for CI
95% may be considered (Fig. 1). If P = 0.05, four possible ranges for CI 95% may be
considered (Fig. 2). These ranges for the CI 95% are summarized in Table 1.

ISO 9001-2015 IA Exam
81% (21)
ISO 9001-2015 IA Exam
7 pages
MRCP Part 1STATISTICS NOTES PDF
100% (2)
MRCP Part 1STATISTICS NOTES PDF
3 pages
FSVPSOP
No ratings yet
FSVPSOP
2 pages
Lecture 4
No ratings yet
Lecture 4
161 pages
G Power Calculation
100% (1)
G Power Calculation
9 pages
Basic Biostats, 2
No ratings yet
Basic Biostats, 2
58 pages
How To Choose The Right Statistical Test
No ratings yet
How To Choose The Right Statistical Test
3 pages
Editorial How To Choose The Right Statistical Test?
No ratings yet
Editorial How To Choose The Right Statistical Test?
0 pages
Choose Statistical Test
No ratings yet
Choose Statistical Test
2 pages
How To Choose The Right Statistical Test?
No ratings yet
How To Choose The Right Statistical Test?
2 pages
Hypothesis Testing, P Values, Confidence Intervals, and Significance
No ratings yet
Hypothesis Testing, P Values, Confidence Intervals, and Significance
6 pages
Nonparametric Statistics
No ratings yet
Nonparametric Statistics
32 pages
New Normal MPA Statistics Chapter 2
No ratings yet
New Normal MPA Statistics Chapter 2
15 pages
MMJ2001 0015
No ratings yet
MMJ2001 0015
4 pages
Sample Size: How Many Is Enough?
No ratings yet
Sample Size: How Many Is Enough?
10 pages
Defining Hypothesis Testing
No ratings yet
Defining Hypothesis Testing
17 pages
Parametric Noparametric Tests
No ratings yet
Parametric Noparametric Tests
25 pages
Sharma 2021 Clinical Significance
No ratings yet
Sharma 2021 Clinical Significance
4 pages
Statistical Significance Versus Clinical Relevance
No ratings yet
Statistical Significance Versus Clinical Relevance
38 pages
Marshall Jonker Inferent Stats 2010
No ratings yet
Marshall Jonker Inferent Stats 2010
12 pages
Determinacion Tamaños Muestra Exp Clinicos (Correlación)
No ratings yet
Determinacion Tamaños Muestra Exp Clinicos (Correlación)
7 pages
Name: Deepak Kumar Singh Student Reg. No. 1708004923
No ratings yet
Name: Deepak Kumar Singh Student Reg. No. 1708004923
6 pages
Hypothesis Testing-2 PDF
No ratings yet
Hypothesis Testing-2 PDF
16 pages
4.1 Common Statistical Tests and Applications in Epidemiological Literature
No ratings yet
4.1 Common Statistical Tests and Applications in Epidemiological Literature
6 pages
Sample Size Calculation
No ratings yet
Sample Size Calculation
9 pages
Data Analysis Lecture
No ratings yet
Data Analysis Lecture
17 pages
Notes5c Paired Ttest
No ratings yet
Notes5c Paired Ttest
15 pages
Hoare Hoe - Statistics 2 Final
No ratings yet
Hoare Hoe - Statistics 2 Final
18 pages
Comparisons of Superiority, Non-Inferiority, and Equivalence Trials With Sample Size Calculation
No ratings yet
Comparisons of Superiority, Non-Inferiority, and Equivalence Trials With Sample Size Calculation
4 pages
What Is Hypothesis Testing
100% (1)
What Is Hypothesis Testing
32 pages
Inferentialstatistics 210411214248
No ratings yet
Inferentialstatistics 210411214248
102 pages
Cureus 0012 00000010047
No ratings yet
Cureus 0012 00000010047
10 pages
T Test As A Parametric Statistic: Tae Kyun Kim
No ratings yet
T Test As A Parametric Statistic: Tae Kyun Kim
7 pages
Bio Statistics
No ratings yet
Bio Statistics
20 pages
Reviewer's Quick Guide To Common Statistical Errors
No ratings yet
Reviewer's Quick Guide To Common Statistical Errors
1 page
Common Statistical Errors
No ratings yet
Common Statistical Errors
1 page
Introduction To Hypothesis Testing24
No ratings yet
Introduction To Hypothesis Testing24
54 pages
90156hypothesis Testing
No ratings yet
90156hypothesis Testing
34 pages
Introduction of Non-Parametric Test
No ratings yet
Introduction of Non-Parametric Test
9 pages
Confidence Interval or P-Value?
No ratings yet
Confidence Interval or P-Value?
5 pages
Statistics Basic Concepts
No ratings yet
Statistics Basic Concepts
13 pages
Chapter 17 Damasceno BP
No ratings yet
Chapter 17 Damasceno BP
19 pages
Testing Hypothesis
No ratings yet
Testing Hypothesis
11 pages
Point Estimation and Interval Estimation: Learning Objectives
No ratings yet
Point Estimation and Interval Estimation: Learning Objectives
58 pages
15-16!17!18 Significance in Continuous Variables
No ratings yet
15-16!17!18 Significance in Continuous Variables
29 pages
1.basic Theory Statistics
No ratings yet
1.basic Theory Statistics
6 pages
Ranganathan Et Al., 2015 - Valores de P e IC
No ratings yet
Ranganathan Et Al., 2015 - Valores de P e IC
2 pages
Nciph ERIC2
No ratings yet
Nciph ERIC2
7 pages
P Value Calculation
No ratings yet
P Value Calculation
9 pages
Statistical Inferences
No ratings yet
Statistical Inferences
46 pages
Choosing Statistical Tests
No ratings yet
Choosing Statistical Tests
6 pages
explainPValues - Pdfmarketing Research
No ratings yet
explainPValues - Pdfmarketing Research
3 pages
Choosing A Significance Test Objectives
No ratings yet
Choosing A Significance Test Objectives
15 pages
BIOstat T-Test Anova
No ratings yet
BIOstat T-Test Anova
10 pages
9-Sig. Tests Workshop 5-2025
No ratings yet
9-Sig. Tests Workshop 5-2025
49 pages
Lecture 3.measures of Effectiveness
No ratings yet
Lecture 3.measures of Effectiveness
38 pages
03 Fact Sheet HME712 Bos - 3 General Principles of Hypothesis Testing
No ratings yet
03 Fact Sheet HME712 Bos - 3 General Principles of Hypothesis Testing
2 pages
Week5 Inferentionalstat
No ratings yet
Week5 Inferentionalstat
54 pages
Module 004 - Parametric and Non-Parametric
No ratings yet
Module 004 - Parametric and Non-Parametric
12 pages
Concise Biostatistical Principles & Concepts: Guidelines for Clinical and Biomedical Researchers
From Everand
Concise Biostatistical Principles & Concepts: Guidelines for Clinical and Biomedical Researchers
Franklin Opara
No ratings yet
Introduction To Non Parametric Methods Through R Software
From Everand
Introduction To Non Parametric Methods Through R Software
Editor IJSMI
No ratings yet
Overview Of Bayesian Approach To Statistical Methods: Software
From Everand
Overview Of Bayesian Approach To Statistical Methods: Software
Vinaitheerthan Renganathan
No ratings yet
Initial Risk Assessment of Entry in Enclosed Space
No ratings yet
Initial Risk Assessment of Entry in Enclosed Space
2 pages
Sports Medicine Foot and Ankle Part VI PowerPoint DH9W1qp
No ratings yet
Sports Medicine Foot and Ankle Part VI PowerPoint DH9W1qp
14 pages
Practical Journals I II III IV BAMS 120115
No ratings yet
Practical Journals I II III IV BAMS 120115
229 pages
SOP For Cobas 5800
No ratings yet
SOP For Cobas 5800
4 pages
Histopathologic & Cytologic Techniques
No ratings yet
Histopathologic & Cytologic Techniques
9 pages
Coping With Exams
No ratings yet
Coping With Exams
3 pages
list-SAA-29 03 2025
No ratings yet
list-SAA-29 03 2025
1 page
Laboratory Brochure
No ratings yet
Laboratory Brochure
2 pages
FFD Exam Guide To Online Applications
No ratings yet
FFD Exam Guide To Online Applications
5 pages
2016 Bauer Prediction of Future Falls in A Community
No ratings yet
2016 Bauer Prediction of Future Falls in A Community
16 pages
PPEPL-SOP-06 Procedure For Performance Evaluation
No ratings yet
PPEPL-SOP-06 Procedure For Performance Evaluation
4 pages
Article 1 (Module1researchact)
No ratings yet
Article 1 (Module1researchact)
33 pages
GMP Audit Cosmetics Products: Example Report
No ratings yet
GMP Audit Cosmetics Products: Example Report
13 pages
L5 Psychometrics
No ratings yet
L5 Psychometrics
4 pages
NBP Planners For Branches (Weekdays) - NEET-2024
No ratings yet
NBP Planners For Branches (Weekdays) - NEET-2024
2 pages
Rutuja Garghate
No ratings yet
Rutuja Garghate
10 pages
Notice Technician
No ratings yet
Notice Technician
2 pages
Asbestos Control - Surveys, Removal, and Management (PDFDrive)
No ratings yet
Asbestos Control - Surveys, Removal, and Management (PDFDrive)
108 pages
B.Sc. Nursing Course (4years) Common Entrance Test (CET) 2021
No ratings yet
B.Sc. Nursing Course (4years) Common Entrance Test (CET) 2021
11 pages
Process Validation of Sterile Manufacturing
No ratings yet
Process Validation of Sterile Manufacturing
7 pages
Validation of Amharic Version of SAQin Addis Ababa
No ratings yet
Validation of Amharic Version of SAQin Addis Ababa
10 pages
Cebu December Summary
No ratings yet
Cebu December Summary
7 pages
Medical Certificate - MLC 2006
No ratings yet
Medical Certificate - MLC 2006
2 pages
Field Training Exercises
No ratings yet
Field Training Exercises
9 pages
Standard Operating Procedure (SOP)
No ratings yet
Standard Operating Procedure (SOP)
57 pages
12 - Eye Irritation Study On Rabbits
No ratings yet
12 - Eye Irritation Study On Rabbits
23 pages
TR - Classification of Vehicle Maintenance and Repair Centers - Version 1
No ratings yet
TR - Classification of Vehicle Maintenance and Repair Centers - Version 1
34 pages
The Procedure: Cleaning Validation
100% (1)
The Procedure: Cleaning Validation
4 pages

Statistics IV Interpreting The Results of Statistical Tests

Uploaded by

Statistics IV Interpreting The Results of Statistical Tests

Uploaded by

This is the fourth in a series of articles in this journal on the use of statistics

in medicine. In the previous issue, we described how to choose an appropriate

More on choosing an appropriate statistical test

Assessing whether a data set follows a normal distribution

Unpaired vs paired data

Another method of conducting this type of investigation is the crossover study

The null hypothesis and P-values

Many published statistical analyses quote P-values as =0.05 (not significant),

Another example of the arbitrary nature of the conventional threshold for

Statistical vs clinical significance

Confidence intervals are important when analysing the results of statistical

You might also like