Statistical Research Methods A Guide For Non Statisticians Accessible DOCX Download
Statistical Research Methods A Guide For Non Statisticians Accessible DOCX Download
Visit the link below to download the full version of this book:
https://medipdf.com/product/statistical-research-methods-a-guide-for-non-statist
icians/
1 Introduction 1
1.1 Statistical Methods as a Part of the Research Process . . . . 1
1.1.1 Populations and Samples . . . . . . . . . . . . . . . . 1
1.1.2 Parameters and Statistics . . . . . . . . . . . . . . . . 2
1.2 The Statistical Method . . . . . . . . . . . . . . . . . . . . . . 3
1.2.1 Research Question and Hypothesis Generation . . . . 3
1.2.2 Statistical Assumptions . . . . . . . . . . . . . . . . . 4
1.2.3 Statistical Method . . . . . . . . . . . . . . . . . . . . 5
1.3 Writing in the IMRaD Format . . . . . . . . . . . . . . . . . 6
1.4 The R Statistical Software Package . . . . . . . . . . . . . . . 7
1.4.1 Getting Started . . . . . . . . . . . . . . . . . . . . . . 10
1.4.2 Loading Data . . . . . . . . . . . . . . . . . . . . . . . 10
1.4.3 Working with Data . . . . . . . . . . . . . . . . . . . . 11
2 One-Sample Proportions 13
2.1 Introduction: Qualitative Data . . . . . . . . . . . . . . . . . 13
2.2 Establishing Hypotheses . . . . . . . . . . . . . . . . . . . . . 14
2.3 Summarizing Categorical Data (with R Code) . . . . . . . . . 15
2.4 Assessing Assumptions . . . . . . . . . . . . . . . . . . . . . . 16
2.5 Hypothesis Test for Comparing a Population Proportion
to a Hypothesized Value . . . . . . . . . . . . . . . . . . . . . 17
2.5.1 Behavior of the Sample Proportion . . . . . . . . . . . 17
2.5.2 Decision Making . . . . . . . . . . . . . . . . . . . . . 18
2.5.3 Standard Normal Distribution . . . . . . . . . . . . . 20
2.6 Performing the Test and Decision Making (with R Code) . . 21
2.6.1 Test Statistic . . . . . . . . . . . . . . . . . . . . . . . 21
2.7 Formal Decision Making . . . . . . . . . . . . . . . . . . . . . 24
2.7.1 Critical Value Method . . . . . . . . . . . . . . . . . . 24
2.7.2 p-value Method . . . . . . . . . . . . . . . . . . . . . . 26
2.7.3 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . 27
2.7.4 Confidence Intervals . . . . . . . . . . . . . . . . . . . 28
2.8 Contingency Methods (with R Code) . . . . . . . . . . . . . . 29
2.9 Communicating the Results (IMRaD Write-Up) . . . . . . . . 32
v
vi CONTENTS
2.10 Process . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
2.11 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
3 Two-Sample Proportions 37
3.1 Summarizing Categorical Data with Contingency Tables
(with R Code) . . . . . . . . . . . . . . . . . . . . . . . . . . 37
3.2 Hypothesis Test for Comparing Two Population
Proportions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
3.2.1 Generating Hypotheses About Two Proportions . . . 39
3.2.2 Statistical Assumptions . . . . . . . . . . . . . . . . . 40
3.3 Performing the Test and Decision Making (with R Code) . . 42
3.3.1 Critical Value Method . . . . . . . . . . . . . . . . . . 43
3.3.2 p-Value Method . . . . . . . . . . . . . . . . . . . . . 44
3.3.3 Confidence Intervals . . . . . . . . . . . . . . . . . . . 45
3.3.4 Chi-Square Test . . . . . . . . . . . . . . . . . . . . . 47
3.4 Contingency Methods (with R Code) . . . . . . . . . . . . . . 51
3.5 Odds Ratio (with R Code) . . . . . . . . . . . . . . . . . . . . 51
3.6 Communicating the Results (IMRaD Write-Up) . . . . . . . . 53
3.7 Process . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
3.8 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56
4 Multi-category Data 59
4.1 Introduction: Types of Multi-categorical Data . . . . . . . . . 59
4.2 Summarizing Categorical Data (with R Code) . . . . . . . . . 60
4.3 Establishing Hypotheses: Difference
Between Comparisons and Association . . . . . . . . . . . . . 63
4.4 Assessing Assumptions (with R Code) . . . . . . . . . . . . . 65
4.5 Performing the Test and Decision Making (with R Code) . . 67
4.5.1 Critical Value Method . . . . . . . . . . . . . . . . . . 68
4.5.2 p-Value Method . . . . . . . . . . . . . . . . . . . . . 70
4.5.3 Interpretation of Results . . . . . . . . . . . . . . . . . 70
4.6 Contingency Methods (with R Code) . . . . . . . . . . . . . . 72
4.7 Communicating the Results (IMRaD Write-Up) . . . . . . . . 73
4.8 Process . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
4.9 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76
9 Power 167
9.1 Making Mistakes with Statistical Tests . . . . . . . . . . . . . 167
9.2 Determinants of Sample Size . . . . . . . . . . . . . . . . . . 169
9.3 Categorical Outcomes . . . . . . . . . . . . . . . . . . . . . . 170
9.3.1 One-Sample Case . . . . . . . . . . . . . . . . . . . . . 170
9.3.2 Two-Sample Case (with R Code) . . . . . . . . . . . . 171
9.4 Continuous Outcomes . . . . . . . . . . . . . . . . . . . . . . 173
9.4.1 One-Sample Case (with R Code) . . . . . . . . . . . . 173
9.4.2 Two-Sample Case (with R Code) . . . . . . . . . . . . 173
9.4.3 Multi-sample Case (with R Code) . . . . . . . . . . . 174
9.5 Post-hoc Power Analysis . . . . . . . . . . . . . . . . . . . . . 175
9.6 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 179
Bibliography 211
Chapter 1
Introduction
populations from which they are drawn (if you ask a statistician, no sample
is large enough), yet manageable enough to be cost effective. For many of
the procedures we will discuss throughout this text we will have rules for
determining how large a sample we need. We will also focus – in Chapter 9 –
on performing a sample size or power analysis, which helps us determine the
sample size we need to collect before we conduct the study.
State the Result: Once we have conducted the statistical analysis, we will
need to make sense of our results. As mentioned earlier, we do this by stating
which hypothesis the evidence supports. Recall that though we are trying to
learn some characteristic about some population, that characteristic exists
or is true; we simply don’t know what it is. So when we state our result, we
can make one of two decisions: the evidence supports the first hypothesis, or
the evidence supports the second hypothesis. Since the conditions stated in
one of the two hypotheses we’ve created must be true, we can make two types
of mistakes called Type I and Type II errors. We will focus on errors of the
first type throughout this text, and we will cover errors of the second type in
Chapter 9. In practice, if we have set the table correctly by following sound
scientific methods in our data collection and sampling methodology, these
types of errors are of little concern and we can put a great deal of faith in
our statistical conclusions. Of course, the key aspect of any statistical result
lies in translating it into a meaningful statement that can be understood by
curious and critical readers.
and its placement of material, so much that they often skip to the parts they
are interested in to glean information quickly. We will spend a great deal
of effort understanding this method and its pieces, as well as practicing how
they apply to specific statistical methods.
Introduction: Here we provide details on the scientific problem in which we
are interested, and then describe the populations of interest. The treatment
or intervention specific to the current study is introduced, and the scientific
research question is un-categorically introduced (i.e. in the form from which
you will create your testable hypotheses).
Methods: In an actual publication, this is the section where you would
describe the setting of your sample, including such details as where and when
subjects were observed. A thorough description of what was measured and
the process under which those measurements were taken would then be pro-
vided. Any technological processes specific to the particular science and mea-
surements in question would be described here. Generally, a description of
the statistical methods used to analyze the sample measurements in light of
the research questions and hypotheses would be placed in the last sub-section
of the Methods Section (and often in small font to indicate its importance).
Here you will state how you summarized your data, how you analyzed the
data, and how you will make your decisions based on that analysis. You
must also specify any details that aid in that process, such as the statistical
software used for analysis.
Results: The details of the statistical analyses are presented here, start-
ing with a summary of the sample data (including any tabular or graphical
representations), continuing with the results from the analysis of the primary
research question, and ending with any secondary or sub-analyses not speci-
fied in the primary research question. An unequivocal answer to the primary
research question must be provided in this section.
Discussion: A brief summary of the results is provided at the beginning of
this section, where here the results are described in words (no statistics). The
scientific or clinical implications of these results are then expounded upon,
specifically with regards to how these results compare to those from previous
research studies. Any study limitations – there will always be something –
must be identified and described in this section, as should a justification as
to how they do or do not affect your results. This section often ends with a
prognostication of what these results mean for future research or what steps
need to be taken to continue this research.