Chapter 1. Biostatistics
Chapter 1. Biostatistics
• The field of statistics has two main areas: mathematical statistics and applied
statistics.
Data Information
Sample
— A sample is a set of data drawn from the population.
— Potentially very large, but less than the population.
E.g. a sample of 765 voters exit polled on election day.
Key Statistical Concepts…
Parameter
— A descriptive measure of a population.
Statistic
— A descriptive measure of a sample.
1.7
Key Statistical Concepts…
Population Sample
Subset
Statistic
Parameter
Populations have Parameters,
Samples have Statistics.
1.8
Types of Statistics
1.11
Statistical Inference…
Statistical inference is the process of making an estimate,
prediction, or decision about a population based on a sample.
Population
Sample
Inference
Statistic
Parameter
1.13
Types of Data….
Quantitative data
Numbers representing counts or
measurements
Discrete
Quantitative
Data
Continuous
Definitions
Discrete
data result when the number of possible values is
either a finite number or a ‘countable’ number of
possible values
0, 1, 2, 3, . . .
Continuous
(numerical) data result from infinitely many possible
values that correspond to some continuous scale or
interval that covers a range of values without gaps,
interruptions, or jumps
2 3
Examples
Discrete
The number of eggs that hens lay; for
example, 3 eggs a day.
Continuous
The amounts of milk that cows produce; for
example, 2.343115 gallons a day.
Interval Scale
Researcher can specify rank ordering of
variables and distance between
Intervals are equal but no rational zero
point (example IQ scale, Fahrenheit
scale)
Data can be treated mathematically,
most statistical tests are possible
Ratio Scale
Highest level of measurement
Rational meaningful zero point
Absolute magnitude of variable (e.g.,
mgm/ml of glucose in urine)
Ideal for all statistical tests
Types of Data (Qualitative)
1.25
Nominal Measurement
Lowest Level
Sorting into categories
quantitative significance
Assign equivalence or nonequivalence
1 2
Rules of Nominal system
All of members of one category are
assigned same numbers
No two categories are assigned the
same number (mutual exclusivity)
Cannot treat the numbers
mathematically
Mode is the only measure of central
tendency
• The ordinal level of measurement
classifies data into categories that
can be ranked; precise differences
between the ranks do not exist.
Ordinal Data…
Ordinal Data appear to be categorical in nature, but their
values have an order; a ranking to them:
1.30
The Ordinal Scale
Sorting variations on the basis of their
relative standing to each other
Attributes ordered according to some
3.5
2.5
1.5
0.5
0
level level level