Chapter 1
Chapter 1
ABDOUN S.
[email protected]
Introduction
• Statistics is the science of collecting, processing and analysing data from
the observation of random phenomena.
Population
Subset
Exemples
Characteristics and modalities
• Character (statistical variable)
Types of characters
Qualitative Quantitative
The modalities are the different values that the character X can take.
Example:
•The modalities of the character "number of
brothers and sisters of a student" are:
Frequency, relative frequency
Cumulative
Cumulative Relative relative
Frequency
frequency frequency frequency
The cumulative
The number, denoted The cumulative The frequency of the
relative frequency of
called the frequency in modality
the modality
frequency of the (respectively in (respectively of the
(respectively of the
modality [, ) , denoted as class , ),
class [, ), is
(respectively the class a such that: denoted as is
called the number
hh hh ) is defined defined by:
denoted s such that:
as the number of
times the modality
a has been observed
(respectively, where a
value in the class
[ , [ is observed)
for .
Representation of data
Qualitative Quantitative
character character
Discrete Continuous
Bar chart Pie chart character character
Stick
Histogram
diagram
Graphical representations Qualitative
character
Bar chart is a graph that associates each A PIE CHART is a graph that divides a disk into
modality of the character with a rectangle angular sectors whose central angles are
of constant base and height proportional to proportional to the frequency (or relative
the frequency (or relative frequency). frequency) of each modality.
The center angle of a modality of size
aa is given (in degrees) by:
Graphical representations
Quantitative
character
Stick Histogram
diagram
Position Dispersion
parameters parameters
Mean Median
Variance
standard deviation
Mode
Position parameters:
Arithmetic mean
The mode, denoted, is the most frequent value taken by the character, i.e.,
the one that repeats most often in the data.
The median, denoted , is the value that divides the studied population into two
groups of equal size (50% of the observations are less than or equal to it, and 50%
are greater than or equal to it).
The standard deviation, denoted , is defined as the square root of the variance.
It is written: