3rd Week
3rd Week
Cut off-
Significancy
Std deviation
Causation
| ABOUT THE PREVIOUS COURSE |
Causation
Correlation
Types Of Data
Nominal Data
Ordinal Data
Interval Data
Continous Data
| WEEKLY LEARNING OUTCOMES |
Students will
• RANGE: 51-70
• Class Interval Size or Class Width
• The difference between the upper
and lower boundaries of any class.
• The class width is also the
difference between the lower
limits of two consecutive classes
or the upper limits of two
consecutive classes.
• It is not the difference between
the upper and lower limits of the
same class.
Class Limits and the
Midpoint
Size of the class interval (interval
of class) i=U-L
• The point halfway between adjacent
intervals
i = size of a class interval
• Upper and lower limits U = upper limit of a class interval
– Distance from upper and lower limit L = lower limit of a class interval
determines the size of class interval
The Midpoint
• The middlemost score value in a class interval
– The sum of the lowest and highest value in a class
interval divided by two
lowest score value highest score value
m=
2
9
Central Tendencies Dispersion
• Central tendency is a property • Dispersion is a
of the data that they tend to
be clustered about a single property of the data
center point. that they tend to be
• Numbers spread out.
• used to present the center • Refers to the spread of
• or the middle set of the data values about the mean
Quartiles
mean (generally not part of the data set)
Range
median (may be part of the data set)
Variance
mode (always part of the data set)
Standart Deviation
1.Arithmetic Mean
• Themean or arithmetic
mean is the "average« of the
sum.
•which is obtained by;
adding all the values in a
sample or population
and dividing them by the
number of values.
17
18
Properties of the mean
1. Uniqueness -- For a given set of data there is one and
only one mean.
2. Simplicity -- The mean is easy to calculate.
3. Affected by extreme values -- The mean is
influenced by each value. Therefore, extreme values
can distort the mean.
19
Calculating Sample Mean
1. Uniqueness -- For a given set of Add up all of the data
data there is one and only one mean. points and divide by the
2. Simplicity -- The mean is easy to number of data points.
calculate.
3. Affected by extreme values --
The mean is influenced by each
value. Therefore, extreme values
can distort the mean.
51 1 51 x 1 51
66 3 66 x 3 198
72 4 72 x 4 288
82 5 82 x 5 410
94 7 94 x 7 658
n: 20 1605
=80,25
Mean of Grouped Data
Freq MidPoint
0--4 2 2 4
4--8 3 6 18
8--12 8 10 80
12--16 3 14 42
16--20 2 18 36
18 180 180/18 10
Groups Xi
f
i Fi x i
0-10 (0+10)/2 =5 3 15
10-20 15 12 180
20-30 25 35 875
10
∑ 𝑓𝑖=340
30-40 35 45 1575
50-60 55 45 2475
𝑖=1
60-70 65 35 2275
70-80 75 30 2250
X̄ = = 49,47
80-90 85 15 1275
90-100 95 10 950
Credits 6 4 2 5 3
Grade 54 50 54 51 67
c) Weighted
Geometric Mean
• In a setting, when the growth in the
number of something per area is
examined over time.
• Or contamination
• G.O =
• The number does not change by the
same amount from one period to the
next, but the change is proportional to
the initial geometric mean is used.
• Another way of saying this is that the
growth is multiplicative, not additive.
• Geometric mean is the nth root of the
product of the values for n
observations
Example
• In a set • G.0=
• The incease in the 1st day is • =
0,8
• For the second day the
increase is by : 0,10 • =0,212
• And for 3rd : 0,12.
• What is the average increase
rate?
Harmonic Mean
• Harmonic mean is another measure
of central tendency
• Like arithmetic mean and geometric
mean, harmonic mean is also useful
for quantitative data.
• The harmonic mean can be
expressed as the reciprocal of the
arithmetic mean of the reciprocals
of the given set of observations
• Or quotient of the “number of the ("Reciprocal" just means
given values” and the“sum of the
reciprocals of the given values”. (1/ value )
• Harmonic Mean: Ideal for rates • The harmonic mean can be
and ratios, especially when the expressed as the reciprocal of
data points are inversely the arithmetic mean of the
related to the quantities of reciprocals of the given set of
interest (e.g., speed and time). observations
• Or quotient of the “number of
the given values” and the“sum
of the reciprocals of the given
values”.
Example
A driver has traveled 200
km road in 2 hours and
returned in 4 hours.
Calculate the average
speed of the driver on
this journey?
400 / 6 = 66.67
km/s
• A company has three machines
with production rates of 60
units/hour, 90 units/hour, and
120 units/hour.
• What is the average production
rate of the machines?
• So the first machine’s rate is 1/60
• 2nd machine’s rate is 1/90
• 3rd machine’s rate is 1/120
Descriptive Statistics
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
88 + 95 = 183
160 160 165 170,1 170,1 175 178 178 178 180 181 183 185 186 187 190
Grouped Data
160-165 3
170-175 3
176-180 4
181-185 3
186-190 3
Finding the Median in a Grouped Data
• L= Class of medians lower boundry
•L + . İ • N=Total frequency
• F= Total frequency above the box
• i= Class interval size
• f= Frequence in box
Cummulative Frequency
2
5
13
16
18
Cummulative Frequency
2
5 (F)
L= 13
16
18 L+ .İ
= 10
Group Xi frequenciei Cummulative • Median : 340/2 = 170th
Frequency • L= Class of medians lower
boundry
0-10 (İ) 5 3 3
• N=Total frequency
10-20 15 12 15
(İ)
• F= Total frequency above the
20-30 25 35 50 box
30-40 35 45 95 (F) • i= Class interval size
40-50 45 110 (f) 205 • F= Frequence in box
(L)
50-60 55 45 250
60-70
70-80
65
75
35
30
285
315
• 40 + . 10
80-90 85 15 330 • = = 46,81
90-100 95 10 340
Total 340
Descritive Biostatistics
Example 1: 2, 3, 4, 5, 7, 7, 8. Mode = 7
Example 2: 2, 3, 4, 5, 7, 7, 8, 8.
Mode = 7, Polymodal
8
Example 3: 1,2,3,1,1,6,5,4,1,4,4,3 -> 1 has the highest
frequency (f=4)
Mode is 1 (not 4 )
The mode is not the frequency of the most
frequent score but the value
Finding the Mode in Grouped Data
Mode = L + . h
L: Mode’s class
𝝙 1 = Mod class frequency – previously class’s frequency
𝝙 2= Mod class frequency – Next class frequency
Finding the Mode in Grouped Data
Mode = L + . h
Groups Xi
f
i
frequency
Mod class frequency – Next
10-20 15 12
𝝙
class frequency
20-30 25 35
2=
30-40 35 45
40-50 45 110
• Mode = L + . h
50-60 55 45
60-70 65 35
• 40+ . 10
70-80 75 30
80-90 85 15
• Mode = 45 90-100 95 10
Toplam 340
Hints
700
Chart Title
• If the number of repetitions
600
of each observation is the
500
same, there is no mode in
400
that data set.
300 • Distributions with a single
200 value with the highest
100 number are called a unimode
0
A
B
C
Hints
300
200
100
• Thus, the mode value becomes
0
A
single so.
B
C
D
E
F
Descritive Biostatistics
Central Tendencies Dispersion
Mean Quartiles
Median Percentage
Mode Variance
Standart Deviation
Quartiles
Measures of central tendencyQ is equal to the 25th percentile
1
that divide a group of data
into spesific subgroups Q is located at 50th percentile and equals
2
the median
• Q1: 25% of the data set is
below the first quartile Q3 is equal to the 75th percentile
• Q2: 50% of the data set is Quartile values are not necessarily
below the second quartile members of the data set
• Q3: 75% of the data set is Q1 Q2 Q3
below the third quartile
25% 25% 25% 25%
Variance vs Variety
• It is the distribution or spread • Values in different data sets
around the mean can have the same mean even
• Variety means having many though their values are totally
different values different.
SET 1
SET 1 SET 2
2.5
2 90 2
1.5
2 70 1
0.5
2 -20 0
0 1 2 3 4 5 6 7 8 9 10
SET 2
2 -50
100
2 8 80
60
2 -10 40
2 28 20
0
0 1 2 3 4 5 6 7 8 9 10
2 -9 -20
-40
2 -89 -60
-80
-100
Variance
A measure of the spread of the recorded values on a variable. A measure of dispersion.
The larger the variance, the further the individual cases are from the mean.
Mean
The smaller the variance, the closer the individual scores are to the mean.
Mean
Shape of the
Distribution
Symmetrical Distributions
• The mode, median, and mean have identical
values
Skewed Distributions
• The mode is the peak of the curve
• The mean is closer to the tail
• The median falls between the two
Bimodal Distributions
• Both modes should be used to describe the data
4
6
Variance
1. Find the mean of the data.
Hint – mean is the average so add up
the values and divide by the number
of items.
2. Subtract the mean from each value – the
result is called the deviation from the
mean.
3. Square each deviation of the mean.
4. Find the sum of the squares.
5. Divide the total by the number of items.
1. Find difference between each data point and mean.
2. Square the differences, and add them up.
3. Divide by the number of data points.
n
Mean is represented by and n is the number of
items.
If measuring variance of population, denoted by 2 (“sigma-
squared”).
If measuring variance of sample, denoted by s2 (“s-squared”).
Measures average squared deviation of data points from their mean.
Highly affected by outliers. Best for symmetric data.
SAMPLE VARIANCE VS POPULATION VARIANCE
N (x ) 2
(x i x)
s 2 i 1 n
n 1
(x i x)
s 2 i 1
n 1
= 1970/ 5
= 394
394
50
55
I have to find the mean:
58
60 x̄ =
64 x̄ =63
70
71
76
Weight frequency (Xİ-X) (Xi-X) (Xi-X)² x̄ =63
50 1 50-63 -13 169 Variance:
55 1 55-63 -8 64 SD²= 68,75
58 1 58-63 -5 25
60 1 60-63 -3 9
64 1 64-63 1 1
70 1 70-63 7 49
71 1 71-63 8 64
76 1 76-63 13 169
504 550
For Grouped Data
Classes
Örnek:
f
i
0-10 3 • Find the variance
10-20 12
20-30 35
30-40 45
40-50 110
50-60 45
60-70 35
70-80 30
80-90 15
90-100 10
Toplam 340
Classes Xi
f
i Fi x i
0-10 (0+10)/2 =5 3 15
10-20 15 12 180
20-30 25 35 875
30-40 35 45 1575
10
∑ 𝑓𝑖=340
40-50 45 110 4950
50-60 55 45 2475
60-70 65 35 2275
𝑖=1
70-80 75 30 2250
80-90 85 15 1275
X̄ = = 49,47
90-100 95 10 950
• S² =
• =358,24
• S=
• =358,24
• =18,93
• =49,47 ± 18,93
Coefficient of variation
Coefficient of variation is a measure of the relative amount of variation
as opposed to the absolute variation.
80
| QUESTIONS AND SUGGESTIONS |
| RECOMMENDED WEEKLY STUDIES |
•Field, A. (2018). Discovering statistics using IBM SPSS Statistics (5th ed.). SAGE Publications.
•Sullivan, M. (2019). Statistics (6th ed.). Pearson.
•Bluman, A. G. (2018). Elementary statistics: A step by step approach (10th ed.). McGraw-Hill
Education.
•Moore, D. S., McCabe, G. P., & Craig, B. A. (2018). Introduction to the practice of statistics (9th
ed.). W.H. Freeman.
| ABOUT THE NEXT WEEK |
STANDART DEVIATION
VARIANCE
"Peace at Home, Peace in the World.”