0% found this document useful (0 votes)
37 views14 pages

075 DescriptiveStatistics

The document discusses methods for organizing and analyzing raw data through frequency distributions and calculations. It describes procedures for creating a frequency distribution table by grouping raw data into class intervals and counting frequencies. It then shows how to calculate various data characteristics like range, mean, median, mode, quantiles, variance and standard deviation from the raw data or frequency distribution table.

Uploaded by

temestruc71
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
37 views14 pages

075 DescriptiveStatistics

The document discusses methods for organizing and analyzing raw data through frequency distributions and calculations. It describes procedures for creating a frequency distribution table by grouping raw data into class intervals and counting frequencies. It then shows how to calculate various data characteristics like range, mean, median, mode, quantiles, variance and standard deviation from the raw data or frequency distribution table.

Uploaded by

temestruc71
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 14

Frequency distributions

Classification
When raw data are collected, they are distributed into classes, or
categories, ot intervals and the number of times the items appear
in a class-interval is known as the class frequency.
Procedure for tabulating a frequency distribution
(i) Fix the magnitude of the class interval
(ii) If possible allow for a maximum of 20-25 class-intervals
(iii) Classify the items according to (i) and (ii).
(iv) Draw up a table showing te frequency of each class-interval

Raw data:
7.1, 7.5, 8.2, 9.2, 5.4, 6.1, 2.3, 5.4, 2.8, 5.8, 8.5, 4.2, 6.8, 4.8, 5.8, 1.8,
5.8, 5.9, 7.8, 4.7, 6.8, 9.9, 10.2, 15.8, 12.8, 7.8, 4.5, 6.7, 5.7, 6.6, 8.5,
7.5, 4.1, 5.8, 2.8, 6.8, 5.5, 5.4, 6.7, 8.7, 5.7, 12.4, 7.7, 5.8, 8.2, 5.6

Raw data:
7.1, 7.5, 8.2, 9.2, 5.4, 6.1, 2.3, 5.4, 2.8, 5.8, 8.5, 4.2, 6.8, 4.8, 5.8, 1.8,
5.8, 5.9, 7.8, 4.7, 6.8, 9.9, 10.2, 15.8, 12.8, 7.8, 4.5, 6.7, 5.7, 6.6, 8.5,
7.5, 4.1, 5.8, 2.8, 6.8, 5.5, 5.4, 6.7, 8.7, 5.7, 12.4, 7.7, 5.8, 8.2, 5.6
Minimum item = 1.8
Maximum item = 15.8
Number of class-intervals = 15
Class-interval length = 1

C-I
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
1.5
2.5
3.5
4.5
5.5
6.5
7.5
8.5
9.5 10.5 11.5 12.5 13.5 14.5 15.5
R
1
3
0
5
13
7
6
5
2
1
0
2
0
0
1
f
1
4
4
9
22
29
35
40
42
43
43
45
45
45
46
cf
0
0,109 0,283 0,152 0,13 0,109 0,043 0,022
0
0,043
0
0
0,022
rf 0,022 0,065
1
rcf 0,022 0,087 0,087 0,196 0,478 0,63 0,761 0,87 0,913 0,935 0,935 0,978 0,978 0,978

C-I class interval


R = Representative
f = Frequency
cf = Cummulative frequency
rf = Relative frequency
rcf = Relative cummulative frequency

Frequency distribution histogram


14

Frequency

12

10

0
1.5

2.5

3.5

4.5

5.5

6.5

7.5

8.5

Values

9.5

10.5

11.5

12.5

13.5

14.5

15.5

Cummulative frequency

Cummulative frequency distribution histogram


50
45
40
35
30
25
20
15
10
5
0
C-I

Values

10

11

12

13

14

15

Relative frequency distribution pie-chart


1
2
3
4
5
6
7
8
9
10
11
12
13
14
15

With a given set of raw data x1 , x2 ,K , xn


or a frequency distribution table

r1

r2

...

rk

f1

f2

...

fk

We can calculate various data characteristics such as


range, arithmetic mean, median, mode, quantiles,
variance, standard deviation

Range
Range is the difference between the magnitude of the
largest and the smallest of data items.

The arithmetic mean

x=

x
i =1

n
k

x
x=

j =1
k

f
j =1

fj
j

Median
Arrange the items of a distribution in order of magnitude
strating with either the largest or the smallest, then
(i) if the number of items is odd, the median is the value of
the middle item
(ii) if the number of items is even, the median is the
arithmetic mean of the two middle items

The mode
The mode of a distribution is the most frequent,
or most 'popular' item. It may not exist and,
even if it does, it may not be unique

Quantiles
median - divides a distribution into two equal parts
quartiles Q1, Q2, Q3, Q4 - divide a distribution into four
equal parts
deciles D1, ..., D10 - divide a distribution into ten equal
parts
percentiles P1, ..., P100 - divide a distribution into a hundred
equal parts
Q2 is the median

Variance, standard deviation

(x x )
n

s =
2

(x x )
n

s=

s' =
2

n 1
2
(
)
x

x
i
n

(x x )
n

s' =

n 1

You might also like