0% found this document useful (0 votes)

4 views20 pages

Chapter Two

Uploaded by

Gezae Gebredingl

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views20 pages

Chapter Two

Uploaded by

Gezae Gebredingl

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 20

CHAPTER TWO

DATA COLLECTION AND PRESENTATION

2.1. DATA COLLECTION
2.1.1. Meaning of Collection of Data
Collection of data implies a systematic and meaningful assembly of information for the
accomplishment of the objective of a statistical investigation. It refers to the methods used in
gathering the required information from the units under investigation.
2.1.2. Primary and Secondary Data
2.1.2.1.Meaning and distinction between primary and secondary data
Statistical data may be obtained either from primary or secondary source. A primary source is a
source from where first-hand information is gathered. On the other hand, secondary source is the
one that makes data available, which were collected by some other agency. Clearly, a source, which
is not primary, is necessarily a secondary source. Primary sources are original sources of data.

Data obtained from a primary source is called primary data.

data. Likewise, data gathered from a
secondary source is known as secondary data.
data.
2.1.2.2.Methods of collecting primary data
After discussing the two sources of data, primary and secondary, it is logical to say a few words
about the methods employed in collecting data from its original or primary source.
Many authors commonly state three methods of collecting primary data. These are:
a. Interview method
b. Observation
c. Questionnaire method
a) Interview method (Personal Enquiry Method)
In personal enquiry method, a question sheet is prepared which is called schedule. The schedule
contains all the questions, which would extract a complete report from a respondent. Depending on
the nature of the interview, personal enquiry method is further classified into two types.
 Direct Personal Interview:
Interview: It is a type of personal enquiry where there is a fac-to-face contact
with the persons from whom the information is to be obtained
 Indirect Personal Enquiry (Interview): It is the second type of personal enquiry where the
investigator contacts third parties called witnessed who are capable of supplying the necessary
information.
b) Observation

1
It includes all methods from simple visual observations to the use of high-level machines and
measurements, sophisticated equipment or facilities, such as radiographic, X-ray machines,
microscope.
In a direct observation approach, an investigator stays at the place of survey and notes down the
observation himself.
c) Questionnaire Method (Self-administered)
Under this method, a list of questions related to the survey is prepared and distributed by hand to
the respondents or sent to the various respondents by post, Web sites, e-mail, etc. However, this
method cannot be used if the respondent is illiterate. It is a method that is often used in many
statistical investigations.

Types of questions used in a questionnaire

Depending on how questions are asked and recorded we can distinguish two major types of
questions- Open –ended questions, and closed ended questions.
a) Closed- ended questions:
questions: Closed questions offer a list of possible options or answers
from which the respondents must choose.
b) Open-ended questions:
questions: Open-ended questions permit free responses that should be
recorded in the respondent’s own words. The respondent is not given any possible
answers to choose from.
2.1.3. Types of Variables and Level (Scale) of Measurement
Types of Variables
Variables are divided into two: Qualitative and quantitative variable
1. Qualitative variables are nonnumeric variables and cannot be measured. Examples: gender,
religious affiliation, and state of birth.
2. Quantitative Variables are numerical variables and can be measured. Examples: balance in
checking account, number of children in family.
Note that quantitative variables are either discrete or continuous
Discrete variable:
variable: It assumes a finite or countable number of possible values. It is usually obtained
by counting. Example: number of children‘s in a family, number of cars at a traffic light
Continuous variable:
variable: It can assume any value within the defined range. Continuous variables are
usually obtained by measuring. Example: weight in kg, height, time, air pressure in a tire.
Level (Scale) of Measurement
There are four general levels of measurements:
These are: Nominal, ordinal, interval and ratio levels of measurements

2
1. Nominal level
The terms nominal level of measurements and nominal scale are commonly used to refer to data
that can only be classified into categories. In the strict sense of the words, however, there are no
measurements and no scales involved. Instead, there are just counts.
There is no any mathematical difference between categories.
Examples:
1. Sex of a person (male and female could be coded as 0 and 1).
2. Ethnic group (black, white and oriental may be coded as 0, 1 and 2).
This indicates that for nominal level of measurement, there is no particular order for the groupings.
Further, the categories are considered to be mutually exclusive. Note that in all the above cases one
cannot say that 1> 0 or 2> 1 etc.
Nominal level is considered the most primitive, the lowest or the most limited type of measurement
2. Ordinal Level
Ordinal data are nominal data, which have order and consensus. Measurements with ordinal scales
are ordered in the sense that higher numbers represent higher values, i.e., they can have meaningful
inequalities (< or >). In such kind of data, only counting and ranking are possible but it is not likely
to find exact differences.
Examples
1. Military ranks: comparing 3 stars general and 4 stars general.
2. Linkers scale such as 1= poor, 2= fair, 3= good and 4= excellent.
The major difference between a nominal level and an ordinal level of measurement is the “greater
than” relationship between the ordinal-level categories. Otherwise, the ordinal scale of
measurement has the same characteristics as the nominal scale; namely, the categories are mutually
exclusive and exhaustive.
2. Interval level
The interval scale of measurement is the next higher level. It includes all the characteristics of the
ordinal scale, but in addition, the distance between values is a constant size. If one observation is
greater than another by a certain amount, and the zero point is arbitrary, the measurement is on at
least an interval scale. For example, the difference between temperatures of 70 degrees and 80
degrees is 10 degrees. Likewise, a temperature of 90 degrees is 10 degrees more than a temperature
of 80 degrees, and so on. Scores on a statistics or mathematics examination are also examples of the
interval scale of measurement.

3
3. Ratio level
Ratio level is the highest level of measurement. This level has all the characteristics of interval
level. The distances between numbers are of a known, constant size; the categories are mutually
exclusive, and so on.

The major differences between interval and ratio levels of measurement are these: (1) Ratio-level
data has a meaningful zero point and (2) the ratio between two numbers is meaningful. Money is a
good illustration having zero dollars has meaning you have none! Weight is another ratio-level
measurement.
Examples:

1. Income is a ratio data because zero dollars is truly “no income”

2. Measurement data like height, weight, volume and area.
If the dial on a scale is zero, there is a complete absence of weight. Also, if you earn $40,000 a year
and John earns $ 10,000, you earn four times what he does. Likewise, if you weigh 80 kg and John
weight 40 kg., you weigh twice John. But such comparisons are impossible in interval level of
measurement.

2.2. CLASSIFICATION OF DATA

After collecting relevant information (data) for the purpose of statistical investigation, the next
important task is classification of this data. It is difficult to group the meaning of any considerable
volume of numerical data unless their mass is somehow reduced to relatively few convenient
classes or categories.

Classification of data is the process of arranging things in groups or classes according to their
resemblance.
Purposes of Classification:
Classification: -
 To eliminate unnecessary detail.
 To bring out clearly points of similarity & dissimilarity
 To enable one to form mental pictures of objects on measurements
 To enable one to make comparisons and draw inferences

Types of Classification
1. Geographical Classification:
Classification: - Data are arranged according to places like continents, regions,
and countries
2. Chronological Classification:
Classification: - Data are arranged according to time like year, month.

4
3. Qualitative Classification: - Data are arranged according to attributes like color,
religion, marital-status, sex, educational background, etc.
4. Quantitative Classification:
Classification: - In this type of classification, the statistical data is classified
according to some quantitative variables. The variable may be either discrete or continuous.
2.3. PRESENTATION OF DATA
This section deals with the study of organizing a set of raw data into a Frequency Distribution (FD)
and describes the distribution graphically in a histogram, a frequency polygon, & a cumulative
frequency curve (ogive). The other types of numerical information will be summarized & presented
in the form of bar chart, pie chart or a pictogram.

Definition: Presentation is a statistical procedure of arranging and putting data in a form of

tables, graphs, charts and/or diagrams
2.3.1. Tabular Data Presentation
Tabulation is a logical or systematic listing of related data in rows and columns. The row of a table
represents the horizontal arrangement of data and column represents the vertical arrangement of
data.
Simple or one-way table
Table 1: Educational level of 200 employees in a certain organization
Educational level Number of employees Percent (%)
BA 120 60
MA 60 30
PhD 20 10
Two-way table:
table: This table shows two characteristics and is formed when either the row or
the column is divided into two or more parts.
Table 2: Educational level by marital status of employees in an organization
Educational level
Marital BA MA PhD Total
Single
Status 90 50 18 158
Married 30 10 2 42
Total 120 60 20 200
2.3.2. Frequency Distribution
When the raw data have been collected, they should be put into an ordered array in an ascending or
descending order so that it can be looked at more objectively. Then this data must be organized into
a “FD” which simply lists the values or classes with their corresponding frequencies in a tabular
form. Here, frequency refers to the number of observations a certain value occurred in a data.
The tabular representation of values of a variable together with the corresponding frequency is
called a Frequency Distribution (FD).

5
Definition: A frequency distribution is the organization of raw data in table form, using classes and
frequencies.
Frequency distribution is of two kinds: Ungrouped and Grouped Frequency Distribution
2.3.2.1. Ungrouped Frequency Distribution (UFD)
Shows a distribution where the values of a variable are linked with the respective frequencies.
Example 1: Consider the number of children in 15 families.
1 0 3 2 0
2 4 1 3 1
4 1 2 2 3
Construct ungrouped FD for the above data.
Solution:
No. of Children No. of Family Frequency Relative
(Values) (Tallies) Frequency
0 // 2 0.13
1 //// 4 0.27
2 //// 4 0.27
3 /// 3 0.2
4 // 2 0.13
Total 15 1
Exercise 1

Consider the following scores in a statistics test obtained by 20 students in a given class.

10, 4, 4, 7, 5, 7, 7, 8, 5, 7, 8, 5, 10, 8, 7, 5, 7, 8, 7, 4
Prepare an ungrouped FD
2.3.2.2. Grouped Frequency Distribution (GFD)
If the mass of the data is very large, it is necessary to condense the data into an appropriate number
of classes or groups of values of a variable and indicate the number of observed values that fall into
each class. Therefore, a GFD is a frequency distribution where values of a variable are linked into
groups & corresponded with the number of observations in each group.
Example 2
Values (xi) 1 - 25 26 - 50 51 - 75 76 - 100
Frequency (fi) 3 10 18 6

Common Terminologies in a GFD

1. Class: - group of values of a variable between two specified numbers called lower class limit
and upper class limit
Lower Class Limit (LCL) & Upper Class Limit (UCL)
In example 2, the GFD contains four classes: 1 – 25, 26 – 50, 51 – 75, and 76 – 100
LCL1 = 1, UCL1 = 25 LCL3 = 51, UCL3 = 75
6
LCL2 = 26, UCL2 = 50 LCL4 = 76, UCL4 = 100
2. Class Frequency (or Simply Frequency):
Frequency): refers to the number of observations corresponding
to a class.
In example 2 the class frequency of the 1st, 2nd, 3rd, & 4th classes are respectively 3, 10, 18 and 6.
3. Class Boundaries: are boundaries obtained by subtracting half of the unit of measurement (u)
from the lower limits or by adding ½ (u) on the upper limits of a class.
i.e LCBi = LCLi - ½ (u)
UCBi = UCLi + ½ (u)
Where LCBi = Lower Class Boundaries and
UCBi = Upper Class Boundaries
Remark:
Remark: The unit of measurement (u) is the gap between any two successive classes. i.e
u = lower limit of a class – upper limit of the preceding class.
In example 2, consider the 2nd class, 26 – 50, since u = 26 – 25 = 1,
LCL2 = 26 UCL2 = 50
LCB2 = 26 - ½(1) = 25.5 UCB2 = 50 + ½(1) =50.5
4. Class Width (size of a class or class interval)
interval): it is the difference between the upper and lower
class limits or the difference between the upper and lower class boundaries of any class.

Remarks:
a. If both the LCL & UCL are included in a class, it is called an inclusive class. E.g 0 – 10,
11 – 20, 21 – 30, and so on. For inclusive classes,
Class width (cw) = UCBi - LCBi
b. If LCL is included and the UCL is not included in a class, it is called an exclusive class.
E.g. 0 – 10, 10 – 20, 20 – 30, and so on. For exclusive classes
cw = UCLi – LCLi
To be consistent, we use inclusive classes.
5. Class Mark (cm): it is the midpoint (center) of a class cmi = UCBi + LCBi
2
Note:-
Note:- the difference between any two successive class marks is equal to the width of a class
6. Range (R) : is the difference between the largest (L) and the smallest (S) values in a data
R=L–S

Steps in constructing a Grouped Frequency Distribution

1. Find the Largest and the smallest value and compute the range; R = L – S,

7
2. n 1 + 3.322LogN; up/down to
Determine the number of classes (n) using sturgles formula n
the nearest whole number, where N=Total number of observation
3. Find the class width (cw) by dividing the range(R) by the number of classes (n) and round to
the nearest integer. cw=R/n
4. Identify the unit of measurement usually as 1, 0.1, 0.01…
5. Determine the class limits
a. Determine the lower-class limit of the first class (LCL1), pick a suitable starting point less
than or equal to the minimum value. then
LCL2 = LCL1 + cw, LCL3 = LCL2 + cw,… LCLi+1 = LCLi + cw
b. Determine the upper class limit of the first class (UCL1) i.e.
UCL1 = LCL1 + cw – u, where u = the unit of measurement, then
UCL2 = UCL1 + cw, UCL3 =UCL2 + cw, … , UCLi+1 = UCLi + cw
6. Compute class boundaries (UCBi = UCLi + ½*U and LCBi = LCLi – ½*U) and class mark
(Cmi=(UCBi+LCBi)/2) then complete the GFD with the respective class frequencies.
Example 3: The number of customers for consecutive 30 days in a supermarket was listed as
follows:
20 48 65
25 48 49
35 25 72
42 22 58
53 42 23
57 65 37
18 65 37
16 39 42
49 68 69
63 29 67
a. Construct a GFD with a suitable number of classes
b. Complete the distribution obtained in (a) with class boundaries & class marks

Solution:
1. Range = Largest value – smallest value = 72 – 16 = 56
2. N = 30 (total number of observations)
 Number of classes, n = 1 + 3.322 log30
 n = 1 + 3.322 log30
= 1 + 3.322 (1.4771)
= 5.9
Hence a suitable number of class n is chosen to be 6

8
3. Class width = Range  = 9.33 = cw
56
n 6

9
For the sake of convenience, take cw to be 10 (note that it is also
possible to choose the cw to be 9).
4. u=1
5. Take lower limit of the 1st class (LCL1) to be 16 & u = 1
LCL1 = 16 and UCL1 = LCL1 + cw – u =16+10-1 = 25
LCL2 = LCL1 + cw = 16 + 10 = 26 UCL2 = UCL1 + cw = 25 + 10 = 35
LCL3=LCL2 + cw = 26 + 10 = 36 UCL3 = UCL2 + cw = 35 + 10 = 45 and so on

6. LCBi = LCLi – ½(u) UCB i=

= UCLi + ½(u) Cmi=(UCBi+LCBi)/2
LCB1 = LCL1 – ½(u) =16-½(1) =15.5, LCB2 = LCL2 – ½(u) =26-½(1) =25.5 and so on
UCB1= UCL1 + ½(u) =25+½(1) =25.5, then continue to UCL6
Therefore, the GFD would be
a) b)
Class (xi) Frequency (f
(fi) Class (xi) Frequency (f
(fi) CBi cmi
16 – 25 7 16 – 25 7 15.5 – 25.5 20.5
26 – 35 2 26 – 35 2 25.5 – 35.5 30.5
36 – 45 6 36 – 45 6 35.5 – 45.5 40.5
46 – 55 5 46 – 55 5 45.5 – 55.5 50.5
56 – 65 6 56 – 65 6 55.5 – 65.5 60.5
66 – 75 4 66 – 75 4 65.5 – 75.5 70.5

Exercise 3: Consider the following data about the ages of 50 persons.

10
37 40 69
35 36 70
72 62 36
72
65 64 47
59 55 42
45 50 46
65
54 63 51
50 61 60
58 58 56
58
55 45 49
51 50 56
44 60 70
44
52 43 55
46 42 62
57 48 60
55
A. How many classes would you recommend?
B. What class interval would you suggest?
C. What would you recommend as the lower limit and upper limit of the first class?
D. Organize the data into a grouped frequency distribution based on the appropriate
class interval.
E. What is the modal age?
2.3.2.3. Cumulative Frequency Distribution (CFD)
It is the collection of values of a variable above or below specified values in a distribution. GFD
is of two types.
a. ‘Less Than’ Cumulative Frequency Distribution (<CFD): shows the collection of cases
lying below the upper class boundaries of each class.
b. ‘More Than’ Cumulative Frequency Distribution (>CFD): shows the collection of cases
lying above the lower class boundaries of each class.
Remark: The frequency distribution does not tell us directly the number of units above or
below specified values of the classes this can be determined from a “cumulative Frequency
Distribution’

11
Example 4: Convert the absolute frequency distribution in example 3 into a less than and more
than cumulative frequency distribution
Class Class Frequency Less than Cumulative More than Cumulative
(xi) Boundaries (fi) Frequency (<cfi) Frequency (>cfi)
16 – 25 15.5 – 25.5 7 7 30
26 – 35 25.5 – 35.5 2 9 23
36 – 45 35.5 – 45.5 6 15 21
46 – 55 45.5 – 55.5 5 20 15
56 – 65 55.5 – 65.5 6 26 10
66 – 75 65.5 – 75.5 4 30 4
This means that from ‘less than’ cumulative frequency distribution there are 7 observations less
than 25.5, 9 observations below 35.5, etc and from ‘more than’ cumulative frequency
distribution 30 observations are above 15.5, 23 observations are above 25.5 etc.
2.3.2.4. Relative Frequency Distribution (RFD)
It enables the researcher to know the proportion or percentage of cases in each class. Relative
frequencies can be obtained by dividing the frequency of each class by the total frequency. It
can be converted in to a percentage frequency by multiplying each relative frequency by 100%.
i.e.

fi
Rf i =
n
Where Rfi – is the relative frequency of the ith class Note: Pfi = Rfi  100%
fi – is the frequency of the ith class Where Pfi is percentage frequency of each class. n – is
the total number of observations
Example 5: The relative and percentage frequency distribution of Example 3 is:

Class Class Frequency Rfi (Pfi)

16(xi)
– 25 15.5 – 25.5
Boundaries 7
(fi) 7/30=0.23 0.23*100=23%
26 – 35 25.5 – 35.5 2 2/30=0.07 0.07*100=7%
36 – 45 35.5 – 45.5 6 6/30=0.2 0.2*100=20%
46 – 55 45.5 – 55.5 5 5/30=0.17 0.17*100=17%
56 – 65 55.5 – 65.5 6 6/30=0.2 0.2*100=20%
66 – 75 65.5 – 75.5 4 4/30=0.13 0.13*100=13%

Exercise 4

12
Consider the following data.
22, 11, 12, 12, 13, 20, 14, 14, 3, 4, 5, 6, 7,7,8,7,8,9,10, 11, 13, 15, 16, 17, 17, 18, 15, 19, 12,
14,15,18
A. Prepare an ungrouped frequency distribution
B. Construct a grouped frequency distribution with appropriate number of classes and
compute the class boundaries and class marks
C. Construct a less than and more than cumulative frequency distribution
D. Compute the relative and percentage frequency distribution
2.3.3. GRAPHIC METHODS OF DATA PRESENTATION
2.3.3.1. Histogram
After you complete a frequency distribution, your next step will be to construct a “picture” of
these data values using a histogram. A histogram is a graph consisting of a series of adjacent
rectangles whose bases are equal to the class width of the corresponding classes and whose
heights are proportional to the corresponding class frequencies. Here, class boundaries are
marked along the horizontal axis (x – axis) and the class frequencies along the vertical axis (y –
axis) according to a suitable scale. It describes the shape of the data. You can use it to answer
quickly such questions as are the data symmetric? And where do most of the data values lay?
Example 6: Construct a histogram for the following distribution.

Class boundaries 14.5-24.5 24.5-34.5 34.5-44.5 44.5-54.5 54.5-64.5

Frequency 3 4 8 6 7
Solution
9

5
Series1
4
F

q
u
r
e

0
14.5-24.5 24.5-34.5 34.5-44.5 44.5-54.5 54.5-64.5

Class Boundaries

13
Exercise 5: Construct a histogram for the following distribution
Class (xi) 5 – 10 10 – 15 15 – 20 20 – 25 25 - 30 30 – 35
Frequency (f
(fi) 4 7 9 12 6 5

2.3.3.2. Frequency Polygon

Frequency polygon is constructed by plotting the class frequencies against class marks and
joining them by a set of line segments.
Although a histogram does demonstrate the shape of the data, perhaps the shape can be more
clearly illustrated by using a frequency polygon. Here, you merely connect the centers of the
tops of the histogram bars (located at the class midpoints) with a series of straight lines. The
resulting figure is a frequency polygon. Here the class marks are plotted along the x – axis and
the class frequencies along the y – axis.
Note: Two classes with zero frequencies at the two ends of the Frequency distribution are added
to complete the polygon
Example 7: Consider the frequency distribution given in the above example 6 and construct a
frequency polygon.

Frequency Polygon

9
8
7
6
5
Frequency
F

q
u
r
e

4
3
2
1
0
9.5 19.5 29.5 39.5 49.5 59.5 69.5
Class Marks
.
Exercise 6: Construct
Construct a frequency polygon for the frequency distribution given in exercise 5
2.3.3.3. Cumulative Frequency Curve (Ogive)
It is the graphic representation of a cumulative frequency distribution Ogives are of two kinds.
‘Less than’ ogive and ‘more than’ Ogive (< Ogive and > Ogive).
A) ‘Less than’ ogive:
ogive: here, upper class boundaries are plotted against the ‘less than’
cumulative frequencies of the respective class & they are joined by adjacent lines.

14
Example 8: Draw a ‘less than’ Ogive for the following frequency distribution
Class (xi) 3-6 7 – 10 11 – 14 15 – 18 19 – 22
Frequency (f
(fi) 4 7 10 6 3
Class Boundaries 2.5-6.5 6.5-10.5 10.5-14.5 14.5-18.5 18.5-22.5
<Cumulative FD 4 11 21 27 30
>Cumulative FD 30 26 19 9 3

A less than ogive showing the frequency distribution above

Less than cumulative frequency (<Cfi)

35
30
25
20
15
10
5
0
6.5 10.5 14.5 18.5 22.5

Upper class boundary (UCBi)

B
. ‘More
‘More than’ Ogive:
Ogive: here, lower class boundaries are plotted against the ‘more than’ cumulative
frequencies of their respective class and they are joined by adjacent lines.
Example 9: Draw a ‘More than’ Ogive for the frequency distribution given in example 8
Solution:

15
A more than ogive for the above frequency distribu-
tion
More than cumulative frequency (>Cfi

35
30
25
20
15
10
5
0
2.5 6.5 10.5 14.5 18.5
lower class boundaries (LCBi)

2.3.3.4. Line Graph

It represents the relationship between time (on the x-axis) and values of variable (on the y-axis).
The values are recorded with respect to the time of occurrence.
Example 10: Draw a line graph for the following time series data.

Year 1986 1987 1988 1989 1990 1991

Values 20 10 30 15 25 10
Solution
A line graph showing the above time series

40
30 30
Values

25
20 20
15
10 10 10
0
1986 1987 1988 1989 1990 1991
Year

16
2.3.3.5. Bar Chart (Bar Diagram)
Histogram, Frequency polygon, ogives are used for data having an interval or ratio level of
measurement. The other kinds of presenting statistical data suitable for a particular kind of
situations are bar charts, pie chart and pictograph.
Bar chart is a series of equally spaced bars of uniform width where the height (length) of a bar
represents the amount (magnitude) of frequency corresponding with a category. Bars may be
drawn horizontally or vertically. Vertical bar graphs are preferred as they allow comparison with
other bars.

Types of Bar Charts

A. Simple Bar Chart: It represents a single set of data (variable) classified in different
categories. Singular bars are drawn with the respective frequencies.
Example 11: Revenue (in millions of Birr) of company x from 1980 to 1982 is given below
Year Revenue
1980 50
1981 150
1982 200

A simple bar chart showing revenues of company X from 1980 to 1982

250
Revenue

200
150
100
50
0
1980 1981 1982
year

B. Multiple Bar Chart: Here two or more bars are grouped with the corresponding frequency to
represent two or more interrelated data in each category. The bars of related variables are kept
adjacent to each other for every set of values. These charts can be used if the overall total is not
required and each bar is shaded or colored separately and a key is given to distinguish them.

17
Example 12: The following table shows the production of wheat and maize in hundreds of
quintals.
Year 1980 1981 1982
Maize 40 20 60
Wheat 80 60 100
Solution:

The number of quintals(in thousands) of

wheat and maize production
100
100 80
80 60 60
Number of60 40
quintals
40 20 maize
20 wheat
0
1980 1981 1982
Year

C. Subdivided (Component) Bar Chart: It is used to present data by subdividing a single bar
with respect to the proportional frequency. Each portion of the bar is then shaded or colored and
a key is given to distinguish them.
Example 13: The number of quintals of wheat and maize (in millions of quintals) produced by
country x in the indicated years.

Year Wheat Maize

1980 150 150
1981 300 200
1982 350 100

Solution:

18
The number of quintals of wheat and maize
produced by country X

Number of quintals
600

400 200 100 Maize

200 150 Wheat

300 350
150
0
1980 1981 1982

Year

D. Percentage Bar Chart:

Chart: It is a subdivided bar chart where percentages are used in each
classification rather than the actual frequencies.
Example 14: Construct percentage bar chart for the data given in Example 13.
Year % of Wheat Production % of Maize Production
1980 150/300  100 = 50 150/300  100 = 50
1981 300/500  100 = 60 200/500  100 = 40
1982 350/450  100 = 78 100/450  100 = 22
Percentage produced

Percentage of wheat and maize production from 1980-1982

100%
22
80% 50 40
60% wheat
40% 78 maize
50 60
20%
0%
1980 1981 1982
Year
Solution:
2.3.3.6. Pie Chart
A pie chart is a circle divided in to various sectors with areas proportional to the value of the
component they represent. It shows the components in terms of percentages not in absolute
magnitude. The degree of the angle formed at the center has to be proportional to the values
represented.

Example 15: The monthly expenditure of a certain family is given below.

Items Expenditure % Proportion (Pf
(Pfi)

19
Clothing 100 100/1000  100 = 10%
Food 350 350/1000  100 = 35%
House Rent 250 250/1000  100 = 25%
Miscellaneous 300 300/1000  100 = 30%
Total 1000 100%
Solution: The pie chart for the above expenditure is as follows

Food
30% 35% House rent
Clothing
Misc.
10% 25%

2.3.3.7. Pictograph (Pictogram)

A pictograph is a graph that uses symbols or pictures to represent data.
Example 16: In comparing the population of a country from 1990 to 1992, we simply draw
pictures of people where each picture may represent 1,000,000 people.

1992 -  Key:  = 1,000,000

1991 - 

1990 - 

U-5 Data Collection & Sources of Data: Lecturer: Dr. Meseret E
No ratings yet
U-5 Data Collection & Sources of Data: Lecturer: Dr. Meseret E
28 pages
Data Collection Under Research
No ratings yet
Data Collection Under Research
126 pages
BRM CH-6
No ratings yet
BRM CH-6
33 pages
Lecture 7
No ratings yet
Lecture 7
43 pages
Boistats
No ratings yet
Boistats
26 pages
Metho of Data Collection
No ratings yet
Metho of Data Collection
33 pages
MethodsofDatacollection
No ratings yet
MethodsofDatacollection
45 pages
Collection of Data
No ratings yet
Collection of Data
78 pages
Socio
No ratings yet
Socio
16 pages
3is q4 Module-2 Lesson-2 Data-Collection
No ratings yet
3is q4 Module-2 Lesson-2 Data-Collection
34 pages
Ca - 1
No ratings yet
Ca - 1
23 pages
Data Gathering Procedure
No ratings yet
Data Gathering Procedure
43 pages
Method of Data Collection
No ratings yet
Method of Data Collection
33 pages
Grade 11 Collection of Data
No ratings yet
Grade 11 Collection of Data
94 pages
Chapter 2-Primary and Secondary Data
No ratings yet
Chapter 2-Primary and Secondary Data
27 pages
Statistics 1 2025
No ratings yet
Statistics 1 2025
29 pages
Concept and Sources of Primary Data and Secondary
No ratings yet
Concept and Sources of Primary Data and Secondary
47 pages
Data Collecation Newsest 1
No ratings yet
Data Collecation Newsest 1
42 pages
Module 2 - Stats
No ratings yet
Module 2 - Stats
31 pages
Data Collection and Analysis
No ratings yet
Data Collection and Analysis
137 pages
Methods of Data Collection
No ratings yet
Methods of Data Collection
18 pages
18bzo64e U3
No ratings yet
18bzo64e U3
24 pages
01-Obtaining Data
No ratings yet
01-Obtaining Data
92 pages
Advanced Excel
No ratings yet
Advanced Excel
48 pages
Nihon Kohden 6450 Operator Manual
No ratings yet
Nihon Kohden 6450 Operator Manual
335 pages
Data Science
No ratings yet
Data Science
310 pages
Metho of Data Collection
No ratings yet
Metho of Data Collection
33 pages
Data Collection
No ratings yet
Data Collection
40 pages
Sources and Collection of Data
No ratings yet
Sources and Collection of Data
38 pages
Unit - III Data Collection
No ratings yet
Unit - III Data Collection
95 pages
Collection of Data
No ratings yet
Collection of Data
28 pages
Primary Data Are Those Which Are Collected Afresh and For The First Primary Data Are Those Which Are Collected Afresh and For The First
No ratings yet
Primary Data Are Those Which Are Collected Afresh and For The First Primary Data Are Those Which Are Collected Afresh and For The First
18 pages
Collection and Presentation of Data
No ratings yet
Collection and Presentation of Data
10 pages
Chapter 6 - Data Collection
No ratings yet
Chapter 6 - Data Collection
25 pages
Data Collection
No ratings yet
Data Collection
25 pages
Basic Business Stats (Foster Stine Water)
No ratings yet
Basic Business Stats (Foster Stine Water)
239 pages
Methods of Data Collection
No ratings yet
Methods of Data Collection
5 pages
AI Handout For CS
No ratings yet
AI Handout For CS
65 pages
Statistics
No ratings yet
Statistics
15 pages
CHAPTER TWO Stat I
No ratings yet
CHAPTER TWO Stat I
25 pages
Problem Solving - Basic 7QC Tools
No ratings yet
Problem Solving - Basic 7QC Tools
86 pages
Ids Unit 5 By1
No ratings yet
Ids Unit 5 By1
24 pages
Module 1
No ratings yet
Module 1
19 pages
Stat Chapter Two
No ratings yet
Stat Chapter Two
10 pages
Data Collection Methods: Interview and Observations: Group Members Submitted To: DR Swati Upveja
No ratings yet
Data Collection Methods: Interview and Observations: Group Members Submitted To: DR Swati Upveja
27 pages
Concept and Sources of Primary Data and Secondary
No ratings yet
Concept and Sources of Primary Data and Secondary
47 pages
ES209 Module 1 - Data Management
No ratings yet
ES209 Module 1 - Data Management
19 pages
Lecture 7 - Computer Security
No ratings yet
Lecture 7 - Computer Security
46 pages
Q3-M6 - 3is - Data Collection Procedure
No ratings yet
Q3-M6 - 3is - Data Collection Procedure
30 pages
Practice Exam 1
No ratings yet
Practice Exam 1
22 pages
Collection of Data
No ratings yet
Collection of Data
7 pages
Document
No ratings yet
Document
14 pages
Data Collection
No ratings yet
Data Collection
30 pages
TOC GTU Study Material Presentations Unit-1 18062021085415AM
No ratings yet
TOC GTU Study Material Presentations Unit-1 18062021085415AM
65 pages
Statistics - Chapter 2
No ratings yet
Statistics - Chapter 2
13 pages
Chapter Two Ins
No ratings yet
Chapter Two Ins
35 pages
Metho of Data Collection
No ratings yet
Metho of Data Collection
33 pages
BRM 1
No ratings yet
BRM 1
32 pages
Data Collection Meaning, Types of Data, Methods: Deppt. of Hotel and Tourism Management
No ratings yet
Data Collection Meaning, Types of Data, Methods: Deppt. of Hotel and Tourism Management
9 pages
Grade: Self-Learning Module in Inquiries, Investigations and Immersion Lesson
No ratings yet
Grade: Self-Learning Module in Inquiries, Investigations and Immersion Lesson
11 pages
04 Data Collection
No ratings yet
04 Data Collection
48 pages
Handnote On B.Stat 2nd Chapter
No ratings yet
Handnote On B.Stat 2nd Chapter
55 pages
CET0212 Module 1
No ratings yet
CET0212 Module 1
31 pages
Statistics Lecture PDF
No ratings yet
Statistics Lecture PDF
51 pages
Statistical Analysis With Software Application
No ratings yet
Statistical Analysis With Software Application
3 pages
Statistical Process Control
No ratings yet
Statistical Process Control
39 pages
Microsoft Excel Data Analysis For Beginners, Intermediate and Expert
100% (1)
Microsoft Excel Data Analysis For Beginners, Intermediate and Expert
19 pages
Collection of Data
No ratings yet
Collection of Data
5 pages
Collection of Data
No ratings yet
Collection of Data
11 pages
Chp. 3 - Data Management
No ratings yet
Chp. 3 - Data Management
32 pages
Tabular and Graphical Presentation of Data1
100% (1)
Tabular and Graphical Presentation of Data1
7 pages
11 Economics-Collection of Data-Notes and Video Link
No ratings yet
11 Economics-Collection of Data-Notes and Video Link
11 pages
Notes - Chapter 2 - IT Skills and Data Analysis I
No ratings yet
Notes - Chapter 2 - IT Skills and Data Analysis I
22 pages
Describing Data: Graphs and Tables
No ratings yet
Describing Data: Graphs and Tables
14 pages
Describing Data:: Frequency Tables, Frequency Distributions, and Graphic Presentation
No ratings yet
Describing Data:: Frequency Tables, Frequency Distributions, and Graphic Presentation
35 pages
Data Collection and Sources of Data
No ratings yet
Data Collection and Sources of Data
22 pages
CSIT366-Lab File
No ratings yet
CSIT366-Lab File
17 pages
Chapter 7 Project Communication Management
No ratings yet
Chapter 7 Project Communication Management
8 pages
Nama: Etik Eliza NIM: B1A020107 Kelas: C Biologi Praktikum Statistika Deskriptif
No ratings yet
Nama: Etik Eliza NIM: B1A020107 Kelas: C Biologi Praktikum Statistika Deskriptif
23 pages
Six Sigma Yellow Belt 1698378517
No ratings yet
Six Sigma Yellow Belt 1698378517
64 pages
Summer Task Mathamatics-VIII
67% (3)
Summer Task Mathamatics-VIII
5 pages
PDF
No ratings yet
PDF
114 pages
UPDATED - Black Belt Cheat Sheet - 17NOV2017
No ratings yet
UPDATED - Black Belt Cheat Sheet - 17NOV2017
5 pages
STA211
No ratings yet
STA211
14 pages
Topic 5. Graphical Methods of Data Representation
No ratings yet
Topic 5. Graphical Methods of Data Representation
6 pages
Leperltier C. 1969. A Simplified Statistical Treatment of Geochemical Data by Graphical Representation
No ratings yet
Leperltier C. 1969. A Simplified Statistical Treatment of Geochemical Data by Graphical Representation
13 pages
Matching Form Column A and B
No ratings yet
Matching Form Column A and B
2 pages
L3 Probability Distributions and Histogram For The Probability Distribution
No ratings yet
L3 Probability Distributions and Histogram For The Probability Distribution
10 pages
Statistics Notes
No ratings yet
Statistics Notes
17 pages
Frequency Polygon Homework Tes
100% (1)
Frequency Polygon Homework Tes
5 pages
Exp19 Excel Ch08 Cap Golden State 5K Instructions
No ratings yet
Exp19 Excel Ch08 Cap Golden State 5K Instructions
2 pages
2008 Project
No ratings yet
2008 Project
1 page
Elementary Statistics
From Everand
Elementary Statistics
jay prakash Maheshwari
5/5 (1)
Business Statistics I Essentials
From Everand
Business Statistics I Essentials
Louise Clark
5/5 (5)
Introduction To Non Parametric Methods Through R Software
From Everand
Introduction To Non Parametric Methods Through R Software
Editor IJSMI
No ratings yet
Descriptive Statistics: Six Sigma Thinking, #3
From Everand
Descriptive Statistics: Six Sigma Thinking, #3
Sumeet Savant
No ratings yet

Chapter Two

Uploaded by

Chapter Two

Uploaded by

CHAPTER TWO

DATA COLLECTION AND PRESENTATION

Data obtained from a primary source is called primary data.

Types of questions used in a questionnaire

1. Income is a ratio data because zero dollars is truly “no income”

2.2. CLASSIFICATION OF DATA

Definition: Presentation is a statistical procedure of arranging and putting data in a form of

Common Terminologies in a GFD

Steps in constructing a Grouped Frequency Distribution

6. LCBi = LCLi – ½(u) UCB i=

Exercise 3: Consider the following data about the ages of 50 persons.

Class Class Frequency Rfi (Pfi)

Class boundaries 14.5-24.5 24.5-34.5 34.5-44.5 44.5-54.5 54.5-64.5

2.3.3.2. Frequency Polygon

A less than ogive showing the frequency distribution above

Upper class boundary (UCBi)

2.3.3.4. Line Graph

Year 1986 1987 1988 1989 1990 1991

Types of Bar Charts

A simple bar chart showing revenues of company X from 1980 to 1982

The number of quintals(in thousands) of

Year Wheat Maize

400 200 100 Maize

200 150 Wheat

D. Percentage Bar Chart:

Percentage of wheat and maize production from 1980-1982

Example 15: The monthly expenditure of a certain family is given below.

2.3.3.7. Pictograph (Pictogram)

1992 -  Key:  = 1,000,000

You might also like