Chapter 2
Chapter 2
CHAPTER 2
2. METHODS OF DATA PRESENTATION
Once it is decide what type of study is to be made, it becomes necessary to collected
information about the concerned study, mostly in the form of data. In order to generate
valid conclusion from a data, information has to be collected in a systematic manner.
Whatever the quality of sampling and analysis method, a haphazardly collected dataset is
less likely to produce valuable and generalizable information.
Types of Data:- There are two types (sources) for the collection of data.
1) Primary data
The primary data are the first hand information collected, compiled and published by
organization for some purpose. They are most original data in character and have not
undergone any sort of statistical treatment.
Refer to those that are collected by conducting survey to meet the specific problem needs
at hand.
Example: Population census reports are primary data because these are collected,
complied and published by the population census organization.
2) Secondary data
The secondary data are the second hand information which are already collected by
someone(organization) for some purpose and are available for the present study.
The secondary data are not pure in character and have undergone some treatment at least
once.Data taken from already available published or unpublished source.
2.1 Methods of collection
There are three major methods of data collection
1. self-administered questionnaire
2. direct investigation-measurement (observation) of the subject and interviewing(face-
toface, telephone, --- )
3. the use of documentary source
1. Self-administered questionnaire
Questionnaire is the main data collection instrument in formal sample survey. Before
examining the steps in designing a questionnaire we need to review the types of questions
used in questionnaires. Depending on the amount of freedom given to respondent in
offering responses, there are two basic types of questions that can be used in
questionnaires: openended questions and closed ended questions.
The type of questions for use will be determined by the form of responses wanted, the
nature of the respondents and their ability to answer the questions.
Open–ended questions:- allows the respondent to answer it freely in his or her own
words
Example: what do you think are the reasons for a high drop-out rate of village health
committee members?
Closed–ended questions:- Predetermined list of alternate responses is presented to the
respondent for checking the appropriate one(s). It implies that the respondent‟s answers
are restricted in some way to a limited range of alternatives.
2. Direct investigation
Page 1 of 16
Lecture notes on Introduction to Statistics Chapter 2 METHODS OF DATA PRESNTATION
Data can be obtained through direct observation or measurement that provides accurate
information but it is expensive and inconvenient
eg: Land area measurement, Animal weight gain, Physical examination, direct
observation of work.
ii) Interview
a) Face-to-Face interview
Advantage:-
Interviewers can observe the surroundings and can use nonverbal communication and
visual aids.
The interviewer can help the respondent if he/she has difficulty in understanding the
questions.
Respondent is likely to answer all the questions alone
Disadvantage:-
Cost is high
Interviewer bias is also high
Untrained interviewer may distort the meaning of the questions
b) Telephone Interview
Advantage:-
It is less expensive in time and money compared to face to face interviews
Relatively high response rate
Reach people who would not open their doors to an interviewer, but might willing to
talk on the telephone
Disadvantage:-
Unrepresentative of the groups which do not have telephones
Unlisted telephone numbers are excluded from the study.
Respondent may be substitute by another
3. The use of documentary source
Extracting information from existing resources.
Is much less expensive than any other two sources
It is difficult to get the information needed when records are compiled in un standardized
manner.
Example: - Hospital records, professional institutes, Official statistics, - - -
Page 2 of 16
Lecture notes on Introduction to Statistics Chapter 2 METHODS OF DATA PRESNTATION
qualitative and quantitative data. Tabular and graphical summaries of data can be
obtained in annual reports, newspaper articles and research studies. Everyone is exposed
to these types of presentations, so it is important to understand how they are prepared and
how they will be interpreted.
Modern statistical software packages provide extensive capabilities for summarizing data
and preparing graphical presentations. MINITAB, SPSS and STATA are three packages
that are widely available.
2.2.1 Editing of Data.
After collecting the data either from primary or secondary source, the next step is its
editing.Editing means the examination of collected data to discover any error and mistake
before presenting it. It has to be decided before hand what degree of accuracy is wanted
and what extent of errors can be tolerated in the inquiry. The editing of secondary data is
simpler than that of primary data.
2.2.2. Classification of Data
The process of arranging data into homogenous group or classes according to some
common characteristics present in the data is called classification. For Example, The
process of sorting letters in a post office, the letters are classified according to the regions
and further arranged according to zones, cities, etc.
Bases of Classification:- There are four important bases of classification:
(1) Qualitative Base:- When the data are classified according to some quality or
attributes such as sex, religion, literacy, intelligence etc…
(2) Quantitative Base:- When the data are classified by quantitative characteristics like
heights,weights, ages, income etc…
(3) Geographical Base:- When the data are classified by geographical regions or
location, like states, provinces, cities, countries etc…
(4) Chronological or Temporal Base:- When the data are classified or arranged by their
time of occurrence, such as years, months, weeks, days etc…
For Example: Time series data.
2.2.3 Tabulation of Data
The process of placing classified data into tabular form is known as tabulation. A table is
a symmetric arrangement of statistical data in rows and columns. Rows are horizontal
arrangements whereas columns are vertical arrangements.
Frequency distribution: is the organization of raw data in table form using classes and
frequencies.
Raw data: recorded information in its original collected form, whether it is counts or
measurements, is referred to as raw data.
Page 3 of 16
Lecture notes on Introduction to Statistics Chapter 2 METHODS OF DATA PRESNTATION
Used for data that can be place in specific categories such as nominal, or ordinal. e.g.
marital status.
Example 2: a social worker collected the following data on marital status for 25
persons.(M=married, S=single, W=widowed, D=divorced)
M S D W D
S S M M M
W D S M M
W D D S S
S W W D D
Solution:
Page 4 of 16
Lecture notes on Introduction to Statistics Chapter 2 METHODS OF DATA PRESNTATION
Since the data are categorical, discrete classes can be used. There are four types of marital
status M, S, D, and W. These types will be used as class for the distribution. We follow
procedure to construct the frequency distribution.
Step 2: Tally the data and place the result in column (2).
Step 3: Count the tally and place the result in column (3).
f
% * 100
n Where f= frequency of the class, n=total number of value.
Percentages are not normally a part of frequency distribution but they can be added since
they are used in certain types diagrammatic such as pie charts.
Combing the entire steps one can construct the following frequency distribution.
S //// // 7 28
D //// // 7 28
W //// 5 24
Page 5 of 16
Lecture notes on Introduction to Statistics Chapter 2 METHODS OF DATA PRESNTATION
ih
-Is a table of all the potential raw score values that could possible occur in the data along
with the number of times each actually occurred.
First find the smallest and largest raw score in the collected data.
Arrange the data in order of magnitude and count the frequency.
To facilitate counting one may include a column of tallies.
Example:
80 76 90 85 80
70 60 62 70 85
65 60 63 74 75
76 70 70 80 85
Page 6 of 16
Lecture notes on Introduction to Statistics Chapter 2 METHODS OF DATA PRESNTATION
-When the range of the data is large, the data must be grouped in to classes that are more than
one unit in width.
Definitions:
Page 7 of 16
Lecture notes on Introduction to Statistics Chapter 2 METHODS OF DATA PRESNTATION
5. The classes must be equal in width. The exception here is the first or last class. It
is possible to have an "below ..." or "... and above" class. This is often used with
ages.
Example*:
11 29 6 33 14 31 22 27 19 20
18 17 22 38 23 21 26 34 39 27
Solutions:
Step 1: Find the highest and the lowest value H=39, L=6
Page 8 of 16
Lecture notes on Introduction to Statistics Chapter 2 METHODS OF DATA PRESNTATION
Step 6: Find the upper class limit; e.g. the first upper class=12-U=12-1=11
11, 17, 23, 29, 35, 41 are the upper class limits.
So combining step 5 and step 6, one can construct the following classes.
Class limits
6 – 11
12 – 17
18 – 23
24 – 29
30 – 35
36 – 41
Class boundary
5.5 – 11.5
11.5 – 17.5
17.5 – 23.5
23.5 – 29.5
29.5 – 35.5
35.5 – 41.5
Step 9: Write the numeric values for the tallies in the frequency column.
Page 9 of 16
Lecture notes on Introduction to Statistics Chapter 2 METHODS OF DATA PRESNTATION
Class Class boundary Class Tally Freq. Cf (less Cf (more rf. rcf (less
limit Mark than than type) than type
type)
6 – 11 5.5 – 11.5 8.5 // 2 2 20 0.10 0.10
12 – 17 11.5 – 17.5 14.5 // 2 4 18 0.10 0.20
18 – 23 17.5 – 23.5 20.5 ////// 7 11 16 0.35 0.55
24 – 29 23.5 – 29.5 26.5 //// 4 15 9 0.20 0.75
30 – 35 29.5 – 35.5 32.5 /// 3 18 5 0.15 0.90
36 – 41 35.5 – 41.5 38.5 // 2 20 2 0.10 1.00
These are techniques for presenting data in visual displays using geometric and pictures.
Importance:
-The three most commonly used diagrammatic presentation for discrete as well as qualitative
data are:
Pie charts
Pictogram
Bar charts
Pie chart
- A pie chart is a circle that is divided in to sections or wedges according to the percentage of
frequencies in each category of the distribution. The angle of the sector is obtained using:
Page 10 of 16
Lecture notes on Introduction to Statistics Chapter 2 METHODS OF DATA PRESNTATION
Solutions:
Step 3: Using a protractor and compass, graph each section and write its name corresponding
percentage.
Pie Chart
Boys,15%
Men,25%
-In these
diagrams,
we represent
data by
means of
some picture symbols. We decide about a suitable picture to represent a definite
number of units in which the variable is measured.
Page 11 of 16
Lecture notes on Introduction to Statistics Chapter 2 METHODS OF DATA PRESNTATION
Populatio
n
Bar Charts:
- A set of bars (thick lines or narrow rectangles) representing some magnitude over time space.
- They are useful for comparing aggregate over time space.
- Bars can be drawn either vertically or horizontally.
- There are different types of bar charts. The most common being :
Solutions:
30
25
Sales in $
20
15
10
5
0
A B C
product
Page 12 of 16
Lecture notes on Introduction to Statistics Chapter 2 METHODS OF DATA PRESNTATION
-The bars represent total value of a variable with each total broken in to its component parts and
different colours or designs are used for identifications
Example:
Draw a component bar chart to represent the sales by product from 1957 to 1959.
Solutions:
100
80
Sales in $
Product C
60
Product B
40
Product A
20
0
1957 1958 1959
Year of production
Example:
Draw a component bar chart to represent the sales by product from 1957 to 1959.
Page 13 of 16
Lecture notes on Introduction to Statistics Chapter 2 METHODS OF DATA PRESNTATION
Solutions:
60
50
40
Sales in $
Product A
30 Product B
20 Product C
10
0
1957 1958 1959
Year of production
Histogram
A graph which displays the data by using vertical bars of various height to represent
frequencies. Class boundaries are placed along the horizontal axes. Class marks and class limits
are some times used as quantity on the X axes.
Page 14 of 16
Lecture notes on Introduction to Statistics Chapter 2 METHODS OF DATA PRESNTATION
Frequency Polygon:
- A line graph. The frequency is placed along the vertical axis and classes mid points are placed
along the horizontal axis. It is customer to the next higher and lower class interval with
corresponding frequency of zero, this is to make it a complete polygon.
Example: Draw a frequency polygon for the above data (example *).
Solutions:
8
4
Value Frequency
0
2.5 8.5 14.5 20.5 26.5 32.5 38.5 44.5
Example: Draw an ogive curve(less than type) for the above data.(Example *)
Page 15 of 16