Chapter-2-Static of Data-1
Chapter-2-Static of Data-1
CHAPTER 2
Any scientific investigation requires data related to the study. The required data is
obtained from two sources called primary & secondary.
1. Primary Data
Data measured or collect by the investigator or the user directly from the
source.
Primary data are data originally collected for the immediate purpose. The
sources of primary data are the objects under study themselves and there is
also a direct contact between the investigator and the items (objects)
under investigation because of this it is more expensive.
2. Secondary Data
When an investigator uses data, which have already been collected by others, such
data are called "Secondary Data". Such data are primary data for the agency that
collected them, and become secondary for someone else who uses these data for
his own purposes.
The secondary data can be obtained from journals, reports, government
publications, publications of professionals and research organizations. Secondary
data are less expensive to collect both in money, cost and time.
Note:
Data which are primary for one may be secondary for the other.
Primary data are more expensive than secondary data.
In primary data collection, you collect the data yourself using methods such as
I. Questionnaire methods: it includes personal interview (face to face,
telephone) & mail interview.
II. Observation: It involves recording the behavioral patterns of people, objects
and events in a systematic manner.
III. Laboratory experiment: Conducting laboratory experiments on fields of
chemical, biological sciences and so on.
Having collected and edited the data, the next important step is to organize it. That is to
present it in a readily comprehensible condensed form that aids in order to draw inferences
from it. It is also necessary that the like be separated from the unlike ones.
The presentation of data is broadly classified in to the following two categories:
Tabular presentation
Diagrammatic and Graphic presentation.
Used for data that can be place in specific categories such as nominal, or ordinal. e.g. marital
status.
Example: a social worker collected the following data on marital status for 25
persons.(M=married, S=single, W=widowed, D=divorced)
M S D W D
S S M M M
W D S M M
W D D S S
S W W D D
Solution:
Since the data are categorical, discrete classes can be used. There are four types of marital
status M, S, D, and W. These types will be used as class for the distribution. We follow
procedure to construct the frequency distribution.
Step 3: Count the tally and place the result in column (3).
f
% * 100 Where f= frequency of the class, n=total number of value.
n
Percentages are not normally a part of frequency distribution but they can be added since they
are used in certain types diagrammatic such as pie charts.
Combing all the steps one can construct the following frequency distribution.
Each individual value is presented separately, that is why it is named ungrouped frequency
distribution.
-When the range of the data is large, the data must be grouped in to classes that are more than
one unit in width.
Definitions:
Example*:
11 29 6 33 14 31 22 27 19 20
18 17 22 38 23 21 26 34 39 27
Solutions:
Step 1: Find the highest and the lowest value H=39, L=6
Step 6: Find the upper class limit; e.g. the first upper class=12-U=12-1=11
11, 17, 23, 29, 35, 41 are the upper class limits.
So combining step 5 and step 6, one can construct the following classes.
Class limits
6 – 11
12 – 17
18 – 23
24 – 29
30 – 35
36 – 41
Class boundary
5.5 – 11.5
11.5 – 17.5
17.5 – 23.5
23.5 – 29.5
29.5 – 35.5
35.5 – 41.5
Step 9: Write the numeric values for the tallies in the frequency column.
Class Class boundary Class Tally Freq. Cf (less Cf (more rf. rcf (less
limit Mark than than type) than type
type)
6 – 11 5.5 – 11.5 8.5 // 2 2 20 0.10 0.10
12 – 17 11.5 – 17.5 14.5 // 2 4 18 0.10 0.20
18 – 23 17.5 – 23.5 20.5 7 11 16 0.35 0.55
//////
24 – 29 23.5 – 29.5 26.5 //// 4 15 9 0.20 0.75
30 – 35 29.5 – 35.5 32.5 /// 3 18 5 0.15 0.90
36 – 41 35.5 – 41.5 38.5 // 2 20 2 0.10 1.00
These are techniques for presenting data in visual displays using geometric and pictures.
Importance:
-The three most commonly used diagrammatic presentation for discrete as well as qualitative data
are:
Pie charts
Pictogram
Bar charts
Pie chart
- A pie chart is a circle that is divided in to sections or wedges according to the percentage of
frequencies in each category of the distribution. The angle of the sector is obtained using:
Valueofthepart
Angleof sec tor *100
thewholequantity
Solutions:
Step 3: Using a protractor and compass, graph each section and write its name corresponding
percentage.
CLASS
Boy s Men
Girls Women
Pictogram
-In these diagram, we represent data by means of some picture symbols. We decide
abut a suitable picture to represent a definite number of units in which the variable is
measured.
Bar Charts:
- A set of bars (thick lines or narrow rectangles) representing some magnitude over time space.
- They are useful for comparing aggregate over time space.
- Bars can be drawn either vertically or horizontally.
- There are different types of bar charts. The most common being :
Simple bar chart
Deviation o0r two way bar chart
Broken bar chart
Component or sub divided bar chart.
Multiple bar charts.
Solutions:
30
25
Sales in $
20
15
10
5
0
A B C
product
100
80
Sales in $
Product C
60
Product B
40
Product A
20
0
1957 1958 1959
Year of production
Solutions:
60
50
Sales in $
40 Product A
30 Product B
20 Product C
10
0
1957 1958 1959
Year of production
A graph which displays the data by using vertical bars of various height to represent frequencies.
Class boundaries are placed along the horizontal axes. Class marks and class limits are some times
used as quantity on the X axes.
Frequency Polygon:
- A line graph. The frequency is placed along the vertical axis and classes mid points are placed
along the horizontal axis. It is customer to the next higher and lower class interval with
corresponding frequency of zero, this is to make it a complete polygon.
Example: Draw a frequency polygon for the above data (example *).
Solutions:
8
4
Value Frequency
0
2. 5 8. 5 14 .5 20 .5 26 .5 32 .5 38 .5 44 .5