0% found this document useful (0 votes)
6 views

Data

This document covers the importance of data collection, organization, and presentation in statistics, emphasizing the data handling cycle. It outlines objectives for learning statistical data gathering, organization in frequency tables, and appropriate graphical representation. The document also details types of data, sampling techniques, and methods for organizing and presenting data effectively.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
6 views

Data

This document covers the importance of data collection, organization, and presentation in statistics, emphasizing the data handling cycle. It outlines objectives for learning statistical data gathering, organization in frequency tables, and appropriate graphical representation. The document also details types of data, sampling techniques, and methods for organizing and presenting data effectively.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 26

TOPIC: DATA COLLECTION, ORGANIZATION, AND

PRESENTATION

I. INTRODUCTION

Statistics is a valuable part of the scientific world as it allows us


to interpret what has happened in the past, so that we can foresee
what is possible to happen in the future, and plan for it. A vital part of
working with statistics is collecting data and/or information, recording
it, and then presenting it so that it can be understood effortlessly and
used by other people. This process is called data handling.

In this module, you will learn more about data collection,


organization, and presentation of data as the first process of the data
handling cycle.

II. OBJECTIVE/S
At the end of this module, you will be able to:
 gather statistical data.
 organize data in a frequency distribution table.
 use appropriate graphs to represent organized data: pie chart,
bar graph, frequency polygon, histogram, and ogive.

III. CONCEPT SUMMARY


 Data handling is gathering and recording information, and the
presenting it in a way that can be understood easily and used by
other people.
 Data are used to undertake particular problems or to provide a
basis which certain decisions are generated.
 Sampling is a technique of selecting individual members or a subset
of the population to make statistical inferences from them and
estimate characteristics of the whole population.
 Data presentation also needs planning and presentation. If data are
properly and interestingly presented, the benefits will not only go to
the readers or users out more so to the statisticians who will make
the analysis and interpretation of the data gathered.

IV. BRIEF DISCUSSION


1. Concept
Development DATA
COLLECTION

Data are needed whenever we undertake studies or researches.


They are used to undertake problems or to provide a basis which
certain decisions are generated. The next step after the problem
has been defined in the study is the data collection.

Types of Data

a. Primary Data
These are the data collected by the research himself. It can be
more effective and informative if given the necessary
preparation and planning. These are first – hand or original
sources. They can be collected through the following:

 Observation and Measurement


It involves looking at something that actually
happens, and then measuring and recording
it.

e.g. counting the number of cars


passing Guadalupe bridge.
(Observation) measuring the
masses of all the employees in a
company. (Measurement)

 Interview
It usually takes place between two people.
One of the people is called the interviewer
and the other is the interviewee or
respondent. We use interviews when we
can talk to the respondents directly.

e.g. interview people leaving a shop to find out


whether they were happy with the way they were
treated by the people working in the shop.
Job and Parent – Teacher interviews
 Questionnaire
It is a set of questions given to many people to
complete. It useful for getting information from
many people, as it can be handed out and then
collected later, and does not need an
interviewer. It is the best method of collecting
data when you want to ask a large group of
people what they think about a specific
issue. A
questionnaire should have a short explanation of what your
research is about.

e.g. National Population Census in 2012 in Nigeria

 Database or Registry
It is an organized collection of data that
someone else has already organized and
presented.

e.g. birth, death, and marriage certificate

b. Secondary Data
This are information taken from published or unpublished
materials previously gathered by other researchers or agencies
such as book, newspapers, magazines, journals, published and
unpublished thesis and dissertations.

Quick Check!
Identify the best way to collect data given the following problems.

1. Johanna wants to know if her baby is crying too much.


2. Michael wants to find out his friends’ favorite television
program.
3. Pearl wants to know how the price of cooking oil has increase
over the last ten years.
SAMPLING TECHNIQUES

One of the most important parts of the research work that needs
preparation and planning is choosing the right and appropriate
sampling method. Any sampling procedure that produces an
inference that underestimates what is biased and erroneous.
Sampling is a technique of choosing individual members or a
subset of the population to make statistical inferences from them
and estimate characteristics of the entire population. Various
sampling methods are generally used by researchers in research so
that they do not need to research the whole population to collect
actionable insights. It is also a time –convenient and a cost –
effective method and hence forms the foundation of any research
design. Sampling techniques can be used in a research survey
software for optimum derivation.

Types of Sampling: Sampling Methods

There are two types of sampling techniques: probability sampling and


non –
probability sampling.

a. Probability sampling
It is a sampling technique where a researcher sets a selection of
a few criteria and chooses members of a population randomly. All
the members have an equal opportunity to be a part of the
sample with this selection parameter. The following are the uses
of probability sampling:

 Reduce Sample Bias


 From a population where the bias in the sample is
derived is negligible to non – existent.
 The selection of the sample mainly shows the
understanding and the inference of the researcher.
 It leads to higher quality data collection as the sample
properly represents the population.

 Diverse Population
 When the population is immense and varied, it is vital to
have suitable representation so that the data is not skewed
towards one demographic.
 Create an Accurate Sample
 Probability sampling helps the researchers plan and
create an accurate sample.

Types of Probability Sampling

 Simple Random Sampling


It is a reliable method of obtaining information where every
single member of a population is chosen randomly, merely by
chance. It is one of the best probability sampling techniques
that helps in saving time and resources.

 Cluster Sampling
It is a method where the researchers divide the entire
population into sections or clusters that represent a
population. Clusters are identified and included in a sample
based on demographic parameters like age, sex, location, etc.
This makes it very simple for a survey creator to derive
effective inference from the feedback.

 Systematic Sampling
Researchers use this method to choose the sample members
of a population at regular intervals. It requires the selection of
a starting point for the sample and sample size that can be
repeated at regular intervals. This type of sampling method
has a predefined range, and hence this sampling technique is
the least time – consuming.

 Stratified Random Sampling


It is a method in which the researcher divides the population
into smaller groups that don’t overlap but represent the entire
population. While sampling, these groups can be organized
and then draw a sample from each group separately.
Quick Check!
Classify each sample as random, systematic, stratified, or
cluster.

1. In a large sports arena, all spectators from two buildings


are interviewed to determine whether they believe
basketball players have more shooting practices to do
now than in previous years.
2. Every twenty – first customer entering a food chain is
asked to select his or her favorite meal for lunch.
3. Circuit – board production associates are selected using
random numbers in order to determine annual salaries.
4. Barangay officials in Metro Manila are divided into four
groups according to gender (male or female) and according
to whether they prefer day or night duties. Then 30 are
selected from each group and interviewed to determine

b. Non – probability sampling


The researcher chooses members for research at random. This
sampling method is not a fixed or predefined selection process.
This makes it difficult for all elements of a population to have
equal opportunities to be included in a sample. The following are
the uses of non – probability sampling:

 Create a Hypothesis
 It is used to create an assumption when limited to
no prior information is available.
 This method helps with the immediate return of data and
builds a base for more research.

 Exploratory Research
 This is widely use when conducting qualitative research,
pilot studies, or exploratory research.

 Budget and Time Constraints


 The survey design is not rigid.
 It is easier to pick respondents at random and have them
take the survey or questionnaire.

Types of Non – Probability Sampling


 Convenience Sampling
This kind of method is reliant on the comfort of access to
respondents such as surveying customers at a mall or
passers-by on a busy street. It is usually named as
convenience sampling, because of the researcher’s ease of
carrying it out and getting in touch with the subjects.
Researchers have nearly no power to select the sample
elements, and it’s purely done based on nearness and not
representativeness. It is used when there are time and cost
limitations in collecting feedback. In situations where there
are resource limitations such as the initial stages of research,
convenience sampling is used.

 Judgmental or Purposive Sampling


This are formed by the discretion of the researcher.
Researchers purely consider the purpose of the study, along
with the understanding of the target audience.

 Snowball Sampling
It is a sampling method that researchers apply when the
subjects are difficult to trace. Researchers also implement this
sampling method in situations where the topic is highly
sensitive and not openly discussed

 Quota Sampling
The selection of members in this sampling technique happens
based on a pre-set standard. In this case, as a sample is
formed based on specific attributes, the created sample will
have the same qualities found in the total population. It is a
rapid method of collecting samples.
DATA ORGANIZATION

After data has been collected, the next step is data


organization, by bringing it together in a systematic way that
makes it easier to read. You can organize data by using tallies and
frequency tables.

Raw data are data collected in an investigation and they are not
organized systematically. Raw data that are presented in the form
of a frequency distribution are called grouped data.

Methods of Data Organization


Using this data, let’s discuss the methods of data organization. The following are
the ages of the previous customer in a certain grocery store:

58 50 52 38 80 62 77 56
60 61 58 62 51 36 54 18
71 54 44 52 26 63 58 56
41 34 61 50 60 53 62 62
53 43 63 71 65 79 45 66

Stem and Leaf Diagram


Setting up the data into this diagram, the number (raw data) is broken into tens
and unit digits are tallied together whose values share with the tens digits. The
tens digit is the stem and the unit digit is the leaf.

1 8
2 6

3 4 6 8
4 1 3 4 5
5 0 0 1 2 2 3 3 4 4 6 8 8 8
6 0 0 1 1 2 2 2 2 3 3 6
7 1 1 7 9
8 0
a. Frequency Distribution
A frequency distribution refers to summarizing a large data set
into a small number of intervals. These are the steps in creating
a frequency distribution:

Step 1. Sort the data in ascending order. (see the stem – and –
leaf diagram)

Step 2. Calculate the range.

Range = Highest Observed Value – Lowest Observed


Value Range = 80 – 18 = 62

Step 3. Decide on the number of intervals in the frequency

distribution. If the number of classes is given:


Range
Class Interval =
Number of
Classes

If the number of classes is not given:


Range
Class Interval =
1 + 3.322 log N

where N is the number of observations.


62
Class Interval =
1 + 3.322 log 40 (1.6020) = 9.8 = 10

Step 4. Determine the intervals starting with the lowest


observed value until you reached the range which the highest
observed value is included.

18 – 27
28 – 37
38 – 47
48 – 57
58 – 67
68 – 77
78 – 87
Step 5. Tally and count the observations under each interval.

Classe Tallying Frequenc


s y
18 – 27 II 2
28 – 37 II 2
38 – 47 IIIII 5
48 – 57 IIIII – IIIII 10
58 – 67 IIIII – IIIII – IIIi 14
68 – 77 III 3
78 – 87 II 2
38

Step 6. Solve for the relative frequency (rf).


Frequency
Relative Frequency =
Total Number of Observations

Classe Frequenc Relative


s y Frequency
18 – 27 2 2 / 38 = 0.05
28 – 37 2 2 / 38 = 0.05
38 – 47 5 5 / 38 = 0.131
48 – 57 10 10 / 38 = 0.26
58 – 67 14 16 / 38 = 0.42
68 – 77 3 3 / 38 = 0.078
78 – 87 2 2 / 38 = 0.05
38

Step 7. Solve for class mark (x).

Lower Limit + Upper Limit


Class Mark = 2

Classe Frequenc Relative Class Mark


s y Frequency
18 – 27 2 0.05 (18 + 27) / 2 =
22.5
28 – 37 2 0.05 32.5
38 – 47 5 0.131 42.5
48 – 57 10 0.26 52.5
58 – 67 14 0.42 62.5
68 – 77 3 0.078 72.5
78 – 87 2 0.05 82.5
38
Step 8. Solve for the class boundaries.

Lower Boundary = Lower Limit –


0.5 Upper Boundary = Upper Limit
+ 0.5

Classe f rf x LB UB
s
18 – 27 2 0.05 22.5 18 – 0.5 = 27 + 0.5 =
17.5 27.5
28 – 37 2 0.05 32.5 27.5 37.5
38 – 47 5 0..13 42.5 37.5 47.5
1
48 – 57 1 0.26 52.5 47.5 57.5
0
58 – 67 1 0.42 62.5 57.5 67.5
4
68 – 77 3 0.07 72.5 67.5 77.5
8
78 – 87 2 0.05 82.5 77.5 87.5
38

Step 9. Solve for the less and greater than cumulative frequency.

Classe f rf x LB UB <cf >cf


s
18 – 27 2 0.05 22. 17. 27. 2 36 + 2 =
5 5 5 38
28 – 37 2 0.05 32. 27. 37. 2+2=4 33 + 2 =
5 5 5 35
38 – 47 5 0.13 42. 37. 47. 4+5=9 19 + 5 =
1 5 5 5 24
48 – 57 1 0.26 52. 47. 57. 9 + 10 = 9 + 10 =
0 5 5 5 19 19
58 – 67 1 0.42 62. 57. 67. 19 + 14 = 4 + 14 =
4 5 5 5 33 18
68 – 77 3 0.07 72. 67. 77. 33 + 3 = 2+3=5
8 5 5 5 36
78 – 87 2 0.05 82. 77. 87. 36 + 2 = 2
5 5 5 38
38

The complete frequency distribution table for the given data is:

Classe f rf x LB UB <cf >cf


s
18 – 27 2 0.05 22. 17. 27. 2 38
5 5 5
28 – 37 2 0.05 32. 27. 37. 4 35
5 5 5
38 – 47 5 0.13 42. 37. 47. 9 24
1 5 5 5
48 – 57 1 0.26 52. 47. 57. 19 19
0 5 5 5
58 – 67 1 0.42 62. 57. 67. 33 18
4 5 5 5
68 – 77 3 0.07 72. 67. 77. 36 5
8 5 5 5
78 – 87 2 0.05 82. 77. 87. 38 2
5 5 5
38
assignment
Quick Check!
Create a stem – and – leaf diagram and frequency distribution
table for the following data:
46 3 4 5 5 2 2 4 55
7 5 2 6 6 5 7
29 4 3 5 3 4 5 3 49
0 1 0 3 2 9 7

To learn more about frequency distribution table, click this


link: https://www.youtube.com/watch?v=pg6LAwkeCk0

DATA PRESENTATION

Presentation of data also needs planning and presentation. If


data are properly and interestingly presented, the benefits will not
only go to the readers or users out more so to the statisticians who
will make the analysis and interpretation of the data gathered.

Forms of Data Presentation

 Textual Form
The data are incorporated in the text of the report.

 Tabular Form
The data are presented in rows and columns.

 Graphical Form
It is utilized for an easy to digest information. This comes in
graphs and diagrams.

Graphical Representation of Frequency Distribution

 Histogram
It is one of the most easily interpreted charts. It is a graphic
representation of a frequency distribution where adjoined
vertical rectangles are drawn on the horizontal axis with the
centers of the bases located at the class
marks. The class boundaries are plotted against the frequencies.
It is called a frequency histogram when frequencies are plotted
along the vertical axis against the class boundaries.

To know the process of creating a histogram, click this


link: https://www.youtube.com/watch?v=YLPDPglvePY

 Frequency Polygon
It is a closed figure of n sides constructed by plotting the class
marks against the frequencies. Connecting the said points will
make a close polygon with the x – axis, where the two ends of
the n – sides are connected thru two more class marks on the x –
axis.
To know the process of creating a frequency polygon, click
this link: https://www.youtube.com/watch?
v=y7Wb9AjbRjo
 Ogive
An ogive is a graph representing the class boundaries along the
horizontal axis and the corresponding cumulative frequencies
along the vertical axis. The two types of ogives are less than and
greater than ogives.

To know the process of creating an ogive, click this link:


https://www.youtube.com/watch?v=13QGqWIsLWk

 Pie Chart
It is a type of graph that displays data in a circular graph. The
pieces of the graph are proportional to the fraction of the whole
in each category.
To know the process of creating a pie chart, click this link:
https://www.youtube.com/watch?v=UrCz6yDbhb8

 Bar Graph
It is a chart or graph that presents categorical data with
rectangular bars with heights or lengths proportional to the
values that they represent. The bars can be plotted vertically or
horizontally.

To know the process of creating a bar graph, click this link:


https://www.youtube.com/watch?v=ReW4MPqXTvA

Learning Task
Construct three different graphs using the given data. Use a
different sheet if the space is not enough.
Score Frequenc
y
3–5 6
6–8 8
9 – 11 10
12 – 14
14
15 – 6
17
18 – 7
20
21 – 19
23
24 – 11
26
27 – 6
29
30 – 13
32
Graphs:

2. Broadening of Concepts
Directions: Write your answers on the space provided.

 Why is it important to know how to collect, organize, and present


data?

 Beside from the discussed graphical forms of data, are there


other ways to represent data graphically? If so, explain its
function.

 If you are to make a research about your class, what data you
are interested to investigate? Why? How is this going to help
your class?
3. Integrations
Directions: Write your answers on the space provided.

 Data can be presented in different ways such as textual, tabular,


or graphical. As a person you also present yourself in different
roles and you wear different hats like: you are a student, you are
a son/daughter, you are sister or brother, you are a best friend,
you are a citizen of our country, you are class officer etc. Which
of these identified roles you are doing very best and doing least
best? Why?

 What are the things you consider in presenting yourself?

 Does having a good presentation of yourself gives you a good


impression from others? In what way?

 If you are going to present yourself to God, what are the data
you will consider presenting and sharing to God and why?

V. EVALUATION/ ASSESSMENT
Directions: Using the given raw data, create the following:

58 1 3 1 2 6 5 1 6 40
0 0 3 4 1 5 7 7
15 4 2 6 2 3 2 4 2 18
1 0 2 2 1 0 3 0
96 9 7 3 3 3 4 9 5 89
4 1 3 0 3 1 1 8
85 5 4 7 4 2 4 3 8 29
8 2 9 4 0 0 4 8
70 5 5 5 6 9 7 7 8 71
7 2 7 4 5 3 5 9
67 9 6 8 3 9 6 5 6 30
6 6 0 8 1 9 2 2
a. stem – and – leaf diagram

b. range

c. class interval

d. frequency distribution table

e. histogram
f. ogive

g. frequency polygon

VI. CLOSURE/ SYNTHESIS


Directions: Check your status right now.

I need help and I can’t keep


working.
I need help, but I can keep
working.
I’m working fine.

How will you summarize our lesson for today?

You’re done with


Module 2!
VII. REFERENCES
Siyavula Technology – Powered Learning. Collecting and Organizing
Data.
Retrieved February 2021 from
https://intl.siyavula.com/read/maths/jss1/data-collection-and-
presentation/17-data-collection-and-
presentation#:~:text=Collecting%2C%20organising%20and
%20presentin g%20data,type%20of%20research%20being
%20done.
Finance Train. How to Construct a Frequency Distribution? Retrieved
February 5, 2021 from https://financetrain.com/how-to-construct-
a-frequency- distribution/
Question Pro. Types of Sampling: Sampling Methods with
Examples. Retrieved February 5, 2021 from
https://www.questionpro.com/blog/types-of- sampling-for-
social-research/
Video Citations
Connolly, D. (12 January 2015). Primary and Secondary Data. Retrieved
February 5, 2021 from https://www.youtube.com/watch?
v=zSdnd9MznPk
Key Differences (8 June 2020). Primary vs. Secondary Data: Difference
Between them with Definition and Comparison Chart. Retrieved
February 5, 2021 from https://www.youtube.com/watch?
v=0_HpiRFrNAk
Mays, S. (26 August 2011). Simple Random Sampling. Retrieved
February 5, 2021 from https://www.youtube.com/watch?
v=yx5KZi5QArQ
Mays, S. (26 August 2011). Stratified Sampling. Retrieved February 5,
2021 from https://www.youtube.com/watch?v=sYRUYJYOpG0
Mays, S. (26 August 2011). Cluster Sampling. Retrieved February 5,
2021 from https://www.youtube.com/watch?v=QOxXy-I6ogs
Mays, S. (26 August 2011). Systematic Sampling. Retrieved
February 5, 2021 from https://www.youtube.com/watch?
v=QFoisfSZs8I
Mays, S. (26 August 2011). Convenience Sampling. Retrieved February
5, 2021 from https://www.youtube.com/watch?v=MJjq0NILrnk
Pornsukjantra, P. (9 November 2016). Judgmental Sampling.
Retrieved February 5, 2021 from
https://www.youtube.com/watch?v=5PUOCDNgK3I
Primmyl (9 October 2016). Snowball Sampling. Retrieved February 5,
2021 from https://www.youtube.com/watch?v=lq8dQel2ZRI
Nadhira, S. (4 January 2017). Quota Sampling. Retrieved February 5,
2021 from https://www.youtube.com/watch?v=4vMy5oxgPiw
The Organic Chemistry Tutor (11 January 2019). Stem and Leaf Plots.
Retrieved February 5, 2021 from https://www.youtube.com/watch?
v=MUCvUgGfzdo
Don’t Memorise (20 December 2014). What is a Histogram? Retrieved
February 5, 2021 from https://www.youtube.com/watch?
v=YLPDPglvePY
The Organic Chemistry Tutor (15 January 2019). How to Make a
Frequency Polygon? Retrieved February 5, 2021 from
https://www.youtube.com/watch?v=y7Wb9AjbRjo
Jones, A. (3 July 2017). Constructing an Ogive. Retrieved February 5,
2021 from https://www.youtube.com/watch?v=13QGqWIsLWk
Mathscasts (11 December 2012). How to Draw a Pie Chart? Mathscast.
Retrieved February 5, 2021 from https://www.youtube.com/watch?
v=UrCz6yDbhb8
Icon Math (12 October 2016). Creating Bar Graphs. Retrieved February
5, 2021 from https://www.youtube.com/watch?v=ReW4MPqXTvA
The Organic Chemistry Tutor (9 January 2019). How to Make a
Relative Frequency Distribution Table? Retrieved February 5,
2021 from https://www.youtube.com/watch?v=gq3FPpm2yvA
Math Dot Com (29 April 2019). How to Find Class Boundaries, Class
Marks, and Cumulative Frequency? Retrieved February 5, 2021
from https://www.youtube.com/watch?v=pg6LAwkeCk0

You might also like