0% found this document useful (0 votes)
67 views14 pages

Data Gathering Sampling Techniques 1

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
67 views14 pages

Data Gathering Sampling Techniques 1

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 14

Statistics Chapter 2 - Collection of Data

Objectives of the Lesson


At the end of the lesson, you should be able to:
1. differentiate the types of data
2. tell the differences among the different methods of collecting data
3. determine the sample size
4. identify the appropriate sampling technique to be used in gathering data

Daily, we come across different kinds of information, data, facts, and figures
from various communication and information media. Some current examples are:
• surveys conducted by SWS on ratings of public offices and officials or opinions
of the public on issues using a sample of 1200 respondents
• the daily data on Philippine COVID-19 cases provided by the Department of
Health (such data include new cases, fully recovered, and deaths added to the
previous total cases) where the active cases are categorized into
asymptomatic, mild, severe, and critical with corresponding percent.
These data are also the basis for the prediction of UP experts to project the
total cases up to a certain period, which are seemingly accurate.
• weather conditions and forecasts
• the clinical trials for COVID-19 vaccines
• reports of DOTr on the number of commuters concerning the number of public
transportations to be allowed to operate in the quarantine periods
• reports on the stock market situation
• the estimated funds of Philhealth that were lost due to corruption

Data Gathering

Types of Data

There are two types of data, namely:


1. Primary data. These are data or information gathered by the researcher
from first-hand sources, like government offices, private organizations, business
establishments, or individuals with first-hand information about the needed data.

2. Secondary data. These are data or information obtained from published


or unpublished sources like newspapers, magazines, journals, books and theses,
and other republished materials

For example, the details in a vehicular accident gathered by a policeman who


interviewed the victims are primary data. The viewers who watched the news about
this incident are getting only secondary data.

Methods of Collecting Data


1. The Direct or Interview Method. This method involves the interviewer (the
person conducting the interview/the researcher), and the interviewee (the
person from whom data is being gathered). This method can provide the
researcher with the opportunity to ask more questions to the interviewee to
obtain all the information that he needs. He can also make clarifications, if
necessary. Questions can be repeated or rephrased for a clearer
understanding of the person being interviewed. The interviewer has to see
to it that he does not influence the responses of the interviewee in any
manner. However, this method is more time consuming and more costly.

Two forms of Interviews

1. Structured. The questions are closed-ended and are already prepared


and asked in the same wordings.
2. Unstructured. In this form, there are no prepared questions at hand
before the interview.

2. The Indirect or Questionnaire Method. This method utilizes


questionnaires to be answered by the respondents. These questionnaires
must be carefully prepared so that the respondents can clearly understand
the directions and the questions to give honest responses. Many
researchers use this method since it is less time-consuming. It is less
expensive because the questionnaires can be reused. A limitation of this
method is that it can only be used for literate persons.

These are the two forms of questions.

1. A fixed-alternative question is a question where the possible responses


from which the respondent is to choose his answer are given.

Example: How often do you read the newspaper?


O once a day O when there is an assignment
O every day O every other day
O once a week O never at all

2. An open-ended question allows the respondent to express his answer


freely.
Examples: 1) How much longer do you think it would take for the pandemic to
end?
2) What are the common problems that students would encounter
in using online mode of learning, and what can you suggest
to remedy these?

Other methods of collecting data are as follows:

1. The observation method is used when gathering data about the behavior of
individuals in the study.

2. Experimentation Method. In this method, the researcher observes the effect


of a variable on other variables. The independent variable of the study is the
variable that is manipulated to see its effect/s on the dependent variable (the
variable that may change).

3. Registration Method refers to the continuous, permanent, compulsory


recording of the occurrence of significant events and certain identifying or
descriptive characteristics concerning them, as provided through the civil code,
laws, or regulations of each country.
The vital events may be live births, fetal deaths, deaths, marriages, divorces,
judicial separations, annulments of marriage, adoptions, recognitions
(acknowledgments of natural children), legitimating.

4. Texting Method. In this method, the researcher gathers data in the survey
being conducted through text messages.

Determining the Sample Size

In conducting a study, the researcher must consider the time element and the cost
involved to complete the study. This is why most researchers make use of a sample
(the representative of the population and possesses the characteristic of the
population) instead of the population (the entirety of objects, individuals, events, or
things). Slovin's formula is used to determine an appropriate sample size from the
population.
𝑵𝑵
The Slovin's formula is n =𝟏𝟏+𝑵𝑵𝒆𝒆𝟐𝟐 Formula 1

where: n = sample size; N = population size ; e = margin of error

The margin of error shows how reliable the result of the survey is. A small
margin of error means that it is more likely that the results of the survey are true for
the population.

Example 1. A group of environmentalists is conducting a survey on the opinions


of people regarding the putting of dolomite sand in and around Manila Bay. If the
population of the study is 6000 residents in Metro Manila and the margin of error to be
used is 5%, what would be the sample size?

Solution:
Given: N = 6000 e = 5% = 0.05
𝑁𝑁 6000
n= = = 375
1+𝑁𝑁𝑒𝑒 2 1+(6000)(0.05)2

Example 2. Using example 1, what will be the sample size if the margin of error
is 8%?
𝑁𝑁 6000
n= = = 152.28 = 152
1+𝑁𝑁𝑒𝑒 2 1+(6000)(0.08)2

Did you notice that the bigger the margin of error, the smaller the
sample size becomes?

Example 3. Another researcher wants to conduct the same survey. However,


due to time constraints and limited budget, he will be using 100 respondents only.

To solve for the margin of error (e) in this example, the formula to be used is

𝑵𝑵−𝒏𝒏
e = � 𝒏𝒏𝒏𝒏 Formula 1a

In the formula, e is the margin of error, N is the population, and n is the sample size.
Solution:
Given: N = 6000 n = 100

𝑁𝑁−𝑛𝑛 6000−100
e=� = � = 0.09916 𝑜𝑜𝑜𝑜 9.92%
𝑛𝑛𝑛𝑛 100(6000)

Sampling Techniques

Do you still remember the difference between a population and a sample?


Now let us explore sampling.

Sampling is the process of choosing the samples from a population.

The two kinds of selecting/drawing samples are

1) Probability Sampling (or random sampling) is a sampling technique


where every member of the population has an equal chance of being selected to be
part of the sample. For example, if there are 200 members in the population, each of
these 200 can be part of the survey/study.

2) Non-Probability Sampling is a sampling technique where not all of the


members of the population have a chance to be included in the sample. For example,
a researcher is conducting a survey in a Barangay. He did not include in his survey
those who live far from the main road. Those people were not given a chance to be
part of the survey.

Probability Sampling

Here are the kinds of probability sampling.

1. Simple Random Sampling. This probability sampling is the simplest


among the types of probability sampling. In this method, numbers are assigned to the
members of the population. Numbers are drawn, and the element of the population
whose number is drawn becomes a part of the sample. This method is also known as
the fishbowl or lottery technique.

Example: A survey is to be conducted among the grade 11 students of a


school. There are 200 grade 11 students. A students' list is prepared, and pieces of
papers numbered 1 to 200 are placed in a container. The researcher draws papers
from this container. Students whose numbers are drawn will become part of the
sample in the survey. The Table of Random Numbers or a number generator can also
be used in drawing the sample from the population.

2. Systematic Random Sampling is a method where every nth or kth


element in the population list is selected in obtaining the sample needed.

Example:
Given: N= 1400 and n = 141
Step 1. Determine k (sampling interval) by dividing the population by the
sample size.
𝑵𝑵 1400
𝒌𝒌(𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠 𝑖𝑖𝑖𝑖𝑖𝑖𝑖𝑖𝑖𝑖𝑖𝑖𝑖𝑖𝑖𝑖) = = = 9.93 𝑜𝑜𝑜𝑜 10 (This means that every
𝒏𝒏 141
th
10 element in the population list will be included in the sample until 141
samples are obtained.)

Step 2. The random start can be determined by choosing from numbers 1,


2, 3, 4, 5, 6, 7, 8, 9, and 10 (k=10). This can be done by lottery,

If number 4 happens to come out, then the random start is 4.

random start: 4 (the 4th member of the population is included in the sample)

The second number is obtained by adding k to the random start.


4 + k = 4 + 10 = 14
nd
The third number is the 2 number + k.
14 + k = 14 + 10 = 24

Repeat the procedure until the desired sample size is obtained.

These are the first 20 numbers of the samples: 4, 24,34, 34, 44,54, 64, 74, 84,
94, 104, 114, 124,134, 144, 154, 164, 174, 184, 194

3. Stratified Random Sampling is done by splitting the population into groups


or categories. The samples to be chosen from the groups must be proportional to the
size of the group. This means fewer samples will be taken from smaller groups, and
more samples will be taken from bigger groups.

Example:
Given: N = 4370 patients; n = 151
Male Patients – 2734; Female Patients – 1636

Step 1. Divide the sample by the population.


𝒏𝒏 151
𝒑𝒑(𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝 𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎) = = = .034554
𝑵𝑵 4370

Step 2. Multiply the result obtained in step 1 (p) by the size of each group to get
the number of samples to be taken from that group.

Category No. of patients p Sample Size (n)


Male 2734 . 034554 94
Female 1636 . 034554 57
Total 4370 151

4. Cluster Sampling is used to randomly select the samples from a population


spread out over a wide geographical area. The cluster is used as a sampling unit,
meaning that all the individuals in that cluster will be included in the sample. This is
sometimes called area sampling.

A cluster is a group where the objects or individuals in the group are more
similar to each other as compared to those from other groups.

Example: A sample of 100 health workers in Malolos will be chosen as


respondents in a study. The researcher may consider the barangays as
clusters. He can select the clusters randomly. After he has chosen the
barangays, he can now include all the health workers belonging to the
chosen barangays as part of the sample.

5. Multi-Stage Sampling is done by utilizing a combination of sampling


techniques. This is used when drawing samples from a huge population.

Example: A survey is to be conducted on the opinion of the beneficiaries of the


Social Amelioration Program (SAP) in the province of Bulacan. Randomly
select/draw the municipalities in the province. Then randomly choose the
barangays in the chosen municipalities. Lastly, randomly choose the
beneficiaries in the chosen barangays.

Non-probability Sampling

The non-probability sampling has three kinds.

1. Quota Sampling is a sampling technique where the population is


divided into categories, just like in stratified sampling. But there is no required sampling
frame that must be used in determining the samples of the study.

Example: A researcher decided to interview 50 audiences during the concert


of a K-pop group at the Philippine Arena. Since he already knows that most
of the viewers are female, he decided to have 80% female respondents and
20% male respondents. So he approached and interviewed 40 female
audiences and ten (10) male audiences.

2. Purposive Sampling is done by selecting a sample based on the


purpose or needs of the study.

Example: A researcher would like to know why grade 10 students preferred to


transfer to public schools this school year. For this purpose, he will interview
only grade 10 transferees.

3. Incidental/Accidental/Convenience Sampling. This sampling


technique is often used in market research. People that are easy to reach to get a
quick response are the ones who are chosen to become samples in the study.

Example: A researcher wanted to now the acceptance of consumers of a brand


of disinfectant. He will possibly interview his friends, relatives, neighbors, or
go to public places nearby and conduct an interview.
Activity 4.1
Collection of Data

Name: _______________________________ Score:__________________


Program/Year/Section: ______________ Descriptive Rating: _______
Date Accomplshed: _________________ Instructor: ______________

I. Identify whether the source of data is primary or secondary. Write P if it is a primary


source and S if it is a secondary source.

___1. A written commentary from other researchers


___2. Data and information downloaded from the internet
___3. Information from the birth certificate
___4. An interview with the health officials concerning the effectiveness of the COVID-
19 vaccine
___5. Student’s online evaluation of the performance of the faculty
___6. A TV host was asked about the status of the entertainment industry.
___7. There is a price adjustment on our electric bill according to the latest news.
___8. Students of BS-Biology gathered important features of their research from the
data published in the newspapers.
___9. A weatherman reports the development of a typhoon signal in the affected area.
___10. A student of BS Psychology is directly observing the behavior of patients with
depression.

II. Complete the table below. (show your complete solution)

Problem Population Sample Margin of


size Size error
N n e
1. Given 1,500 as the population size
and 3% margin of error. Find the
desired sample size.

2. What should be the margin of error


when the population size is 5,000,
and the sample size is 400?

3. Suppose the population size is


11,250, and the margin of error is
8%. Find the sample size.
III. Determine the sample size for each of the following problems.

1) The College of Science surveys its graduating students to determine the starting
salaries and the job market for the new graduates. The population of graduating
students according to majors is recorded as follows: BS Bio (85), BSM- CS (102),
BSM- BA (78), and BSM- AS (75). If you would like to select 100 samples, how
large must a sample be for each major using proportionate allocation?

Courses Number of Proportion Samples


students
BS Biology 85
BSM- CS 102
BSM- BA 78
BSM- AS 75
Total 100

2). The event organizer plans to give 20 complimentary tickets to the presidents of the
school organizations so they can attend the upcoming event for free. Since the tickets
will be given only to 20 out of 100 presidents, the event organizer decided to choose
a systematic random sampling procedure. Let the random start be at number three
(3).

1. 6. 11. 16.
2. 7. 12. 17
3. 8. 13. 18.
4. 9. 14. 19.
5. 10. 15. 20.

End of Activity

Self-Check: Turn to page 131 for the Answer Key. Write your score and descriptive
rating on the space provided.

Score Descriptive Rating

31 - 45 Very Good
16 - 30 Good
1 - 15 Practice More

Lesson Check-Up Test 4.1

Student Name: ________________________ Score: ___________________


(Surname, First Name, MI)
Program/Year/Section: ____________ Descriptive Rating: __________
Date Accomplished: ______________ Instructor: _________________

Time Allotment: 1 hours Time Started: _______ Time Finished: __________

Write the capital letter of the correct answer on the space provided before each
number.

___1) Which of the following is NOT a probability sampling method?


A. Cluster sampling C. quota sampling
B. Systematic random sampling D. stratified sampling
___2) A health official wants to test if the water in a well is safe to drink or not, so he
takes one liter of it for water testing. In this situation, which is considered the
population?
A. health official C. water testing
B. water in the well D. one liter of water
___3) From statement number 2, which is considered the sample?
A. health official C. water testing
B. water in the well D. one liter of water
___4) It is a set of individuals or objects collected or selected from a larger population
using a pre-defined selection procedure.
A. Nominal C. Ordinal
B. Ratio D. Sample
___5) If you need to know the quality of several boxes of apples, how can you facilitate
the quality test?
A. By the process of organizing the items
B. By the process of sampling
C. By the process of cluster sampling
D. By the process of random sampling
___6) When a doctor examines a patient’s blood, they extract only a few cubic
centimeters of blood to determine their illness. Which phrase in the statement
represents the sample?
A. a few cubic centimeters of blood C. a patient’s blood
B. causes the patient’s illness D. cubic centimeters
__7) It refers to the set of people, objects, measurements, or happenings that belong
to a defined group.
A. variable C. population
B. statistics D. sampling
___8) These are information gathered by the researcher.
A. variable C. population
B. data D. sample
___9) Which data gathering is most appropriate in this situation: “If people are taking
vitamin C regularly, it can prevent them from the risk of iron deficiency.”
A. Census C. Sample Survey
B. Experimentation D. Observation
___10). A researcher plans to conduct a survey. Using a 3% margin of error, compute
the target population when the required sample size is 400.
A. 600 C. 650
B. 525 D. 625
___11). It is a sampling procedure where every element of a population is given an
equal chance of being selected as a member of the sample.
A. Probability C. Convenience
B. Non-probability D. Purposive
___12) A class president is conducting a survey in her section and decide that after
randomly selecting a start point, she will sample every 7th on the class list. What
type of sampling will she be using for the survey?
A. Systematic C. Simple random
B. Cluster D. Stratified
___13). When an instructor surveys the perceptions of students on the quality of life
on the campus, she gets only 50% of the total number of students currently
enrolled in the university to participate in the survey. Which constitutes the
population?
A. the quality of life in the campus
B. the total number of students currently enrolled
C. 50% of the total number of students
D. the perceptions of the students
___14) Which phrase describes the sample from item number 13?
A. the quality of life in the campus
B. the total number of students currently enrolled
C. 50% of the total number of students
D. The perceptions of the students
___15). This type of sampling method is applied if the population is too big. The
respondent population is divided into equal groups of elements.
A. Stratified sampling C. Cluster
B. Quota sampling D. Systematic
___16) In this method, a researcher obtains data by asking the respondents certain
information about facts and events which they may know.
A. Indirect method C. Registration method
B. Direct method D. Observation method
___17). This type of sampling method is done by creating different classes or strata
within the population.
A. Quota C. Stratified
B. Convenience D. Purposive
___18) A group of researchers was tasked by the Department of Education to survey
whether students in the National Capital Region favor using Flexible Modes of
Learning. If there are 2,000,000 students and a 5% margin of error is expected,
compute the sample size.
A. 380 C. 405
B. 420 D. 400
___19) In this type of sampling, the selection of informants is intentional and based on
their ability to explain a specific topic, idea, or situation.
A. Purposive C. Quota
B. Convenience D. Systematic
__20) A researcher wants to determine the effect of the implementation of flu
vaccination in community pharmacies. It is best if he will use the ____ method
of collecting data.
A. Experimentation C. Observation
B. Registration D. Questionnaire
___21) A type of non-probability sampling involves the sample being drawn from the
part of the population that is near or easy to reach.
A. Stratified C. Convenience
B. Purposive D. Cluster
___22) What is the margin of error if the population is 5,000 and the sample size is
150?
A. 8% C. 10%
B. 7% D. 9%
___23) A non-probability sampling method where the survey population is divided into
mutually exclusive subgroups that are of great interest to the study.
A. Purposive C. Cluster
B. Convenience D. Quota

___24) Which among the following sources refers to secondary data?


A. Data that is collected by a researcher
B. Data that an individual obtains directly by asking questions
C. Data that is collected first hand
D. Data that can be read in newspapers and magazines
___25). A doctor is watching a patient after giving a shot of injection. Identify the most
appropriate method to be used by the researcher.
A. Interview C. Experimentation
B. Questionnaire D. Observation
___26) The following are primary sources of statistical data except
A. encyclopedia C. organization
B. government institutions D. business agencies
___27) A procedure used to determine the individuals or members of a sample.
A. Presentation of data C. Collection of data
B. Sampling D. Observation
___28) Which of the following is not classified as sample?
A. three hundred prison inmates
B. monthly food budget of 25 families
C. selected senior students
D. all owners of food companies
___29) When a housewife buys a sack of rice, she examines only a handful of rice
from the sack to find out whether it is of good quality or not. Which phrase
represents the population?
A. housewife C. handful of rice
B. the sack of rice D. good quality
___30) There are 400 students in a certain school in which 140 are male, and 260 are
female. The school needs 12 students from each. Identify the sampling
procedure used in this situation.
A. stratified C. simple random
B. systematic D. cluster
___31) A researcher plans to get 588 sample units from a population N using a 4%
margin of error. What is the value of the population?
A. N = 9,900 C. N= 9,932
B. N = 9,392 D. N = 9,239
___32) From a population of 2500, estimate the amount of margin of error (e) if the
sample size is 100.
A. e = 11% C. e = 8%
B. e = 10% D. e = 12%
For numbers 33- 37, determine the method of collecting data in the following
problems:
A. Interview B. Questionnaire C. Observation
D. Experimentation E. None of these

___33) A study to find out if a new brand of medicine is effective in curing COVID-19.

___34) The security guard of the school monitors the number of students wearing their
IDs in entering the gate.
___35) A study to determine the cultures of Aetas in Pampanga.
___36) An inquiry on students’ experience with online classes in a particular school.
___37) A businessman was asked directly about his assessment of the country’s
economy.
___38) What happens to the sample size if the margin of error is increased?
A. The sample size gets bigger.
B. The sample size would equal the population.
C. The sample size gets smaller.
D. The sample size remains the same.
___39) Which of the following is not a primary source of data?
A. The data on COVID-19 cases as gathered directly from the Department of
health
B. Information on the economic status of the Philippines was gathered from the
newspaper
C. Forex rate data personally gathered from the Forex trading floor
D. Information from the registration record of the Businessman’s Association
___40). At a 3% margin of error, a researcher used 700 students from a public school
as a sample. What is the student population of the school?
A. 1829 C. 1982
B. 1895 D. 1892

End of Test

What you have answered is a Self-Check Test. Turn to page 132 for the answer
key. Write your score and the corresponding descriptive rating in the space provided
for using the score guide below:

31 - 40 Very Good. You are a focused learner. Continue the good work.
21 - 30 Good. You can achieve better if you are more focused.
11 - 20 Somewhat Good. Exert more effort to focus on the lessons.
1 - 10 Fair. It is best if you go back to the lessons where you missed the
correct answers. You can achieve better if you try harder.

You might also like