Data Collection, Analysis and Interpretation
Data Collection, Analysis and Interpretation
data.
➢ Systematic gathering of data for a particular purpose from various sources, that
has been systematically observed, recorded, organized.
➢ Data are the basic inputs to any decision making process in business.
➢ PURPOSE OF DATA COLLECTION:
➢ To obtain information
➢ To keep on record
➢ To make decisions about important issues
➢ To pass information on to others
➢ There are two types of data:
➢ Primary Data and Secondary Data
➢ PRIMARY DATA:
Data that is developed specifically for the purpose of study at hand.
➢ Sources Include:
Individual, group or organization, directly / indirectly related to your topic and
can provide data for testing of your hypothesis. Like:
Customers
Employees
Managers
People etc.
➢ SECONDARY DATA:
Data that was developed for some purpose, other than to solve the problem
at hand.
➢ Sources Include:
Newspaper
Journals
Magazine
Websites
Govt. publications
Other published data
❑ Population: whole area under observation.
❑ A finite group is called population whereas a non-finite (infinite) group is
called universe.
❑ Sample: part of population which represents the whole.
❑ A sample is a subset of a larger population of objects individuals, households,
businesses, organizations and so forth.
❑ Sampling enables researchers to make estimates of some unknown
characteristics of the population in question.
❑ Parameter: numerical characteristic of a population
❑ Statistic: numerical characteristic of a sample
❑ Two issues to satisfy representation of sample:
Size Of Sample:
➢ No hard & fast rule
➢ Depends upon many factors, Researcher has to decide
➢ For descriptive studies, 10% is advisable, if appropriate
✓ Budget and time Constraints (in case of large populations)
❖ A target population may be, for example, all faculty members in the
Department of Management Sciences, all pre-college students in Rawalpindi,
and all medical doctors in Pakistan
❖ The target group should be clearly delineated if possible, for example, do all
pre-college students include only primary and secondary students or also
students in other specialized educational institutions?
❖ The sampling frame is a list of all those population elements that will be used in
the sample
❖ Often, the list does not include the entire population. The discrepancy is often a
source of error associated with the selection of the sample (sampling frame
error)
▪ Every student at university whose first name begins with the letter “S”
▪ All child passengers under 18 years of age who are traveling in a train
from destination X to destination Y
▪ This is a technique which ensures that each element in the population has an
equal chance of being selected for the sample.
▪ Involves you selecting at random frame using either random number tables,
a computer or an online random number generator such as Research
Randomizer.
▪ All individuals must get EQUAL and INDEPENDENT chance of selection in the
sample.
▪ Example: Choosing raffle tickets from a drum, computer-generated
selections, random-digit telephone dialing, common example is balloting .
▪ The major advantage of simple random sampling is its simplicity
➢ Steps in Simple Random Sampling:
▪ Sampling in which individuals are selected from each strata randomly, with
the same proportion as they present in population.
➢ Stratified Random Sampling:
▪ The disadvantage lies therein, that it is more complex and information on the
strata may be difficult to obtain
➢ Stratified Random Sampling:
➢ Cluster Sampling:
▪ Sampling in which group(s), not individuals are selected randomly.
▪ Similar to stratified as you need to divide the population into discrete groups
prior to sampling. The groups are termed clusters in this form of sampling
and can be based in any naturally occurring grouping.
▪ For example, you could group your data by type of manufacturing firm or
geographical area.
▪ For cluster sampling your sampling frame is the complete list of clusters
rather than complete list of individual cases within population, you then
select a few cluster normally using simple random sampling.
▪ Data are then collected from every case within the selected clusters
➢ Cluster Sampling:
➢ Systematic Sampling:
▪ Systematic sampling involves you selecting the sample at regular intervals
from the sampling frame.
▪ Number each of the cases in your sampling frame with a unique number. The
first is numbered 0, the second 1 and so on.
▪ Select the first case using a random number.
▪ Calculate the sample fraction: Sampling in which individuals are selected
through ‘K’ number. K = Total population (List) / Sample size.
▪ Select subsequent cases systematically using the sample fraction to
determine the frequency of selection.
➢ Systematic Sampling:
▪ Example: From a list of 1500 name entries, a name on the list is randomly
selected and then (say) every 25th name thereafter.
▪ For systematic sampling to work best, the list should be random in nature
and not have some underlying systematic pattern
➢ Systematic Sampling:
➢ Multi-Stage Sampling:
▪ However, like cluster sampling you can use it for any discrete groups,
including those not are geographically based.
✓ Purposeful Random: If sample is larger than what can be handled and help to
reduce sample size (Like Random but no equal representation)
✓ The advantage of snowball sampling is that smaller sample sizes and costs
are necessary.
3 Dealers 44 Cluster 10