Lecture#2 - Probability & Statistics - Fall24
Lecture#2 - Probability & Statistics - Fall24
MT220
(FALL-2024)
Random Sampling
Lecture#2
By,
Dr. Mehwish Manzur
LAYOUT OF LECTURE
• Collection of Data
• Random Sampling
STEPS INVOLVED IN ANY STATISTICAL RESEARCH
The most important part of statistical work is perhaps the collection of data.
Statistical data are collected either by a COMPLETE enumeration of the whole
field, called CENSUS, which in many cases would be too costly and too time
consuming as it requires large number of enumerators and supervisory
staff, or by a PARTIAL enumeration associated with a SAMPLE which saves
much time and money.
COLLECTION OF DATA
Data that have been originally collected (raw data) and have not
undergone any sort of statistical treatment, are called PRIMARY data.
Data that have undergone any sort of treatment by statistical methods at
least ONCE, i.e. the data that have been collected, classified, tabulated or
presented in some form for a certain purpose, are called SECONDARY data.
5 Traditional Methods of Primary Data Collection
One or more of the following methods are employed to collect primary data:
i) Direct Personal Investigation.
ii) Indirect Investigation.
iii) Collection through Questionnaires.
iv) Collection through Enumerators.
v) Collection through Local Sources.
DIRECT PERSONAL INVESTIGATION
In this method, an investigator collects the information personally from the
individuals concerned. Since he interviews the informants himself, the
information collected is generally considered quite accurate and complete.
1.Target Population
The target population (also called the universe) refers to the entire group of individuals,
objects, or events that a researcher is interested in studying. This is the group from which you
ideally want to generalize your findings.
It’s the population you want to draw conclusions about.
In practice, it's often too large or difficult to study every single member of the target
population, so a sample is taken instead.
Example:
If a researcher wants to study college students' attitudes toward online learning, the target
population could be all college students in the world.
2. Sampled Population
The sampled population refers to the group from which the sample is actually drawn.
It is a subset of the target population that the researcher can realistically access.
The sampled population may differ from the target population due to practical
limitations such as time, cost, geography, or other constraints.
Example:
In the same study of college students' attitudes toward online learning, the sampled
population could be college students from a few universities or a specific region
(if the researcher cannot access all students worldwide).
Examples:
1. Health Survey Example:
1. Target Population: All adults in a country.
2. Sampled Population: Adults in selected cities (due to budget constraints,
the survey is only conducted in specific cities).
2. Customer Feedback Example:
1. Target Population: All customers who purchased a product in the past year.
2. Sampled Population: Customers who responded to an email survey (only
those who saw the survey and chose to participate).
Question
Suppose we desire to know the opinions of the college students in the Punjab
regarding the present examination system. Comment on Sampled and Targeted
Population.
Thus, our population will consist of the total no. of students in all the colleges in
the Punjab.
For example, suppose on account of shortage of resources or of time, we are able to
conduct such a survey on only 5 colleges scattered throughout the province.
In this case, the students of all the colleges will constitute the target population
whereas the students of those 5 colleges from which the sample of students will be
selected will constitute the sampled population.
SAMPLING FRAME
A sampling frame is a complete list of all the elements in the population from
which the sample is drawn.
Examples:
• The complete list of the BCS students of NUTECH on September 23 ,
2024.
• If a researcher is conducting a survey of university students, the sampling
frame could be a list of all enrolled students at that university, which might
be obtained from the registrar's office.
• If a company wants to survey its customers, the sampling frame could be a
customer database containing the contact information of all customers.
The Sampling frame should be free from various types of defects:
does not contain inaccurate elements
is not incomplete
group is surveyed.
The following frequency table of distribution gives the ages of a population of 1000
teen-age college students in a particular country. Select a sample of 10 students using the
random numbers table. Find the sample mean age and compare with the population mean
age.
Age Cumulative
Frequency
(years) Frequency
17 100 100
18 200 300
19 350 650
20 250 900
21 100 1000
Step 3: Random Numbers Generation
Now, we need 10 random numbers from 1 to 1000 to select our
sample. If we had a random number table (or a random number
generator), we could generate numbers like this:
•Generated Random Numbers: 75, 231, 512, 650, 789, 904, 112,
333, 480, 699
Step 4: Select the Sample Using Cumulative Frequencies
Using these random numbers and matching them to the cumulative frequency table:
•75 falls in the 17-year group (since 1-100 corresponds to 17 years old). Age Freque Cumulative
•231 falls in the 18-year group (since 101-300 corresponds to 18 years old). (years) ncy Frequency
•512 falls in the 19-year group (since 301-650 corresponds to 19 years old). 17 100 100
18 200 300
•650 falls in the 19-year group (since 301-650 corresponds to 19 years old).
19 350 650
•789 falls in the 20-year group (since 651-900 corresponds to 20 years old).
20 250 900
•904 falls in the 21-year group (since 901-1000 corresponds to 21 years old).
21 100 1000
•112 falls in the 18-year group (since 101-300 corresponds to 18 years old).
•333 falls in the 19-year group (since 301-650 corresponds to 19 years old).
•480 falls in the 19-year group (since 301-650 corresponds to 19 years old).
•699 falls in the 20-year group (since 651-900 corresponds to 20 years old).
So the sample we’ve selected based on these random
numbers is:
•1 student aged 17
•2 students aged 18
•4 students aged 19
•2 students aged 20
•1 student aged 21
Step 5: Sample Mean Age
Now, we calculate the mean age of the sample:
The sample mean age is very close to the population mean age, which
shows that our random sampling method has worked well in
approximating the population characteristics
Sampling Error: