0% found this document useful (0 votes)
14 views7 pages

Wa0006.

The document contains 6 multiple choice questions about analyzing and interpreting sample data. It includes sample exam questions on comparing variables, identifying outliers, determining distributions, and conducting statistical tests. It also provides a formula sheet for calculations.

Uploaded by

Talha Javid
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
14 views7 pages

Wa0006.

The document contains 6 multiple choice questions about analyzing and interpreting sample data. It includes sample exam questions on comparing variables, identifying outliers, determining distributions, and conducting statistical tests. It also provides a formula sheet for calculations.

Uploaded by

Talha Javid
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

Sample Exam Paper for Practice

This is just a sample paper for your practice. It is suggested to perform this after you are done
with you preparation to test yourself how much you can perform in the given duration of time
(2 hours) for the midterm exam. Best of luck! 

Question No. 1
Consider the dataset shown below..
Var1 Var2
2.5 16000
2.5 27000
4.0 12000
2.5 25000
2.0 95000
1.0 36000
3.5 50000
2.5 15004
3.5 400040
3.5 12030
5.0 105000
2.0 12000
1.5 2500
2.5 35000
3.0 2500

Compare the CGPA with Salary at percentiles 25, 50, 78 and 88. How the standard deviation varies at these
percentile values? (Show as area graph)

Question No. 2
For the dataset given below
Dataset:
X= 33.36, 336.71, 375.25, 327.12, 818.43, 371.18, 218.33, 819.14

a) Verify that data is cleaned from outliers. Justify using a particular plot.
b) What is the frequency distribution of the given dataset (show as bar graph) and what is its
interpretation?
c) Determine the predicted outcome X (taking it as population).

Question No. 3
For the dataset (X) shown below, determine the following

a) Verify that data is cleaned from outliers. Justify using a particular plot.
b) What is the frequency distribution of the given dataset (show as bar graph) and what is its
interpretation?
c) Determine the predicted outcome X (taking it as population).

Question No. 4
Suppose the mean length of time between submission of a state tax return requesting a refund and the
issuance of the refund is 47 days, with standard deviation 6 days. Find the probability that in a sample of
50 returns requesting a refund, the mean such time will be more than 50 days

Question No. 5
Consider the dataset as shown in the following comprising of 3 variables, Gender, Universityg and Grade.

Gender University Grade


female Iqra Pass
female Iqra Pass
male Iqra Fail
female IoBM Fail
male LUMS Pass
female LUMS Pass
male LUMS Pass
female IoBM Fail
male LUMS Pass
female IoBM Pass
female IoBM Fail
female LUMS Pass
male LUMS Fail
male IoBM Pass
male Iqra Pass
female IoBM Fail
male LUMS Fail
female LUMS Pass
male Iqra Fail
female IoBM Fail

a) Represent this information in a cross tabulation table (showing all three variables)
b) Plot all probability distribution plots of all intersection probabilities.
Question No. 6
Consider the following Data 1.

Data 1
21.5
24.5
18.5
17.2
14.5
23.2
22.1
20.5
19.4
18.1

Determine the following:


a) Estimated value of population mean at 99% CI
b) Perform one sample T test to test the claim that population mean is 17.8 at 99% CI. Justify
your results using
a. Significant value
b. Error bar graph and
c. Confidence Interval of the difference

Formula Sheet

Allowed maximum and minimum values for a dataset X:

𝑋𝑎𝑙𝑙𝑜𝑤𝑒𝑑 𝑚𝑖𝑛 = 𝑄1 − 1.5(𝐼𝑄𝑅)


𝑋𝑎𝑙𝑙𝑜𝑤𝑒𝑑 𝑚𝑎𝑥 = 𝑄3 + 1.5(𝐼𝑄𝑅)

𝐼𝑄𝑅 = 𝑄3 − 𝑄1

Z scores:

𝐷𝑎𝑡𝑎 − 𝑀𝑒𝑎𝑛
𝑍=
𝑆𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛

Mean for sample and population in a dataset X


∑𝑁
𝑖=1 𝑥𝑖
µ=
𝑁
Here 𝑁 is the population size
∑𝑁
𝑖=1 𝑥𝑖
x̅ =
𝑁
Here N is the sample size

∑𝑛𝑖=1 𝑓𝑖 𝑥𝑖
µ=
𝑁
Here 𝑓𝑖 is the frequency of each group.
Here 𝑁 is the total number of observations in the population or N = ∑𝑛𝑖=1 𝑓𝑖
Here 𝑛 is the total no values of groups.

∑𝑛𝑖=1 𝑓𝑖 𝑥𝑖
x̅ =
𝑁
Here 𝑓𝑖 is the frequency of each group.
Here N is the total number of observations in the sample or N = ∑𝑛𝑖=1 𝑓𝑖
Here 𝑛 is the total no values of groups.

Variance (Population)

∑𝑵
𝒊=𝟏(𝒙𝒊 − 𝝁)
𝟐
𝝈𝟐 = ( )
𝑵
Here
𝜇 is the mean value of population
𝑁 is the population size

∑𝒏𝒊=𝟏 𝒇𝒊 (𝒙𝒊 − 𝝁)𝟐


𝝈𝟐 = ( )
𝑵
Here
𝜇 is the mean value of population
𝑁 is the population size
𝑛 is the total no values of groups
𝑓𝑖 is the frequency of each group.
Variance (Sample)

∑𝑵 ̅)𝟐
𝒊=𝟏(𝒙𝒊 − x
𝒔𝟐 = ( )
𝑵−𝟏
Here
x̅ is the mean value of sample
𝑁 is the sample size

∑𝒏𝒊=𝟏 𝒇𝒊 (𝒙𝒊 − x̅)𝟐


𝝈𝟐 = ( )
𝑵−𝟏
Here
x̅ is the mean value of sample
𝑁 is the sample size
𝑛 is the total no values of groups
𝑓𝑖 is the frequency of each group.

No. of groups in a dataset


𝐿𝑜𝑔10 (𝑁)
𝑛=
𝐿𝑜𝑔10 (2)

𝑛 is the total no. of groups/ classes


𝑁 is the dataset size

Class width and height


𝑅𝑎𝑛𝑔𝑒
𝑛

𝑛 is the total no. of groups/ classes

You might also like