Wa0006.
Wa0006.
This is just a sample paper for your practice. It is suggested to perform this after you are done
with you preparation to test yourself how much you can perform in the given duration of time
(2 hours) for the midterm exam. Best of luck!
Question No. 1
Consider the dataset shown below..
Var1 Var2
2.5 16000
2.5 27000
4.0 12000
2.5 25000
2.0 95000
1.0 36000
3.5 50000
2.5 15004
3.5 400040
3.5 12030
5.0 105000
2.0 12000
1.5 2500
2.5 35000
3.0 2500
Compare the CGPA with Salary at percentiles 25, 50, 78 and 88. How the standard deviation varies at these
percentile values? (Show as area graph)
Question No. 2
For the dataset given below
Dataset:
X= 33.36, 336.71, 375.25, 327.12, 818.43, 371.18, 218.33, 819.14
a) Verify that data is cleaned from outliers. Justify using a particular plot.
b) What is the frequency distribution of the given dataset (show as bar graph) and what is its
interpretation?
c) Determine the predicted outcome X (taking it as population).
Question No. 3
For the dataset (X) shown below, determine the following
a) Verify that data is cleaned from outliers. Justify using a particular plot.
b) What is the frequency distribution of the given dataset (show as bar graph) and what is its
interpretation?
c) Determine the predicted outcome X (taking it as population).
Question No. 4
Suppose the mean length of time between submission of a state tax return requesting a refund and the
issuance of the refund is 47 days, with standard deviation 6 days. Find the probability that in a sample of
50 returns requesting a refund, the mean such time will be more than 50 days
Question No. 5
Consider the dataset as shown in the following comprising of 3 variables, Gender, Universityg and Grade.
a) Represent this information in a cross tabulation table (showing all three variables)
b) Plot all probability distribution plots of all intersection probabilities.
Question No. 6
Consider the following Data 1.
Data 1
21.5
24.5
18.5
17.2
14.5
23.2
22.1
20.5
19.4
18.1
Formula Sheet
𝐼𝑄𝑅 = 𝑄3 − 𝑄1
Z scores:
𝐷𝑎𝑡𝑎 − 𝑀𝑒𝑎𝑛
𝑍=
𝑆𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛
∑𝑛𝑖=1 𝑓𝑖 𝑥𝑖
µ=
𝑁
Here 𝑓𝑖 is the frequency of each group.
Here 𝑁 is the total number of observations in the population or N = ∑𝑛𝑖=1 𝑓𝑖
Here 𝑛 is the total no values of groups.
∑𝑛𝑖=1 𝑓𝑖 𝑥𝑖
x̅ =
𝑁
Here 𝑓𝑖 is the frequency of each group.
Here N is the total number of observations in the sample or N = ∑𝑛𝑖=1 𝑓𝑖
Here 𝑛 is the total no values of groups.
Variance (Population)
∑𝑵
𝒊=𝟏(𝒙𝒊 − 𝝁)
𝟐
𝝈𝟐 = ( )
𝑵
Here
𝜇 is the mean value of population
𝑁 is the population size
∑𝑵 ̅)𝟐
𝒊=𝟏(𝒙𝒊 − x
𝒔𝟐 = ( )
𝑵−𝟏
Here
x̅ is the mean value of sample
𝑁 is the sample size