Final Stat1 Hadath March2022 1stsemester en Solution v6
Final Stat1 Hadath March2022 1stsemester en Solution v6
Problem 1: (35pts)
The team manager of the sales department wishes to determine how monthly working hours (X) are related
to sales volume (Y) for a population 200 employees at a company. Their monthly working hours and
their sales volume (in thousands of dollars) are given.
Sales Volume (Y)
[2,6[ [6,10[ [10,20[
Monthly [40,60[ 8 24 0
Working [60,80[ 0 15 5
hours [80,90[ 0 5 15
(X) [90,100[ 0 0 6
2. Calculate the mean (2+2pts) and standard deviation (4+4pts) for each of X and Y
1
midopoints fi fimi fimi^2
2 6 4 8 32 128
Y 6 10 8 44 352 2816
10 20 15 26 390 5850
774 8794
average of Y:
(2pts): 9.92307692
var Y: 14.27613
sigmaY(4pts): 3.778377
Y
4 8 15
X 50 1600 9600 0
70 0 8400 5250
85 0 3400 19125
95 0 0 8550
XY_bar(3pts): 716.9871795
COV_XY(3pts): 46.5433925
rho(3pts): 0.758231114
5. Estimate the percentage of employees whose monthly sales volume is between 3 and 12.5 thousands of
dollars (8pts)
interpolated
Y frequency
3 to 6 (1pt) 6
6 to 10 (1pt) 44
10 to
12.5 (3pts) 6.5
Total 56.5
%employees (3pts) 0.724
2
Problem 2: (25pts)
The weekly salary distribution (in $100) at LENOVO company are given.
weekly salary [3 -7[ [7 -9[ [9 -11[ [11 -15[ [15;21[
Frequency 9 5 8 2 1
1. Find the mode and the median.(16 points total with partial credit as follows)
Median
Correct
medial answer(2pts)
class: (2pts) 8.6
Mode
Correct
answer(2pts)
modal class Class Class Wi Di CF
(2pt) 9.6
L H F mi mifi (2pts) (2pts) (2pts)
delta1 (1pts) 1.5
3 7 9 5 45 4 2.25 9
delta2 (1pts) 3.5 7 9 5 8 40 2 2.5 14
9 11 8 10 80 2 4 22
11 15 2 13 26 4 0.5 24
15 21 1 18 18 6 0.167 25
2. The following table contains calculations that lead to the Gini coefficient. (total: 9pts)
a. Give proper labels for columns A, B, C, D.
A:midpoint; B:Class income (CI)
C:Relative Class income (RCI); D:cumulative RCI; (3pts: any 3 correct)
b. Complete the missing parts and find the Gini coefficient.
𝐺𝑖𝑛𝑖 = (2pts for using correct formula)1 − Σ(𝑐𝑜𝑙𝑢𝑚𝑛𝐺) = 1 − 0.798
= 0.202 (answer 3pts)
c. Interpret. Fair distribution (1pt)
Low High Freq A B C D E F G
3
Problem 3: (15pts)
The daily fruit sales 𝑋 (in Kg) at a shop is normally distributed with a mean of 50 and a variance of 9.
1. Find 𝑃(𝑋 > 53.75) (5pts) and 𝑃(46.25 ≤ 𝑋 ≤ 53.75) (5pts)
53.75 − 50
𝑃(𝑋 > 53.75) = 𝑃 (𝑍 > = 1.25) = 1 − 𝐹(1.25) = 1 − 0.8944 = 0.1056
3
OR 0.5 − 0.3944 = 0.1056 (5 pts)
(partial credit: 2 pts for correct Z)
Problem 4: (15pts)
The Learning mode this year at the Lebanese University was the Hybrid mode. Suppose that 20% of
students do not attend the exam. We select a random sample of n=20 students. Let X be the number of students
among the sample who do not attend the exam. Using this sample
1. Find the expected value and standard deviation for the students who attend the exam.
2. What is the probability that at least 2 students do not attend the exam?
𝐸(𝑋) = 𝑛𝑝 = 20 × 0.8 = 16 (3 pts)
𝜎 = √𝑛𝑝𝑞 = √(20)(0.2)(0.8) = 1.789 (3 pts)
𝑃(𝑋 ≥ 2) = 1 − 𝑃(𝑋 < 2) (2 pts)
= 1 − (𝑃(𝑋 = 0) + 𝑃(𝑋 = 1)) =
𝑃(𝑋 = 0) = 0.0115; (2 pts)
𝑃(𝑋 = 1) = 0.0576; (2 pts)
𝑃(𝑋 ≥ 2) = 1 − 0.0115 − 0.0576 = 0.93 (3 pts)
4
Formulas
𝑁𝑘 𝑁
Σ𝑤𝑖 𝑥𝑖 ⌈ ⌉ − 𝐶𝐹𝑏𝑒𝑓𝑜𝑟𝑒 ⌈ ⌉ − 𝐶𝐹𝑏𝑒𝑓𝑜𝑟𝑒
100 2
𝜇= 𝑃𝑘 = 𝐿 + ( )𝑤 𝑀𝑒𝑑 = 𝐿𝑚𝑐 + ( ) 𝑤𝑚𝑐
Σ𝑤𝑖 𝐹 𝐹𝑚𝑐
Population Sample
Σ𝑓𝑖 (𝑥𝑖 −𝜇)2 Σ𝑓𝑖 (𝑥𝑖 −𝑥̅ )2
• 𝜎 = √𝑉𝑎𝑟 = √ • 𝑠 = √𝑉𝑎𝑟 = √
𝑁 𝑛−1
Σ(𝑥𝑖2 ) (Σ𝑥𝑖 )2
• 𝜎 = √𝑉𝑎𝑟 = √̅̅̅
𝑥𝑖2 − (𝑥̅ )2 • 𝑠 = √𝑉𝑎𝑟 = √
𝑛−1
−
𝑛(𝑛−1)
Σ𝑓𝑖 (𝑚𝑖2 ) Σ𝑓𝑖 𝑚𝑖 2 Σ𝑓𝑖 (𝑚𝑖2 ) (Σ𝑓𝑖 𝑚𝑖 )2
• 𝜎=√ −( ) • 𝑠 = √𝑉𝑎𝑟 = √ −
𝑁 𝑁 𝑛−1 𝑛(𝑛−1)
𝐴
𝐺= = 2𝐴 = 1 − 2𝐵 𝐺 = 1 − Σ𝑅𝐹𝑖 (𝐶𝑅𝐼𝑖 + 𝐶𝑅𝐼𝑖−1 )
𝐴+𝐵
𝐶𝑜𝑣𝑥𝑦
𝐶𝑜𝑣𝑥𝑦 = ̅̅̅
𝑥𝑦 − 𝑥̅ . 𝑦̅ 𝜌=
𝜎𝑥 𝜎𝑦
𝐶𝑂𝑉𝑥𝑦
slope 𝑚 = Intercept 𝑏 = 𝑦̅ − 𝑚𝑥̅
𝜎𝑥2
𝑃(𝐴, 𝐵) 𝑃(𝐵|𝐴) 𝑃 (𝐴)
𝐸 (𝑥 ) = Σ𝑝𝑖 𝑥𝑖 𝑃(𝐴, 𝐵) = 𝑃(𝐴|𝐵 )𝑃(𝐵) 𝑃(𝐴|𝐵 ) = =
𝑃(𝐵) 𝑃(𝐵|𝐴)𝑃 (𝐴) + 𝑃(𝐵|𝐴̅)𝑃(𝐴̅)
𝑥̅ − 𝜇 𝑛!
𝑧= 𝑃(𝑥) = 𝑛𝐶𝑥 𝑝 𝑥 𝑞𝑛−𝑥 = 𝑝 𝑥 𝑞 𝑛−𝑥
𝜎 (𝑛 − 𝑥)! 𝑥!