NMIMS
DECISION SCIENCE
APPLICABLE FOR JUNE 2020 EXAMS
1. Identify the type of the variable in the following table
TABLE GIVEN BELOW
Variable Data Type
a Gender
b Education Background
c Satisfaction
d Motivation
e Exchange Rate
f Gold price
g Preference of cars
h Teachers Feedback
i Grades in post-graduation
j Marital Status
k Quality of services
l Age group
m GDP
n Interest rate
o Twitter comments
p Facebook pictures
Answer:
Variable Data Type
a Gender Nominal variable
b Education Background Ordinal variable
c Satisfaction Ordinal variable
d Motivation Ordinal variable
e Exchange Rate Interval variable
f Gold price Interval variable
g Preference of cars Nominal variable
h Teachers Feedback Nominal variable
i Grades in post-graduation Ordinal variable
j Marital Status Nominal variable
k Quality of services Interval Variable
l Age group Interval Variable
m GDP
Ordinal variable
n Interest rate Interval variable
o Twitter comments Ratio variable
p Facebook pictures Ratio variable
2. Following data of performance scores is available of employees working with a
company. You are required to perform the following:
a. Make the frequency distribution, Calculate the frequency and the Cumulative frequency
b. Calculate the mean, median, quartiles and Mode
c. Calculate the variance and the standard deviation
Table: Performance score of the employees:
TABLE BELOW
5 33 70 95 5 61 47 60
2 7
5 64 54 94 3 61 89 48
7 8
5 39 94 63 5 31 88 46
0 9
6 88 93 48 8 82 72 73
8 2
7 70 92 76 9 91 80 68
4 8
3 33 31 75 5 48 62 53
2 4
3 64 63 66 9 98 91 42
6 2
3 54 71 86 8 55 33 43
6 4
9 34 64 67 8 78 47 62
1 9
9 92 53 56 6 55 36 67
7 8
9 42 51 77 3 93 51 66
3 6
4 66 63 33 6 79 92 76
4 8
8 53 86 76 3 40 43 46
3 5
5 41 36 39 4 96 42 77
5 2
6 53 38 51 9 56 93 63
0 5
4 69 49 33 9 37 83 64
8 5
8 62 96 34 8 32 40 85
3 5
3 59 77 62 3 34 39 92
9 5
5 89 36 45 8 34 86 90
4 3
3 61 88 86 5 33 77 40
9 5
6 54 30 38 7 77 44 59
9 9
9 34 38 91 8 90 58 40
5 0
8 45 95 71 8 43 89 53
8 0
6 40 31 61 5 53 88 94
1 8
9 63 60 94 9 53 53 45
1 8
5 34 75 74 9 98 87 66
0 0
Answer: a) Make the frequency distribution, Calculate the frequency and the Cumulative
frequency
performanc
e Mid Cumulative
scores Point* frequency frequency
30-39 34.5 36 36
40-49 44.5 27 63
50-59 54.5 32 95
60-69 64.5 33 128
70-79 74.5 21 149
80-89 84.5 26 175
90-99 94.5 33 208
*Mid point = (lower frequency +upper frequency)/2
b. Calculate the mean, median, quartiles and Mode
Mean
Perform
ance Midpoi freque
scores nt(x) ncy (f) f*x
124
30-39 34.5 36 2
120
40-49 44.5 27 1.5
174
50-59 54.5 32 4
60-69 64.5 33 212
8.5
156
70-79 74.5 21 4.5
219
80-89 84.5 26 7
311
90-99 94.5 33 8.5
131
208 96
Mean = ∑fx/∑f
= 13196/208
= 63.44
Therefore, mean = 63.44
Median
performanc
e Mid Cumulative
scores point frequency frequency
30-39 34.5 36 36
40-49 44.5 27 63
50-59 54.5 32 95
60-69 64.5 33 128
70-79 74.5 21 149
80-89 84.5 26 175
90-99 94.5 33 208
Median = (208+1)/2 = 209/2 = 104.5
Median = L + N/2 – C.Fp * (W)
Fmed
L = lower limit
CFp = cumulative frequency upto but not including the frequency of median class
Fmed = Frequency of median class
W = width of median class
N = total number of frequencies
Median = 60 + 208/2 – 95 * 10
33
= 60 + 104 – 95 * 10
33
= 60 + 9 * 10
33
= 60 + 90/33
= 60 + 2.73
= 62.73
Therefore, median = 62.73
Quartiles
performanc
e Mid Cumulative
scores point frequency frequency
30-39 34.5 36 36
40-49 44.5 27 63
50-59 54.5 32 95
60-69 64.5 33 128
70-79 74.5 21 149
80-89 84.5 26 175
90-99 94.5 33 208
Q1 = N/4 = 208/4 = 52
Q1 = Lq1 + N/4 – C.F * (W)
Fq1
= 40 + 208/4 – 36 * 10
27
= 40 + 52 – 36 * 10
27
= 40 + 16 * 10
27
= 40 + 160/27
= 40 + 5.93
= 45.93
Q3 = 3N/4 = 3*208/4 = 624/4 =156
Q3 = Lq3 + 3N/4 – C.F * (W)
Fq3
= 80 + 3*208/4 – 149 * 10
26
= 80 + 156 – 149 * 10
26
= 80 + 7 * 10
26
= 80 + 70/26
= 80 + 2.69
= 82.69
Mode
The mode for grouped data is the class midpoint of the modal class. The modal class is the
class interval with the greatest frequency. Using the data from Table above, the 30-39 class
intervals contains the greatest frequency, 36. Thus, the modal class is 30-39. The class
midpoint of this modal class is 34.5. Therefore, the mode for the frequency distribution
shown in Table above is 34.5.
c. Calculate the variance and the standard deviation
Performanc Mid Cumulative
e point Frequency frequency (x-
scores (x) (f) (cf) f*x x-µ µ)^2 f(x-µ)^2
30-39 34.5 36 36 1242 -28.94 837.66 30155.66
1201.
40-49 44.5 27 63 5 -18.94 358.81 9687.898
50-59 54.5 32 95 1744 -8.94 79.96 2558.876
2128.
60-69 64.5 33 128 5 1.06 1.12 36.91753
1564.
70-79 74.5 21 149 5 11.06 122.27 2567.724
80-89 84.5 26 175 2197 21.06 443.43 11529.09
3118.
90-99 94.5 33 208 5 31.06 964.58 31831.15
208 13196 88367.31
σ^2 = ∑f(x-µ)^2
N
= 88367.31/208
= 424.84
σ = √424.84 = 20.61
3. a. In continuation with the data of performance scores of employees in previous
example, perform the following:
a. Calculate the range and inter-quartile range
b. Calculate the z scores
c. Calculate the skewness and Kurtosis (using excel)
d. Comment on the distribution of the data
3. b. In continuation with the data of performance scores of employees in previous
example, perform the following:
a. Make the histogram
b. Plot the box-plot diagram
c. Plot the frequency polygon
d. Plot the Ogive diagram
Answer: a.a. Calculate the range and interquartile range
Range
The range often is defined as thedifference between the largest and smallest numbers. The
range for the data inTable above is 68 (98-30).
Inter-quartile range
IQR = Q3 - Q1
= 82.69- 45.93 = 36.76
b. Calculate the z scores
Note: To calculate the Z score, we are taking range as x
z=x-µ
σ
= 68- 63.44
20.61
= 4.56/20.61
= 0.22125
P-value from Z-Table:
P(x<68) = 0.58755
P(x>68) = 1 - P(x<68) = 0.41245
P(63.44<x<68) = P(x<68) - 0.5 = 0.087552
c. Calculate the skewness and Kurtosis (using excel)
lower upper lower upper class frequen Cumulative
limit limit boundary boundary mark cy frequency
30 39 29.5 39.5 34.5 36 36
40 49 39.5 49.5 44.5 27 63
50 59 49.5 59.5 54.5 32 95
60 69 59.5 69.5 64.5 33 128
70 79 69.5 79.5 74.5 21 149
80 89 79.5 89.5 84.5 26 175
90 99 89.5 99.5 94.5 33 208
Skewness = 0.097671068
Kurtosis = -1.285421147
d. Comment on the distribution of the data
The distribution is positively skewed
The distribution is Platykurtic (The term "platykurtic" refers to a statistical
distribution in which the excess kurtosis value is negative)
b.
lower upper lower upper class frequen Cumulative
limit limit boundary boundary mark cy frequency
30 39 29.5 39.5 34.5 36 36
40 49 39.5 49.5 44.5 27 63
50 59 49.5 59.5 54.5 32 95
60 69 59.5 69.5 64.5 33 128
70 79 69.5 79.5 74.5 21 149
80 89 79.5 89.5 84.5 26 175
90 99 89.5 99.5 94.5 33 208
a. Make the histogram
Histogram
40
30
Frequency
20 Frequency
10
0
39.5 49.5 59.5 69.5 79.5 89.5 99.5 More
Bin
b. Plot the box-plot diagram
Min
45 62 83 1 st quartile
2nd quartile
3 rd quartile
Max
0 50 100 150 200 250 300 350
c. Plot the frequency polygon
frequency Polygon
40
35
30
25
20
frequency
15
10
5
0
24.5 34.5 44.5 54.5 64.5 74.5 84.5 94.5 105.5
midpoint
d. Plot the Ogive diagram
score
250
200
150
cumulative frequency
100
50
0
39.5 49.5 59.5 69.5 79.5 89.5 99.5
upper boundary