Assignment-Case Study - QM
Assignment-Case Study - QM
Problem Statements
Group - 9
Members Name:
pg. 2
CONTENTS:
1. Abstract………………………………………………………………………………4
2. Problem Statements………………………………………………………………..5
3. Data Related Problems…………………………………………………………… 6
4. Proposed Questions………………………………………………………………..7
5. References…………………………………………………………………………35
pg. 3
Abstract:
Money Control- moneycontrol is India's No 1 Financial and Business portal. With in-
depth market coverage, analysis, expert opinions and a gamut of financial tools,
moneycontrol.com has been the premier destination for consumers and market
watchers. A part of Network18, moneycontrol.com is the most influential destination
for stock market news and advice, business news and news about the Indian and
global economy.
NIFTY 50- The NIFTY 50 is a benchmark Indian stock market index that represents
the weighted average of 50 of the largest Indian companies listed on the National
Stock Exchange. It is one of the two main stock indices used in India, the other being
the BSE SENSEX.
NSE- Powered by millions of dreams, hopes and aspirations, India today is brimming
with potential. At NSE, we are driven by this ambition that makes India charge ahead
and take a more prominent place on the global stage.
Our ambition gives us direction. It pivots, and propels us. It is the leap we take from
today, towards a brighter tomorrow.
NIFTY PSU Bank- NIFTY PSU Bank Index is computed using free float market
capitalization method, wherein the level of the index reflects the total free float
market value of all the stocks in the index relative to particular base market
capitalization value
The NIFTY PSU Bank Index is designed to reflect the performance of the
public sector banks.
Problem Statements:
pg. 4
Problem 1-
1. Data Source
1. Go to https://www.moneycontrol.com/ (Links to an external
site.)
2. Go to Market and chose Indian Indices, Chose NIFTY 50
3. Choose your company, scroll down to the financials heading
( for example for Adani port, financials are given
at https://www.moneycontrol.com/india/stockpricequote/infrastr
ucture-general/adaniportsspecialeconomiczone/
MPS#financials (Links to an external site.)).
4. Download/copy five-year quarterly data (total 20 data points)
of:
1. Total Income from Operations
2. Employee cost
3. Depreciation
4. Other Income
5. Other Expenses
5. Test
1. Is there any difference between total income from
Operations and total cost (Employee cost+
depreciation+ other expenses) at a 90% confidence
interval?
1. Chose a suitable statistical tool for it.
2. Frame null and alternate hypothesis
3. Test using the data if you have to reject the null
hypothesis using the critical value approach.
2. Is “other income” greater than the “total income from
operations”. Test at 95% confidence interval.
3. Is there any relationship between the “income form
operation” and “Employee cost”?
1. What are the collinearity coefficient and
coefficient of determination?
0. Show your calculations.
2. Test if income from operations is independent of
Employee cost at a 90% confidence interval.
4. Find the trend of the total income from operation with
respect to time.
1. State all your assumptions if any for all the
questions.
pg. 5
Problem 2 –
1. Go to
www1.nseindia.com/products/content/equities/indices/historical_index_dat
a.htm
2. Choose your indices
3. Download one-year data of
1. Open
2. Close
3. High
4. Low
4. Test
1. If open and high values are the same
2. Close and low values are the same
3. Open and close values are the same
4. Low and high values are the same
0. At 90% confidence interval.
5. Write a descriptive report using the information gained by the
analysis in not more than 500 words
pg. 6
Problem 1
Q1. Is there any difference between total income from Operations and total
cost (Employee cost+ depreciation+ other expenses) at a 90% confidence
interval?
1. Chose a suitable statistical tool for it.
2. Frame null and alternate hypothesis
3. Test using the data if you have to reject the null hypothesis using the
critical value approach.
Step 1:
H0: u1=u2, so here Total Income from operations is equals to Total cost, u1-
u2=0
Ha: u1 != u2, so here Total Income from operations is not equals to Total cost
pg. 7
n1 20
n2 20
x1bar 4,887.78
x2bar 1907.629
s1 785.2440557
s2 381.3309679
Step 2:
df =
df = 18,61,14= 19
Step 3:
Df=19
Rejection Rejection
0.05 0.05
- 1.729 0 1.729
pg. 8
Step 4:
t= 15.26
Hence t is greater than Rejection value. Hence, we have to reject the Null Hypothesis
value.
pg. 9
a) Chose a suitable statistical tool for it.
Microsoft Excel
pg. 10
MS Excel does offer a wide variety of tools for data visualization and simple
statistics. It’s simple to generate summary metrics and customizable graphics
and figures, making it a usable tool for many who want to see the basics of their
data. As many individuals and companies both own and know how to use Excel,
it also makes it an accessible option for those looking to get started with
statistics.
pg. 11
b) Frame null and alternate hypothesis:
pg. 12
c) Test using the data if you have to reject the null hypothesis using the
critical value approach.
Critical Value Approach to Two-Tailed Hypothesis Testing:
• The critical values will occur in both the lower and upper tails of the standard normal
curve.
• Use the standard normal probability distribution table to find za/2 (the t-value with an
area of a/2 in the upper tail of the distribution).
• The rejection rule is: Reject H0 if t < -ta/2 or t > ta/2
Calculations is:
pg. 13
xbar 2,980.15
s 571.160617
n 20
U0 59602.94
t(α/2, n - 1)
a/2 0.5
n 20
n-1 19
pg. 14
Q2. Is “other income” greater than the “total income from operations”. Test at
95% confidence interval?
Other Income Total income from
operations
Mar '22 1902 5739.72
Dec '21 1876.78 5494.94
Sep '21 1837.2 5463.06
Jun '21 1781.07 5297.53
Mar '21 1780.12 5080.59
Dec '20 1705.46 4955.16
Sep '20 1553.84 4765.31
Jun '20 1519.19 4454.99
Mar '20 1772 4551.23
Dec '19 1789.4 5912.70
Sep '19 1726.66 6103.05
Jun '19 1663.25 6277.72
Mar '19 1558.99 4266.06
Dec '18 1468.85 4985.66
Sep '18 1317.28 4636.98
Jun '18 1301.6 4506.84
Mar '18 1208.51 4076.36
Dec '17 1186.76 3820.50
Sep '17 1187.57 3727.11
Jun '17 1167.26 3640.01
TOTAL 31303.79 97755.52
x1 = 1565.15
x2 = 4887.77
The Other Income is not greater than the “Total income from the operations”.
pg. 15
What are Confidence Intervals in Statistics?
The confidence interval (CI) is a range of values that’s likely to include a population value
with a certain degree of confidence. It is often expressed as a % whereby a population mean
lies between an upper and lower interval.
The 95% confidence interval is a range of values that you can be 95% confident contains the
true mean of the population. Due to natural sampling variability, the sample mean (center of
the CI) will vary from sample to sample.
The confidence is in the method, not in a particular CI. If we repeated the sampling method
many times, approximately 95% of the intervals constructed would capture the true
population mean.
Therefore, as the sample size increases, the range of interval values will narrow, meaning
that you know that mean with much more accuracy compared with a smaller sample.
We can visualize this using a normal distribution (see the below graph).
pg. 16
For example, the probability of the population mean value being between -1.96 and +1.96
standard deviations (z-scores) from the sample mean is 95%.
Accordingly, there is a 5% chance that the population mean lies outside of the upper and
lower confidence interval (as illustrated by the 2.5% of outliers on either side of the 1.96 z-
scores).
To calculate the confidence interval, start by computing the mean and standard error of the
sample.
Remember, you must calculate an upper and low score for the confidence interval using
the z-score for the chosen confidence level (see table below).
0.90 1.645
0.95 1.96
pg. 17
Confidence Level Z-Score
0.99 2.58
Where:
For the lower interval score divide the standard error by the square root on n, and then
multiply the sum of this calculation by the z-score (1.96 for 95%). Finally, subtract the value
of this calculation from the sample mean.
For the above problem 95% confidence interval will be calculated as follows:
X (mean) = 86
Z = 1.960 (from the table above for 95%)
s (standard error) = 6.2
n (sample size) = 46
Therefore at 5% level of significance, the population mean is expected to lie within
the interval of: X + 1.96 standard error of mean.
pg. 18
Finding R Squared / The Coefficient of Determination
2.Calculation :
pg. 19
1. Test if income from operations is independent of Employee cost at a
90% confidence interval.
If either sample size is less than 30, then the t-table is used.
where:
For both large and small samples Sp is the pooled estimate of the common
standard deviation (assuming that the variances in the populations are similar)
computed as the weighted average of the standard deviations in the samples.
pg. 20
Please find the attached excel solution for reference
pg. 21
Q4. Find the trend of the total income from operation with respect to time.
The formula for trend analysis (change in amount) can be derived by subtracting the base
year amount from the current year amount. Mathematically, it is represented as,
Change in Amount = Current Year Amount – Base Year Amount
The formula for trend analysis (percentage change) can be derived by dividing the difference
between the current year amount and the base year amount by the base year amount.
Mathematically, it is represented as,
Percentage Change = [(Current Year Amount – Base Year Amount) / Base Year
Amount]
pg. 22
Change in Amount
Percentage change
60%
2000 50%
1500 40%
1000 30%
20%
500 10%
0 0%
Axis Title
Change in Amounts
(Trend of Total income from operation with time)
Percentage change
pg. 23
a) As per the Analysis, we have applied multiple test to get the less deviation from the mean
value, but somehow it got rejected.
b) Among these assumptions, the data must be randomly sampled from the population of
interest and the data variables must follow a normal distribution.
c) We calculated the values of mean, average, standard deviation, sample size etc
d) We used 2 sample z-test to evaluate the difference between the 2 groups and used the
formula for cumulative area for 90% confidence interval where level of significance is 5%
Problem 2
pg. 24
Q1. If open and High values are same
data.csv
n1 226
n2 226
s1 201.748
s2 206.138
0.90
0.05
Rejected Area 0
pg. 25
Z= (2637.15 - 2668.153) – (0)/sqr(201.748^2/226)+( 206.137^2/226)
Z= -30.9996/sqr(180.098+ 188.019)
Z = -30.9996/19.186
Z= -1.615
So, Here fails to reject Null Hypothesis. Z value does not lie in Rejected Area.
pg. 26
Q2. IF close and low values are the same?
For u1 = u2 = 0
For u1 – u2 !=0
And a = 0.05
As given,
Zc =1.645
0.90
0.50
Rejected Area 0
We can use the two-sample z-test to evaluate the difference between two groups:
pg. 27
or more formally:
Z= 30.502/sqr(352.657)
Z= 1.6242
So Now, Z value not lie in critical area reason, Hence we are not rejecting Null value Hypothesis.
pg. 28
Q3. Open and close values are the same
= sample variance
= sum of…
Χ = each value
= sample mean
n = number of values in the sample
Number of Values in the Sample (n)= n1= 226 and n2= 226
n1-1= 225, n2-1 = 225
X1 = 2637.15, X2 = 2632.34
Sample Var1 (open), By using above formula in Excel we got Var1 = (X1-Xbar1)2/(n1-
1) = 40702.27
pg. 29
Sample Var2 (Close), By using above formula in Excel we got Var2 =(X2 – Xbar2)2/(n2-
1) = 40901.15
Standard Deviation
By using above formula we have calculated Standard Deviation for Open and Close
Sd1 for Open σ 1 = 20351.13
Sd2 for Close σ 2 = 20450.57
Hypothesis test
Formula:
where and are the means of the two samples, Δ is the hypothesized difference
between the population means (0 if testing for equal means), σ 1 and σ 2 are the
standard deviations of the two populations, and n 1and n 2are the sizes of the two
samples which we have already calculated above.
pg. 30
We can use the two-sample z-test to evaluate the difference between two groups:
or more formally:
Please find the attached excel solution file for reference purpose
pg. 31
Q4. Low and high values are the same
For u1 = u2 = 0
For u1 – u2 !=0
And a = 0.05
As given,
pg. 32
Cumulative Area = Az = 0.95
Zc =1.645
0.90
0.05
Rejected Area 0
We can use the two-sample z-test to evaluate the difference between two groups:
or more formally:
Z= -66.32/sqr(177.66+ 188.00)
Z = -66.32/19.104
Z= -3.472
So here Z value is greater than critical value, hence we are rejecting Null value hypothesis
pg. 33
Q5. Write a descriptive report using the information gained by the analysis in
not more than 500 words
Provide your recommendations to an investor based on your analysis in
problems 1 and 2.
We have applied multiple analysis assumptions to get the less deviation in the
result.
You can never be too familiar with your data, and time spent examining it is
nearly always time well spent. Descriptive statistics and graphic displays can
also be the final product of a statistical analysis.
1.https://www.open.edu/openlearncreate/pluginfile.php/183122/mod_resource/
content/1/Report%20writingGRSQ61.pdf
2. https://www.moneycontrol.com/
3. www1.nseindia.com/products/content/equities/indices/historical_index_data.html
4. http://www.stat.ucla.edu/~cochran/stat10/winter/lectures/lect21.html
5. https://www.cliffsnotes.com/study-guides/statistics/univariate-inferential-tests/two-
sample-z-test-for-comparing-two-means
pg. 35