0% found this document useful (0 votes)
45 views11 pages

DMBA103 Gumbo

The document contains information about a statistics course taken by a learner, including the course code, name, semester, and name of the learner. It includes 3 questions asking to help choose an investment proposal based on net present value and standard deviation, identify the plant a defective pipe came from based on production volumes and defect rates, and explain different statistical sampling techniques.

Uploaded by

chetz1991
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
45 views11 pages

DMBA103 Gumbo

The document contains information about a statistics course taken by a learner, including the course code, name, semester, and name of the learner. It includes 3 questions asking to help choose an investment proposal based on net present value and standard deviation, identify the plant a defective pipe came from based on production volumes and defect rates, and explain different statistical sampling techniques.

Uploaded by

chetz1991
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 11

SESSION JUL/AUG 2021

PROGRAMME MASTER OF BUSINESS ADMINISTRATION (MBA)


SEMESTER I
COURSE CODE & NAME DMBA103-STATISTICS FOR MANAGEMENT
NAME OF THE LEARNER
ROLL NUMBER

1. Mr. Vijay, a retired government servant, is considering investing his money in two
proposals. He wants to choose the one that has higher average net present value and
lower standard deviations. The relevant data are given below. Can you help him
choosing the proposal?

Proposal A Net present Chance of the


value (NPV) possible
outcome of
NPV
1559 0.30
5662 0.40
9175 0.30

Proposal B Net present Chance of the


value (NPV) possible
outcome of
NPV
- 0.30
10050 0.40
5812 0.30
20584

Answer:

Proposal A
PV Probability NPV*Probability NPV– 5485 = (B) (B)(B)*
Probability
(A)
59 .3 467.7 -3926 4624042.8

62 .4 2264.8 177 12531.6

75 .3 2752.5 3690 13616100

5485 18252674

Standard Deviation = 4272.3

Proposal B

PV Probability NPV*Probability NPV – 5485 = (B) (B)(B)*


Probability
(B)
0050 .3 -3015 -15535 72400867.5

12 .4 2324.8 329 42771.6

584 .3 6175.2 15099 683940.3

5485 73127579

Standard Deviation = 11867.5

Decision: NPV of both the proposals are same. But Standard deviation of Proposal A is
less. So Proposal A must be chosen.

Q2
A manufacturing firm produces steel pipes in three plants with daily production
volumes of 500, 1000 and 2000 units respectively. According to past experience, it is
known that the fraction of defectives output produced by the three plants are
respectively 0.005, 0.008 and 0.010. If a pipe is selected from a day’s total production
and found to be defective, find out (a) from which plant the pipe comes (b) what is the
probability that it comes from first plant?

Answer:

lants Production(A) Faulty (A)*(B)= C (C/30.5) 100 = D


Production(B)

500 .005 2.5 8.2%

1000 .008 8 26.23%

2000 .010 20 65.57%

3500 30.5 100%

a) As plant 3 is having substantial production it can be said that if pipe is selected from
days total production and found defective will be from plant 3.
b) Probability of first plant is 8.2%

Que3.Explain the following:


Ans a. Probability Sampling and Non-Probability Sampling
Probability sampling provides a scientific technique of drawing samples from the
population. The technique of drawing samples is according to the law in which each unit
has a predetermined probability of being included in the sample.
The different ways of assigning probability are as follows:
i) each unit is assigned with the same chance of being selected
ii) sampling units are assigned with varying probability depending on priorities
iii) units are assigned with probability proportional to the sample size

Non-Probability Sampling
Depending upon the object of enquiry and other considerations a predetermined number of
sample units is selected purposely so that they represent the true characteristics of the
population.
A serious drawback of this sampling design is that it is highly subjective in nature. The
selection of sample units depends entirely upon the personal convenience, biases,
prejudices and beliefs of the investigator. This method will be more successful if the
investigator is thoroughly skilled and
experienced.

b. Systematic sampling and Cluster Sampling

Ans.Systematic sampling is defined as a probability sampling method where the researcher


chooses elements from a target population by selecting a random starting point and selects
sample members after a fixed ‘sampling interval.’

Cluster Sampling
The total population is divided into recognisable sub-divisions, known as clusters such that
within each cluster, units are more heterogeneous and between clusters they are
homogenous. The units are selected from each cluster by suitable sampling techniques.

c. Advantages of Probability Sampling over Non-probability Sampling


Ans.A major advantage with non-probability sampling is that—compared to probability
sampling—it's very cost- and time-effective. It's also easy to use and can also be used when
it's impossible to conduct probability sampling (e.g. when you have a very small population
to work with)
Probability sampling involves random selection, allowing you to make strong statistical
inferences about the whole group. Non-probability sampling involves non-random selection
based on convenience or other criteria, allowing you to easily collect data.

d• Census vs. sample


A census is the procedure of systematically acquiring and recording information about the
members of a given population
In the census method, every unit or object of the population is included in the investigation.
For example, in the census method, if we want to study the average annual income of 500
families in a given area, we must study the income of all the families in that area. When the
population is large, applying the census method would be difficult.
Sample
Sample is a finite subset of a population. A sample is drawn from a
population to estimate the characteristics of the population.
e• Primary and Secondary Data
Primary data is the one, which is collected by the investigator for the purpose of a specific
inquiry or study. Such data is original in character and is generated by a survey conducted
by individuals or a research institution or any organization

Secondary data

Any information, that is used for the current investigation but is obtained from some data,
which has been collected and used by some other agency or person in a separate
investigation, or survey, is known as secondary data. They are available in a published or
unpublished form.

In published form, secondary data is available in research papers, newspapers, magazines,


government publication, international publication, and websites.

SET-II

4. A report says that 80% of India’s females aged 15-59 are not currently engaged in
the workforce. A national agency has an opinion that this percentage may be even
more. To validate its opinion, the agency did a survey of randomly chosen 1200
females of the age group 15-59 from the different parts of the country and found 228
females working. Do the figures of the survey help the agency in validation of its
opinion?

Answer:
The survey shows that around 19% of females are working that matches the expectation of
80% not working women in age of 15-59. 
But this doesn't helps in real analysis that is the validation of 80% non-workingwomen. 
The real problem here lies in the age group chosen as the 15-21 the most people
irrespective of gender are not working anyhow in INDIA. And also the analysis lacks
information whether women not working temporarily is due to pregnancy. 
Also general as per analysis men and women both in age 55-59 are not working due to
health reasons or not having financial requirements. 
The analysis needs to be done in groups for example- age 21-30, 31-40 , 41-50 and 51-59.
So this will help more in logical analysis that would help to find out real number of women
that are not working. 
And also 1200 population is really not enough to represent the total population of the
country. The analysts must try to get more number of people for survey so that they can be
analysed. 
Also the location of the records matters, because due to mind set of various places , the
attitude for women working is not supportive enough. 
Like the ratio in metropolitan cities would be much more than, ones in rural ones like
punjab, bihar, etc. 
These qualitative factors affects how to people choose to run their households. 
Moreover distinction can be made between women who are educated and the ones who are
not. Even if uneducated women are doing odd jobs, they have very less chance to be
reported.
Therefore we can conclude that figures of the survey doesn't helps in validation of opinion
of the agency 

Q5. What is regression analysis? Explain steps of performing regression analysis in


detail?
The following data shows the yearly sales (in million Rs.) of A2Z- corporation for the
last nine years. Develop a regression model.

Year 2011 2012 2013 2014 2015 2016 2017 2018 2019
Sale 2.3 5.3 5.1 3.5 3.4 2.7 2.8 4.1 2.9
Answer:
Regression analysis mind set is most mathematically minded method is usually why people
shy away from it. This technique is meant for those companies that need in-depth, granular,
or quantitative knowledge of what might be impacting sales and how it can be changed in
one direction or the other, as necessary.
The regression model equation might be as simple as Y = a + bX in which case the Y is
your Sales, the ‘a’ is the intercept and the ‘b’ is the slope. You would need regression
software to run an effective analysis. You are trying to find the best fit in order to uncover
the relationship between these variables

Steps of performing regression analysis

Regression Analysis is an analytical process whose end goal is to understand the inter-
relationships in the data and find as much useful information as possible. According to the
book, there are a number of steps which are loosely detailed below.

1. Problem definition

The very first step is to, of course, define the problem we are trying to solve. Perhaps a
business question that needs to be answered or simply a prediction we want to make based
on some set of data. In this stage we must know target variable and the attributes we
presume affects the target variable. This would be later analysed to judge its credibility.

2. Analyse Data

The key is to have visual representations of our data so we can better understand the ‘inter-
relationships’ of the variables and likely so, the book I was referring to earlier, highly
recommends using visual tools to make the EDA(Exploratory Data Analysis) process easier.
Finding correlation is an important step as it allows us to roughly pick the attributes that
have a relation with the response variable. We are most likely to pick the attributes/variables
that show a positive correlation with respect to the target variable.

3. Model Selection

Based on the data , we are to pick a suitable model or regression equation. You may be
familiar with many such models like Linear Regression, Support Vector Machine, Random
Forest etc. The task in this step is to pick one that we assume will express the relationships
of our data in the best way possible. This assumption can be later accepted or refuted based
on analysis after fitting the model.

4. Model Fitting

For simplicity’s sake, lets consider Linear regression

Y = a + bX. We have the data, we have a model.

5. Model evaluation

 Final step is model evaluation — measuring and criticising exactly how good is the model
fitting the data points. We run the model on the test data and check to see how accurately it
was able to predicit the output values.

X Y XY X*X

1 2.3 2.3 5.29

2 5.3 10.6 28.09

3 5.1 15.3 26.01

4 3.5 14 12.25

5 3.4 17 11.56

6 2.7 16.2 7.29

7 2.8 19.6 7.84

8 4.1 32.8 16.81


9 2.9 26.1 8.41

45 32.1 153.9 123.55


X= ∑

X / n = 45/9 = 5

Y 32.1

Y =∑ n = 9 =3.567 ¿

¿

A = Y – b X = 3.567 – (.0235)(5) = 3.4495


∑ XY −n X Y
153.9 – 9(5)(3.567)
B= ❑
❑ = = .0235
123.55−9 ( 5 ) ( 5 )


x∗x−n XX

2020 = 3.4495 + .0235(10) = 3.6845

2021= 3.4495 + .0235(11) = 3.708

2022 = 3.4495 + .0235(12) = 3.7315

2023= 3.4495 + .0235(13) = 3.755

6. Four observers determine the moisture content of samples of a powder, each man
taking a sample from each of six consignments. Their assessments are given below:
observation consignments
1 2 3 4 5 6
1 9 10 9 10 11 11
2 12 11 9 11 10 10
3 11 10 10 12 11 10
4 12 11 11 14 12 10
Discuss whether there is any significant difference between consignments. (Useful
data: Ftab(5,15):2.96, Ftab(3,15):3.29)

Answer:

Four observers determine the moisture content of samples of a powder, each man taking a sample from
each of six consignments. Their assessments are givenbelow:

Consignments
1 2 3 4 5 6
1 9 10 9 10 11 11
2 12 11 9 11 10 10
3 11 10 10 12 11 10
4 12 11 11 14 12 10

Discuss whether there is any significant difference between consignments. (Useful data: F tab(5,15):2.96,
Ftab(3,15):3.29)

Solution

P=4

Q=6

N=p*q=24

p q 2
G
TSS=∑ ∑
2
y −
ij
i=1 j=1 n
2
257
=[ 92 +122 +… … … … .+102 +102 ] −
24

=30.958
p 2
1 2 G
SSA= ∑ y i −
q i=1 n

1[ 2 2 2 2
] 2572
= 60 +63 +64 +70 −
6 24

=8.79
q 2
1 G
SSB= ∑
p j=1
2
y j−
n
2
1[ 2 257
44 + 42 +39 + 47 +44 +41 ] −
2 2 2 2 2
=
6 24

=9.708

SSE= TSA-SSA-SSB=12.46

dof Sum of squares Mean of sum of Factors


squares
Factor A p-1 SSA MSA=SSA/(p-1) FA=MSA/MSE
Factor B q-1 SSB MSB=SSB/(q-1) FB=MSB/MSE
Error (p-1)(q-1) SSE MSE=SSE/(n-p)

dof Sum of squares


Mean of sum of Factors
squares
Factor A 3 8.79 2.93 FA=4.703
Factor B 5 9.708 1.94 FB=3.11
Error 15 12.46 0.623
F(5,15)=3.11 >Ftab(5,15)=2.96, we conclude that 6 experimentations are not equal

F(3,15)=4.703 >Ftab(3,15)=3.39, we conclude 4 observers are not equal and there is significant difference in
result.

You might also like