06 Analyze
06 Analyze
2
Analyze
Purpose
To pinpoint and verify causes affecting the key input and output
variables tied to project goals. (“Finding the critical Xs”)
Deliverables
• Data charts and other analyses that show the link between the
targeted input and process (Xs) variables and critical outputs (Y)
3
Analyze
Key Steps
1.Conduct value analysis: Identify value-add and non-value-add
steps
4
Analyze
Key Steps
5. Generate theories to explain potential causes: Use
brainstorming, FMEA, Cause & Effect diagrams, and other tools
identify potential causes of the observed effects
5
Analyze
Hypothesis Testing
● A statistical hypothesis is an assumption about a population
parameter
● If sample data are not consistent with the statistical hypothesis, the
hypothesis is rejected.
6
Analyze
Hypothesis Testing
Hypothesis Testing enables us to:
7
Analyze
Hypothesis Testing
8
Analyze
Hypothesis Testing
Example:
Suppose we wanted to determine whether a coin was fair and balanced. A null
hypothesis might be that half the flips would result in Heads and half, in Tails.
The alternative hypothesis might be that the number of Heads and Tails would
be very different.
Symbolically, these hypotheses would be expressed as
H0: P = 0.5
Ha: P ≠ 0.5
Suppose we flipped the coin 50 times, resulting in 40 Heads and 10 Tails. Given
this result, we would be inclined to reject the null hypothesis.
We would conclude, based on the evidence, that the coin was probably not fair
and balanced.
9
Analyze
Hypothesis Testing
Example:
Now, suppose our sample says his performance level is 88%, which is very close
to manager’s statement. We are not absolutely certain to accept or reject
manager’s statement.
What would be hypothesis statement?
10
Analyze
Hypothesis Testing – Steps
● State the hypotheses: This involves stating the null and alternative
hypotheses. The hypotheses are stated in such a way that they are mutually
exclusive. That is, if one is true, the other must be false.
● Formulate an analysis plan: The analysis plan describes how to use sample
data to evaluate the null hypothesis. The evaluation often focuses around a
single test statistic.
● Analyze sample data: Find the value of the test statistic (mean, proportion,
etc.)
● Interpret results: Apply the decision rules. If the value of the test statistic is
unlikely, based on the null hypothesis, reject the null hypothesis.
11
Analyze
Hypothesis Testing – Decision Errors
12
Analyze
Hypothesis Testing – Confidence Interval
13
Analyze
Hypothesis Testing – Confidence Interval
● The mean of a sample does not normally equate exactly to the mean
of the population from which the sample is taken
14
Analyze
Hypothesis Testing – Decision Rules
Two ways to describe decision rules:
● P- value:
● The strength of evidence in support of a null hypothesis is measured by the
P-value.
● If the data show that the null hypothesis has less than a 5% chance of being
right, we say it’s wrong.
● By convention, if the p-value is less than 0.05 (p < 0.05), we conclude that
the null hypothesis can be rejected.
15
Analyze
Hypothesis Testing – Decision Rules
Region of Region of Region of
Two ways to describe decision rules: rejection acceptance rejection
● Region of acceptance :
Critical 0 Critical
● The region of acceptance is a range of values. value value
● If the test statistic falls within the region of acceptance, the null hypothesis is
not rejected.
● The set of values outside the region of acceptance is called the region of
rejection.
● If the test statistic falls within the region of rejection, the null hypothesis is
rejected.
● In such cases, we say that the hypothesis has been rejected at the α level
of significance.
16
Analyze
Statistical Tools
Analysis
Analysis
Continuous
Continuous Discrete
Discrete
data
data data
data
One
OneSample
Sample Two
TwoSample
Sample 11--Proportion
Proportion 22--Proportion
Proportion
TTTest
Test TTTest
Test Test
Test Test
Test
Statistical parameter Between two DPU vs Target Between two binomial
vs Target statistical parameters Chi proportions
ChiSquare
Square
ANOVA
ANOVA Test
Test
To compare three or
To compare three or
more samples
more samples
17
Analyze
One Sample T Test
● Tells us whether or not a statistical parameter (average, standard
deviation, etc.) is different from a value of interest e.g. target
Example:
18
Analyze
Two Sample T Test
● The two sample t is used to test whether or not the means of two
samples are the same
Example:
The same automobile manufacturer has data on another supplier and wants to
compare the two:
Supplier 1:
mean = 599.55 SD = 0.62
95% CI for mean is 599.43 to 599.67
Supplier 2:
mean = 600.23 SD = 1.87
95% CI for mean is 599.86 to 600.60
19
Analyze
ANOVA (Analysis Of Variance)
Purpose:
When to use:
20
Analyze
ANOVA (Analysis Of Variance)
● In the statistical world, inputs are sometimes referred to as factors
Example:
21
Analyze
ANOVA (Analysis Of Variance)
● To tell whether the three or more options are statistically different,
ANOVA looks at three sources of variation:
22
Analyze
1 – Proportion Test
● 1 proportion test: When historic data (DPU) is available
Example:
BK500EI yari was running in cell2 from past 1 year with DPU of 8%. Later the
same model is moved to cell1 and they produced 400 units out of 210 units
passed. Calculate for statistical difference between these cell performance.
H0: DPU = 8%
Ha: DPU ≠ 8%
Test and CI for One Proportion
Test of p = 0.8 vs p not = 0.8 P value is less than .05.
Exact
Hence, Reject null
Sample X N Sample p 95% CI P-Value
hypothesis
1 190 400 0.475000 (0.425155, 0.525217) 0.000
23
Analyze
2 – Proportion Test
● 2 proportion test: Performs a test of two binomial proportions
Example:
A corporation's purchasing manager need to authorize the purchase of twenty new
photocopy machines. After comparing many brands in terms of price, copy quality,
warranty, and features, he has narrowed the choice to two: Brand X and Brand Y. The
determining factor will be the reliability of the brands as defined by the proportion
requiring service within one year of purchase. Because corporation already uses both of
these brands, he was able to obtain information on the service history of 50 randomly
selected machines of each brand. Records indicate that six Brand X machines and eight
Brand Y machines needed service. With this information, he has to choose which brand
to purchase.
24
Analyze
2 – Proportion Test
Test and CI for Two Proportions
Sample X N Sample p
1 44 50 0.880000
2 42 50 0.840000
Difference = p (1) - p (2)
Estimate for difference: 0.04
95% CI for difference: (-0.0957903, 0.175790)
Test for difference = 0 (vs not = 0): Z = 0.58 P-Value = 0.564
That is, the proportion of photocopy machines that needed service in the first
year did not differ depending on brand. As the purchasing manager, he need to
find a different criterion to guide the decision on which brand to purchase.
25