0% found this document useful (0 votes)
122 views15 pages

Understanding Stratification in Statistical Process Control: Harry B. Rowe September 14, 2012

Statistical Process Control

Uploaded by

kt
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
122 views15 pages

Understanding Stratification in Statistical Process Control: Harry B. Rowe September 14, 2012

Statistical Process Control

Uploaded by

kt
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 15

Understanding Stratification in Statistical Process Control

Harry B. Rowe

September 14, 2012


Understanding Stratification in Statistical Process Control

“Stratification” can be defined as “the process of dividing things into layers” or “the state
of being divided into layers”.

In statistical process control (SPC), “stratification” refers to a situation in which the data
being charted does not represent a single process, but comes from two or more different
processes. Sometimes this is inadvertent. Sometimes it occurs due to a misunderstanding
of SPC principles. When it happens, it can be quite interesting.

To illustrate, I will provide a few examples and explain what is occurring in each case.

Suppose there are two identical machines sitting side by side on a factory floor, making
the same product. If we measure a critical dimension, we find that the products
manufactured by each machine exhibit similar variability, but the average dimension of
the products produced on machine 1 differs slightly from those produced on machine 2.

Both machines are stable, and an SPC chart of the output of either would show it to be in
a state of statistical control.

This situation is not unusual at all. Variability in manufacturing is frequently a


consequence of the design of a machine, while the average, or process mean, is a function
of operator setup of the individual machine.

In this example, machine 1 is producing parts with an average dimension of 31 mm and


the average on machine 2 is 30 mm. The standard deviation in each case is 0.3 mm. If we
plot the part measurements sequentially, we might see something like this.

© 2012 Harry B. Rowe www.rowequality.com 2


Understanding Stratification in Statistical Process Control

If the part specification limits were, say, 28 to 34, the difference in process means might
easily go unnoticed. If we plot the dimensions from the two machines in different colors,
we can see a difference.

© 2012 Harry B. Rowe www.rowequality.com 3


Understanding Stratification in Statistical Process Control

A histogram of the data shows a slightly bimodal distribution.

© 2012 Harry B. Rowe www.rowequality.com 4


Understanding Stratification in Statistical Process Control

Now let us suppose that a young quality engineer is assigned to implement an SPC
program for the product. She decides to use an X-bar and R chart with a sample size of 4,
taking samples every 15 minutes. Since there are two machines making only one part, she
asks the operators to take a sample of four parts every fifteen minutes, alternating
between the two machines. That is, on the hour take four parts from machine 1. At :15
after, take four parts from machine 2. At the half hour, take four parts from machine 1. At
:45 after, four parts from machine 2. And so on. The parts are measured and the results
recorded in a spreadsheet.

The next morning, our young engineer excitedly enters the data into the plant’s SPC
program. Having been trained in SPC, she looks first at the R chart.

© 2012 Harry B. Rowe www.rowequality.com 5


Understanding Stratification in Statistical Process Control

There is nothing extraordinary in the R chart, so she goes on to the X-bar chart where she
sees this.

© 2012 Harry B. Rowe www.rowequality.com 6


Understanding Stratification in Statistical Process Control

Goodness! Twenty of the twenty-five points are outside the control limits! Something is
horribly wrong. But the parts manufactured overnight all passed inspection! What can it
be? (We know, but our young engineer does not, that both the processes are in control
when viewed as separate processes.)

Suspecting that there may have been something unusual about the previous day, our
young engineer decides to try again. But this time, instead of alternating samples between
the two machines, she instructs the operators to take two parts from each machine every
fifteen minutes.

The next morning she enters the data in a new SPC chart and again examines the R chart.

© 2012 Harry B. Rowe www.rowequality.com 7


Understanding Stratification in Statistical Process Control

Nothing extraordinary there, so she looks at the X-bar chart.

© 2012 Harry B. Rowe www.rowequality.com 8


Understanding Stratification in Statistical Process Control

Whew! That’s more like it. All the points are within the control limits.

But wait a minute. There are no points outside the control limits, but there aren’t any
points even near the limits. All of the points are concentrated quite close to the process
mean. Statistically, nearly one-quarter of the points should fall outside the two sigma
zone. Some SPC programs will flag a pattern of 15 consecutive points inside the one
sigma zone as a violating run. There is still something wrong!

So again, we have a startling result. What is going on in each case?

Recall that in SPC, the control limits are based on an estimate of process standard
deviation. That estimate is based on within sample variability. In the X-bar and R chart,
the estimated process standard deviation is based on R-bar.

© 2012 Harry B. Rowe www.rowequality.com 9


Understanding Stratification in Statistical Process Control

(Some SPC programs estimate process standard deviation using grouped standard
deviation rather than R-bar. The result, however, will be very similar.)

In the first case, alternating samples between machines, each sample is drawn from a
single machine. R, the difference between the largest and smallest values in a sample,
will be representative of the variability of the machines. The estimated process standard
deviation will be close to the standard deviation of the parts produced by either machine.
Note that the distance between the control limits is approximately 0.87.

But since each sample represents a single machine, the individual X-bar values will be
close to the process mean of the machine from which the that particular sample is drawn.
The result is an X-bar chart in which the values alternate between a value near 31 and a
value near 30, while the grand mean falls at 30.5, the average of 30 and 31. The sample to
sample variation is much greater than the variation predicted by the machine variability,
resulting in lots of points outside the control limits.

In the second case, since each sample contains two parts from each machine, each R
value will include both the machine variability and the difference between the two
process means. This results in an estimate of process standard deviation that is quite
large. Note that in this case, the distance between the control limits is almost 2.0.

At the same time, the X-bar values cluster around the grand mean, which is still midway
between the two process means.

While these two situations seem contrived, I have seen process engineers (and not
necessarily young ones) make both of these mistakes.

In a much more common scenario, suppose the two machines both discharge into a
common bin. The SPC sample of four parts is subsequently taken from the bin.

Assuming the bin is large and the sampling is random, the probability of a sample
containing a particular number of parts from each machine is as follows:

Parts from
Machine Probability
1 2
4 0 1/16
3 1 4/16
2 2 6/16
1 3 4/16
0 4 1/16

On an X-bar chart, this will result in X-bar values clustering into five bands, or strata.
• One sixteenth of the X-bar values, those from samples with four values from
machine 1, will lie around the mean of machine 1.

© 2012 Harry B. Rowe www.rowequality.com 10


Understanding Stratification in Statistical Process Control

• Another one sixteenth of the X-bar values, those from samples with four values
from machine 2, will lie around the mean of machine 2.
• The majority of the X-bar values, the six-sixteenths of them corresponding to
samples with two values from each machine, will lie at the grand mean, halfway
between the two machines’ process means.
• Four-sixteenths of the X-bar values corresponding to samples with three values
from machine 1 and one from machine 2, will lie at the midpoint between the
grand mean and the mean of machine 1.
• Four sixteenths of the X-bar values corresponding to samples with three values
from machine 2 and one from machine 1 will lie at the midpoint between the
grand mean and the process mean of machine 2.

The separation of the values into bands or strata leads to the name “stratification”.

In the example we have been using, that X-bar chart would look like this.

© 2012 Harry B. Rowe www.rowequality.com 11


Understanding Stratification in Statistical Process Control

It takes a bit of imagination to see the different strata in this chart. If, however, the two
process means are changed to 30 and 40, leaving the process standard deviation the same,
the stratification pattern becomes very clear. (The number of points plotted has also been
increased to 60.)

© 2012 Harry B. Rowe www.rowequality.com 12


Understanding Stratification in Statistical Process Control

The R chart associated with this data also exhibits a characteristic pattern as shown
below.

© 2012 Harry B. Rowe www.rowequality.com 13


Understanding Stratification in Statistical Process Control

Note that most of the R values fall in a band around 10, corresponding to the difference
between the two process means. These are all the samples containing at least one value
from each machine. The remaining points, those corresponding to samples containing
values from only one machine or the other, fall in a band much nearer to zero. These few
points are the ones that are truly representative of the process standard deviations of the
two machines.

In this case, the SPC program has identified 20 violating runs, corresponding to the
number of times there were an excessive number of consecutive points on one side of the
center line.

While the examples used here were X-bar and R charts using n=4, the same effects will
appear with different sample sizes, or with X-bar and S charts. As sample sizes increase,
however, so will the number of strata, making detection more difficult. The number of

© 2012 Harry B. Rowe www.rowequality.com 14


Understanding Stratification in Statistical Process Control

strata will increase linearly with n. If the number of different processes represented
increases, however, the number of strata increases geometrically.

The main lesson to be learned from this phenomenon is that one should be very careful to
avoid plotting data from more than one process on any given SPC chart. Whether it
occurs inadvertently or through a misguided effort to simplify sampling or charting, it is
always a bad idea.

(The data and graphs in this article were produced using the R
statistical environment, free software downloadable from www.r-
project.org. For SPC charts, use was made of the “qcc” package,
an extension to R developed by Dr. Luca Scrucca of the
Department of Economics Finance and Statistics of the Universita
degli Studi di Perugia in Perugia, Italy.)

© 2012 Harry B. Rowe www.rowequality.com 15

You might also like