How To Establish Sample Sizes For Process Validation Using Statistical T...
How To Establish Sample Sizes For Process Validation Using Statistical T...
Before we begin, we must establish our definitions of risk and their associated confidence
level and reliability value. These definitions can and should vary based upon the
organizational needs. A good place to determine the risk level is failure mode and effects
analysis (FMEA), a systematic group of activities designed to recognize, document, and
evaluate the potential failure of a product or process and its effects. FMEA uses a risk
priority number (RPN), which is based on the frequency, detection, and severity of a
potential failure mode. The higher the RPN, the higher the risk. However, a low probability
of occurrence in conjunction with high severity and high probability of detection may still
necessitate the appropriate controls for high risk.
Table 1 depicts an example FMEA with the associated risk levels. Once the risk level has
been determined (low, medium, high), the appropriate confidence level and reliability can
be selected using Table 3. Figure 1 depicts the linkage between FMEA, risk, and confidence
level and reliability.
Figure 1: Risk process for determining the appropriate confidence level and reliability
When using statistical tolerance intervals, there is an assumption and expectation that the
data is normally distributed. There are many ways to determine if data is normally
distributed, including computer programs and spreadsheets. However, for small samples
(15 or fewer) normal probability plots can be used to assess normality.
Normal probability plots can be constructed to look for linearity when using one variable.
Normal probability plots provide a visual way to determine if a distribution is
approximately normal. If the distribution is close to normal, the plotted points will lie close
to a line. Normal probability plots are constructed by doing the following:
(For an example of how to create and analyze normal probability plots, refer to Chapter 4
of Practical Engineering, Process, and Reliability Statistics, ASQ Quality Press, 2014.)
The first step in using statistical tolerance intervals to determine sample sizes for process
validation is to calculate the mean and standard deviation from a small sample, which
should capture the expected range of variation that can reasonably be expected from the
process (different batches of materials, operators, etc.). To determine this minimum initial
sample for a one-sided test:
Where:
α = confidence
β = reliability
A pouch-sealing operation is considered high risk according the FMEA in Table 1. High
risk requires 95% confidence (0.05α) and 99% reliability (0.01β), as shown in Table 3. The
validation team decided to use a δ of 1.0σ. Table 4 requires an initial sample of 16. The 16
samples were found to be normally distributed, with a mean of 3.2 lbs and sigma of 0.23.
The pouch-sealing process specification requires a minimum pull strength of 2.5 lbs.
The calculated k value is 3.043. (The k value selected should be equal to or less than the
calculated value.) High risk requires 95% confidence (0.05α) and 99% reliability (0.01β),
for which Table 5 indicates a minimum of 35 samples. A minimum of 35 samples must be
used for the validation. Because the company typically uses three batches for process
validation, 12 samples will be randomly drawn from each of three batches for a total of 36
samples.
Three validation lots were manufactured. The 36 samples were found to be normally
distributed with a mean of 3.1 lbs and sigma of 0.21.
Since the calculated tolerance interval (3.037 lbs) is greater than the minimum pull
strength of 2.5 lbs, the validation passed. We can state that we are 95% confident that the
process is 99% reliable. In other words, we can claim that 95% of the time, 99% of the
products produced will exceed the minimum pull strength of 2.5 lbs.
Sample size does matter! Let Mark Durivage show you which method
for determining sample size is best for your process validation
efforts:
The calculated k value is 2.326. (The k value selected should be equal to or less than the
calculated value.) Medium risk requires 95% confidence (0.05α) and 95% reliability
(0.05β), which Table 6 indicates a minimum of 80 samples. A minimum of 80 samples
must be used for the validation. Because the company typically uses three batches for
process validation, 27 samples will be randomly drawn from each of three batches for a
total of 81 samples.
Three validation lots were manufactured. The 81 samples were found to be normally with a
sample mean of 994 g and sample standard deviation of 4.2.
Table 6: 95% Two-Sided Tolerance k Factors (Partial)
Recall the tolerance is 1000 g ±5 g (995 g to 1005 g). Since the lower tolerance interval
(984.5 g) is below the lower specification of 995 g, the validation has failed. It should be
apparent that there are two issues with the process. The first is that the process is not
centered, and second is that there is excessive variation. The validation team must first
center the process and then determine the best method(s) to reduce the variation within
the process.
The methods presented here have been used and successfully defended during audits and
inspections. I cannot emphasize enough the importance of proceduralizing (documenting)
the statistical methods and rationale your organization may use for process validation
activities. Table 2 provides an example of documenting and standardizing risk levels,
defect classifications, and defect definitions. Table 3 provides an example for documenting
the confidence level and reliability requirements for process validation activities. I also
recommend that validation and statistical technique procedures include the formulas as
well as fully worked examples, like those demonstrated above, to provide clarity and
guidance for those individuals writing, performing, executing, and approving process
validation activities.
Subsequent articles in this series will provide additional how-to examples for applying
risk-based sample size techniques to process validations in your organization. Previous
articles include: