Statistical Mixing
Statistical Mixing
Statistical Mixing
Assessment of Blend
Homogeneity
Camilla Madsen
LYNGBY 2002
IMM-PHD-2002-99
ATV Erhvervsforskerprojekt EF 767
IMM
IMM-PHD-2002-99
ATV Erhvervsforskerprojekt EF 767
ISSN: 0909-3192
ISBN: 87-88306-15-1
c Copyright 2002 by Camilla Madsen.
This thesis has been prepared at the department Informatics and Mathematical
Modelling (IMM), Technical University of Denmark (DTU), and at the phar-
maceutical company Novo Nordisk A/S in partial fulfilment of the require-
ments for the industrial Ph.D. degree within the Mathematical Ph.D. Program
at DTU.
Camilla Madsen
iii
iv
Acknowledgements
I also would like to thank colleagues in the Quality Support Statistics depart-
ment, Novo Nordisk A/S and at IMM, DTU for a good and pleasant scientific
and social environment. Águsta Haflidadottir as well as my two officemates
Dorte Rehm and Dorte Vistisen deserves special acknowledgements for many
pleasant and constructive hours.
Last but not least I am grateful to my fiancee, my family and my friends for
their support, patience and encouragement during the hard parts of this work.
v
vi
Summary
In this thesis the use of various statistical methods to address some of the prob-
lems related to assessment of the homogeneity of powder blends in tablet pro-
duction is discussed.
Methods to evaluate external factors, that may have an influence on the content
in blend samples, as e.g. sampling device, have been presented. However,
the content in samples is also affected of internal factors to the blend e.g. the
particle size distribution. The relation between particle size distribution and
the variation in drug content in blend and tablet samples is discussed.
A central problem is to develop acceptance criteria for blends and tablet batches
vii
viii
Further, it is shown how to set up parametric acceptance criteria for the batch
that gives a high confidence that future samples with a probability larger than
a specified value will pass the USP three-class criteria.
Properties and robustness of proposed changes to the USP test for content uni-
formity are investigated by the use of simulations, and single sampling accep-
tance plans for inspection by variables that aim at matching the USP proposal
have been suggested.
Resumé (in Danish)
Metoder til at vurdere ydre faktorers betydning for indholdet af aktivt stof i
prøver fra blandingen er blevet diskuteret. En ydre faktor kan f.eks. være det
redskab, prøverne udtages med. Indholdet af aktivt stof i prøver fra blandingen
samt i tabletterne afhænger også af indre faktorer som f.eks. partikel størrelses-
fordelingen. Sammenhængen mellem partikel størrelsesfordeling og variation
i indholdet af aktivt stof i prøver fra blandingen og tabletbatchen er blevet
diskuteret.
ix
x
Det vises, hvordan et parametrisk accept kriterium for blandingen eller tablet-
batchen, der giver en fastlagt (stor) sikkerhed for at fremtidige stikprøver vil
have mindst en fastlagt sandsynlighed for godkendelse under et USP three-
class krav.
Egenskaber og robusthed ved ændringsforslag til USPs test for Content Unifor-
mity er belyst ved simuleringer, og enkeltprøvningsplaner for inspektion ved
kontinuert variation, der tilstræber at matche USP-forslaget er blevet forslået.
Contents
Preface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iv
Acknowledgements . . . . . . . . . . . . . . . . . . . . . . . . . . vi
Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix
I 1
1 Background 3
2 Introduction 7
xi
xii CONTENTS
2.3.1 Organizations . . . . . . . . . . . . . . . . . . . . . . 13
3.2.3 Example . . . . . . . . . . . . . . . . . . . . . . . . 25
4 Conclusion 31
II Included papers 35
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
3.4 Conclusion . . . . . . . . . . . . . . . . . . . . . . . 54
4.1 Conclusion . . . . . . . . . . . . . . . . . . . . . . . 59
5 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . 59
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 89
5 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . 102
CONTENTS xv
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 107
8 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . 179
5 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . 210
1 Purpose . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 219
2 Experimental . . . . . . . . . . . . . . . . . . . . . . . . . . 219
Part I
1
Chapter 1
Background
The background for the Ph.D. project is the general requirements to the phar-
maceutical industry of scientific based documentation of the methods used for
validation of processes.
3
4 C HAPTER 1. BACKGROUND
products. See [2] for a detailed overview over the compliance and science of
blend uniformity analysis.
The decision caused FDA (The Food and Drug Administration) to reexamine
and modify its policies on blend uniformity and sampling techniques. The
resulting policies are based on the assumption that current technology provides
a means to consistently collect minute representative samples from much larger
static powder beds.
At the moment the tablet press could be viewed as the ultimate sampling de-
vice, because the whole batch is being sampled at this stage of the process.
However it is not allowed solely to rely on this when demonstrating blend uni-
formity.
1.1. O UTLINE OF THE T HESIS 5
This thesis is organized into two parts. The first part is four chapters and con-
tains a description of tablet production and the relevant regulatory affairs as
well as a discussion of the results presented in part two.
Part two is six appendices with five articles and manuscripts to articles as well
as a case studies. The appendices deals with different aspects of assessment of
blend homogeneity and as such they represent the main part of the thesis.
In Appendix B two methods to assess the overall (total) variation in the blend
are discussed.
The effect of the particle size distribution on the distributed content in blend
and tablet samples are discussed in Appendix E.
Finally acceptance criteria for blend and tablet samples are discussed in Ap-
pendix C and Appendix D.
The appendices should be read together with the discussion in Chapter 3 as the
results are put in a larger perspective in this chapter.
6 C HAPTER 1. BACKGROUND
Chapter 2
Introduction
The technical area of this thesis is applied statistics and the field of application
is the pharmaceutical industry, specifically tablet production. Thus, the content
of the thesis is in the borderland between the areas of applied statistics and
pharmaceutical science.
A tablet consists of one or more active ingredients; the drug substance, and
some filling materials which have the main purpose to give the tablet suitable
physical, biological and chemical properties. For example assure that the drug
is released after a certain amount of time in the body or to assure the breaking
strength so the tablet does not break into pieces before it is consumed by the
patient.
All raw materials are in a powdered form. As the particle size distribution is
7
8 C HAPTER 2. I NTRODUCTION
Raw materials
Sieving
Blending
Granulation
Sieving
Compression
very important for a number of tablet properties as well as for the variation in
content in the tablets the raw materials are initially sieved to eliminate lumbs.
Then the raw materials are mixed in a blender until the blend is considered
homogeneous. Many different types of blenders exists. The differences may
e.g. be due to the physical presentation or due to the mechanical principles
used. Further, some blenders may also be used for an eventual granulation of
the blend. The purpose of granulation is to obtain particles of more uniform
size. This can be done either by breaking larger particles but most often by
combining smaller particles to larger particles.
The granulate may be sieved again to eliminate lumbs. Then the blend is com-
pressed at the tablet press. Some products are produced by direct compression
of the blend without granulation.
2.1. P RINCIPLES OF TABLET PRODUCTION 9
Figure 2.2 shows a row of punch stations on the tablet press. A and B are upper
and lower stamp respectively. C is the place where the blend is led to the press.
Beneath this place the matrix (D) passes at the same time as the dosing takes
place. Then the filled matrices and the corresponding punches passes two rolls
(E and F) that presses the punches together. Immediately after this the tablet is
ejected and pushed away.
After the compression at the tablet press the tablets may be coated for example
to protect the active ingredient from decomposition due to light or moisture or
to facilitate packaging, mask unpleasant taste or smell etc.
10 C HAPTER 2. I NTRODUCTION
The current state of the art regarding sampling technology is a device referred
to as a sampling thief. Many different types of sampling thieves have been
developed. However, in general a sampling thief consists of two concentric
tubes. The inner tube is solid except for one or more chambers that allow for
sample collection. The outer tube is hollow and contains openings that can
align with the chambers in the inner tube. A handle, located at the top of the
thief is used to rotate the inner tube within the outer tube in order to open or
close the thief. A sample is collected by inserting the closed thief into the
blend. Then the handle is rotated in order to open the thief allowing the sample
to flow into the sampling chamber in the inner tube. Then the thief is closed
and pulled out of the blend.
Figure 2.3 shows examples of two different types of sampling thieves. In these
thieves the sampling chamber is located at the tip of the thief rather than on the
side as described above.
However, the two “Golden Rules” of sampling [5] states that a powder should
be sampled when in motion and that the whole of the stream of powder should
be taken for many short increments of time. Any sampling methods which does
not adhere to these rules should be regarded as a second-best method liable to
lead to errors.
Collecting blend samples by the use of sampling thieves violates these Golden
2.3. R EGULATORY A FFAIRS 11
Figure 2.3: Two examples of sampling thieves. In these thieves the sampling
chamber is located in the tip of the thief.
Rules, and the result is the risk of sampling error or bias. The presence and
the size of such sampling errors and bias depends on factors such as sampling
device, sampling technique, blend formulation, blender size, sample location
and size of the collected sample. For a more detailed discussion of these factors
see e.g. [2].
Figure 2.4 shows a boxplot for the results of samples from blend and the cor-
responding tablets for seven batches produced at Pilot Plant, Novo Nordisk
A/S. Each batch corresponds to a different value of label claim (LC). It is seen
that for all batches the mean content in blend samples is larger than the mean
content in the tablet samples. This could be a typical result of sampling bias.
Pharmaceutical companies often sell their products in several parts of the world.
In order to do that the companies have to comply with all the requirements
covering the countries or areas in which they sell their products. In order to
reduce the need to duplicate the testing carried out during the research and de-
velopment of new medicines efforts are done at making greater harmonisation
in the interpretation and application of technical guidelines and requirements
for product registration. This work is particularly organized via The Interna-
tiona Conference on Harmonisation, ICH, which is a joint initiative between
12 C HAPTER 2. I NTRODUCTION
the regulatory authorities of Europe, Japan and the United States and experts
of the pharmaceutical industry in the three regions. Until this hamonization is
completed the pharmaceutical companies have to comply with various sets of
requirements and in this situation the American legislation is very important
because for many pharmaceutical companies the American market is one of
the most important markets. Therefore the focus in this thesis is on rules and
requirements in the American legislation.
2.3.1 Organizations
The main actors in the U.S. are the FDA, USP, PDA and PQRI. In the following
these organizations are introduced.
In deciding whether to approve new drugs, FDA does not itself do research,
but rather examines the results of studies done by the manufacturer. A part of
this investigation is to assess whether the new drug produces the benefits it is
supposed to without causing side effects that would outweigh those benefits
[6].
ensure the quality of medicines for human and veterinary use. Manufacturers
must meet these standards to sell their products in the U.S. The standards are
officially recognized standards of quality and authoritative information for the
manufacturing and use of medicines and other health care technologies [7].
A number of working groups are established. The ultimate goal of these work-
ing groups is to develop scientific knowledge that will result in appropriate
changes to regulatory policies to make them less burdensome [9].
2.3. R EGULATORY A FFAIRS 15
In 1996 FDA proposed some changes to 21 CFR Part 210 and Part 211. Re-
garding blend uniformity testing the most important change is a new paragraph
211.110(d) that specifically require blend samples to approximate the dosage
size of the product for blend uniformity analysis. Thus, this proposed amend-
ment would for the first time legally oblige the pharmaceutical industry to con-
duct blend uniformity analysis using unit dose testing.
1
CGMP regulations are based on fundamental concepts of quality assurance: (1) Quality,
safety, and effectiveness must be designed and built into a product; (2) quality cannot be in-
spected or tested into a finished product; and (3) each step of the manufacturing process must
be controlled to maximize the likelihood that the finished product will be acceptable. [10]
16 C HAPTER 2. I NTRODUCTION
Chapter 3
17
18 C HAPTER 3. R ESULTS AND DISCUSSION
Regarding the patients using the final tablets it is of less relevance if the varia-
tion between the doses is due to large, medium or small scale variation in the
blend. In this relation the magnitude of the total variation in the batch of tablets
is relevant. The total variation between the content of the tablets is closely re-
lated to the total variation in the blend. Therefore for practical purposes it
is relevant to control the total variation in the blend. In Appendix B the three
scales of homogeneity discussed in Appendix A are related to overall measures
of blend homogeneity. The measures of overall homogeneity are compared by
relating them to an acceptance criterion for blend uniformity.
Acceptance criteria for both blend and tablets are usually assessed assuming
a normal distribution of content in the samples. However, actual distributions
of particle sizes are often seen to be skewed. This might have an effect on the
shape of the distribution of content in blend and tablet samples. Therefore, in
Appendix E the effect of a skewed particle size distribution on the distribution
of content in the samples is discussed.
Keeping in mind, that for example a skewed particle size distribution can in-
fluence the distribution of the content in the blend and tablet samples, the
statistical properties of acceptance criteria for blend and tablet samples are
discussed under the normal assumption in Appendix C and Appendix D. Ap-
pendix C gives background and preliminary considerations to the analysis in
Appendix D. Further, the acceptance criteria analysed in Appendix C is an ear-
lier version of the corresponding acceptance criteria analysed in Appendix D.
In the following the results and discussions of these are given in more detail.
Rather, the ultimate sampling unit must be created, at the time of sampling, by
means of some sampling device. The size and form of the units depend on the
particular device employed, how it is used, the nature, condition, and structure
of the material, and other factors.
However, this definition of a unit is convenient and conceptual and further for
practical purposes the size of a sample do not differ much from the size of a
tablet produced from the blend. Thus, a unit defined in this way is in agreement
with the tabletting process and therefore makes it less complicated to compare
homogeneity in the blend to homogeneity in the tablets.
When a unit has been defined the next problem is to decide where to sample
and how many samples to collect to be able to estimate a variance that is rep-
resentative for the blend homogeneity. In this relation it should be mentioned
that as an example the total amount of drug in a 360 kg batch (drug and filling
material) could be as little as 0.5 kg, and the weight of a sample less than f.ex.
80 mg. With these orders of magnitude and in case of batch inhomogeneity a
variance estimated between samples sampled close to each other differs from
a variance estimated from samples collected far apart. Hence, for exploratory
purposes it is relevant to assess different types of variances, i.e. variances esti-
mated from samples sampled closely and variances based on samples sampled
far apart.
In case of blend homogeneity the large and the medium scale variation (mea-
suring differences between the mean content in respectively layers and areas
within a layer) are zero. The small scale variation is an inherent variation in
the blend and therefore it is not zero in case of homogeneity. However, in case
of homogeneity the small scale variation is independent of in which layer of
the batch it is estimated.
It should be noted that in the literature several examples exist of models taking
into account correlation between the samples measured as a function of the
distance between the spots in the blend from which the samples are collected.
(See e.g. [13]). However, these models are generally not used in practice. With
future techniques as e.g. NIR (near infra red) techniques correlation as a func-
tion of distance may be used in relation to image analysis methods. However,
NIR technology is not commonly introduced in production yet, and the focus
of this thesis is to develop and improve methods to assess uniformity within
the scope of current sampling technology, the sampling thieves.
The first method is to use the total variation from the analysis of variance
(ANOVA) table corresponding to the hierarchical model for the variation in
the batch as an estimate for the overall variation. The other method is to use
the total variation on a randomly collected sample from the blend as an esti-
mate of the overall variation in the blend.
The difference between the estimates of the overall variation obtained with
each of these two methods depends on the sampling plan used to collect the
samples on which the estimates are based.
3.1. VARIANCES AS A MEASURE OF HOMOGENEITY 21
If a patient only uses one randomly sampled tablet for example when taking
an aspirin to relieve the pain of a headache, he/she will experience a deviation
from LC corresponding to the variance on a randomly chosen tablet. However,
if the patient uses more than one tablet as part of an ongoing treatment, the total
variation in drug content experienced may depend on the way the tablets are
collected. Are they randomly chosen from the batch or do they all come from
the same part of the batch etc. The tablets in a single package will in general not
be sampled from a balanced, hierarchical sampling plan as in Appendix A and
Appendix B, and even if the tablets by accident were sampled in accordance
with a hierarchical sampling plan, the "sampling plan" would be unknown.
Hence, regarding the total variation experienced by a patient using more than
one tablet neither method of estimating the overall variation is ideal.
Another criteria for deciding which estimate to use as a measure for the overall
variation in the blend is to consider the properties of the acceptance criteria for
the blend. Acceptance criteria are discussed in more detail in Section 3.3. In
case of uncorrelated samples, which corresponds to the model in Appendix A
with no variation between layers and a hierarchical sampling plan with only
one replicate per area, the two measures for the overall variation in the blend
are identical. Otherwise the measure of the total variation corresponding to the
ANOVA table in general leads to more efficient and less ambiguous properties
of the acceptance criteria for blend homogeneity.
In the thesis two different approaches to assess homogeneity and factors that
may influence homogeneity have been introduced.
The first approach described in Appendix E leads to a model of the best ob-
tainable blend and content uniformity derived from the distribution of particle
radii. However, the best obtainable homogeneity is a ’theoretical’ limit that
holds for all batches with the same distribution of particle radii, and therefore
this approach does not lead to information on the actual homogeneity of a given
batch. The second approach described in Appendix A introduces two methods
to assess the homogeneity of a specific batch.
Particle size distributions are often seen to be skewed and it has been shown
in Appendix E that this feature affects the distribution of content in blend and
tablet samples.
Beside the variation in particle mass the variation in dose content is affected
by variation in the number of particle in a sample. For a homogeneous blend
with a random scattering of particles over the blend it is demonstrated that for
a given distribution of particle sizes the variation in the distribution of absolute
doses is proportional to the average number of particles in the samples (tablets).
Further, it is shown that the larger the average number of particles in the sample
the closer the distribution of content in the samples is to a normal-distribution.
3.2. M ETHODS TO ASSESS HOMOGENEITY AND FACTORS THAT MAY
INFLUENCE HOMOGENEITY 23
For spherical particles an explicit relation between the variation in the relative
doses and the mean and the coefficient of variation in the distribution of particle
radii is given.
GENMOD
However, depending on the experimental design and the assumptions made, the
method presented in this section can also be used to assess factors influencing
sample error (variation). For example the method can be used to test if one
sampling thief leads to larger variation between replicate samples than another
thief.
24 C HAPTER 3. R ESULTS AND DISCUSSION
Using the thief leading to the smallest variation between replicates will reduce
the risk of incorrectly rejecting a batch because of suspicion of inhomogeneity.
Other external sources of variation (as e.g. the sampling procedure) may have
a similar effect on the small scale variation. Examples of such analysis are
given in Appendix F.
GLM
When a Generalized Linear Model has been applied a General Linear Model
can be applied specially to assess the medium and the large scale variation as
well as factors that may influence these types of variation. In case the variation
between replicates is found not to be constant throughout the blend, this should
be corrected for in the analysis by introducing appropriate weights
Under the assumption that the variation between replicate samples is indepen-
dent of the layer and that there is no interaction between the factors in the
model, two statistical methods to describe blend homogeneity have been in-
vestigated.
The analysis showed that the two methods are approximately equally good
at detecting inhomogeneity. That is, an analysis according to the aggregated
model can be used to detect inhomogeneity even in situations with large scale
variation (variation between layers).
The most important difference between the two types of analysis is that when
inhomogeneity is declared according to the aggregated analysis the result does
not reveal whether this inhomogeneity is due to large or medium scale variation
in the batch. However, the hierarchical model explicitly assesses respectively
large and medium scale variation.
3.2. M ETHODS TO ASSESS HOMOGENEITY AND FACTORS THAT MAY
INFLUENCE HOMOGENEITY 25
The power of the respective tests as a function of the standard deviation corre-
sponding to respectively variation between layers and variation between areas
within a layer are shown in Figure 3 to Figure 5 in Appendix A. The standard
deviation corresponding to respectively variation between layers and variation
between areas within a layer are measured relatively to the standard deviation
corresponding to replicates.
Finally, for the given design (and for σrep = 1) the power of the test of a thief
effect was shown in Figure 6 in Appendix A. The test of the thief effect is
independent of σlayer and σarea,hi .
With the given design (and for σrep = 1) a difference greater than 1.5 in mean
content in samples from two different thieves will be detected with a probabil-
ity of at least 0.95.
3.2.3 Example
As an example of how the robustness and power of a test can be used to evaluate
the test result the test of the small scale variation in Appendix F is discussed in
relation to the results from Appendix A.
The estimated variance components, σrep2 , are given in the Table 3.1. The esti-
mates in the table are multiplied with 1000 compared to the results in Case F.
The reason is that in Appendix A the unit is % LC rather than fraction LC.
26 C HAPTER 3. R ESULTS AND DISCUSSION
Batch 1 Batch 1
Thief 1 2 3 1 2 3
2
σrep,top 0 0 1 0 3 8
2
σrep,bottom 1 7 17 4 35 98
2
It is seen from the three cases in the table where σrep,top 6= 0 that the estimate
2 2
of σrep,bottom is between 10 and 20 times as large as σrep,top . At first a dif-
ference that large may be expected to be found significant. However, for the
design in Appendix A it was showed that σrep in one layer should be more than
4.5 times as large as σrep in an other layer for the test to show significance with
a high probability. This corresponds to one variance estimate being 4.52 ≈ 20
times larger than the smallest. This result is valid for the balanced design with
12 pairs of replicates in each batch and analysed with a model with one factor
(layer).
The design in Case F is not balanced, it has only 6 pairs of replicates in each
batch and two factors included in the model (layer and thief). Hence, with
2
an estimate of σrep,bottom 2
being 10 to 20 times as large as σrep,top it is not
surprising that the test shows no significance - however, the estimated differ-
2
ence between σrep,top 2
and σrep,bottom may still be real and give practical and
valuable insight to the sampling process.
pendix A could also be taken as basis for a mixing index (see also [15]). In this
case the theoretical limit of homogeneity is the various variance components
being zero. It should be noted that a variance component being zero serves
more as a theoretical value for homogeneity. It is not useful as an acceptance
criteria. Further, it should be noted that in a homogeneous blend the large and
medium scale variation are negligible. However, the small scale variation is an
inherent variation that is non-zero even in a random blend.
The discussion of the acceptance criteria and the derivation of expressions for
acceptance probabilities in the appendices is performed under the assumption
that individual sample values may be represented by independent, identically
distributed variables and that the distribution of sample results may be de-
scribed by a normal distribution.
Thus, when the samples are tablets selected from a batch, the assumption cor-
responds to assuming that the overall distribution of dose content in the batch
may be represented by a normal distribution and that individual dosage units
are selected at random from the dosage units in the batch. When the samples
are blend samples, the assumption analogously corresponds to assuming that
the overall distribution of such potential samples from the blend may be repre-
sented by a normal distribution and that samples are taken at randomly selected
positions in the blend. Thus, the model will not be adequate when the overall
distribution in the blend (or batch) is bimodal or multimodal corresponding e.g.
to stratification, when the distribution is skewed, e.g. as a result of deblending,
or when the distribution has heavier tails than the normal distribution, e.g. as
a result of imperfect mixing (clustering) or of using drug particles that are too
large for the intended dosage (see Appendix E).
Under the assumption that the distribution of sample results may be described
by a normal distribution, the following results regarding acceptance criteria
have been derived in Appendix C and Appendix D.
However, regulatory practice for pharmaceutical products has most often speci-
fied criteria for sample values rather than providing specifications for the entity
under test. As therefore control and monitoring procedures in tablet production
are based upon samples from the blend, or from the batch of tablets, there is
an inherent uncertainty concerning the actual dispersion in the blend or batch
being sampled. This uncertainty is partly due to sampling and partly due to the
(in)homogeneity of the blend/batch.
The statistical tool used to link sample result and acceptance criteria to the
actual dispersion in the blend or batch is an OC-curve (or surface) that shows
the probability of passing the acceptance criteria as a function of e.g. fraction
nonconforming or true mean and standard deviation in the blend or batch, as
an OC-curve (or surface) reflects the effect of such sampling uncertainty.
In Appendix D statistical tools and methods that can be used to determine how
assurance depends on sample size (i.e. how to set up a criterion that gives a
3.3. A NALYSIS OF ACCEPTANCE CRITERIA 29
certain assurance with an appropriate sample size) are described and discussed
for simple acceptance criteria (sample standard deviation and coefficient of
variation as well as an USP criterion that includes a test by attributes).
In the literature changes to the procedure in USP have been proposed. In gen-
eral the proposed test procedure is similar to the parametric criteria mentioned
above. In the thesis simulations have been performed both for normally and
log-normally distributed content in the tablets. The simulations revealed that
the test is relatively robust to deviations from the normal distribution. This is
relevant as such deviations for example is seen in case of low-dose tablets with
large particle radii as also discussed in the thesis.
Finally single sampling acceptance plans for inspection by variables that aim
at matching the USP proposal have been suggested.
30 C HAPTER 3. R ESULTS AND DISCUSSION
Chapter 4
Conclusion
In this thesis the use of statistical methods to address some of the problems
related to assessment of the homogeneity of powder blends in tablet production
is discussed.
With this definition of a unit, variances between blend samples can be used as
a measure of the (in)homogeneity in a blend.
In the thesis a hierarchical (or nested) as well as an aggregated model has been
introduced to describe (in)homogeneity. The hierarchical model specifically
takes into account deblending in a specified direction. Both the hierarchical
and the aggregated model can be used to detect inhomogeneity. However, in
case of inhomogeneity the hierarchical model provides the most detailed infor-
mation on the type of inhomogeneity.
Regarding the end users of the tablets the total variation between the tablet
content is relevant. This variation is closely related to the overall variation in
the blend. Two methods to determine the overall variation in the blend have
31
32 C HAPTER 4. C ONCLUSION
been suggested. One of the methods (estimating the overall variation from the
ANOVA table) leads to less ambiguous properties of acceptance criteria for
blend uniformity. However, none of the methods truly describes the variation
experienced by a patient, as this variation depends on how the tablets in the
package are selected from the batch of tablets.
It has been shown that particle size distribution may have an influence on the
distribution of content in blend and tablet samples. Specially for low-dose
tablets it is important to keep the particle radii small (and the number of parti-
cles large) to minimize the variation in content in the blend and tablet samples.
Generalized Linear Models (GENMOD) can be used to assess factors that may
have an influence on a variance (e.g. the effect of layers on the replicate vari-
ance). General Linear Models (GLM) can be used to assess factors that may
have an influence on the mean content in blend samples, e.g. a sampling de-
vice leading to sampling bias. For a specific sampling design the power and
robustness of the statistical tests related to the GENMOD and GLM models
have been assessed.
A central problem is to develop acceptance criteria for blends and tablet batches
33
Regulatory practice related to tablet production are most often criteria specify-
ing limits for sample values rather than for the actual homogeneity in the blend
or batch of tablets. This leads to an inherent uncertainty concerning the homo-
geneity in the blend or tablet batch. This uncertainty is partly due to sampling
and partly due to (in)homogeneity of the blend or batch.
In the thesis it is shown how to link sampling result and acceptance criteria
to the actual quality (homogeneity) of the blend or tablet batch. Further it is
discussed how the assurance related to a specific acceptance criteria can be
obtained from the corresponding OC-curve.
Also in the thesis it is shown that a three-class attribute criteria as e.g. in USP
24 for content uniformity in essence controls the proportion of tablet samples
outside the inner set of limits for individual observations. For normally dis-
tributed observations this is identical to control the combination of batch mean
and standard deviation, i.e. a parametric acceptance criterion. It is shown how
to set up parametric acceptance criteria for the batch that gives a high confi-
dence that future samples with a probability larger than a specified value passes
the USP three-class criteria.
In the literature changes to the procedure in USP have been proposed. In gen-
eral the proposed test procedure is similar to the parametric criteria mentioned
above. In the thesis simulations have been performed both for normally and
log-normally distributed content in the tablets. The simulations revealed that
the test is relatively robust to deviations from the normal distribution. This is
relevant as such deviations for example is seen in case of low-dose tablets with
large particle radii as also discussed in the thesis.
34 C HAPTER 4. C ONCLUSION
II
Part II
Included papers
35
A
Paper A
37
39
1 Introduction
In this appendix the robustness and power of statistical methods to assess blend
homogeneity is discussed. The statistical methods assess various factors that
may alter batch homogeneity and therefore one could also say that the statisti-
cal methods assess blend inhomogeneity. For convenience the term homogene-
ity as well as the term inhomogeneity will be used in relation to the statistical
models. The discussion is based on simulations of blend samples in SAS [17].
However, before simulations of the blend samples are made it is necessary to
have a definition and a model for blend homogeneity.
All samples are simulated in accordance with the hierarchical model. How-
ever, both the aggregated and the hierarchical model are used in the statistical
analysis of the simulated samples.
1
The terms large, medium and small scale variation should be taken loosely. They are meant
as a convenient way to distinguish between different types of (in)homogeneities in a batch. By
large scale variation is meant variation between samples collected so far apart that the distance
between them should be measured on a ’large scale’, e.g. top, middle and bottom layer of the
batch. Similarly small scale variation means variation between samples collected so close to
each other (e.g. replicate samples) that the distance between them should be measured on a
’small scale’. However, it is very important to understand that large and small scale does not
refer to the size of the the respective variations. Thus, the large scale variation could actually be
smaller than the variation measured on the small scale.
40
In the following two different statistical models of batch homogeneity are in-
troduced: the aggregated and the hierarchical models.
The variation between samples collected in accordance with this sampling plan
41
(
i = 1, .., 12
Yij = µ + Ai + Eij , . (1)
j = 1, 2, 3
Yij is the content of a sample, µ the mean content in the batch (including bias
due to sampling and chemical analysis), Ai the effect corresponding to the
area from which the sample is taken, and Eij is the error term corresponding
2
to the j’th replicate from the i’th area. Further Ai ∈ N (0, σarea,agg ) and
2 2
Eij ∈ N (0, σrep ). The variation, σarea,agg , can be thought of as an aggregate
of the large and the medium scale variation in the batch.
If all samples are collected in random order the design is a simple single-factor
design. The experimental design, the analysis, assumptions and the statistical
terminology related to such a design is given by Montgomery [19].
Figure 2 shows an example of a batch divided into three layers. Within each
layer samples are taken from the points of a triangle, as well as from the centre,
thus four areas within each layer are chosen. By changing to the next level, the
triangle is rotated by 180o . In the left corner of the figure a top view of the
sampling scheme is shown.
42
Fig. 2: A batch divided in three layers: top, middle and bottom. Variation be-
tween the mean content in the layers represents large scale variation. Variation
between the mean content in the four areas within a layer represents medium
scale variation. To the left a top view of the sampling scheme is seen.
Variation between the mean content in the three layers can also be thought of
as large scale variation in the vertical direction. The aggregated model (1) does
not explicitly account for this type of variation or inhomogeneity. However, it
is accounted for in the following model:
i = 1, 2, 3
Yijk = µ + Li + A(L)j(i) + Eijk , j = 1, 2, 3, 4 . (2)
k = 1, 2, 3
Li is the effect from layer i, A(L)j(i) is the effect from the j’th area within
layer i and Eijk is the effect from the k’th replicate from area j in layer i.
2
Further it is assumed that Li ∈ N (0, σlayer 2
), A(L)j(i) ∈ N (0, σarea,hi ) and
2 2 2
Eijk ∈ N (0, σrep ). In contrast to σarea,agg , σarea,hi only accounts for the
medium scale variation in the blend, i.e. the variation between the areas within
a layer.
2
When there is no difference between layers (i.e. σlayer = 0) the aggre-
2
gated model (1) and the hierarchical model (2) are identical, i.e. σarea,agg =
2 2
σarea,hi = σarea . The hierarchical model corresponds to the definition of ho-
mogeneity used in the cases in Appendix F.
In practice when sampling replicates from the same area the replicates are sam-
pled as close as possible without overlapping. The reason is to avoid spots with
a high degree of deblending caused by the insertion of the thief when collecting
previous samples. Thus the variation between replicates includes variation be-
tween content in neighbouring spots within an area, i.e. variation in the batch
measured on a very small scale.
In a situation where deblending has occurred in one layer the variation be-
tween the content in neighbouring spots within an area may differ from the
corresponding variation in other layers, hence the small scale variation may
depend on the layer.
Further, the static pressure in the blend is lower near the surface than near
the bottom. This difference in static pressure may affect the sampling results
(see e.g. Berman [2]). One example would be the difference in static pressure
causing the variation between the content in replicate samples being larger
when sampled from the bottom layer than when samples from the top layer
(see e.g. Appendix F).
Neither the aggregated model (1) nor the hierarchical model (2) accounts for
this type of variation. However, with the following extension to the hierarchical
model (2), small scale variation is accounted for:
44
i = 1, 2, 3
2
Eijk ∈ N (0, σrep,i ), j = 1, 2, 3, 4 . (3)
k = 1, 2, 3
The full hierarchical model (the hierarchical model (2) with the extension (3))
is able to account for three different types of variation in the batch: variation
measured on a large scale (the variation between layers), a medium scale (vari-
ation between areas within a layer) and the small scale variation (variation be-
tween replicates within an area). This ability makes it a very versatile model.
The drawback is that the model includes six parameters (µ, σlayer 2 2
, σarea,hi ,
2 2 2
σrep,top , σrep,middle and σrep,bottom ), hence it is very difficult in a clear way
to analyse and present results with all parameters varying at a time. There-
fore in each series of simulated batches some of the parameters are kept fixed
depending on the aim (hypothesis) of the model under analysis.
Thus, the full hierarchical model (model (2) with the extension (3)) was used to
generate a number of datasets all with a total of 36 samples equally distributed
over 3 layers, 4 areas within each layer and 3 replicates within each area. The
sampling plan is shown in the first three columns of Table 1, page 52. On
the whole the number of samples corresponds to the number of samples in Ap-
pendix F as well as the sampling plan suggested by PQRI [18]. In each analysis
500 batches are simulated for each combination of the parameter values.
In the analysis the sample content is measured in percent of label claim, LC.
Thus µ = 100 corresponds to LC. However, as the overall mean content in the
batch is balanced out in the calculations the results are independent of the value
of the overall mean content. Further the results do only depend on the ratios
σlayer /σrep and σarea,hi /σrep (or σarea,agg /σrep ), and therefore the value of
the variance components is measured relatively to σrep 2 (or σ
rep,top in a model
with extension (3)), i.e. either σrep = 1 or σrep,top = 1.
Two types of analyses are made in the following sections. The first type of
analysis is described in Section 3. It focuses on factors influencing the mean
content of a sample corresponding to the terms Ai , A(L)j(i) and Li in the
45
aggregated model (1) and the hierarchical model (2). This corresponds to an
analysis of the large and the medium scale variation. The other type of anal-
ysis, described in Section 4 focuses on factors influencing the term Eijk in
extension (3) of the full hierarchical model. Thus this is an analysis of the
small scale variation.
In this section the simulated batches (some of which contain variation between
layers) are analysed in accordance with both the aggregated model (1) and
the hierarchical model (2). The robustness and the power of these methods is
assessed as a function of the variation between layers, σlayer , and the variation
between areas within a layer, σarea,hi .
Finally the effect of including an external factor (sampling thieves) in the anal-
yses of the mean content of the collected samples is discussed.
The tests are conducted using the GLM procedure in SAS. In simulations of
2 = 1.
samples for the analysis µ = 100 and σrep
46
where hP i
12
3× i=1 (ȳi. − ȳ.. )2
2
Sarea,agg = (5)
12 − 1
and P12 P3
2 i=1 j=1 (yij − ȳi. )2
Srep = . (6)
12 × (3 − 1)
ȳi. is the mean of the three replicates in area i and ȳ.. is the overall mean.
2
Under the aggregated model with σarea,agg = 0 the test statistic follows a
2
F (12 − 1, 12 × (3 − 1)) distribution, and therefore σarea,agg is declared signifi-
cantly different from 0 at the 5 % level when Zarea,agg > F (11, 24).95 = 2.18.
Under the aggregated model (1)
2
Sarea,agg 2
/(3 × σarea,agg 2 )
+ σrep
2 /(σ 2 )
∈ F (12 − 1, 12 × (3 − 1)). (7)
Srep rep
As the samples were simulated from a model including the variation between
layers, the samples from the same layer are centered around the mean content
in that layer. Hence the averages ȳi. s are correlated, and therefore the under-
lying assumption from the aggregated model (1), that the mean content in the
2
areas are uncorrelated, is violated. Strictly spoken this means that σarea,agg
2 2
from model (1) does not exist in a context with σlayer and σarea,hi .
Under the hierarchical model (2) - which corresponds to the model from which
the samples are simulated - the overall variation between areas is separated into
47
the variation between layers, σlayer2 , and the variation between areas within a
2
layer, σarea,hi . In relation to the specific sampling plan from which the samples
2
are sampled (i.e. four areas within each layer) the mean value of Sarea,agg is
2 24 2 33 2 2
E[Sarea,agg ] = 11 σlayer + 11 σarea,hi + σrep .
0.953
0.860
0.767
s 0.674
t
d
l 0.581
a
y 2
e
r 0.487
0.394
0.301
0.208
0
0 1 2 0.115
std area
0.05
Fig. 3: The probability of declaring the variation between areas under the ag-
2
gregated model, σarea,agg , significant at the 5% level as a function of σarea,hi
and σlayer . This is also called the power of the test. The probability is
low when both σarea,hi and σlayer are small. When σarea,hi is larger than
1.25×σrep the probability is more than 0.95 no matter the value of σlayer . Sim-
ilarly when σlayer is larger than 3.5 × σrep the probability of testing σarea,agg
significant is more than 0.95, irrespective of the value of σarea,hi
This is also seen from Figure 3. The figure shows level curves for the power of
the test in (7) as a function of σarea,hi and σlayer . The power of the test is the
2
probability of declaring σarea,agg significant (here at the 5% level). The prob-
ability is found for each set of parameter values by testing simulated samples
from 500 batches.
More explicit it is seen that, when σarea,hi is larger than 1.25 × σrep the prob-
2
ability of finding σarea,agg > 0 is at least 0.95. When σarea,hi is smaller the
2
probability of declaring σarea,agg significant depends on the value of σlayer .
The plot shows that because the variation between layers is not specifically ac-
counted for in the aggregated model (1), variation between layers in the batch
is identified as variation between areas. When the variation between layers,
2
σlayer , is more than 3.5×σrep the probability of testing σarea,agg > 0 is at least
24 2 33 2 2
0.95. In conclusion the test in 7 really measures 11 σlayer + 11 σarea,hi + σrep
2 .
relatively to σrep
2
Slayer
Zlayer = 2 (8)
Sarea,hi
and
2
Sarea,hi
Zarea,hi = 2
, (9)
Srep
where hP i
3
4×3× i=1 (ȳi.. − ȳ... )2
2
Slayer = (10)
3−1
and hP i
3 P4
3× i=1 j=1 (ȳij. − ȳi.. )2
2
Sarea,hi = (11)
3 × (4 − 1)
and P3 P4 P3
2 i=1 j=1 k=1 (ȳijk − ȳij. )2
Srep = (12)
3 × 4 × (3 − 1)
ȳ... is the overall mean, ȳi.. is the mean of the i’th layer and ȳij. is the mean of
the j’th area in the i’th layer.
49
The critical intervals for the test statistics are Zlayer > F (2, 9).95 = 4.26 and
Zarea,hi > F (9, 24).95 = 2.30
2
Under the hierarchical model the variation between areas, σarea,hi , is corrected
2
for the contribution from layers, σlayer . Hence, in this case the test of the
2
variation between areas, σarea,hi 2
= 0, is independent of the value of σlayer .
2
However, the test statistic, Zlayer , depends on Sarea,hi and therefore the test of
2 2
σlayer is expected to depend on the value of σarea,hi .
0.952
0.856
0.760
s 0.664
t
d
l 0.568
a
y 2
e
r 0.472
0.376
0.280
0.184
0
0 1 2 0.088
std area
0.05
2
Fig. 4: The power of the 5% level test of σarea,hi in the hierarchical model
(2). When σarea,hi is approximately 1.5 × σrep the probability of declaring
significance is more than 0.95. The probability is independent of σlayer .
2
For the test at the 5 % level Figure 4 shows the power of the test of σarea,hi as
a function of σarea,hi and σlayer .
As expected it is seen that now that the variation between layers is accounted
for in the model the test of the variation between areas does no longer depend
of the value of σlayer . In agreement with Figure 3, Figure 4 shows that when
2
σarea,hi is approximately 1.5 × σrep , the probability of declaring σarea,hi sig-
nificant is at least 0.95.
50
0.952
0.856
0.760
s 0.663
t
d
l 0.567
a
y 4
e
r 0.471
0.375
2
0.279
0.182
0
0 2 4 6 8 0.086
std area
0.05
2
Fig. 5: The probability of the 5% level test of σlayer in the hierarchical model
(2). The power is dependent of σlayer and σarea,hi .
2
Figure 5 shows the power of the test of σlayer as a function of σarea,hi and
2
σlayer . As expected the power of the test of σlayer depends on the value of
2
σlayer
σarea,hi , namely in essence of 2
1+3×σarea,hi
.
If the factors describing the variation between layers had been simulated with
three fixed (predefined) levels the test statistic would follow a non-central F-
distribution, but the overall picture would not be changed.
So far the homogeneity of the blend has been modelled by models taking into
account the large and the medium scale variation. Blend samples results are
however not entirely a consequence of the batch homogeneity. Sometimes it is
relevant to investigate whether other factors such as sampling thief or sampling
technique have an effect on the content in a sample. This is the situation in
Appendix F. Also Berman [23] has assessed the significance of factors as
51
A number of batches in which two different thieves were used for sampling
were simulated. The experimental design is shown in Table 1. The model is
i = 1, 2, 3
j = 1, 2, 3, 4
Yijkl = µ + Li + A(L)j(i) + tk + Eijkl , , (13)
k = 1, 2
l = 1, 2, 3
P
where tk is the effect of thief k, and tk = 0. Apart from that the assumptions
are identical to those from the hierarchical model (2).
For practical reasons the following parameters are fixed: σlayer = 2, σrep = 1
and µ = 100.
2
The test statistic for the hypothesis tk = 0, σlayer 2
= 0 and σarea,hi = 0 are
now
2
Sthief
Zthief = 2
, (14)
Srep
2
Slayer
Zlayer = 2 , (15)
Sarea,hi
and
2
Sarea,hi
Zarea,hi = 2
. (16)
Srep
2
Sthief denotes the mean sum of squares corresponding to variation between
thieves.
The critical value for the test statistic corresponding to thieves is Zthief >
F (1, 23).95 = 4.3. The test statistic corresponding to layers and areas within a
layer are unchanged from the hierarchical model (1).
As there were only three replicate samples in each area, the use of two different
thieves could not be completely balanced within an area, and therefore the
partitioning of the variation is not straightforward. However, as the design
allows for an estimation of the variation between replicates and as the use of
the two thieves has been balanced over areas and layers, it is seen from (14)
that the test of the effect of sampling thief, tk , is independent of σlayer and
σarea,hi .
0.961
0.883
0.805
t 0.726
h
i 3
e
f 0.648
d
i
f
. 0.570
2
0.492
0.414
1
0.335
0 2 4 6 8 0.257
std area
0.05
Fig. 6: The power of the 5% level test of the thief effect, tk , i.e. the probability
of declaring the thief effect significant.
Figure 6 illustrates the power of the 5 % level test of tk . The power is plotted as
2
a function of σarea,hi , and the effect of the thieves. The effect is the difference
between the mean of samples collected with one thief and the mean of samples
collected with the other thief.
54
The power of the test of the thief effect is independent of σarea,hi and a dif-
ference between samples collected with each of the two thieves of larger than
1.5% of LC has a probability of at least 0.95 of being detected with the given
design.
2
From expression (16) the test of σarea,hi is expected to be independent of both
2
σlayer and tk . Expression (15) shows that the power of the test of σlayer is
2
independent of tk but not of σarea . Hence, plot of the power of these two tests
are similar to Figure 4 and Figure 5.
3.4 Conclusion
Under the assumption that the homogeneity of real batches in this way may
consist of both a large and a medium scale variation the power of two statistical
methods to assess batch homogeneity has been found.
The power of the respective tests are shown in Figure 3 to 5. It should be noted
that σlayer and σarea,hi are measured relatively to σrep .
Finally, for the given design the power of the test of a thief effect was assessed
in Figure 6. In these simulations the variation between layers were held fixed
(σlayer = 2). However, the test of the thief effect is independent of σlayer and
σarea,hi .
With the given design a difference greater than 1.5 in mean content in samples
from the two thieves has a probability of at least 0.95 of being detected.
The method presented in this section can be used to assess factors with influ-
ence on either the small scale variation or eg. the size of the variation due to the
sampling, depending on the experimental design and the assumptions made.
In this analysis the factor under consideration is the effect of layers: Layers
having an effect on the small scale variation means the special kind of inhomo-
geneity where for example the variation between neighbouring spots within an
2
area is smaller in the top layer, σrep,top 2
, than in the bottom layer, σrep,bottom .
56
However, differences between the variation between replicates in the top layer,
2
σrep,top 2
, and the variation between replicates in the bottom layer, σrep,bottom ,
may also be due to the fact that it is more difficult to handle a long thief in the
bottom of the blend than handling a short thief in the top layer. Also the static
pressure in the bottom of a batch is greater than near surface, which may result
in different types of sampling bias [2] for example differences in the variation
between replicates. These are examples of layers having an effect on the size
of the sampling error (variation).
In this analysis it is assumed that the variation between replicates represents the
small scale variation. The variation due to the chemical analysis is assumed to
be neglectable.
For the analysis of the small scale variation samples from a number of batches
were simulated according to the hierarchical model (2) with extension (3).
P3
i = 1, 2, 3
2 k=1 (yijk
− ȳij. )2
Srep,ij = , j = 1, 2, 3, 4 (17)
3−1
k = 1, 2, 3
2 2
Srep,ij ∈ σrep,i χ2 (2)/2, (18)
see e.g. Montgomery [19], and therefore standard methods for analysis of
linear models for normally distributed observations are not applicable to assess
2
hypotheses concerning σrep,i .
57
Bartlett and Kendall [24] have suggested approximate tests for the analysis
of variance heterogeneity. However, utilizing the relation between the χ2 -
distribution and the gamma-distribution, models for variance heterogeneity
may be investigated using the theory for the so-called generalized linear mod-
els [25].
The hypothesis
2 2 2
σrep,top = σrep,middle = σrep,bottom (19)
2
ln(σrep,i ) = µ∗ + αi , (20)
Figure 7 shows the level curves for the power of the test as function of ln(σrep,middle )
and ln(σrep,bottom ). It is seen that the level curves have ellipsoid shape. The
reason is that the power depends on
" #2 " #2
2
ln(σrep,middle 2
) + ln(σrep,bottom ) 2
ln(σrep,middle 2
) − ln(σrep,bottom )
+3
2 2
= (ln(σrep,middle ) + ln(σrep,bottom ))2 + 3 [ln(σrep,middle ) − ln(σrep,bottom )]2
= C.
(22)
58
4
0.954
0.862
0.770
l
o
g
( 0.677
v
a
r
r 2
0.585
e
p
m 0.493
i
d
)
0.401
0.309
0.216
0
0 2 4 0.124
2
Fig. 7: Level curves for the power of the 5% level test for a dependency of σrep
on layers. The power is plotted against σrep,middle and σrep,bottom .
0.954
0.862
6 0.770
s
t 0.677
d
r
e 0.585
p
b 4
o
t
. 0.493
0.401
0.309
2
0.216
2 4 6 8 0.124
2
Fig. 8: Level curves for the power of the 5% level test for a dependency of σrep
on layers plotted against σrep,middle and σrep,bottom .
59
The shape of the level curves is similar for other values of σrep,top .
4.1 Conclusion
For samples simulated from model (2) with the extension (3) it has been shown
that for the 5 % level test the variation between replicates σrep,i has to be 4.5
times larger in one layer than the variation between replicates in another layer
for the effect of layers to be declared significant with a probability of at least
0.95.
Depending on the experimental design and the assumptions made the method
presented in this section can also be used to assess factors influencing sample
error (variation). For example the method can be used to test if one sampling
thief leads to larger variation between replicate samples than another thief.
5 Conclusion
As both large, medium and small scale variation may occur in real blend
60
batches all batches and samples were simulated from this hierarchical model.
However, the analysis of the simulated samples were conducted both in accor-
dance with an aggregated model that as a measure of inhomogeneity uses an
aggregate of the medium and the large scale variation, and in accordance with
a hierarchical model explicitly taking into account the large and the medium
scale variation. In essence the aggregated model relates to the acceptance cri-
teria suggested by FDA [11] and PQRI [18].
The power of the tests of the homogeneity according to respectively the aggre-
gated and the hierarchical model is found. Basically the probability of declar-
ing inhomogeneity is similar for analysis corresponding to each of the two
models. The power of the respective tests are given in more details in Figure 3
to Figure 5.
In the specific design used in this appendix (with σrep2 = 1) the 5% level test
for the thief effect has a probability of at least 0.95 of declaring significance
when the difference between the mean content in samples collected with each
of the two thieves is larger than 1.5.
arate analysis of the variation between replicate samples. It was shown that
when the variation between replicates is more than 4.5 times as large as the
variation between replicates in another layer, probability of detecting an effect
of layers is at least 0.95.
For all tests in this appendix it should be noted that the power of the tests
corresponds to the specific experimental plan used. For other experimental
plans the power of the corresponding test can be found after same principles as
in this appendix.
Further, when using the methods to assess factors with influence on the mean
content (see Section 3) it is assumed that the variation between replicates is
the same. In the analysis in this appendix only batches with constant variation
between replicates were simulated for this analysis. However, when analysing
the mean content in samples from real batches there is a risk that the variation
between replicates is not constant. The method presented in Section 4 can be
used to verify this assumption. Therefore in practice a suitable procedure is
to use the methods presented in this appendix in reversed order, i.e. first to
analyse the variation between replicates and if this variation is constant, then
continue with the analysis of the mean content. If the variation between repli-
cates is not the same, a so-called weighted squares of means method should be
used (see e.g. [19]).
62
B
Paper B
Comprehensive measures of
blend uniformity
63
65
1 Introduction
The discussion is exemplified by the first stage of a procedure for blend vali-
dation suggested by PQRI [18]. However, the principles in the discussion of
interpreting the total variation in the blend would be the same had a differ-
ent acceptance criteria been chosen, e.g. the acceptance criteria recommended
by FDA [11] or the standard prediction interval (SDPI) method suggested by
PDA [22].
dard deviation divided by the average) for the samples should be less than or
equal to 5.0% and all individual results should be within 90.0% - 110.0 % of
the mean result (of all samples). If these two criteria are met the first stage of
the suggested procedure is passed.
Fig. 1: The probability of passing the first stage of the PQRI acceptance criteria
for blend uniformity validation [18]. The probability is plotted as a function of
the total standard deviation in the blend.
this appendix.
Besides a slight variation due to the simulation procedure the plot shows a
slim, smooth curve indicating that in the situation of uncorrelated, identically
(normally) distributed samples, there exists an unambiguous relation between
the uniformity (total variation) of the blend and the probability for the batch
to pass the acceptance criteria. The question is how this relation is affected by
the interpretation of the total variation in the blend.
In the analysis in this appendix it should be noted that the values of the variance
components are no longer measured relatively to the small scale variation as in
Appendix A. All variance components are measured absolutely. For practical
purposes the small scale variation (variation between replicates from the same
area) has been fixed, σrep2 = 1, in these calculations and simulations. The
results in this appendix are only valid for the absolute value of the variance
components on which the calculations are based.
Table 1: Most of the simulations in this appendix are based on 12 samples from
each batch. The distribution of the samples in the batch appears from the table.
68 Paper B
As mentioned above the first stage of the procedure suggested by PQRI [18]
recommends to assay one sample per area in the batch. Samples should be
collected from at least ten different areas. In the simulations discussed in this
appendix samples from twelve different areas were collected. This sampling
plan does not specifically refer to layers in the batch. Therefore, as a first ex-
ample batches with no variation between layers and a fixed standard deviation
corresponding to replicates, σrep = 1, are simulated. With no variation be-
tween layers the hierarchical model (2) in Appendix A from which the batches
are simulated, is identical to the aggregated model (1) in Appendix A, hence,
2
σarea,hi 2
= σarea,agg 2
= σarea .
Two different principles of determining the total variation in the batch is pre-
sented in the following.
One way of defining the total variation in the batch is to use the expression for
the total variance obtained from the analysis of variance table corresponding to
an analysis of sample results obtained from the specific sampling plan, in this
case the plan sketched in Table 1. The model for this analysis is
Yij is the content of a sample, µ the mean content in the batch, Ai the effect
corresponding to the area from which the sample is taken, and Ei is the error
term also including the variation due to sampling. As only one sample from
each area is collected the terms Ai and Ei have identical indices. Further
2
Ai ∈ N (0, σarea,agg 2 ). The analysis of variance table
) and Ei ∈ N (0, σrep
corresponding to this model is given in Montgomery [19].
In this case where one sample is collected from each of twelve areas the anal-
ysis of variance (ANOVA) table is
69
ANOVA Table
Source of Sums of Degrees of Expected mean square
variation squares freedom (df) (EMS)
2 2
Between areas SSarea,agg 12 - 1 σarea,agg + σrep
2
Error SSrep 0 σrep
Total SStotal 12 - 1
As the terms in the ’sums of squares’ column are additive the total variation in
2
the blend, σtotal , assessed under this scheme is estimated from
2 SSarea,agg + SSrep
σtotal =
dftotal
dfarea,agg × M SEarea,agg + dfrep × M SErep
=
dftotal
2 2 )
11 × (σarea,agg + σrep
=
11
2 2
= σarea,agg + σrep . (2)
Another definition of the total variation in the blend is to consider the variance
2
on a randomly sampled unit. In relation to model (1) this variance is σsample =
2 2
σarea,agg + σrep . In this situation with one replicate per area and no variation
between layers, the total variation in the blend determined from the ANOVA
table is identical to the variance on a randomly sampled unit from the blend.
Figure 2 shows the level curves for the OC-surface of the PQRI acceptance
criteria as a function of the mean content and the total standard deviation in the
blend, σsample = σtotal , determined from the expression (2).
The figure shows that when the total standard deviation is less than approxi-
mately 3%, the probability of passing the first stage of the PQRI criteria is at
least 0.95. Further, it should be noted that even batches with a mean content
relatively far from LC (µ = 100) have a relative high probability of accep-
70 Paper B
10
0.950
0.850
8
0.750
s 0.650
t
d 6
b 0.550
a
t
c
h 0.450
4
0.350
0.250
2
0.150
mu
pass
Fig. 2: Level curves for properties (OC-surface) of the first stage of the pro-
cedure for blend validation suggested by PQRI [18]. The OC-surface is plot-
ted against the mean content in the blend, µ, and the total standard devia-
q in the blend in a situation with no variation between layers, σtotal =
tion
2
σarea,agg 2 . Under model (1) the expression for the total standard de-
+ σrep
viation determined from the ANOVA table is identical to the expression for
the total standard deviation on a randomly sampled unit from the blend, i.e.
σtotal = σsample . The standard deviation corresponding to replicates is fixed,
σrep = 1.
71
tance. The reason is that the PQRI criteria allows for sampling bias of content.
Allowing the mean content of samples to deviate from LC, reduces the risk of
rejecting homogeneous batches with a mean content close to LC in cases of
sampling bias. This is in contrast to the USP criteria for tablets which besides
uniformity also controls deviation from LC (see Appendix D and Appendix C).
Further it is seen that the level curves are rather smooth and almost linear and
almost independent of the mean content. In these simulations the level curves
2 +1
depend weekly on the mean content through CV = σtotal µ = σareaµ . Thus the
test is essentially a test on CV even though it also includes limits for individual
observations.
In this situation with fixed variation between replicates and no variation be-
tween layers, there exists an unambiguous relation between the probability of
passing the PQRI test and the total standard deviation in the batch. This is also
seen from Figure 3 which is a sectional plane of the OC-surface in Figure 2
for µ = 100. From superimposing Figure 3 on Figure 1 it is also seen that
the total standard deviation in the batch obtained from combining the variation
among areas and the replicate variation in accordance with expression (2) is
identical to the the total standard deviation among 12 uncorrelated, identically
(normally) distributed samples in Figure 1. This is not surprising as the areas
from which each of the 12 samples in model 1 is sampled are uncorrelated.
2
In this section batches with large scale variation, σlayer , (and fixed small scale
2
variation, σrep = 1), but no medium scale variation, σarea,hi , are considered.
72 Paper B
With three layers and four samples from each layer a model for the content in
a sample taking into account only large scale variation is
(
i = 1, 2, 3
Yij = µ + Li + Eij , . (3)
j = 1, 2, 3, 4
Yij is the content of a sample, µ the mean content in the batch, Li the effect
corresponding to the layer from which the sample is taken, and Eij is the error
2
term also including the variation due to sampling. Further Li ∈ N (0, σlayer )
2
and Eij ∈ N (0, σrep ).
ANOVA Table
Source of Sums of Degrees of Expected mean square
variation squares freedom (df) (EMS)
2 2
Between layers SSlayer 3-1 4σlayer + σrep
2
Error SSrep 3(4-1) σrep
Total SStotal 12 - 1
2 SSlayer + SSrep
σtotal =
dftotal
dflayer × M SElayer + dfrep × M SErep
=
dftotal
2 2 + 9σ 2
8σlayer + 2σrep rep
=
11
8 2 2
= σlayer + σrep . (4)
11
2
According to model (3) the variation on a sample from the blend is σsample =
2 2
σlayer + σrep . Thus, with the given sampling plan and under model (3) the
2
expression for the total variation in the batch according to Table 3, σtotal , and
the expression for the variation on a randomly sampled unit from the blend,
2
σsample , are not identical. The variation on a sample from the blend is al-
ways the sum of the variance components corresponding to the given model,
whereas the expression for the total variation derived from an ANOVA table is
only a sum with all coefficients equal to one in a few cases, depending on the
corresponding sampling plan.
As the dotted line in Figure 4 is more steep than the circled line it appears
that in a batch with large scale but no medium scale variation the acceptance
criteria is more efficient at separating batches with good quality from batches
with a low quality when the measure of total variation is as derived from the
ANOVA table rather than from the expression for the variance on a randomly
sampled unit from the batch.
Further as both lines in Figure 4 is less steep than the line in Figure 3 it is seen
that under a sampling plan with 12 observations from three layers and four
areas within each layer the acceptance criteria is more sensitive in the case of
medium scale variation but no large scale variation than in the case of large
scale variation and no medium scale variation in the blend.
In real batches both large and medium scale variation may occur at the same
time. Therefore, in this section expressions for the total blend homogeneity
is derived for batches with both large and medium scale variation (and fixed
small scale variation).
76 Paper B
Under a sampling plan with three layers, four areas within each layer and (in
accordance with the first stage of the PQRI procedure) only one sample from
each area, the hierarchical model (2) from Appendix A reduces to
(
i = 1, 2, 3
Yijk = µ + Li + A(L)j(i) + Eij , . (5)
j = 1, 2, 3, 4
Li is the effect from layer i, A(L)j(i) is the effect from the j’th area within
layer i and Eij is the error term also including the variation due to sampling.
2
Further it is assumed that Li ∈ N (0, σlayer 2
), A(L)j(i) ∈ N (0, σarea,hi ) and
2
Eij ∈ N (0, σrep ).
ANOVA Table
Source of Sums of Degrees of Expected mean square
variation squares freedom (df) (EMS)
2 2 2
Between layers SSlayer 3−1 4σlayer + σarea,hi + σrep
2 2
Between areas, SSarea,hi 3(4 − 1) σarea,hi + σrep
within a layer
2
Error SSrep 0 σrep
Total SStotal 12 - 1
2
In analogy with the derivations of σtotal in the previous sections, the expression
2 2 8 2 2 2 . The OC-
for σtotal derived from Table 4 is σtotal = 11 σlayer + σarea,hi + σrep
surface is plotted as a function of the corresponding standard deviation, σtotal ,
with dots in Figure 5 to Figure 7. The difference between the three figures is
that the mean content is fixed at respectively 90%, 100% and 110 % of LC. It
is seen that the plots in the three figures are very alike in agreement with the
fact that the acceptance criteria is only moderately dependent on the value of
the mean content in the batch, µ.
It is also seen that for this OC-surface there is not an unambiguous relation
77
between the total variation in the blend and the probability of passing the test 2 .
Figure 5 to Figure 7 is the resulting plot from superimposing all plots for spec-
ified values of respectively variation between areas and variation between lay-
ers. If σarea is fixed at same value the plot is a parallel displaced version of
Figure 4. If σlayer is fixed at same value the plot is a parallel displaced version
of Figure 3.
Fig. 5: The dots correspond to the total standard deviation as derived from
2 8 2 2 2 , and the circles correspond to the total
σtotal = 11 σlayer + σarea,hi + σrep
2
standard deviation on a randomly sampled unit from the blend, σsample =
2
σlayer 2 2
+ σarea,hi + σrep . The mean content in the batch is 90% of LC.
It should be remarked that if there had been three replicates per area as in
the batches simulated in Section 2 in Appendix A, the expression for the total
variation in the batch derived from the corresponding ANOVA table would be
8 2
2
σtotal = 35 σlayer + 33 2 2
35 σarea,hi + σrep .
2
In the simulations σarea,hi and σlayer vary between 0 and 10. Had larger values for the
two parameters been included in the simulations the OC-band would have been even broader in
these figures
78 Paper B
Fig. 6: The dots correspond to the total standard deviation as derived from
2 8 2 2 2 , and the circles correspond to the total
σtotal = 11 σlayer + σarea,hi + σrep
2
standard deviation on a randomly sampled unit from the blend, σsample =
2 2 2
σlayer + σarea,hi + σrep . The mean content in the batch is 100% of LC.
79
Fig. 7: The dots correspond to the total standard deviation as derived from
2 8 2 2 2 , and the circles correspond to the total
σtotal = 11 σlayer + σarea,hi + σrep
2
standard deviation on a randomly sampled unit from the blend, σsample =
2 2 2
σlayer + σarea,hi + σrep . The mean content in the batch is 110% of LC.
80 Paper B
Figure 9 is similar to Figure 8 except that the criteria for the individual tablets
are disregarded. It is seen that as long as the criteria for the sample coefficient
of variation is included, the criteria on the individual tablets does not have a
large effect on which batches are accepted by the acceptance criteria.
81
Fig. 9: The figure is similar to Figure 8 except that the criteria for the individual
tablets are left out. The dots correspond to σtotal and the circles correspond to
σsample .
82 Paper B
The total variance of a randomly sampled unit from the hierarchical model (5)
2
is σsample 2
= σlayer 2
+ σarea,hi 2 .
+ σrep
In the previous sections various expressions for the total variation in the blend
has been derived as a function of the variation between respectively layers, ar-
eas and between replicates. These expressions are not always straight forward
and they are dependent on the specific sampling plane. Thus in case of both
medium and large scale variation (and fixed variation between replicates) in
the blend it may be more straight forward to investigate the OC-surface for the
acceptance criteria directly as a function of σlayer and σarea,hi respectively. In
Figure 10 through Figure 12 this relation is plotted for the hierarchical model
with 12 samples. In the three figures µ is fixed at respectively 85 % LC, 100
% LC and 115 % LC. The variation between replicates is fixed, σrep = 1.
The aim of this appendix is to investigate how a concept of the total variation
in the blend, can be interpreted in case of inhomogeneities in the blend.
10
0.950
0.850
0.750
s 0.650
t
d
l 5 0.550
a
y
e
r 0.450
0.350
0.250
0.150
0
0 5 10 0.050
std area
pass
Fig. 10: Level curves for the OC-function of the first stage of the acceptance
criteria for blend uniformity suggested by PQRI. The mean content in the
batch, µ, is 85 % LC and the variation between replicates is fixed, σrep = 1.
84 Paper B
10
0.950
0.850
0.751
s 0.651
t
d
l 5 0.551
a
y
e
r 0.451
0.351
0.252
0.152
0
0 5 10 0.052
std area
pass
Fig. 11: Level curves for the OC-function of the first stage of the acceptance
criteria for blend uniformity suggested by PQRI. The mean content in the
batch, µ, is 100 % LC and the variation between replicates is fixed, σrep = 1.
85
10
0.950
0.850
0.750
s 0.650
t
d
l 5 0.550
a
y
e
r 0.450
0.350
0.250
0.150
0
0 5 10 0.050
std area
pass
Fig. 12: Level curves for the OC-function of the first stage of the acceptance
criteria for blend uniformity suggested by PQRI. The mean content in the
batch, µ, is 115 % LC and the variation between replicates is fixed, σrep = 1.
86 Paper B
In general the total variation is defined from the analysis of variance table leads
to more effective (steeper OC-surface) and less ambiguous (more narrow OC-
band) acceptance criteria than the total variation defined as the total variation
on a random sample from the batch. However, the expression for total variation
defined as the total variance from the analysis of variance table, depends on the
sampling plan used.
Paper C
87
Presented at ENBIS 2001 89
Abstract
Keywords
1 Introduction
To assure the therapeutic utility of dosage units such as tablets, the drug content
of each unit in a batch should not deviate too much from a chosen target value,
e.g. label claim (LC).
The current test for content uniformity is a two-stage test including tests by
attributes for the content of single tablets in the sample expressed relatively to
the label claim (LC) as well as a limit for the sample coefficient of variation.
duplicate the testing carried out during the research and development of new
drugs, efforts are made to achieve greater harmonisation in the interpretation
and application of technical guidelines and requirements for product registra-
tion.
Regarding testing for content uniformity this has so far resulted in a proposal
from the US authorities for an alternative to the current test procedure.
The proposed test procedure, which is the subject of this article, is a two-stage
test using in each stage a test by attributes on the content of the individual
tablets combined with a test by variables.
The first proposed amendment to the requirements in the form described above
was moved in 1997 [27]. The amendment has been commented and discussed
over the last years for the present resulting in a proposed test described in Phar-
macopeial Forum [28]. The version of the proposed test under concern in this
article is described in Pharmacopeial Forum [29]. However, the content of the
various versions of the proposed test are essentially the same, and somewhat
different from the the approach in the current requirement.
for the content in individual tablets in the sample. In this procedure the mean
content is not directly controlled. This is in line with the test being a test for
uniformity, i.e. variability, and not explicitly for mean content. The proposed
test, however, controls content variability by means of a linear combination of
the sample standard deviation and the absolute mean deviation from the target.
Also in this procedure a test for the content in individual tablets is included.
With the linear combination including sample standard deviation and mean
in the new procedure not only the variability but also the mean is controlled.
Thus, what historically started as a test for uniformity, still is termed a test for
content uniformity, but now also controls the mean content.
The details of the proposed test, see [29], are described in the following.
The proposed test is a two stage test which combines a criterion on the sample
mean and standard deviation, (x̄, s), with a criterion on individual tablets. The
details of the testing procedure is outlined schematically in Figure 11 .
Before the testing procedure is started 30 tablets are selected from the batch.
The sample used for test stage 2 consists of the 10 tablets from stage 1 com-
bined with the remaining 20 tablets. Stage 2 results in either acceptance or
1
For simplicity only the part of the proposed test that concerns compressed tablets with a
rubric mean less than 101.5 percent of label claim is considered. The rubric mean is the average
of the shelf limits specified in the potency definition of the drug.
92 Paper C
Stage 1
NO YES
REJECT
The combination of
sample mean and sample
standard deviation falls YES
within the acceptance area ACCEPT
for stage one
NO
Stage 2
NO YES
REJECT
The combination of
sample mean and sample
standard deviation falls YES
within the acceptance area
ACCEPT
for stage two
NO
REJECT
Table 1: Acceptance limits, measured relatively to label claim (LC), for the sample standard
deviation and for individual tablets. In general the limits are functions of the sample mean. If
x̄ ≤ 0.835 LC or x̄ ≥ 1.165 LC the batch is rejected. x̄ and s are based on a sample of 10
tablets on stage 1 and the total sample of 30 tablets on stage 2.
rejection of the batch. The batch is accepted if criteria for individual tablets as
well as criteria for the sample mean and sample standard deviation are satisfied.
Otherwise the batch is rejected.
Acceptance limits for individual tablets and the sample standard deviation as
a function of the sample mean are shown in Table 1. The corresponding ac-
ceptance area in the (x̄, s)-plane for the test on sample mean and standard
deviation is shown in Figure 2.
The proposed test evaluates a batch on the basis of the content and the varia-
tion of the active component (drug) in the batch. Therefore, assuming a nor-
mal distribution of tablet content, the properties of the test is described by the
94 Paper C
Acceptance area
0.14
0.12
Stage 1
Stage 2
sample std. dev. as fraction of LC
0.10
0.08
0.06
0.04
0.02
Acceptance area
0.0
Fig. 2: Acceptance area for sample mean content and standard deviation.
From Figure 3 it is seen that when the standard deviation, σ, exceeds 0.095 LC
the probability of accepting the batch is less than 0.1 no matter what the mean
content, µ, is. Similarly, when the mean content is less than 0.835 LC or larger
than 1.165 LC the probability of accepting the batch is less than 0.1, no matter
Presented at ENBIS 2001 95
0.12
0.10
0.10
0.08
sigma
0.1
0.06
0.50
0.04
0.99
0.02
0.25
0.10
0.25
0.50
0.75
0.0 0.90
0.99
mu
Fig. 3: Level-curves for the OC-surface for the test. The curves are based on 3000 simulations
for each lattice point (µ, σ).
As a single quality measure for a batch, one might consider the proportion
of tablets in the batch with a content outside some specified range of values.
The U.S authorities have not explicitly specified limiting values for the content
of tablets in the batch, nor is an unacceptable proportion specified. However,
the acceptance limits for individual tablets in Table 1 suggests that it is not
considered satisfactory for the content in a tablet to deviate more than 25%
from label claim. Figur 4 shows the proportion of tablets with a content outside
1 ± 0.25 LC in a batch as a function of µ and σ.
Comparing the proportion of tablets in a batch outside these limits with the
probability for acceptance of the batch is a way to evaluate the properties of
96 Paper C
0.12
0.10
0.08
sigma
0.06
0.15
0.04 0.0001
0.02
0.0005
0.1500
0.1000
0.0500
0.0100
0.0050
0.0005
0.0010
0.0001
0.0
mu
Fig. 4: Level-curves for the proportion of tablets in a batch outside the limits 1 ± 0.25 LC.
the test: Batches with a large proportion of tablets outside these limits should
have a high probability of being rejected by the test, and batches with a very
small proportion of tablets outside the limits should have a high probability of
acceptance.
As the curves in the two figures do not have the same shape it is not possible
to determine the probability of rejecting a batch as a single-valued function
of the proportion of tablets outside the specification, i.e. it is not possible
unambiguously to determine which proportion of unacceptable tablets in the
batch will lead to rejection with a given probability. As an example in some
cases batches with a proportion of 0.0001 outside the specification is rejected
with a probability of more than 0.9. However, in other cases, depending on the
value of mu, such batches have a probability of 0.99 of acceptance.
The acceptance limits for the sample standard deviation as a function of the
Presented at ENBIS 2001 97
sample mean (Figure 2 and Table 1) suggest also to consider the specification,
1 ± 0.165 LC, for the content in a tablet.
The proportion of tablets in the batch outside these limits as a function of the
batch mean and standard deviation (µ, σ), is shown in Figure 5.
Superimposing Figure 3 upon Figure 5 shows that the shape of the level-curves
in the two plots are similar. This means that it is (almost) possible to determine
a unique relation between the proportion of tablets outside these limits and the
probability of acceptance by the USP test.
0.12
0.10
0.08
sigma
0.06
0.09
0.02
0.090
0.050
0.040
0.030
0.020
0.010
0.005
0.0
mu
Fig. 5: Level-curves for the proportion of tablets in the batch outside the limits 1 ± 0.165 LC.
Batches with a proportion of less than 0.005 outside this specification have a
probability of more than 0.99 of acceptance by the test. Batches with a pro-
portion of 0.09 outside this specification corresponds to a probability of 0.1 of
acceptance.
The purpose of the test is to assess whether a batch may be considered satisfac-
tory (accept) or not (reject). This discriminatory ability was assessed above.
98 Paper C
0.12
0.10
0.10
0.08
sigma
0.1
0.06
0.50
0.04 0.99
0.02
0.25
0.10
0.25
0.50
0.75
0.0 0.99
mu
Fig. 6: Level-curves for the OC-surface for the test - disregarding the attribute test. The
level-curves are based on 3000 simulations for each lattice point (µ, σ).
To further investigate the effect of respectively the test by variables and the test
by attributes the level-curves for the OC-surface is plotted for a test procedure
disregarding the attribute test. These level-curves are shown in Figure 6. Su-
perimposing this figure upon Figure 3, the level-curves for the total test, reveals
that the probability for a batch to be accepted by the total test virtually does not
depend on whether the attribute test is included in the test or not. This is in line
with the fact that under the assumption of normality (x̄, s) are jointly sufficient
for (µ, σ), and hence the conditional distribution of the content of individual
Presented at ENBIS 2001 99
tablets in the sample for a given combination (x̄, σ) does not depend on the
value of (µ, σ).
Now, consider the two stages of the test. Figure 7 shows the probability of
invoking test stage 2. Batches corresponding to a (µ, σ)-combination in the
inner triangular area have a high probability of acceptance on stage 1. Batches
corresponding to a (µ, σ)-combination outside the plotted level-curves have a
high probability of being rejected on stage 1, whereas batches corresponding
to a (µ, σ)-combination in the ’sausage’-shape in the middle have a high prob-
ability of invoking test stage 2. For these latter batches 30 tablets instead of
10 shall be analysed, i.e. such batches are expensive regarding time and other
ressources to the testing and chemical analysis.
0.12
0.10
0.08
sigma
0.06
0.04
0.99
0.99 0.05 0.99
0.02
0.05
0.10
0.25
0.50 0.75
0.50 0.99 0.05
0.25
0.50
0.10
0.0 0.75
0.99 0.25
0.10
0.05
mu
Fig. 7: The probability of invoking stage 2 when both the variable and the attribute tests are
included in the test procedure. The level-curves are based on 3000 simulations for each lattice
point (µ, σ).
Figure 8 shows the probability of invoking test stage 2 when the attribute test
on individual tablets is disregarded in the procedure. Batches corresponding
to a (µ, σ)-combination in the inner triangular area have a high probability of
acceptance on test stage 1. However, disregarding the test for individual tablets
rejection on stage 1 is not possible, implying that all batches corresponding to
a (µ, σ)-combination outside the inner triangular area would have a high prob-
ability of invoking test stage 2, hence requiring the efforts of further testing.
100 Paper C
0.12
0.10
0.08
sigma
0.06
0.95
0.04
0.05
0.02
0.95
0.75
0.50
0.0 0.25
0.10
0.05
mu
Fig. 8: The probability of invoking stage 2 - disregarding the attribute test. The level-curves
are based on 3000 simulations for each lattice point (µ, σ).
Other benefits of including the test by attributes in the test procedure is that the
test provides robustness of the procedure in case of a non-normal distribution
of the tablet content. The discriminating properties of the test in the case of
non-normal distribution of the tablet content have not been investigated. The
limits for individual tablets may in these situations serve as a safety precaution.
Finally the test by attributes may serve a psychological purpose. The reason
is that the use of a test by variables - without a test by attributes - might (in
extreme cases) lead to acceptance of a batch even though tablets of a very ex-
treme quality is found in the sample. The attribute specification in the proposed
test prevents such a situation.
Presented at ENBIS 2001 101
Although the current trend in quality management is to shift the focus from the
final product inspection to the monitoring of the process, acceptance sampling
may still serve a purpose as part of quality control procedures as described e.g.
in the quality management standard QS 9000 [36] developed by the automo-
tive industry and the US military standard, MIL-STD-1916 [37] for acceptance
of product. However, in line with the quest for “zero defects”, the acceptance
sampling plans suggested by these standards are plans with acceptance num-
ber zero (“accept zero plans”) for sampling by attributes, and equivalent plans
for sampling by variables. Although the assumption of normally distributed
item characteristics is seldomly questioned in the calculation of process ca-
pability and performance measures, there is however, some reluctance towards
using variables sampling plans, partly because of the underlying assumption of
a normal distribution, and partly because the use of a variables sampling plan
might (in extreme cases) lead to acceptance of product even when a noncon-
forming item is found in the sample. Such situations are in apparent conflict
with the “ accept zero” philosophy.
and [39] lays down a sampling procedure to be used by the authorities for ver-
ifying compliance to the labelled content. The procedure is a combination of a
test by attributes for the proportion of packages with content less than specified,
and a separate test by variables leading to rejectance when the sample average
is significantly smaller than the specified content in the packages. Although
the test by variables is only concerned with the mean content, the combined
result of the two tests is an economic incentive to the producer to maintain a
small variance in order to avoid overfill.
Standard acceptance sampling plans for industrial use are based upon the usual
industrial practice of setting up specifications for the final product (and all pre-
vious stages). Such specification limits, or tolerances, facilitate the communi-
cation between the parties and provide a basis for assessing quality simply by
verifying compliance to specifications.
5 Conclusion
The proposed procedures all include both a test by attributes and a test by
variables whereas the current procedure includes test by attributes and a test of
the relative coefficient of variation.
In this article the discriminating properties of the proposed test procedure have
been assessed. Under the assumption of normally distributed content in the
tablets the qualities, (µ, σ), leading to acceptance of a batch have been de-
lineated. The analysis in this article reveals that the proposed procedure in
essence is a procedure for controlling the proportion of tablets with content
outside the 1 ± 0.165 LC, combined with an “accept zero” plan corresponding
to the limits 1 ± 0.25 LC.
Further the effect of individual elements of the test procedure have been as-
sessed.
105
107
1 Introduction
In the last decade there has been an increasing interest as well among phar-
maceutical manufacturers as in regulatory agencies to clarify and standardize
cGMP procedures for demonstrating blend uniformity, see the discussion in
PDA Technical Report No. 25 [22] and the recent review by Berman [2], and
to harmonize requirements on final product testing, see the series of proposed
amendments to the United States Pharmacopeia tests for uniformity of dosage
units, [27], [28].
As such control and monitoring procedures are based upon samples from the
blend, or from the batch of tablets, there is an inherent uncertainty concerning
the actual dispersion in the blend or batch being sampled. Therefore, the as-
surance provided by such procedures is of a statistical nature (i.e. depending
on the pattern of variation in the entity under test), and in order to assess the
influence of the uncertainty due to sampling the properties of the procedures
may therefore be assessed using statistical concepts and techniques under due
consideration to the potential sources of variation in the processes being mon-
108 Paper D
itored.
Stage 1: Assay 10 tablets. Pass if both of the following criteria are met:
Stage 2: Assay 20 further tablets. Pass if, for all 30 tablets, the following
criteria are met:
Otherwise fail
In the paper we shall therefore address some basic statistical issues related to
a crude demonstration of uniformity, i.e. provision of evidence that there is a
satisfactorily narrow dispersion of values in the entity under investigation.
The main issue of the paper is to discuss the influence of sampling uncertainty
in such demonstrations of uniformity, and in particular to discuss the assurance
provided against “unsatisfactorily large dispersions of values” under various
110 Paper D
In the paper we shall discuss some of the acceptance criteria for blend or
dose uniformity that have been suggested in the pharmaceutical literature. In
Section 4 criteria based solely upon a measure of dispersion in the sample
(standard deviation or coefficient of variation) are discussed. Such criteria are
mainly used for blend uniformity analysis where use of statistical measures of
location are not necessarily relevant (e.g. because of bias due to the sampling
device).
Thus, when a parametrized model for the sample results is assumed, we shall
formulate a hypothesis (null hypothesis) H0 regarding values of the parameters
of the model (i.e. quality of the blend or batch). Those parameter values that
do not belong to H0 constitute the alternative hypothesis, H1 .
with θ0 denoting some specified limiting value for the population parameter θ,
and assume that some statistic T provides information about θ.
Then the rule: “Reject H0 (and claim θ < θ0 ) whenever the upper (1 − α)
confidence limit for θ is less than the limiting value θ0 ” corresponds to a level
α test of the hypothesis H0 , (or, alternatively to a level (1 − α) test of the
hypothesis H0∗ : θ ≤ θ0 ). For a formal proof of the duality between statistical
confidence regions and tests seee e.g. [44] chapter 4.5.
Following the usual practice in the pharmaceutical literature we shall not con-
sider absolute dose values but assume that measurements are recorded relative
to some (known) target dose weight or label claim.
We shall let D and Sd denote the average relative dose and standard deviation
of relative doses in the sample,
X
n .
D = Di n (2)
i=1
X
n .
Sd2 = (Di − D)2 (n − 1) (3)
i=1
Moreover, as the term relative sample standard deviation may have different
interpretations (as standard deviation of measurements relative to target value,
or as standard deviation of measurements relative to the average sample value),
we shall throughout the paper use the unambiguous term sample coefficient of
variation to mean .
Z = Sd D. (4)
normal distribution,
To preserve the generality we shall use the term “population” when referring
to this distribution of potential values representing dose content measurements
from all doses in the tablet batch, or to the distribution of potential values
representing conceivable blend sample measurements.
It should be noted that the model (5) will not be adequate when the distribution
in the population is bimodal or multimodal corresponding e.g. to stratifica-
tion between locations (e.g. top, middle and bottom), when the distribution is
skewed, e.g. as a result of deblending, or when the distribution has heavier tails
than the normal distribution, e.g. as a result of imperfect mixing (clustering)
or of using dose particles that are too large for the intended dosage.
When sampling is performed from the “worst case positions in the blender”,
as suggested e.g. in the FDA guidelines for blend analysis [20], the population
refers only to sample results that might have been obtained under hypothetical
repeated sampling from these positions, and the model does not necessarily
reflect the population of values representing the totality of the blend.
Therefore, it might be considered to specify a limiting value, σlim for the sam-
ple standard deviation, and use the following acceptance criterion
The significance level of this test (6) of the hypothesis (7) is given as the prob-
ability of rejection in the borderline case when σd = σlim . In this borderline
case there is approximately a 50% risk of obtaining a sample standard devia-
tion exceeding σlim , and hence (referring to the hypothesis (7)) the significance
level is approximately 50% regardless of the sample size.
i.e. from the cumulative distribution function of the χ2n−1 distribution. In the
SAS-system software the acceptance probability may be found by calling the
function PROBCHI(x,df) with the arguments x = (n − 1)(σlim /σd )2 and
df = n − 1 with σd denoting the (true) standard deviation between doses. In
Microsoft Excel, the probability may be found from the function CHIDIST(x,df)
providing a value which is one minus the probability of acceptance.
(consumer’s risk quality). As the limiting value of the sample standard devi-
ation does not depend on the sample size, the consumer’s and producer’s risk
qualities move towards this limiting value with increasing sample sizes, i.e.
the (fixed) limiting value of the sample standarddeviation, σd = 0.06 is also
the borderline between batch standard deviations that are accepted under this
criterion and batch standard deviations that are not accepted.
This measure is used, eg. as part of the acceptance criterion in the test for
uniformity of dosage units in the currently valid version of the United States
Pharmacopeia, USP 24, [26].
It is interesting to note that although dosage values are already measured rela-
tive to the target, or label claim, and hence, the standard deviation of relative
dosage values already measures dispersion in relative units, it has been con-
sidered appropriate, to make a further adjustment (for possible sampling or
analysis bias ) by considering the standard deviation relative to the estimate, D
119
of actual content.
The operating characteristic for this acceptance criterion may be assessed un-
der the assumption of a normal distribution of dosage units, (5) utilizing that
the distribution of √
D n
T = √ = (11)
Sd / n Z
is a non-central t-distribution with non-centrality parameter
√
nµd
δ=
σd
and f = n − 1 degrees of freedom (see e.g. Johnson and Kotz [45]) and hence
for normally distributed measurements the distribution of the sample coeffi-
cient of variation depends only on the sample size and the batch coefficient of
variation, Cd = σd /µd .
In analogy with the test of the sample standard deviation, Figure 1, the set
of curves tend to intersect at the point Cd = 0.06, Pacc = 0.5 indicating that
acceptance under this scheme provides a confidence of approximately 50% that
the coefficient of variation in the blend (or batch) does not exceed the limiting
value, Clim = 0.06, independent of the sample size.
However, because of the sampling variation of the average dose, D, that is in-
troduced in the criterion (10) on the sample coefficient of variation, the curves
corresponding to this criterion (Figure 2) are not quite as steep as their counter-
parts for the criterion on the sample standard deviation (Figure 1). Thus, when
the coefficient of variation of dosage units in the batch is less than the limiting
value, Clim = 0.06, the acceptance probability under the sample coefficient of
variation criterion is slightly less than under the sample relative standard devi-
ation criterion, whereas it is slightly larger when the coefficient of variation in
the batch is larger than 0.06.
122 Paper D
When the acceptance criterion specifies a limiting value for some sample statis-
tic, independent of the size of the sample, this limiting value implicitly defines
a border value between population values that will pass under applicaton of
this criterion, and those that will not. For large samples, blends or batches
with a population value that does not exceed this limiting value will pass under
this criterion. For smaller samples the sampling uncertainty implies a lesser
discriminatory power, and the amount of sample evidence favouring the hy-
pothesis is given the same weight as the evidence against the hypothesis (the
statistical test is performed at a 50% significance level).
This feature is not very satisfactory when the limiting value used in the ac-
ceptance criterion has been chosen to represent some undesirable quality, and
when the purpose of the test is to provide some assurance against accepting
batches of such qualities, and therefore the sample standard deviation should
in some way be reduced by its uncertainty before comparison with the limiting
value.
Such an acceptance procedure that puts more focus upon the assurance pro-
vided by the procedure has been suggested in PDA Technical Report 25 [22].
The fundamental idea in this procedure is that the sampling uncertainty asso-
ciated with the sample under investigation shall be taken into account when
deciding whether or not to accept the blend or batch. The blend is passed only
if the sample provides strong evidence (even when accounting for the sampling
uncertainty) that the quality is better than some prescribed undesirable quality.
The statistical technical tool invoked in the design of this procedure is the so-
called Standard Deviation Prediction Interval to be explained below.
to provide a high assurance that batches that are accepted by the sampling plan
really are of a satisfactory quality. In [22] “satisfactory quality” is taken to
mean that the standard deviation in a sample (of size 10) from the batch does
not exceed sspec = 0.06, and the “assurance” is chosen to be 90%.
It is worth noting that the “specification” of batch quality is not explicitly stated
in terms of parameters like σd characterising the batch, but rather in terms of
properties of a future sample from the batch.
(a future sample).
Let
X
n .
Sd2 = (Di − D)2 (n − 1) (14)
i=1
denote the sample variance found in the current sample (of size n), and let
moreover
X
m .
0
Sp2 = (Di0 − D )2 (m − 1) (15)
i=1
denote the sample variance in a future sample of size m. Then, under the dis-
tributional model (5) assuming independence between the two sets of samples
and assuming the same underlying population variance, σd2 , for both samples,
it follows that the distribution of
Sp2
F = (16)
Sd2
125
or, equivalently
P Sp2 ≤ Sd2 F(m − 1, n − 1)1−α = 1 − α (18)
The reader is cautioned that in some (mainly US) textbooks and software
quantiles and percentiles of non-normal statistical distributions are sometimes
given as the so-called critical values indexed by the probability associated with
more extreme values in the distribution. To avoid the ambiguity when using
this practice in situations where one-sided as well as two-sided tails may be
appropriate, we have chosen the unambiguous approach to let symbols like
F(m − 1, n − 1)1−α , χ2f,1−α , tf,1−α , etc. refer to quantiles or percentiles of the
distributions under consideration.
In the SAS-system software, the quantile may be found by calling the function
FINV(p,ndf,ddf) with the arguments p = 1 − α, ndf = m − 1, ddf = n −
1, and in Microsoft Excel, by calling the function FINV(p,df1,df2) with the
arguments p = α, df 1 = m − 1, df 2 = n − 1 (as the Microsoft Excel function
returns the critical value).
q
Thus, when a value, sd = s2d , of the sample standard deviation is found in
the sample, the upper 1 − α prediction bound for the sample standard deviation
in a future sample from that batch may be found as
p
spred (sd ) = sd F(m − 1, n − 1)1−α . (19)
It follows from the formal calculations that as well the sampling uncertainty
associated with the standard deviation in the actual sample, Sd , as the uncer-
tainty associated with the standard deviation in the future sample, Sp , has been
taken into account in the derivation of this upper bound.
126 Paper D
Acceptance criterion
Now, assume that a limiting value, sspec for the sample standard deviation in a
future sample of size m has been specified.
Following the approach in [22] the following acceptance criterion may be con-
sidered.
Utilizing the relation (19), and observing that F(m − 1, n − 1)1−α = 1/F(n −
1, m−1)α the acceptance criterion may be phrased in terms of a limiting value,
slim of the sample standard deviation, sd in the current sample,
p
slim (n) = sspec F(n − 1, m − 1)α . (21)
Clearly, the limiting value of the sample standard deviation depends on the
sample size, n.
It may be shown that when the sample size, n, increases, the limiting value of
the sample standard deviation, slim (n) → σlim with
q
σlim = sspec (m − 1) χ2m−1,1−α (23)
n
10 15 20 25 30 35 40 ∞
3.841 4.119 4.260 4.346 4.440 4.444 4.476 4.697
Table 1: Limiting value, slim (n) of the sample standard deviation (in percent
of target value) ensuring that the 90% prediction bound for a future sample of
size 10 does not exceed 0.06
Table 1 shows values of the limiting value slim (n) for various sample sizes,
n = 10, 15, . . . , 40 for sspec = 0.06 and α = 0.10.
Under the distributional assumptions (5), the distribution of Sd , and, hence also
of spred (Sd ) depends on the population standard deviation, σd , and therefore
the acceptance criterion may be interpreted as defining a critical region for a
test of the hypothesis
with σlim given by (23). This limiting value of the population standard devia-
tion depends on the number, m, of prospective sample items, and the desired
confidence, 1 − α, that the standard deviation in this prospective sample shall
be less than the design value, sspec.
Operating characteristic
Under the standard deviation prediction interval acceptance criterion the prob-
ability of acceptance when σd = 0.06 is between 7% (for n = 10) and 1% (for
n = 40).
In general, the significance level of the test of the hypothesis (24) performed
using an acceptance criterion of the form (20) is found as 1 − Pacc (σlim ) with
129
The significance level depends on the sample size. For increasing sample size,
n, the significance level α∗ → 0.5.
The borderline between those population standard deviations that will lead
to acceptance according to this criterion, and those that are not accepted is
σd = 0.047. Thus, for a sample size n = 10, it is necessary that the batch
standard deviation is less than 0.03 (producer’s risk quality) in order to as-
sure a high probability of acceptance, but the sampling plan provides a con-
sumer protection (10% consumer risk) against a batch standard deviation of
0.06 (consumer’s risk quality). Using a larger sample, e.g. n = 35 allows the
manufacturer to operate with a population standard deviation, σd up to 0.04
and still retain a high probability of acceptance, and, moreover, for this sample
size, a batch with a standard deviation exceeding σd = 0.05, say, will only be
accepted with a probability less than 15%.
Realizing that the approach based upon the standard deviation prediction inter-
val is basically a statistical test of a hypothesis (24) concerning the population
standard deviation σd , a more direct approach using concepts from statistical
theories of hypothesis testing, one might choose a more direct approach and
use the standard test for the hypothesis
with some specified limiting value, σlim , e.g. given by (23). The test may be
found in introductory statistical textbooks (e.g. [47], sec. 10.13), and is also
provided in the ISO Standard ISO 2854 [48] with the corresponding power
curves given in ISO 3494 [49].
significance level α∗ is
with the limiting value, slim (n), for the sample standard deviation given by
q
slim (n) = σlim χ2n−1,1−α∗ /(n − 1) (30)
It follows from the discussion in Section 2.2 that the acceptance rule of this
test is equivalent with an acceptance rule of the form “accept when the upper
α∗ confidence limit for σd (as calculated from sd ) does not exceed σlim . Thus,
whenever a sample passes this test, there is a confidence of α∗ (e.g. α∗ = 0.8
corresponding to 80% confidence) that the population standard deviation does
not exceed σlim .
with the interpretation that the content uniformity test is passed whenever the
sample leads to rejection of this hypothesis (31) at a significance level, α =
0.20, say.
The formulation in terms of a statistical test of this null hypothesis (31) might
be more easily understood, as rejection of the hypothesis (31) takes place
whenever the sample result provides strong evidence (at the magnitude 1 − α)
against the hypothesis σd > σlim in favour of the alternative σd ≤ σlim , and,
hence failure to reject the hypothesis σd > σlim means that there is not suffi-
cient evidence (considering the sample uncertainty) to claim that σd ≤ σlim .
132 Paper D
Traditionally, tests for content uniformity of final product (dosage units) have
not only been concerned with the dispersion of dosage values, but require-
ments to dispersion (e.g. to the sample coefficient of variation) have been
supplemented by requirements on individual measurements.
Thus, in USP 21 [42] on each stage the criterion on the sample coefficient of
variation is combined with criteria on individual measurements.
The requirements for passing this test for content uniformity are
Stage 1: Assay 10 tablets. Pass if both of the following criteria are met:
Stage 2: Assay 20 further tablets. Pass if, for all 30 tablets, the following
criteria are met:
Otherwise fail.
134 Paper D
From a statistical point of view two different issues are involved in these re-
quirements
a) In each stage, the criteria include as well a limit on the sample coeffi-
cient of variation (inspection by variables) as limits on individual mea-
surements (inspection by attributes).
It is well-known from statistical theory (see e.g. [44]) that under the assump-
tion of independent, normally distributed sample values the sample average
D and sample standard variance Sd2 are jointly sufficient for the population
mean and variance, (µd , σd2 ), and therefore, when sample average and sample
standard deviation (D, Sd ) are known, the distribution of individual measure-
ment values (D1 , . . . , Dn ) does only depend on these sample statistics, and
not upon population mean and variance (µd , σd2 ). Thus, it is formally pos-
sible to device a test based solely upon sample average and sample standard
deviation in the two stages (otherwise disregarding individual measurements)
having the same probability of passing as the USP-21 test for all values of
population mean and standard deviation. However, as the level curves of the
probability, Pusp (µd , σd ) of passing the USP-test do not allow for a simple an-
alytical representation, the actual mathematical derivation of such a test is not
straightforward, and therefore more insight is gained by considering various
approximations to Pusp (µd , σd ).
Stage 1: Assay 10 tablets. Pass if no value is outside claim ±15% (LC ± ∆)1 .
continue to stage 2 if no more than one value is outside claim ±15%
(LC ± ∆), and no value is outside claim ±25% (LC ± ∆1 ).
Stage 2: Assay 20 further tablets. Pass if, for all these 20 tablets, no value is
outside claim ±15% (LC ± ∆).
Otherwise fail.
To aid the intuition the zones have been labelled with the colours on traffic
light signals.
Stage 1: Assay 10 tablets. Pass if all values are in the “green zone”,
continue to stage 2 if no more than one value is in the “amber zone” and
no values in the “red zone”.
Stage 2: Assay 20 further tablets. Pass if, for all these 20 tablets, all values
are in the “green zone”.
Otherwise fail.
For single sampling plans (i.e. acceptance sampling plans with only one stage
of sampling), sampling plans involving such attribute criteria with three classes
are often referred to as a “three-class” attribute sampling plans. Such sampling
plans were introduced by Bray et al [51] inspired by applications of lot accep-
tance sampling in food microbiology, and subsequently recommended by the
International Commison on Microbiological Specifications for Foods (ICMSF)
of the International Union of Microbiological Societies [52]. In the terminol-
ogy of three-class sampling plans, the “green” zone is often labelled “good”,
or “in compliance with upper limit of GMP”, the “amber” labelled “marginal”
and the “red” labelled “bad”. Three class attribute sampling plans (to be used
for one-stage sampling) usually specify sample size, maximum number, cm ,
of non-good (i.e. non-green) items in the sample and maximum number, cM ,
of bad (i.e. red) items in the sample. In microbiological applications cM is
often taken to be zero.
This probability is slightly larger than the value given by (35). For any value
of pa (µd , σd ), the largest value of pr (µd , σd ) occurs when µd = LC and
σd = ∆/z1−pa /2 . For values of ∆ and ∆1 used in pharmaceutical practice,
the approximation error incurred when substituting (35) by (36), i.e. by disre-
garding pr , is only marginal.
Figure 5 shows the maximal value of the difference, P3−c,appr (µd , σd )−P3−c (µd , σd )
as function of the the proportion, pa = pnonc (µd , σd ; ∆) for ∆ = 15% and
∆1 = 25%. The largest potential difference occurs for pa = 0.09 (i.e. 9% of
the population values falling in the amber or the red zone) with a maximum ap-
proximation error, P3−c,appr (µd , σd ) − P3−c (µd , σd ) = 0.003, i.e. a difference
in acceptance probability of 0.3 %.
139
P p0 (P )
0.99 0.0070
0.95 0.0165
0.90 0.0244
0.50 0.0800
(Dahms and Hildebrandt [53] have discussed the choice of limits ∆ and ∆1 for
applications in microbiological quality control under single sampling).
It is well-known, see e.g. the textbook by Schilling [12] that for a specified
fraction, p0 , of nonconforming product the set of solutions (µd , σd ) to
Wallis [54] has provided a parametric description of the curve using an auxil-
iary variable, π, (0 < π < p0 ) as
µd = LC + ∆ × (zp0 −π − zπ ) (zp0 −π + zπ ) (41)
σd = (µd − LC − ∆) zπ (42)
Figure 6 shows examples of the curve (41), (42) for ∆ = 16.5% and for p0 =
0.0165 and p0 = 0.0278, respectively.
A: µd = LC − ∆ ; σd =0
B: µd = LC − ∆(1 − z1−p0 /z1−p0 /2 ) ; σd = ∆/z1−p0 /2
(43)
C: µd = LC + ∆(1 − z1−p0 /z1−p0 /2 ) ; σd = ∆/z1−p0 /2
D: µd = LC + ∆ ; σd = 0.
The length of the baseline is 2∆, and the height of the trapezoid is ∆/z1−p0 /2 .
The slope of the left-hand side of the trapezoid is 1/z1−p0 . For p0 > 0.5
the oblique sides of the trapezoid are sloping outwards, and the upper side of
the trapezoid will be longer than the baseline. For p0 = 0.5 the trapezoid
degenerates to a rectangle, and for p0 < 0.5 the oblique sides of the trapezoid
are sloping inwards. The bounding trapezoids have also been shown in Figure
6.
In turn, assurance that the proportion, pnonc (µd , σd ; ∆), of units in the popu-
lation with values outside the interval LC ± ∆ does not exceed some specified
value, p0 , is equivalent to assuring that the combination (µd , σd ) is within an
143
p0 = 0.0165
p0 = 0.0278
Consider a level curve for the probability, Pusp (µd , σd ), of passing the USP-
test under a normal distribution of dosage content, e.g. the curve corresponding
to a probability γ of passing the test, i.e. values (µd , σd ) satisfying
for that value of γ. The curve defines a region in the (µd , σd )-plane of combi-
nations of batch mean and standard deviation with the properties that at least
the proportion γ of all samples tested for content uniformity under the USP-test
plan will pass the test.
Thus, Bergums approach shares the spirit of the prediction interval approach
discussed in Section 4.3 in the sense that acceptance of the batch is based
upon a strong confidence that a future sample from the batch will conform to
145
some specified requirements on that sample. In this way, when assessing the
current sample, the uncertainty associated with the current sample is taken into
consideration in order to provide the assurance desired.
Analogously, the relations (37) and (39) might be used to formulate an accep-
tance criterion based upon a (1 − α) confidence interval for the proportion
pa (µd , σd ) of doses in the batch with values in the amber and red zones, and
accept the batch when the upper bound of this interval does not exceed p0 (γ).
Alternatively one could use a formulation of this criterion in terms of a sta-
tistical tolerance interval that contains at least the proportion 1 − p0 (γ) with
confidence (1 − α), and accept the batch when the tolerance limits are within
the green zone, see e.g. Hahn and Meeker [46].
In order to maintain equivalence between the phrasing “passing the test” and
“accepting the hypothesis” the statistical hypothesis to be tested in (45) has
been formulated in accordance with the discussion in Section 2.1.
The statistical problem of testing a hypothesis of the form (45) was adressed by
Wallis, [54], who suggested to use a test based upon the estimator of pa (µd , σd )
obtained by substituting µd and σd in the generic function (34) by the usual
estimates, d¯ and sd
¯ sd ) ≤ plim , otherwise reject
Accept H0 if pa (d, (46)
i.e. use the sample average and sample standard deviation to compute an es-
timate of the fraction nonconforming items using (33), and compare this esti-
mate to some limiting value, plim .
In the following we shall follow the direct approach suggested by Wallis [54].
The difference between this direct approach by Wallis and use of the likelihood-
ratio principle is that under the likelihood-ratio approach, sd of
p the estimate,
σd is replaced by the maximum-likelihood estimate, (n − 1)/n sd . For
a comparison of the direct approach and the likelihood-ratio approach see Lei
and Vardeman [60].
147
Once, the limiting value, plim , of the estimated fraction nonconforming has
been established, the acceptance region (in the (d, ¯ sd )-plane) for the test may
¯
determined as the set of values, (d, sd ), such that
¯ sd ) ≤ plim
pa (d, (47)
with the function pa (·, ·) given by (33) and (34). The acceptance region is of
a similar shape as shown in Figure 6. The boundary of the region may be de-
termined from (41) and (42) substituting plim for p0 . When the significance
level of the test is less than 50%, plim will be larger than p0 (the sampling un-
certainty is “subtracted” from the estimated proportion nonconforming before
comparing with p0 ), and the acceptance region will include the region corre-
sponding to pa (d, ¯ sd ) ≤ p0 . When the significance level of the test is greater
than 50%, plim will be less than p0 (the sampling uncertainty is “added” to
the estimated proportion nonconforming before comparing with p0 ), and the
acceptance region will be inside the region corresponding to pa (d, ¯ s d ) ≤ p0 .
¯
The acceptance region may be expressed as a list of limiting values, slim (d),
for the sample standard deviation, sd , with the limiting values depending on
the sample average, d.¯ The maximal acceptable value of the sample standard
deviation is found from (42) for π = plim /2, d¯ = LC as
For any combination, (µd , σd ), of batch mean and standard deviation, the prob-
ability of passing this dosage uniformity test with a specified limiting estimated
proportion nonconforming, plim , may be determined by numerical integration
¯ sd ) over the acceptance region
of the joint probability density function for (d,
¯ sd )-plane). It follows from the mutual statistical independence of d¯
(in the (d,
148 Paper D
and sd (see [44]) that the joint probability function may be determined as the
product of density functions corresponding to the normal distribution of d¯ and
the σd2 χ2 (n − 1)/(n − 1) distribution of s2d .
Hence, for any specified limiting value, plim , the probability of passing this
test may be determined as function of population parameters (µd , σd ). As the
test is based upon a comparison of the estimated fraction nonconforming items
with some limiting value, it is to be expected that the probability of passing the
test depends only on the proportion, pa (µd , σd ) of nonconforming items in the
population. Although this is not exactly true, the assertion generally holds to
a degree of approximation satisfactory for all practical purposes, see Lei and
Vardeman [60].
The limiting value, plim , is determined from the requirement to the assurance
desired from the test procedure. In statistical terms, the assurance is expressed
in terms of the significance level, α, of the statistical test. Thus, using sig-
nificance level, α = 0.90 say, the borderline case with population proportion
nonconforming, py (µd , σd ) = p0 , the probability of rejecting H0 is 90% (the
probability of rejecting H0 is α).
When the design value p0 in (45) has been selected as the value p0 (γ) derived
from (37) and (39), it follows from the duality between statistical confidence
regions and tests that acceptance by the test at a significance level α provides
assurance with confidence 1−α that there is at least a probability γ that a future
sample from the batch will pass the three-class modification of the content
uniformity test.
The distribution of
√ U −D √
T = n = nW
Sd
is a non-central t-distribution with n − 1 degrees of freedom and with non-
√
centrality parameter δ = θ n.
√
As W + = T / n it follows that
√ √
P[W + ≤ k] = P[T / n ≤ k] = P[T ≤ k n]
and, hence
√
P[(U − D)/sd ≤ k] = P[W + ≤ k] = P[t(n − 1, δ(p)) ≤ k n] (53)
√ √
with δ(p) = θ n = z1−p n.
Thus, the critical value, k (for w+ ) at a level α test for the hypothesis (51) is
determined from
√
k = k(p0 ) = t(n − 1, δ(p0 ))α / n (54)
viz. the α percentile in the t(n − 1, δ(p0 ))-distribution. The hypothesis (51) is
accepted when w+ ≥ k(p0 ), and therefore, whenever w+ ≥ k(p0 ) it has been
demonstrated with a confidence α that p+ (µd , σd ) ≤ p0 .
k(0.0165) = 3.255.
¯ sd )-plane
The requirement determines a region in the (d,
z ≥ sd k
or,
U − d¯ ≥ sd k.
Inserting U = LC + ∆, the requirement w+ ≥ k may be expressed as
LC + ∆ − d¯ ≥ sd k(p0 ). (55)
151
with k = k(p0 ) given by (54). In this form the criterion simply compares
a suitably chosen linear combination of sample average and sample standard
deviation with the upper specification for individual measurement values.
For sample values, (d, ¯ sd ), to the left of this line the hypothesis (50) is ac-
cepted. It follows from (53) that the probability of obtaining a sample value,
¯ sd ), to the left of this line depends on (µd , σd ) only through p+ (µd , σd ). In
(d,
other words, all combinations of batch mean and standard deviation (µd , σd )
with the same proportion, p = p+ (µd , σd ) of nonconforming units (with re-
spect to the upper limit LC + ∆) will have the same probability of rejection
under this test. In statistical terms the critical region is said to be “similar”.
with the limiting value, plim for the estimated fraction nonconforming deter-
mined from
plim = Φ(−k) (60)
with k = k(p0 ) given by (54). Thus, the limiting value for the estimated
fraction nonconforming depends as well upon the sample size as upon the sig-
nificance level, α, of the test (the desired level of assurance of p ≤ p0 ).
152 Paper D
For a level α-test, the value of k is given by (54) and the acceptance probability
for this test is determined from (53) with that value of k, i.e.
By symmetry, it is found that a level α-test of the hypothesis (63) rejects the
hypothesis when
d¯ − LC − ∆ < sd k (64)
with k = k(p0 ) given by (54).
¯ + ksd ≤ ∆
Accept H0− when (LC − d) (65)
otherwise reject.
153
So far, the upper and lower limits, U and L, have been considered separately.
However, the quantity of interest is the proportion,
Accordingly, the estimates pb+ from (58) and pb− from (67) may be combined
to form an estimate,
¯ sd ) = p− (d,
pba (d, ¯ sd ) + p+ (d,
¯ sd ) (70)
¯ sd )-plane satisfying
A parametric form of the boundary of the region in the (d,
¯ sd ) ≤ plim is given by (41) and (42).
pba (d,
For any combination, (µd , σd ) of the population values, the acceptance prob-
ability under this criterion may be determined by numerical integration of the
joint probability density function for (D, Sd ) over this region.
However, as noted by Lei and Vardeman [60] for any specified limiting value,
plim , there is a whole band of operating characteristic curves, and there the
determination of a limiting value plim corresponding to a specified significance
level, α, of the test of the hypothesis (45) requires extensive use of numerical
search methods.
with p∗ = Φ(−k).
¯ sd )-space
It follows from the considerations in Section 5.4 that the region in (d,
¯ ¯
of values (d, sd ) such that pa (d, sd ) ≤ plim is bounded by this acceptance
polygon. Thus, the acceptance criterion (73) corresponding to the polygon
will be slightly more conservative (i.e. accept in more cases) than the criterion
155
(71) based upon a direct comparison of pa (d, ¯ sd ) with plim . However, as indi-
cated in Figure 6, the difference between these two acceptance regions is only
marginal. The difference is most apparent for larger values of sd close to smax .
Hence, for any given value of the proportion nonconforming, pa (µd , σd ), the
largest difference in acceptance probability occurs when µd = LC. This max-
imal difference in probability of acceptance under the two criteria has been
illustrated as function of pa (µd , σd ) in Figure 7. The largest difference occurs
for acceptance probabilities around 0.50.
Rearranging (55), the requirement for acceptance at the upper limit may be
expressed as
¯
sd ≤ (LC + ∆ − d)/k
or,
sd /d¯ ≤ (LC + ∆ − d)/(k
¯ ¯
d) (74)
which, corresponding to each value of the sample average, d, ¯ specifies an upper
limit to the sample coefficient of variation to be applied for LC < d.¯
Similarly, from rearranging (64), the requirement for acceptance at the lower
limit may be expressed as
sd ≤ (d¯ − LC − ∆)/k
or,
sd /d¯ ≤ (d¯ − LC − ∆)/(kd)
¯ (75)
providing an upper limit to the sample coefficient of variation to be applied for
d¯ < LC.
acceptance probability for plim (lower curve), and trap. region(upper curve)
1.0
0.8
0.6
Pacc
0.4
0.2
0.0
0.0 0.05 0.10 0.15 0.20
Fig. 7: Top: Acceptance probability under the criterion (71) based upon the
estimated proportion nonconforming, and under the criterion (73) for the cor-
responding trapezoidal rule. Bottom: P − P -plot of corresponding values of
the acceptance probability under the two criteria. The identity line has been
superimposed on the graph.
157
corresponding to the center part of the trapezoid (see (43)), the criterion to the
sample standard deviation is
sd ≤ smax
with the maximum sample standard deviation, smax given by (48). Hence, for
these values, the criterion to the sample coefficient of variation is
The requirements (74), (75) and (77) may be presented in a tabular form as in
PDA Technical Report No. 25 [22].
Combining (56) and (65) it is seen that the region bounded by the oblique sides
of the trapezoid may be delimited by the requirement
¯ + ksd < ∆
|LC − d| (78)
i.e. the absolute value of the deviation between sample average and LC plus
(or minus, depending on the sign of k) some adjustment for sampling uncer-
tainty shall not exceed the specification ∆ (e.g. 16.5%). It follows from (54)
that when the level of significance, α has been chosen to be larger than 0.50,
then k > 0 and, hence the uncertainty adjustment is added as an extra penalty,
whereas when α < 0.50, the uncertainty adjustment is subtracted before com-
parison with ∆.
In this section we shall briefly describe the effect of the choice of significance
level on the designation of the various limiting values. For ease of trans-
parency, we shall first consider a test of a one-sided upper specification as
discussed in the subsection page 149.
158 Paper D
The approach to the adjustment for sampling uncertainty depends on the value
of the significance level α. When α > 0.50 (and p0 < 0.5) then k will be
larger than z1−p0 , implying that the sample average shall be even less than
d¯lim (sd ) = (LC + ∆) − z1−p0 sd (79)
in order to provide satisfactorily strong evidence of pa < p0 .
For significance levels α < 0.50 the value of k will be smaller than z1−p0 , im-
plying that even sample averages exceeding d¯lim (sd ) may lead to acceptance,
if only the exceedance is not larger than what may be attributed to the sampling
uncertainty. Thus, for α < 0.50 the procedure accepts unless there is strong
sample evidence contradicting pa < p0 .
The larger the sample, the smaller is the adjustment for sampling uncertainty,
and the closer will k be to z1−p0 .
The interpretations above are valid also for the combined test in the sub sec-
tions page 153 and page 155.
5.6 Discussion
The statistical procedures for acceptance sampling by variables have been tay-
lored for assessing such population values, and the assurance provided by these
procedures under due consideration to the sampling uncertainty has been dis-
cussed in the statistical literature. An operational acceptance criterion may be
formulated in terms of a trapezoidal acceptance polygon in (d, ¯ sd )-space that
approximates the criterion (71) on the estimated proportion of values outside
the limits.
160 Paper D
In the following the statistical properties of the acceptance criteria for unifor-
mity of dosage units by the Content Uniformity method will be investigated.
We shall consider the acceptance criteria only for such situations where the
target test sample amount at the time of manufacture equals the label claim
(LC = 100%, or, in fraction LC = 1).
The proposal has been formulated in terms of an “acceptance value”, and limits
for individual units in the sample as follows:
161
¯ sd ) = |M − d| + ksd
A = A(d, (80)
with (
2.4 when n = 10
k = k(n) = (81)
2.0 when n = 30
and (
¯ 0.985}
max{d, when d¯ ≤ 1.0
¯ =
M = M (d) (82)
¯
min{d, 1.015} ¯
when 1.0 < d.
The requirements are met if the acceptance value of the first 10 dosage units is
less than or equal to 0.15 (denoted L1 ). If the acceptance value is greater than
0.15, test the next 20 units and calculate the acceptance value.
The requirements are met if the final acceptance value of the 30 dosage units
is less than or equal to 0.15 (L1 ), and no unit is over the deviation of 0.25
¯
(denoted L2 ) from the calculated value of M (d).”
a) In each stage, the criteria include as well a parametric test based upon
the acceptance value, (inspection by variables), as limits on individual
measurements (inspection by attributes)2 .
As the requirements in the attribute test involves an “accept zero” crite-
rion, we shall use the term “non-satisfactory units” for units with mea-
surements outside the limits for individual units.
As the criteria involve the individual values of the 30 sample units, a direct as-
sessment of the operating properties of the test by analytical methods requires
numerical integration over a region in a 30-dimensional space, and therefore
we have chosen to assess the properties using numerical simulation.
However, before reporting the results of the simulations, the criterion on the
acceptance value will be further discussed.
where the inequality to be satisfied in order to pass the criterion (84) depends
¯
on the value of d.
A: d¯ = 0.985 − L1 ; sd = 0
B: d¯ = 0.985 ; sd = L1 /k
¯ (87)
C: d = 1.015 ; sd = L1 /k
D: d¯ = 1.015 + L1 ; sd = 0.
Comparing the region defined by (87) with the trapezoidal acceptance region
derived in the subections page 153 and page 155, it is seen that letting ∆ =
L1 + 0.015 and k(p0 ) be given by (81), the two approaches lead to trapezoids
with coinciding oblique sides. They only differ in the position of the upper
horizontal line specifying the maximum sample standard deviation. Hence,
on each stage, the criterion on the acceptance value essentially controls the
proportion of population units outside the interval LC ± (0.015 + L1 )
Table 3 summarizes the criteria in terms of sample average and standard devi-
¯ sd ), and limits for individual measurements3 .
ation (d,
Limits
Acceptance limits
for individual
Value of d¯ for sd
units in the sample
Stage 1, 10 units
d¯ < 0.835 - -
0.835 ≤ d¯ ≤ 0.985 ¯
sd ≤ (d − 0.835)/2.4 0.74 - 1.23
0.985 ≤ d¯ ≤ 1.015 sd ≤ 0.0625 0.75d¯ − 1.25d¯†
1.015 ≤ d¯ ≤ 1.165 ¯
sd ≤ (1.165 − d)/2.4 0.76 − 1.27
1.165 < d¯ - -
Stage 2, all 30 units
d¯ < 0.835 - -
0.835 ≤ d¯ ≤ 0.985 ¯
sd ≤ (d − 0.835)/2.0 0.74 - 1.23
0.985 ≤ d¯ ≤ 1.015 sd ≤ 0.075 0.75d¯ − 1.25d¯
1.015 ≤ d¯ ≤ 1.165 ¯
sd ≤ (1.165 − d)/2.0 0.76 − 1.27
1.165 < d¯ - -
Table 3: Acceptance limits for sample standard deviation sd and limits for
individual sample units at each stage in the dosage uniformity test
165
The simulations were performed in a lattice in (µd , σd )-space, with 3000 sam-
ples (each consisting of 30 units) simulated in each lattice point. Hence, the
uncertainty due to simulation is at most ±0.02 (for values of the acceptance
probability in the neighbourhood of Pacc = 0.5) and less than ±0.0001 when
Pacc ≥ 0.75, or Pacc ≤ 0.25.
Figure 8 shows the level-curves (in (µd , σd )-space) for the overall probabil-
ity of acceptance under the procedure. The shape of the curves resembles the
curves of the solution to pnonc (µd , σd ; ∆) = p (see (40) and Figure 6) for ∆ =
0.165. For comparison, Figure 9 shows the solution to pnonc (µd , σd ; 0.165) =
p for various values of p. Superimposing the graphs in the two figures it is seen
that there is a good agreement between the shapes of the two set of curves.
Thus, the criterion on the acceptance value that at each stage controls the pro-
portion of population units outside the interval LC ± (0.015 + L1 ), apparently
also has a dominant effect on the properties of the overall two-stage procedure
with supplementary attribute criteria on individual measurement values, di .
Comparing the graphs in the two figures it is seen that populations with a pro-
portion pnonc (µd , σd ; 0.165) = 0.005 of values outside the interval 1.0±0.165
will have a probability of more than 99% of being accepted by the procedure;
when the population proportion of values outside the interval 1.0 ± 0.165 is
0.09, the acceptance probability is 10%.
For each lattice point (µd , σd ), the value of the acceptance probability Pacc (µd , σd )
has been plotted against the corresponding value of pnonc (µd , σd ; 0.165) in
Figure 10. The rather narrow sigmoid-shaped scatter of points confirms the
impression from Figure 8 that the criteria essentially controls the population
proportion of values outside the interval 1.0 ± 0.165.
0.12
0.10
0.08
sigma
0.1
0.06 0.50
0.04
0.99 0.75
0.99
0.02
0.10
0.25
0.50
0.75
0.0 0.99
mu
Fig. 8: Level-curves for the overall probability of acceptance for the procedure.
0.12
0.10
0.08
sigma
0.06
0.04
0.010
0.04 0.005
0.02
0.0 0.040
0.035
0.030
0.025
0.020
0.015
0.010
0.005
mu
However, as the scatter in the vertical direction is larger than what can be ex-
plained by simulation uncertainty, there is not a unique value of the acceptance
probability corresponding to a given value of pnonc (µd , σd ), but rather an in-
terval of values with the specific value of the acceptance probability depending
on the particular combination (µd , σd ) giving rise to this proportion noncon-
forming. Thus, the acceptance probabilities are represented by an OC “band”
rather than a single OC-curve. It is well-known that even in the case of sin-
gle sampling plans such an OC band is an inherent feature of the statistical
test, but the thickness of the band depends on the test method selected (see
Lei and Vardeman [60]). The thickness of the band clearly has an effect on the
steepness of the OC-curve. The thicker the band the less steep is the OC-curve.
For the two-stage plan under study it is believed that the thickness is further
enhanced by the choice of the position of the upper horizontal line of the
trapeziodal acceptance region in the two stages. Moreover, as discussed in
the subsection below (page 167), the additional attribute-criterion on individ-
ual measurement values further contribute to the thickness of the band.
Grossly speaking, the limits to be applied for the individual measurement val-
ues correspond to an interval ±25% (termed L2 ) around the label claim.
•••••
•
••• •
•
••••••
•••••
Probability of acceptance
• •
•••
0.6
••••••
••• •
•••••••••
•••••
•• ••
•••••••
0.4
•• •
•••
•• •
•••••
••• ••••
• ••
••••
••••••• •
•••••
0.2
••••••••••
• •••••••
•• ••••• •
••••••••••••••••
••• ••••••••• •••
• • ••••••••••
• ••••• ••••••••••••••••••••••••••
• • • •• • •••••••• •••••• ••••••••• •••••••••••••••••••••••• •• • • • •• • •
0.0
• • • ••• • • •
0.25), Figure 11 shows the solution to pnonc (µd , σd ; 0.25) = p for various
values of p. Superimposing the graphs in Figure 11 on the graph of the level-
curves of the OC-surface of the procedure (Figure 8) it is seen that the dis-
crimatory power against proportion of nonsatisfactory units depends strongly
upon the particular combination of population mean and standard deviation,
(µd , σd ). In some cases, populations with 0.01 % nonsatisfactory units will
be rejected with a high probability (larger than 0.9) whereas in other cases
(depending on the value of µd ) such populations will be accepted with a prob-
ability higher than 0.99.
However, in greater detail as seen from Table 3 (page 164), the limits depend
on the value of the sample average, thereby extending the range of values to the
slightly asymmetric interval 0.74 − 1.27. Moreover, as the limit to be applied
depend on the sample average d¯ that is subject to random error, the criteria on
the individual limits might contribute some random “noise” to the acceptance
value criterion and result in a less steep OC-surface.
In order to assess the effect of the attribute criteria on the discriminatory power
169
0.12
0.10
0.0500
0.08
sigma
0.06
0.15
0.04 0.0001
0.02
0.0005
0.0001
0.1500
0.1000
0.0500
0.0100 0.1000
0.1500
0.0 0.0050
0.0010
0.0001
0.0005
mu
of the procedure, the simulation resuls were recalculated disregarding the cri-
teria on individual measurement values and using only the criterion based upon
the acceptance value for determining acceptability. Figure 12 shows the level-
curves (in (µd , σd )-space) for the probability of acceptance when disregarding
the criteria for the individual measurement values. Superimposing these graphs
upon the level-curves in Figure 8, it is seen that there is virtually no difference
between the two sets of level-curves. Thus, under the assumption of normally
distributed measurement values, the criteria for the individual measurement
values does not affect the ability to discriminate between different combina-
tions (µd , σd ) of population values. This is in line with the fact that (d, ¯ sd )
are jointly sufficient for (µd , σd ) and therefore knowledge about the individual
measurement values does not add to the information provided by (d, ¯ sd ).
0.12
0.10
0.08
sigma
0.1
0.06 0.50
0.04
0.99 0.75
0.99
0.02
0.10
0.25
0.50
0.75
0.0 0.99
mu
Fig. 12: Level-curves for the overall probability of acceptance for the proce-
dure, disregarding the criteria on individual measurement values.
An overview of the procedure and the possible conclusions at each stage has
been provided in Figure 13.
171
Stage 1
NO YES
REJECT
The combination of
sample mean and sample
standard deviation falls YES
within the acceptance area ACCEPT
for stage one
NO
Stage 2
NO YES
REJECT
The combination of
sample mean and sample
standard deviation falls YES
within the acceptance area
ACCEPT
for stage two
NO
REJECT
Fig. 13: Schematic representation of the procedure for content uniformity test-
ing.
After assaying the first sample, three different actions are possible
3. Reject (i.e. the test is not passed) when one or more units is beyond the
attribute limits, irrespective of the acceptance value4 .
4
The wording in [28] does not explicitly specify this option; however in previous versions
explicit provisions were given for this option. Moreover, the option makes sense in practice, as
172 Paper D
Thus, it is possible to conclude the test after testing only 10 units, by outright
acceptance (when the acceptance value is satisfactorily small), or by outright
rejection (when one or more nonsatisfactory units are found in the first sample).
0.12
0.05
0.10
0.08
sigma
0.06
0.04
0.05
0.02
0.99
0.05
0.10
0.25
0.50 0.75
0.50
0.25 0.99 0.25
0.10
0.05
0.0 0.75
0.99 0.10
0.05
mu
Fig. 14: Level-curves of the probability of invoking the second stage of the
procedure.
Thus, from a purely statistical point of view, the test by attributes in stage 1
serves the purpose of saving testing ressources for batches that would have
been rejected anyhow (after testing all 30 units). This might, however, have
been achieved by using also a criterion based upon sample average and stan-
dard deviation for rejection in the first stage. Schilling [12] describes the design
of two-stage sampling plans by variables that allow for rejection also at stage
1.
the criterion on sample average and standard deviation has the statistical ad-
vantage to make the test procedure more robust towards deviations from the
assumption of a normal distribution of dosage content in the tablets.
The simulations were performed in the same lattice in (µd , σd )-space as in the
previous section. In each lattice point, (µd , σd ), 3000 samples (each consisting
of 30 units) were simulated as random values from a lognormal distribution
with that mean and standard deviation.
Figure 15 shows the level-curves (in (µd , σd )-space) for the overall probability
of acceptance under this distributional assumption. Comparing with the level-
curves in Figure 8 it is seen that the part of the curves corresponding to values
1.0 < µd and larger values of σd have been shifted slightly to the right un-
der the lognormal distribution, indicating a slighty larger probability of accep-
tance under the lognormal distribution than under a normal distribution with
the same mean and variance. This may be interpretated by the fact that under
the lognormal distribution there is a heavier concentration of probability mass
to the left of the mean than under the corresponding normal distribution, and
this outweighs the heavier right-hand tail of the lognormal distribution.
However, the heavier tail of the lognormal distribution might imply that the
attribute criterion on individual measurement values would have a greater in-
fluence than under the normal distribution. In analogy with Figure 12 for nor-
mally distributed measurement values, Figure 16 shows the level-curves for
the probability of acceptance when disregarding the criteria for the individual
measurement values. Superimposing these graphs upon the graph in Figure
15 it is seen that there is virtually no difference between the two sets of level
curves. Thus, also under a lognormal distribution of individual measurement
175
0.12
0.10
0.08 0.25
sigma
0.1
0.06
0.04
0.99
0.02
0.10
0.25
0.50
0.75
0.0 0.99
mu
Fig. 15: Level-curves for the overall probability of acceptance for the proce-
dure under a lognormal distribution of measurement values.
values, the criteria on the accectance values are the dominant criteria of the
procedure.
0.12
0.10
0.08 0.25
sigma
0.1
0.06
0.04
0.99
0.02
0.10
0.25
0.50
0.75
0.0 0.99
mu
Fig. 16: Level-curves for the overall probability of acceptance for the pro-
cedure under a lognormal distribution, disregarding the criteria on individual
measurement values
When a single point, (p, Pacc (p)) on the OC-curve has been specified, it fol-
lows from the considerations in Section 5.5, that such a specification defines a
relation between sample size n and limiting value, plim = plim (n), of the es-
timated proportion nonconforming units such that the OC-curves of all such
sampling plans with (n, plim ) satisfying this relation will pass through the
specified point (p, Pacc (p)). The larger the sample size, the steeper the OC-
curve.
Correspondingely, when two points (pa , 1 − α) and (pr , β)) on the OC-curve
have been specified, it is possible to determine a unique combination, (n, plim )
such that the OC-curve for that sampling plan satisfies
Often pa is termed the “producer’s risk quality” with α denoting the corre-
sponding producer’s risk, and pr is termed the “consumer’s risk quality” with
177
The table below shows the single sampling plans “matching” the acceptance
procedures in the USP preview dosage uniformity test for various choices of
matching points, (pa , 1 − α) and (pr , β)). It is seen that the discriminatory
power of the 10 − 20 two-stage plan corresponds to a single sampling with
sample size slightly larger than 20 units. Thus, for good quality productions
the savings when using the two-stage plan is 50% (corresponding to acceptance
in stage 1), and the extra effort when analysis of the second sample is called
for, also amounts to 50% of the sample size for the single sampling plan.
pa α pr β n k smax plim
0.010 0.10 0.10 0.05 22 1.89 0.076 0.0299
0.006 0.05 0.10 0.05 21 1.91 0.075 0.0278
0.010 0.10 0.075 0.10 24 1.90 0.075 0.0278
7 Further issues
One such factor could be the variation introduced by the analytical procedure.
The effect of measurement error on the operating properties of procedures for
acceptance sampling by variables has been discussed in [31] and [32].
178 Paper D
acceptance probability for plim (lower curve), and trap. region(upper curve)
1.0
0.8
0.6
Pacc
0.4
0.2
0.0
0.0 0.05 0.10 0.15 0.20
Fig. 17: OC-curve for single sampling plan with n = 21, k = 1.91, smax =
0.075; plim = 0.0278.
acceptance probability for plim (lower curve) and trap. region (upper curve)
1.0
0.8
0.6
Pacc
0.4
0.2
0.0
0.0 0.05 0.10 0.15 0.20
Fig. 18: OC-curve for single sampling plan with n = 22, k = 1.89, smax =
0.076; plim = 0.0299.
179
acceptance probability for plim (lower curve) and trap. region (upper curve)
1.0
0.8
0.6
Pacc
0.4
0.2
0.0
0.0 0.05 0.10 0.15 0.20
Fig. 19: OC-curve for single sampling plan with n = 24, k = 1.90, smax =
0.075; plim = 0.0278.
8 Discussion
values. Thus, considerations on the sampling uncertainty have mainly been im-
plicitly considered in the specification of the limiting values, and the assurance
provided by the acceptance criteria has not been very transparent.
In the paper we have discussed the acceptance criteria in terms of the statistical
hypothesis concerning population values that is implicitly underlying the cri-
teria. Using the concepts from statistical theory of hypothesis testing, a more
transparent description of the assurance provided by the criteria, and the con-
siderations on the sampling uncertainty is obtained.
9 List of symbols
Symbol
E[X] Mean of the population distribution of X
V[X] Variance of the poulation distribution of X
D Relative dose i.e. mass of drug (in blend sample, or in
tablet) as fraction, or percentage of target value (ran-
dom variable)
d Relative dose in sample unit (actual value)
µd Mean relative dose (population value)
σd Standard deviation in distribution of relative doses
(population value)
Cd Coefficient of variation in distribution of relative doses
(population value)
Cd = σd /µd
n Number of units in sample
D sample average relative dose per sample unit (random
variable)
d¯ sample average relative dose per sample unit (actual
value)
Sd sample standard deviation of relataive doses in sample
(random variable)
sd sample standard deviation of doses in sample (actual
value)
LC Required dose, Label Claim. In Section 5 and 6
measurements are relatively to LC, i.e. LC=1 (100%).
χ2f Random variable distributed according to a χ2 -
distribution with f degrees of freedom
χf , p 2 p’th quantile in χ2 -distribution with f degrees of free-
dom, (i.e with probability mass p to the left of this
value)
Some (mainly US) textbooks use the notation χ2α,ν to
denote the so-called χ-squared critical value, denot-
ing the number on the measurement axis such that the
probability mass for the χ2ν -distribution to the right of
this value is α.
F(f1 , f2 ) Random variable distributed according to a F-
distribution with (f1 , f2 ) degrees of freedom
F(f1 , f2 )p p’th quantile in F-distribution with (f1 , f2 ) degrees of
freedom, (i.e with probability mass p to the left of this
value)
182 Paper D
Symbol
tf (δ) Random variable distributed according to a noncen-
tral t-distribution with f degrees of freedom and non-
centrality parameter δ
t(f, δ)p p’th quantile in noncentral t-distribution with f degrees
of freedom and non-centrality parameter δ, (i.e with
probability mass p to the left of this value)
zp p’th quantile in standard normal distribution, (i.e with
probability mass p to the left of this value)
∆, ∆1 quantity serving to specify the limits for individual val-
ues (usually in the form of LC ± ∆ )
P3−c (µd , σd ) Acceptance probability under a 3-class attribute one-
stage sampling plan when mean and standard deviation
in the population are (µd , σd )
pnonc (µ, σ; ∆) Generic function (34) expressing the probability mass
outside the limits LC ± ∆ in a normal distribution with
mean µ and standard deviation σ.
Pusp (µd , σd ) probability of passing the USP-21 test when mean and
standard deviation in the batch is (µd , σd )
pa (µd , σd ) fraction of units outside limits LC ± ∆ when mean and
standard deviation in the batch is (µd , σd )
p+ (µd , σd ) fraction of units violating upper limit, LC + ∆ when
mean and standard deviation in the batch are (µd , σd )
p(µd , σd ) fraction of units violating lower limit, LC − ∆ when
mean and standard deviation in the batch are (µd , σd )
¯ sd )
A(d, Acceptance value used for determining acceptability,
see (80)
Paper E
183
185
The statistical properties of such tests of content uniformity are usually as-
sessed assuming a normal distribution of content in the tablets. However, ac-
tual distributions of particle sizes are often seen to be skewed, and it is there-
fore conceivable that this feature will also affect the shape of the distribution of
content in the dosage units. In this paper the particles referred to are the drug
particles in the blend.
In the paper we investigate the effect of relative variation, skewness and ex-
cess (heavy-tailedness) of the distribution of particle diameters on the result-
ing distribution of particle volume and particle mass under the assumption that
particles are spherical. For a log-normal distribution of particle diameters, the
resulting distribution of particle volume and mass is also a log-normal distribu-
tion with a mean that is larger than the mass corresponding to the mean radius,
and a coefficient of variation that is larger than the coefficient of variation for
the distribution of the radii. It is shown that this implies that the skewness
and excess of the distribution of particle radii is amplified when transformed
to the distribution of particle mass. The larger the coefficient of variation in
the distribution of particle radii, the more these departures from normality are
amplified.
Along with the variation in particle mass, an important source of dose varia-
tion for low-dose tablets is the variation in the number of drug particles in the
tablets. We investigate the transfer of the variation of particle sizes and of num-
ber of particles in the tablets to dose variation in tablets under the assumption
of a homogeneous blend with a random scattering of particles over the blend,
and derive expressions for skewness and excess of the distribution of tablet
doses. It is demonstrated (as previously shown by Yalkowsky and Bolton [64])
that for a given distribution of particle sizes the variance in the distribution of
186
Although the results are presented in terms of content uniformity of tablets, the
statistical results are also valid for blend samples, under the assumption that
no systematic sampling error is introduced by the sampling process. The only
difference being that the size of individual blend samples usually is larger than
the size of the dosage units, and therefore in the interpretation of the results,
label claim shall be substituted by the target value of the blend sample.
where the parameters αR and βR are related to the so-called geometric mean,
µgR , and the geometric standard deviation by
2
µR = E[R] = exp(αR + (1/2)βR ) (4)
p q
σR = V[R] = µR × exp(βR 2 ) − 1. (5)
g
Hence, the parameter βR = ln(σR ) is in a one-to-one correspondence with the
coefficient of variation, CR
q
βR = log(1 + CR 2) (7)
1 2
αR = log µR − log(1 + CR ) (8)
2
and
2 2
exp(βR ) = 1 + CR . (9)
Expressed in terms of the mean, µR and the coefficient of variation, the mo-
ments about the mean are
Hence, the coefficients of skewness, γ1 , and excess, γ2 are (see e.g. [45])
3 2
γ1 (R) = µ3 (R)/σR = CR × (CR + 3) > 0 (13)
4
γ2 (R) = µ4 (R)/σR −3
2 4 2 3 2 2
= (1 + CR ) + 2(1 + CR ) + 3(1 + CR ) −6
2 2 4 6
= CR (16 + 15CR + 6CR + CR ) > 0. (14)
Thus, the distribution is positively skewed, and has heavier tails than the nor-
mal distribution (is leptokurtic).
CR γ1 γ2
0.1 0.30 0.16
0.2 0.61 0.66
0.3 0.93 1.57
0.4 1.26 2.97
0.5 1.63 5.04
E[R3 ] = µ3R CR
4 2
(CR 2
+ 3) + 3CR +1 (15)
V[R3 ] = E[R6 ] − (E[R3 ])2 . (16)
For a log-normal distribution of particle radii, the distribution for the particle
volume, V = (4π/3)R3 , is also a log-normal distribution with parameters
Assuming that all drug particles have the same mass density, ρ, the distribution
of particle mass M = ρV is also log-normal with parameters
Utilizing the fact that the distribution of particle mass is also a log-normal
distribution, the third and fourth moment about the mean are found in analogy
with (11) and (12)
µ3 (M ) = E[(M − µM )3 ] = µ3M CM
4 2
(CM + 3) (25)
4
µ4 (M ) = E[(M − µM ) ]
h i
= µ4M CM
4 2 4
(1 + CM 2 3
) + 2(1 + CM 2 2
) + 3(1 + CM ) − 3 (. 26)
191
In analogy with (13) and (14) one obtains the coefficients of skewness and
excess for the distribution of particle mass
3 2
γ1 (M ) = µ3 (M )/σM = CM × (CM + 3) > 0 (27)
4
γ2 (M ) = µ4 (M )/σM −3
2 4 2 3 2 2
= (1 + CM ) + 2(1 + CM ) + 3(1 + CM ) −6
2 2 4 6
= CM (16 + 15CM + 6CM + CM )> 0. (28)
Inserting µM and CM from (21) and (22) the moments of the distribution of
particle mass may be expressed in terms of the parameters in the distribution
of particle radii as
µ3 (M ) = E[(M − µM )3 ]
= [(4π/3)ρµ3R ]3 (1 + CR
2 9 2 9
) [(1 + CR ) − 1]2 [(1 + CR
2 9
) + 2] (29)
µ4 (M ) = E[(M − µM )4 ]
= [(4π/3)ρµ3R ]4 [(1 + CR
2 9
) − 1]2 (1 + CR
2 12 2 9
) [(1 + CR ) − 1]2 ×
2 36 2 27 2 18
[(1 + CR ) + 2(1 + CR ) + 3(1 + CR ) − 3]. (30)
Further, the coefficients of skewness and excess for particle mass may be ex-
pressed in terms of the coefficient of variation, CR in the distribution of the
radii as
q
2 )9 2 9
γ1 (M ) = (1 + CR −1 (1 + CR ) +2 (31)
2 36 2 27 2 18
γ2 (M ) = (1 + CR ) + 2(1 + CR ) + 3(1 + CR ) − 6. (32)
Table 2.2 shows the coefficient of variation in the distribution of particle mass
and the corresponding coefficients of skewness and excess for different values
of the coefficient of variation in the distribution of particle radii. It is seen that
the relative variation in the distribution of the radii is amplified when consider-
ing the distribution of particle mass, resulting in a similar marked amplification
of the coefficients of skewness and excess.
192
CR CM γ1 γ2
0.1 0.31 0.95 1.64
0.2 0.65 2.23 9.95
0.3 1.08 4.52 50.89
0.4 1.67 9.72 356.55
0.5 2.54 24.00 4.07×103
Most of the probabilistic models for mixtures in the literature discuss statis-
tical properties of samples from mixtures of two (or more) components when
sampling a fixed number of particles, or a fixed volume from the mixture, see
e.g. the survey by Gjelstrup Kristensen [13] and concern the relative distribu-
tion (by mass) of the two components in the sample. In these models the two
components enter symmetrically and the main issue is to describe the overdis-
persion compared to a binomial distribution of particles of the key component.
For low-dose tablets focus is upon the distribution of the small fraction of key
component particles in the blend, and this symmetry between key component
particles and other particles is not necessarily of concern. Therefore, in the
following we shall investigate the distribution of dose content in tablets un-
der various assumptions on the distribution of key component particles in the
blend. As the resulting variation in dose content in tablets is caused by varia-
tion of the number of drug particles in the sample as well as variation in particle
sizes, the discussion aims at separating the contribution from these two sources
of variation, thereby allowing for more transparent analytical formulae for the
variation.
The dose content of a tablet is the sum of the masses of the N drug particles in
that tablet
XN
D= Mi (33)
i=1
If all tablets (or samples) contained the same number of drug particles, N , dose
194
variation would only be caused by the variation in the mass of the individual
drug particles selected by the tabletting process. Assume that particles are
sampled at random from the distribution (by number) of particle radii.
For later reference the moments in the distribution of doses under this simpli-
fying assumption are given below.
P
µD|N= E[ Ni=1 Mi ] = N × µM (34)
2 P 2
µ2 (D|N ) = σD|N = V[ N i=1 Mi ] = N × σM (35)
3 3
µ3 (D|N ) = E[(D − µD|N ) ] = E[(D − N µM ) ] = N µ3 (M ) (36)
µ4 (D|N ) = E[(D − µD|N )4 ] = N µ4 (M ) + 3N (N − 1)σM
4
(37)
such that
µ4 (D|N ) = κ4 (D|N ) + 3(µ2 (D|N ))2
but
κ4 (D|N ) = N κ4 (M ) = N (µ4 (M ) − 3µ2 (M )2 )
and
µ2 (D|N ) = N µ2 (M )
such that
The coefficient of variation in the distribution of doses is found from (34) and
(35) as
σD|N 1
CD|N = = √ × CM (40)
µD|N N
195
Spherical particles
For spherical particles with a log-normal distribution of particle radii, the co-
efficient of variation in the distribution of doses may be expressed in terms of
the coefficient of variation, CR , in the distribution of radii using (22) such that
q
1 2 )9 − 1
CD|N = √ × (1 + CR (43)
N
and γ1 (M ) and γ2 (M ) are given by (31) and (32), respectively.
The expressions (41) and (42) show that the effect of skewness and excess
of the distribution of particle masses is diminished when particle masses are
added to form tablets. The larger the number of particles, the smaller the skew-
ness and excess, and the better will the distribution of doses be approximated
by a normal distribution. This is also a consequence of the central limit theo-
rem in probability theory.
When the intensity of drug particles is small, perfect mixing should imply
that the distribution of particle volume in a sample unit is independent of the
number of particles in the unit. For large particles, however, this assumption
is not necessarily satisfied, and more structured random measures need to be
invoked, see Stoyan et al [65].
In the present discussion we shall assume that the variation in the number of
particles in tablets may be described by independent samples from a Poisson
distribution with mean N , and that the distribution of particle sizes in a sam-
pled unit correspond to a random sample from the (number) distribution of
particle sizes.
Thus, the probability mass function for the number, N , of particles in a tablet
is n
N
fN (n) = exp − N . (44)
n!
For a specified absolute tablet dose, LC, and a drug with mean particle mass,
µM , the mean number of particles will be (assuming that the mean dose equals
label claim, LC)
N = LC/µM . (45)
Taking the variation of the number N of particles into account, and utilizing
that for the Poisson distribution V[N ] = N , the mean and variance of the dose
under an arbitrary distribution of particle masses are found as
We note that the coefficient of variation, CD , of dose content under this as-
sumption of a random spatial distribution of particles mixing satisfies
q
CD = CD|N × 1 + 1/CM 2 (50)
When relative doses, d = D/LC, are considered, the variance, σd2 , of the
distribution of relative doses is
µM
σd2 = V[D/LC] = CD 2
= (CM2
+ 1) (53)
LC
with the coefficient of variation, Cd , in the distribution of relative doses given
by (52) as Cd = CD .
198
This is in line with the general observation from theories on blend uniformity
that the standard deviation of relative content decreases as the square root of
sample weight (or volume).
The relation (53) has previously been established by Yalkowsky and Bolton
[64]. The relation is in agreement with the well known results on variance-
sample size relations for random binary mixtures. Noting that µM /LC = 1/N
the relation (53) simply states that the variance in the distribution of relative
doses is proportional to 1/N .
Spherical particles
0.5
0.4
5
0.3
4
C_ 0 3
R .2
2 R
0.1 mu_
1
0
When relative doses, d = D/LC are considered, the variance, σd2 (53) under
the assumption of spherical particles may be expressed as
4π × ρ
σd2 = × µ3R (1 + CR
2 12
) . (61)
3LC
For a specified maximum value, Cmax of CD , it follows from (52) that under
the assumption of perfect mixing, the mean particle mass, µM , shall satisfy
2
Cmax
µM ≤ LC × 2 (62)
CM + 1
Thus, the limit for acceptable particle masses increases proportional to LC, and
decreases proportional to (1 + CM 2 )−1 as C
M increases.
Thus, the limit for acceptable particle radii increases proportional to the third
root of LC, and decreases with increasing coefficient of variation, CR , for the
2 )−4 .
distribution of particle radii, the increase being proportional to (1 + CR
Reducing the label claim by 50% implies that the maximum allowable average
particle radius is reduced by 20% (if the coefficient of variation in the distribu-
tion of particle radii remains unchanged). Reducing the label claim to a tenth
implies that the maximum allowable average particle radius is reduced by 50%.
201
25
20
0.8
15 0.7
LC 0.6
10 0.5 C_r
0.4
5
Thus, a blend that might produce a satisfactory distribution of doses (in terms
of the coefficient of variation in the distribution of relative doses) for large dose
tablets need not be satisfactory for smaller dose tablets.
Figure 2 shows the limit (63) for particle radii necessary to assure that the
coefficient of variation in the distribution of doses does not exceed Cmax =
0.05. The limit is shown as function of label claim, LC, and coefficient of
variation, CR , in the distribution of particle radii for ρ = 1.3 × 10−6 µg/µm3 .
It is seen that the maximal radius increases rather slowly with the label claim
(proportional to the third root of label claim).
The requirement (62) to the mean drug particle size may be expressed as a
202
Thus, for low-dose tablets, assuming perfect mixing, and a log-normal distri-
bution of particle sizes, the requirement on the coefficient of variation in the
distribution of dosage units is essentially a general requirement on the mini-
mum average number of particles in a dosage unit. This minimum average
number of particles does not depend on label claim.
2
σD = N µ2M × (CM
2
+ 1).
But
(D − N µM )3 = (D − N µM )3 + 3(D − N µM )2 (N − N )µM
+3(D − N µM )(N − N )2 µ2M + (N − N )3 µ3M (67)
203
and therefore
µ3 (M ) + µM σM2 + µ3
1
= √ × M
.
N µ3M (CM2 + 1)3/2
1 C 4 × (CM
2 + 3)
γ1 (D) = √ × M 2 (71)
N (CM + 1)3/2
which shows that also in this case, the skewness tends to zero when the number
of particles increase.
(D − N µM )4 = [(D − N µM ) + (N − N )µM ]4
= (D − N µM )4 + 4(D − N µM )3 (N − N )µM
+6(D − N µM )2 (N − N )2 µ2M
+4(D − N µM )(N − N )3 µ3M + (N − N )4 µ4M
204
such that
and hence
µ4 (D) = EN E[(D − N µM )4 ]|N
= Nµ4 (M ) + 3 V[N ] + N (N − 1) µ2 (M )2 + 4V[N ]µM µ3 (M )
2 2
+6 µ3 (N ) + N V[N ] σM µM + µ4 (N )µ4M .
2
For the Poisson distribution we have µ4 (N ) = 3N + N , and therefore
2
µ4 (D) = N µ4 (M ) + 3N µ2 (M )2 + 4N µM µ3 (M )
2 2 2
+6N (N + 1)σM µM + 3N + N µ4M .
which shows that the excess of the distribution of the doses tends to zero when
the number of particles increase.
Thus, the effect of the deviation from the normal distribution of particle sizes
becomes less pronounced the larger the average number of particles.
Invoking (45) to express the average number of particles in terms of the label
claim, (71) may be expressed as
r
µM C 4 × (CM2 + 3)
γ1 (D) = × M 2 (73)
LC (CM + 1)3/2
However, instead of the random dispersion of dose particles over the blend
modelled by a distribution of the number of particles in the blend as a homoge-
neous Poisson point process in the three-dimensional space, we shall consider
a model for a nonhomogeneous blend.
Let Λ(·) denote the random intensity measure defined over the blend, and let
Z
Λv = Λ(u)du
u∈v
When a sample of a fixed volume, v, is selected from the blend, the sampling
process may similarly be thought of as a two-step random mechanism. First,
a random intensity Λv corresponding to that volume is selected from a distri-
bution of volume-intensities, and secondly a number of particles is generated
as a Poisson distributed random variable with mean λv corresponding to that
intensity.
E[Λv ] = v × ν (75)
where ν denotes the overall average number of particles in the blend per unit
volume ([µm3 ]) and where the measure, γv , of the clustering tendency is re-
lated to the coefficient of variation in the distribution of λv ,
p
V[Λv ] γv
CΛ = =√ . (77)
E[Λv ] v×ν
The suffix v on γv2 serves to emphasize that the variance in the distribution of
Λv may depend on the sample amount v. Moreover, γv2 depends on the second
207
Under this assumption it may be shown (see e.g. Johnson and Kotz [66]) that
the marginal distribution of the number of particles in a sample of volume v is
a negative binomial distribution with mean and variance
N = E[N ] = v × ν (78)
V[N ] = EΛ [V[N |Λv ]] + VΛ [E[N |Λv ]] (79)
= E[Λv ] + V[Λv ] = vν(1 + γv2 ) (80)
= E[N ] × (1 + γv2 ) (81)
i.e. γv2 expresses the overdispersion of counts as compared with the Poisson
distribution. For γv = 0 the negative binomial distribution of particle counts
degenerates to a Poisson distribution with mean N = vν.
particle sizes, then the moments in the conditional distribution of doses given
the number of particles in the tablet are given by (34) to (37).
It then follows from (46) and (47), that the (marginal) mean and variance in
the distribution of doses are
µD = E[D] = N µM = v × ν × µM (83)
2 2
σD = V[D] = N σM + µ2M V[N ] (84)
2
= N × (σM + µ2M (1 + γv2 ))
= N µ2M × (CM
2
+ 1 + γv2 ) (85)
and
q
1 2 + γ 2.
CD = √ 1 + CM v (86)
N
Comparing (87) and (49) for a given number distribution of particle masses,
it is seen that the overdispersion of the distribution of the number of particles
under imperfect mixing implies an overdispersion of doses as compared to the
distribution of doses under perfect mixing (Poisson distribution of number of
particles).
For spherical particles, µM and CM are given by (21) and (22) and one obtains
2
σD = = N µ2M × (CM 2
+ 1 + γv2 ) = N µ2M × (1 + CR 2 9
) + γv2 (91)
h i2
= N × (4π/3)ρµ3R × (1 + CR 2 3
) × (1 + CR2 9
) + γv2 . (92)
Comparing (93), (58) and (40) for a given number distribution of particle radii,
it is seen that the overdispersion of the distribution of the number of particles
(as compared to the Poisson distribution) under imperfect mixing implies an
overdispersion of doses as compared to the distribution of doses under perfect
mixing.
2 4πLC × ρ
σD = V[D] = × µ3R (1 + CR
2 3
) (1 + CR 2 9
) + γv2 (94)
s 3
4πρµ3R (1 + CR
2 )3
2 )9 + γ 2
CD = × (1 + CR (95)
3LC v
which shows that under the assumption of perfect mixing and a given distribu-
tion of particle sizes does the absolute standard deviation increase proportional
210
to the square root of the magnitude of the dose (LC), and hence the coefficient
of variation decreases proportional to the square root of the dose. The larger
the dose, the smaller a coefficient of variation in the distribution of doses.
5 Discussion
transfer to tablet press and tabletting process: ideally, the transfer aims at
transferring the spatial distribution in the blend at tablet size level.
In the paper we have investigated the transfer of variation resulting from the
sieving/milling process through the mixing process to samples from the blend,
or to tablets in ideal situations where only natural variation is present, i.e.
no sampling bias, no deblending etc., (only some clustering tendencies in the
blend have been considered).
been provided. It was found that the effect of skewness and excess in the
distribution of particle masses is diminished when particle masses are added to
form tablets. The larger the number of particles, the smaller the skewness and
excess in the distribution of doses, and the better will the distribution of doses
be approximated by a normal distribution.
In Section 3.3 the further variation introduced by the variation of the number
of particles in the tablets has been investigated under the assumption of a ho-
mogeneous blend, i.e. a random spatial distribution of dose particles in the
blend. Expressions for the moments in the distribution of dose content have
been provided. As could be expected, the variation of the number of particles
amplifies the variation in dose due to varying particle sizes. However, unless
the distribution of particle masses is very narrow, the major contribution to the
variation in dose content originates from the variation in particle masses, and
hence, also in this case the coefficient of variation in the distribution of dose
content is inversely proportional to the square root of the number of particles.
In Section 3.4, explicit expressions for the minimum average number of par-
ticles per tablets necessary to secure a specified coefficient of variation in the
distribution of doses have been provided under the assumption of a homoge-
neous blend, and in Section 3.5 expressions for coefficients of skewness and
excess for the distribution have been provided.
Finally, in Section 4, the results are extended to cover also such nonrandom
mixtures that may be modelled by a hierarchical model representing a spatial
variation of the intensity of particles.
Results have been derived under the ideal assumption of spherical particles.
Therefore, for practical use, a “nonsphericity factor” might be introduced in
the transformation from distribution of particle radii to particle mass and vice
versa. In the paper we have allowed for this possibility by providing separate
expressions for moments in the distribution of doses in terms of properties of
the distribution of particle masses, valid regardless of shapes, supplemented
by expressions in terms of properties of the distribution of particle radii, valid
only under the further assumption of spherical particles.
6 List of symbols
Symbol Units
E[X] Mean of the distribution of X
V[X] Variance of the distribution of X
EN [X] Mean (under variation of N ) of the distribution of X
VN [X] Variance (under variation of N ) of the distribution of X
µ3 (X) Third moment about the mean in the distribution of X
µ3 (X) = E[(X − E[X])3 ]
µ4 (X) Fourth moment about the mean in the distribution of X
µ4 (X) = E[(X − E[X])4 ]
γ1 (X) - Coefficient of skewness of the distribution of X (fraction)
- γ1 (X) = µ3 (X)/(V[X])3/2
γ2 (X) - Coefficient of excess of the distribution of X (fraction)
- γ2 (X) = µ4 (X)/(V[X])2 − 3
R µm Radius of a particle
µR µm Mean particle radius
σR µm Standard deviation in particle radius distribution
CR - Coefficient of variation in particle radius distribution (fraction)
CR = σR /µR
βR Standard deviation in distribution of log particle radius
αR log(Median) in particle radius distribution
V µm3 Particle volume
ρ µg/µm3 Mass density of particles
M µg Particle mass
µM µg Mean particle mass
σM µg Standard deviation in particle mass distribution
CM - Coefficient of variation in particle mass distribution (fraction)
CM = σM /µM
N Number of particles in tablet
N Average number of particles in tablet
D µg Dose (Total mass of drug in tablet)
P
D= N i=1 Mi
µD µg Mean dose
σD µg Standard deviation in dose distribution
CD - Coefficient of variation in dose distribution (fraction)
CD = σD /µD
LC µg Required dose, Label Claim
d - Relative dose (fraction)
d = D/LC
µd - Mean relative dose
σd - Standard deviation in distribution of relative doses
215
216
Paper F
Case: Analysis of
homogeneity in production
scale batches
217 F
219
1 Purpose
This case relates to the sampling from a low dose (25µg) product. An experi-
mental design was set up with the purpose to investigate:
1. whether the type of thief shown to the left in Figure 2.3 in Chapter 2 is
qualified as sampling device for the product. Tablet samples from the
same batch are used to evaluate this problem.
(a) whether samples taken with respectively a long and a short thief
lead to the same result (it might be more difficult to handle a long
thief, or the samples taken with the long thief may be biased be-
cause they are primarily sampled from the bottom of the batch).
(b) whether the size (1X or 3X unit dose, 80 mg) influences the result.
(c) whether the sampling procedure is sensitive of the person collect-
ing the sample.
2 Experimental
The original experimental design included twenty-six blend samples, fifty tablet
samples and a larger blend sample for sieving analysis from each of three
batches of the product.
However, blend samples were taken from the first two batches, but the third
batch was transferred to the tablet press before the blend samples were taken.
220 Paper F
From each of the first two batches ten tablet samples were collected after one
hour of tabletting.
In total 26 samples were collected from each of two batches as shown in Ta-
ble 1.
Nr. Batch Layer Area Theif Pers. Nr. Batch Layer Area Theif Pers.
1 1 Top C 1 X 1 2 Top A 1 Y
2 1 Top C 1 X 2 2 Top A 1 Y
3 1 Top B 1 X 3 2 Top C 1 Y
4 1 Top B 1 Y 4 2 Top B 1 X
5 1 Top A 1 X 5 2 Top D 1 X
6 1 Top D 1 Y 6 2 Top D 1 Y
7 1 Middle B 2 X 7 2 Middle D 3 Y
8 1 Middle B 2 X 8 2 Middle D 2 Y
9 1 Middle B 3 X 9 2 Middle D 2 Y
10 1 Middle C 3 Y 10 2 Middle A 3 X
11 1 Middle C 2 Y 11 2 Middle A 1 X
12 1 Middle C 1 Y 12 2 Middle A 3 X
13 1 Middle D 3 Y 13 2 Middle B 2 Y
14 1 Middle D 1 Y 14 2 Middle B 1 Y
15 1 Middle D 3 Y 15 2 Middle B 1 Y
16 1 Middle A 1 X 16 2 Middle C 1 X
17 1 Middle A 1 X 17 2 Middle C 3 X
18 1 Middle A 2 X 18 2 Middle C 2 X
19 1 Bottom D 3 Y 19 2 Bottom B 2 Y
20 1 Bottom C 3 Y 20 2 Bottom B 2 Y
21 1 Bottom C 2 Y 21 2 Bottom B 3 Y
22 1 Bottom A 2 X 22 2 Bottom B 3 Y
23 1 Bottom A 2 X 23 2 Bottom A 3 X
24 1 Bottom A 3 X 24 2 Bottom D 2 Y
25 1 Bottom A 3 X 25 2 Bottom C 2 X
26 1 Bottom B 2 X 26 2 Bottom C 3 X
Table 1: The table shows the samples collected from two blend batches.
Samples were collected from three layers, and four areas within each layer by
two persons using three different sampling thieves.
Regarding areas, four areas are identified in each layer as shown in Figure 1. In
the middle layer the orientation of the areas are rotated 60 degrees compared to
the top and the bottom layer. If more than one sample is taken in the same area
221
and layer, the samples are sampled as close as possible - but not from exactly
the same position. Estimates of the small scale variation in the blend are based
on these samples.
B D
Fig. 1: A sectional plane of a layer in the blend. Each layer contains four
areas: A, B, C and D.
The same two persons are sampling in the two batches. Person X is a person
with above average skills in sampling techniques.
Because of several physical restrictions the blend samples are not collected in
a completely randomized order.
3 Statistical Analysis
The analyses are performed in SAS using procedures for Generalized Linear
Models (GENMOD) and General Linear Mixed Models (MIXED).
222 Paper F
The statistical analysis is performed on the relative dose, i.e. content of the
active component divided by label claim.
The mean relative content is 0.912 in blend samples from batch 1 and 1.016
in blend samples from batch 2. The relative dose in the individual samples are
shown in Figure 2.
Batch 1 Batch 2
1.15
1.15
1.10
1.10
1.05
1.05
Relative Dose
Relative Dose
1.00
1.00
0.95
0.95
0.90
0.90
0.85
0.85
Figure 2 indicates that the relative content in samples from batch 1 is less
than the relative content in samples from batch 2. A Welch modified two-
223
Table 2: Mean, variance and 95% confidence interval for the dose and the
relative dose for tablet samples.
sample t-test on the tablet samples shows a significant difference between the
two samples. The plot also reveals that in general the variation in batch 2 is
larger than the variation in batch 1.
Figure 3 shows QQ-plot for the tablets from each batch. The content of active
component in the tablets may be considered to be normally distributed, with
mean and variance depending on the batch from which the tablets are sampled.
1.10
1.05
1.05
Relative Dose
Relative Dose
1.00
1.00
0.95
0.95
0.90
0.90
-1 0 1 -1 0 1
Quantiles of Standard Normal Quantiles of Standard Normal
Fig. 3: QQ-plot for ten tablets sampled from each of the two batches. The
tablets were sampled when the tablet press had been running for a period of 1
hour. The Normal distribution gives a satisfactory fit of the data.
An F-test shows that the variation between tablets from batch 2 is significantly
larger than the variation between the tablets from batch 1. This indicates that
224 Paper F
batch 1 is more homogeneous i.e. that the small scale variation in batch 1 is
less than the small scale variation in batch 2.
3.2 Repeatability
The repeatability variance is the variance between replicate samples from the
same batch, layer and area by the same person and sampling thief. The re-
peatability variance incorporates contributions from small scale variation in
the blend, variation due to sampling and due to the chemical analysis. Further,
as blend samples in contrast to tablet samples are adjusted for the weight of
the sample, uncertainty on the weight measurement also has an influence on
the repeatability variance between blend samples.
With 6 pairs of replicates in each batch it has been possible to assess the effect
of different sampling thieves and layers on the repeatability variance. The
number of actual collected samples is too small to include the effect of persons
in this statistical analysis. Figure 4 indicates that the overall variation between
samples collected by each person do not differ, indicating that the persons are
equally consistent in the way they take a sample.
2
ln σlayer,thief = µ∗ + α∗layer + βthief
∗ ,
2
where Slayer,thief 2
∈ σlayer,thief χ2 (1).
Thieves
The statistical analysis shows that use of different sampling thieves have no
significant influence on the repeatability variance. This means that the repeata-
bility variance introduced by the three thieves do not differ significantly. How-
225
Batch 1 Batch 2
1.15
1.15
Top Top
Midt Midt
Bund Bund
1.10
1.10
1.05
1.05
Relative Dose
Relative Dose
1.00
1.00
0.95
0.95
0.90
0.90
0.85
0.85
X Y Tablets X Y Tablets
Person Person
Fig. 4: Variation between replicate samples collected by each of the two per-
sons. This variation includes variation due to variations in the blend, varia-
tion among sampling thieves, analytical error and variation from adjusting the
blend samples for the weight. Results from the tablet samples are included as
reference in the plot.
226 Paper F
ever, the tendency for both batch 1 and batch 2 is that using sampling thief
1 introduces less sampling error than using thief 3. As seen in Table 3 the
estimated standard deviation between samples collected by thief 3 is approx-
imative four to five times as big as the estimated standard deviation between
samples collected by thief 1. It is conceivable that an analysis of the results
from an experimental design with more experiments will find the tendency
significant.
Table 3: Variation between samples collected with various thieves in the mid-
dle layer in batch 1 and batch 2.
Layers
The repeatability variance is not found to differ significantly in the three layers
in a batch. However for both batches the tendency is that the variation between
replicates is less in the top, than in the bottom layer. This can be interpreted
as the batches not being homogeneous on the small scale level. It is conceiv-
able that the tendency will be significant in an experimental design with more
observations.
Ten tablets from each batch were sampled as closely as possible in time after
one hour of tabletting, i.e. at the beginning of the tabletting process. The
variation between the tablets from a batch includes the variation in the blend,
the variation introduced by the tablet press (deblending and weight variation)
and the chemical analysis. However, it is assumed, that the weight variation
and the variation due to deblending introduced at the tablet press are negligible
and therefore the variation between tablets can be regarded as an estimate of
227
the small scale variation in the blend, i.e. a measure of the minimum obtainable
variation between samples from the blend. It is therefore relevant to compare
the variation among tablets to variations in the blends.
Even though not statistically significant the tendency is that the variation be-
tween replicates in the top layer is smaller than the variation between replicates
in the bottom layer. Therefore, comparing variation between tablets to varia-
tion between replicates in respectively the top and the bottom layer may not
lead to the same result regarding an estimate of the small scale variation in the
blend.
To assess whether an estimate of the small scale variation in the blend should
be based on the top or the bottom layer, assumptions have to be made about a
physically explanation for the tendency for the variation between replicates to
be smaller in the top layer than in the bottom.
One explanation is that the real small scale variation in the blend differs among
layers. Another explanation is that it is more difficult to hit the same spot in a
lower layer of a blend when sampling replicates. It is not possible due to the
experimental design to decide which explanation is correct.
The two physically explanations lead to the following estimates of the small
scale variation in the blend:
1. Difficulties with hitting the same spot is the main reason for the ten-
dency of the variation between replicates being smaller in the top
than in the bottom of a batch. The best estimate of the small scale
variation in the blend is an estimate based on replicates sampled with
the thief that introduces the smallest sampling variance, i.e thief 1. The
estimate is based on the top layer because it is easier to hit the same
spot in this layer than in the middle and the bottom layer. A 95% con-
2
fidence interval for the small scale variation, σsmall scale , in batch 1 is
[0.0000;0.0001] and [0.0000;0.0005] for batch 2.
228 Paper F
2. The real small scale variation is not constant among the three layers
in the batch. As the tablets are sampled at the beginning of the tabletting
process, they correspond to the bottom layer of the blend. Therefore the
best estimate of the small scale variation in the blend corresponding to
the variation among the collected tablets is based on the replicates sam-
pled with thief 2 in the bottom of the blend. Thief 2 is chosen because
it is the thief used in the bottom layer that introduces the smallest sam-
pling variance. A 95% confidence interval for the small scale variation
in the bottom layer is [0.0001;0.0063] for batch 1 and [0.0004;0.0272]
for batch 2.
95% confidence interval for the small scale variation based on the tablets are
[0.0001;0.0004] for batch 1 and [0.0004;0.0025] for batch 2. Confidence in-
tervals for the small scale variation based on the tablets are more overlapping
to the confidence interval corresponding to situation 2 than the confidence in-
terval corresponding to situation 1.
If the small scale variation estimated from the tablets represents the true small
scale variation in the blend, the small scale variation estimated from blend
samples must be either of the same size or (when variation due to sampling
is not negligible) larger. However, in situation 1 the estimate based on blend
samples is smaller than the estimate based on tablet samples, thus implying
that the true small scale variation in the bottom layer actually is larger than in
the top layer.
The statistical model for the relative content in a sample from a given batch is
2
where Darea ∈ N (0, σarea 2 ).
) and Elayer,area,person,thief ∈ N (0, σE
The model is a General Linear Mixed Model. Under this model it has been
tested whether the mean content of the active component differs in the three
229
layers in a blend and whether the mean content in a sample depends on the
thief and the person collecting the sample.
The residual variance from this analysis is an estimate of the variance between
replicates in the batch. The estimates are 0.000227 for batch 1 and 0.005141
for batch 2. These estimates are average of the variance between replicates
regardless of the actual person, thief, layer and area from which the sample
is taken. Estimates of the small scale variation (variation between replicates)
from the analysis of repetability (Section 3.2) take into account the the thief
used to collect the sample and the layer from which the sample is collected.
Layers
For both batches the tendency is that the mean content in the top layer is less
than the mean content in the bottom layer.
The analysis also gives an estimate for the variance between the mean content
of areas within a layer. For batch 1 the estimate is 0.00036 = (0.01897)2 . For
batch 2 it has not been possible to distinguish the variation between areas from
the variation between replicates within an area.
Thieves
Thief 1 is a short thief used in the top and the middle layer, whereas both thief
2 and 3 are long thieves used in the middle and the bottom layer. Thief 1
and thief 2 collect samples of the size 1X dose and thief 3 collect samples of
the size 3X dose. Thus, the tip used to thief 1 and thief 2 are similar but not
identical. Because both thief 2 and 3 are long thieves they are interchangeable
and for future sampling it is relevant to know which thief to choose.
In both batches the tendency is that the mean content of the active component
is less in samples collected with thief 2 than in samples collected with thief 3.
230 Paper F
The difference is significant on the 10% level in batch 1 and 16% level in batch
2.
Batch 1 Batch 2
1.15
1.15
Top Top
Midt Midt
Bund Bund
1.10
1.10
1.05
1.05
Relative Dose
Relative Dose
1.00
1.00
0.95
0.95
0.90
0.90
0.85
0.85
1 2 3 Tablets 1 2 3 Tablets
Thief Thief
Regarding content the tablet samples are in general more similar to samples
collected by thief 3 than to samples collected by thief 2. This indicates that
thief 2 introduces bias. The fact that thieves may introduce bias causing the
content in blend samples to differ from the content in tablet samples is in agree-
ment with the literature [5] as well as with Figure 2.4 on page 12. The literature
(e.g. [21]) also gives several examples that the risk of bias is larger when the
blend sample is small, i.e. close to unit dose. This is in agreement with thief
3 taking samples of 3X dose, whereas thief 2, which is under suspicion of
sampling bias, takes samples of 1X dose.
In batch 1 the estimated bias on the mean content in samples collected with
thief 2 is −0.015 ± 0.0184 (95% confidence interval) relatively to the mean
content in samples collected with thief 3. That is, if the relative content in
a sample collected with thief 3 is 0.98, the relative content would have been
0.965 ± 0.0184 had thief 2 been used to collect the sample.
231
Conclusion
From blend- and tablet samples it appears that the mean content in batch 1 is
less than the mean content in batch 2, and that in general the variation in batch
2 is larger than the variation in batch 1.
The mean content in the three layers in a batch does not differ significantly.
However, there is a tendency that the small scale variation in a top layer is
smaller than the small scale variation in the bottom.
In batch 1 the variation between replicates (small scale variation) are smaller
than the variation between areas within a layer. In batch 2 the variation be-
tween replicates is so large that it has not been possible to separate the variation
between replicates from the variation between areas within a layer.
Persons
The mean content in samples collected by person 1 does not differ significantly
from the mean content in samples collected by person 2. It has not been possi-
ble to test whether sampling error (variation) introduced by the two persons is
of the same order of magnitude, but a plot indicates that neither person intro-
duces significantly larger sampling error than the other.
It can not be rejected that the persons have similar skills in sampling.
232
Thieves
The tendency is that thief 3 introduces the largest sampling error (variation)
and that thief 1 introduces the smallest sampling error. A possible explanation
is that it is easier to handle a short thief than a long thief.
Even though thief 2 generally introduces less sampling error than thief 3, thief
3 is recommended for sampling in the bottom layer, as thief 2 is under suspi-
cion of sampling bias.
It is noteworthy that the long thief with the small chamber in the tip (thief 2,
1X dose) is under suspicion of sampling bias, whereas there is no indication
of sampling bias for the long thief with the large chamber in the tip (thief
3, 3X dose). This conclusion is based on a comparison of variation between
replicates from the blends with variation between tablets.
Bibliography
[1] Muzzio, F. J., Roddy, M., Brone, D., Alexander, A. W., and Sudah, O.
An improved Powder-Sampling Tool. Pharmaceutical Technology, pages
92–110, April 1999.
[3] United States of America vs. Barr Laboratories Inc. Civil action for the
District of New Jersey, February 1993.
[4] Center for Drug Evaluation FDA and Research. 21 Code of Federal Reg-
ulations Parts 210 and 211.
[5] Berman, J. and Planchard, J. A. Blend Uniformity and Unit Dose Sam-
pling. Drug Development and Industrial Pharmacy, 21(11), 1995.
[10] Department of Health and FDA Human Services. 21 CFR Parts 210
and 211 Current Good Manufacturing Practice: Amendment of Certain
Requirements for Finished Pharmaceuticals; Proposed Rule. In Federal
Register, Proposed Rules, volume 61, pages 20103 – 20115. FDA.
233
234
[11] Draft Guidance for Industry ANDAs: Blend Uniformity analysis, August
1999.
[14] Fan, L. T., Chen, S. J., and Watson, C. A. Solids Mixing. Industrial and
Engineering Chemistry, 62(7):53–69, 1970.
[17] SAS version 8.02. SAS Institute, Inc., Cary, NC., 1999.
[18] PQRI. The use of stratified sampling of blend and dosage units to demon-
strate adequacy of mix for powder blends. www.pqri.org, September 3.
2001. DRAFT.
[20] Guide to Inspections of oral Solid Dosage forms Pre/Post Approval Is-
suses for Development and Validation. FDA Division of Field Investiga-
tions, January 1994.
[21] Kræmer, J., Svensson, J. R., and Melgaard, H. Sampling Bias in blending
Validation and a Different Approach to Homogeneity Assessment. Drug
Development and Industrial Pharmacy, 25(2), 1999.
[22] Berman, J., Elinski, D. E., Gonzales, C. R., Hofer, J. D., Jimenez, P. J.,
Planchard, J. A., Tlachac, R. J., and Vogel, P. F. Technical Report No. 25
235
[30] S-PLUS. Insightful Corp., 1700 West Lake Avenue North, Suite 500,
Seattle, WA 98109, 2000.
[32] Wilrich, P.-Th. Part 1: Sampling inspection, Single sampling plans for in-
spection by variables in the presence of measurement error. Allgemeines
Statistisches Archiv, 84:239–250, 2000.
[34] ISO 2859: Sampling procedures and charts for inspection by attributes,
Part 1 to 4. International Organization for Standardization, Geneva, 1999.
236
[35] ISO 3951: Sampling procedures and charts for inspection by variables for
percent nonconforming. International Organization for Standardization,
Geneva, 1989.
[38] EEC Council Directive No. 75/106/EEC of 19. December 1974, pp. 1-13
. Technical report, Official Journal of the European Communities L42,
15. february 1975.
[39] EEC Council Directive of 20. January 1976, pp. 1-11. Technical report,
Official Journal of the European Communities L46, 21. February 1999.
[41] ISO 10576-1: Statistical methods - Guidelines for the evaluation of con-
formity with specified requirements - Part 1: General principles. Inter-
national Organization for Standardization. Geneva, Final Draft submitted
for voting 2002.
[42] < 905 > USP Uniformity of Dosage Units. “USP XXI”, Mack Printing
Company, 1984.
[43] Holst, E., Thyregod, P., and Wilrich, P-Th. On Conformity Testing
and the use of Two-stage Procedures. International Statistical Review,
69:419–432, 2001.
[51] Bray, D. F., Lyon, D. A., and Burr, I. W. Three Class Attributes Plans in
Acceptance Sampling. Technometrics, 15, 1973.
[56] American National Standard: Sampling Procedures and Tables for In-
spection by Variables for Percent Nonconformance, ANSI/ASQC Stan-
dard Z1.9. American National Standards Institute, 1980.
238
[58] Boulanger, M., Johnson, M., Perrouchet, C., and Thyregod, P. Interna-
tional statistical standards. Their place in the life-cycle of products and
services. International Statistical Review, (67):151–173, 1999.
[59] Bruhn-Suhr, M., Krumbholz, W., and Lenz, H.-J. On the Design of Exact
Single Sampling Plans by Variables. In Wilrich P.-Th. Lenz H.-J., editor,
Frontiers in Statistical Quality Control, volume 4, pages 83–90. Physica
Verlag, Heidelberg, 1992.
[63] Wilrich, P.-Th. Single sampling plans for inspection by variables under
a variance component situation. In Proceedings of VIIth International
Workshop on Intelligent Statistical Quality Control September 5-7, 2001.,
Waterloo, Canada. Institute for Improvement in Quality and Productivity,
University of Waterloo.
[64] Yalkowsky, S.H. and Bolton, S. Particle Size and Content Uniformity.
Pharmaceutical Research, 7:962–966, 1990.
[65] Stoyan, D., Kendall, W.S, and Mecke, J. Stochastic Geometry and its
Applications. Wiley, second edition, 1995.
[66] Johnson, N.L., Kemp, A.W., and Kotz, S. Univariate Discrete Distribu-
tions. Wiley, New York, 2 edition, 1993.
239
16. Kohl, Niklas. (1995). Exact methods for time constrained routing and
related scheduling problems. xviii + 234 pp.
24. Skands, Ulrik. (1996). Quantitative methods for the analysis of elec-
tron microscope images. xvi + 198 pp.
26. Bendtsen, Claus. (1996). Parallel numerical algorithms for the solution
of systems of ordinary differential equations. viii + 79 pp.
32. Ma, Xin. (1997). Adaptive extremum control and wind turbine control.
xix + 293 pp.
42. Hansen, Peter S.K. (1997). Signal subspace methods for speech en-
hancement. x + 226 pp.
46. Riis, Søren Kamaric. (1998). Hidden markov models and neural net-
works for speech recognition. x + 223 pp.
48. Frydendal, Ib. (1998.) Quality inspection of sugar beets using vision.
iv + 97 pp. + app.
52. Fog, Torben. (1998). Condition monitoring and fault diagnosis in ma-
rine diesel engines. xii + 178 pp.
60. Jensen, Kåre Jean. (1999). Signal processing for distribution network
monitoring. xv + 199 pp.
64. Kvist, Trine. (1999). Statistical modelling of fish stocks. xiv + 173 pp.
66. Sørensen, Per Settergren. (1999). Spatial distribution maps for benthic
communities.
73. Larsen, Allan. (2000). The dynamic vehicle routing problem. xvi + 185
pp.
75. Larsen, Theis Leth. (2000). Phosphorus diffusion in float zone silicon
crystal growth. viii + 119 pp.
84. Nielsen, Torben Skov. (2001). On-line prediction and control in non-
linear stochastic systems. xviii + 242 pp.
88. Kidmose, Preben. (2001). Blind separation of heavy tail signals. viii +
136 pp.
90. Antonov, Anton. (2001). Object-oriented framework for large scale air
pollution models. 156 pp. + (flere pag).
98. Vistisen, Dorte. (2002). Models and methods for hot spot safety work.
ix + 168 pp.
103. Andersen, Irene Klærke. (2002). Dynamic contrast enhanced MRI for
perfusion quantification. iv + 181 pp.
106. Nyeland, Martin Erik. (2002). Sampling strategy and statistical mod-
elling of exposure. viii + 39 + appendix.