Encyclopedia of Pharmaceutical Technology, Optimization
Encyclopedia of Pharmaceutical Technology, Optimization
Gareth A. Lewis
Sanofi-Synthelabo Research, Chilly Mazarin, France
of search. Thus, the type and components of a formu- cles, and the numbers have continued to rise. A con-
Outsourcing
lation may be selected, according to previous experi- ception or presupposition of the difficulty or
ence, by expert knowledge (possibly using an expert complexity of experimental design had to be overcome.
system), or by systematic screening as described later. The change has been attributed of course to a great
Then the relative and/or total proportions of the exci- extent to the availability of computing power and of
pients are varied to obtain the best endpoint, or a pro- relatively inexpensive high-performance software that
cess is chosen, and a study is carried out to determine allows previously difficult or advanced methods to be
the best operating conditions to obtain the desired for- applied. In particular, much attention is now being
mulation properties. Both of these are optimization given to robust processes and formulation, and
studies. This article concentrates on statistical experi- there are developments in treating non-linear and
mental design-based optimization. highly correlated responses.[4]
Systematic screening and factor influence studies are There are four primary methods. First, there is the stat-
closely related to optimization, being often sequential istically designed experiment, in which experiments are
stages in the development process and involving statisti- set up in a (normally regular) matrix to estimate the
cal experimental design methods. Screening methods coefficients in a mathematical model that predicts
are used to identify important and critical effects, for responses within the limits of formulation or operating
example, in the manufacturing process. Factor studies conditions being studied. This is generally the most
are quantitative studies of the effects of changing poten- powerful method, provided the experimentation zone
tially critical process and formulation parameters. They has been correctly identified, and is the subject of most
involve factorial design and are also quite often referred of this article.
to as screening studies; however, the resulting relation- Second, the direct optimization method, the best
ships have just as often been used for optimization. known being the sequential simplex, is a rapid and
The type of study carried out will depend on the powerful method for determining an experimental
stage of the project. In particular, experimental design domain, best combined with experimental design for
may be carried out in stages, and the experiments of a the optimization itself.
factor study may be augmented by further experiments Third, there is the one-factor-at-a-time method in
to a design giving the detailed information needed which the experimenter varies first one factor to find the
for true optimization. It cannot be stressed to highly best value, then another. Its disadvantages are that it can-
that the quality of a statistically designed experiment not be used for multiple responses and that it will not
depends on the choice of experimental run with respect work when there are strong interactions between factors.
to an a priori model, and this quality can and must be Finally, the non-systematic approach in which the
assessed before starting the experiments. knowledge and intuition of the developer allow him
Encyclopedia of Pharmaceutical Technology DOI: 10.1081/E-EPT-100200031
2452 Copyright # 2007 by Informa Healthcare USA, Inc. All rights reserved.
Optimization Methods 2453
to improve results, changing a number of factors at the It is assumed that there are no interactions between
same time is often surprisingly successful in the hands factors; that is to say, the effect of a given excipient on
of a skilled worker. Where he is less skilled or less stability does not depend on what other excipients are
lucky, he can waste a remarkable amount of time found in the formulation. (The same reasoning applies
and resources. to other kinds of factors or responses.) This can only
The use of artificial intelligence and expert systems be an approximation; however, if it should be neces-
is treated elsewhere in this work. sary to take interactions into account, many more
experiments would be needed, and it would probably
be necessary to limit the number of levels for each
SCREENING factor to two for the number of experiments to be
manageable.
Obtaining a Formulation Suitable The choice of excipients may be considered a quali-
for Optimization tative optimization, their quantitative compositions
not having yet been optimized. This and the fact that
Once the dosage form has been selected, the excipients the process used will most likely be on a small labora-
must be identified, their choice often limited by practi- tory scale may affect the affect the choice of excipients.
cal considerations of time and resources determined However, it is in most circumstances an unavoidable
by patents, company practice, or according to expert limitation.
knowledge. However, it may be possible or necessary An example of such a qualitative screening is shown
to test a number of different excipients for each func- in Table 1. This is an experimental design for testing
Non-Prescription–
tion, for example, several diluents, lubricants, binders. the compatibilities of experimental drug (at two con-
Outsourcing
This approach has proved useful in drug–excipient centrations) with a number of number of excipients.
compatibility testing in which protoformulations are The samples, which were wet granulated, were stored
set up according to a statistical screening design to for 3weeks at 50 C/50% relative humidity. The results
assess stability and compatibility. are also given in Table 1. The mean degradation level
Here the factor is the excipient’s function. This can was high, at 6.2%, indicating a fairly unstable drug.
be set at different levels, the level being the excipient The effects of each excipient were calculated by linear
itself. So the factor may be ‘‘binder,’’ and the levels regression, or, because the design is orthogonal, by
are, for example, HPMC, povidone, polyvinylacetate, linear combinations of the responses, and plotted in
and no disintegrant present. A mathematical model Fig. 1. There, the degradation for each excipient is
relates the response (in this case, degradation) to com- calculated in each excipient type (e.g., disintegrant),
position. It includes variables corresponding to each setting the excipients in the remaining type to a hypo-
factor with (qualitative) levels corresponding to each thetical mean value. Thus, the value for magnesium
excipient. Plackett and Burman[2] described designs is the mean response for all mixtures containing mag-
suitable for treating this kind of problem. Designs with nesium stearate, and the effect of stearate on the
the factors at only two levels are widely used. However, response is the difference between this figure and the
there are other designs at 3, 4, and 5 levels as well as global mean.
asymmetric designs derived from them in which the Inspecting the results shows that the disintegrant
various factors take a different number of levels.[5,6] and binder have major effects, and mixtures containing
sodium starch glycolate and HPMC are more stable Methods for screening factors
than those containing croscarmellose sodium and povi-
done, respectively. Diluents had only small effects here, Because a large number of factors may need to be
Non-Prescription–
however, these were much greater in the mixtures screened, the postulated model must be simple. It is
Outsourcing
stored at low humidity, where mixtures containing usually assumed that the response(s) y depends only
microcrystalline cellulose or, especially, calcium phos- on the level (value or state) of each factor xi separately
phate were less stable than those containing lactose and not on combinations of levels. The model is thus
or mannitol. Thus, a capsule based on lactose, sodium first-order, for example:
starch glycolate, HPMC, and magnesium stearate (the
last being selected for reasons of feasibility, there being y ¼ b0 þ b1 x1 þ b2 x2 þ b3 x3 þ b4 x4
no difference in stability between it and glyceryl behe- þ b5 x5 þ e
nate) was formulated and gave satisfactory stability.
If the factors are quantitative, they are set at their
extreme values. Thus, if the factor is granulation time,
Before Optimizing a Process and the possible range is 1.5–7 min, the normal values
tested are 1.5 min and 7 min. They are expressed in
The major choice to be made here is that of equipment, terms of dimensionless coded variables, normally tak-
and that will depend on what is available in the labora- ing values 1 and þ1. Thus, on transformation to the
tory and also in the factory. There may be a very large coded variable x1, 1.5 min corresponds to x1 ¼ 1,
number of factors to be studied, and it will probably be and 7 min corresponds to x1 ¼ þ1.
If the factors are quantitative, they may take any complex situations, it is advisable to carry out a more
number of levels. Only two-level designs are described detailed study between the screening and optimization
here. Qualitative levels are set arbitrarily at the coded (response surface studies). This could be a completion
levels. If, for example, the screening method was one of the screening study by means of a complementary
of the factors tested, wet screening could be set at 1 foldover design[3,7] or by a separate quantitative study
and dry screening at þ1 (or vice versa). to allow individual effects of the factors and/or their
Quite wide limits are generally chosen for screening binary interactions to be calculated separately (shown
quantitative factors. They are then often narrowed for in factorial designs, later).
more detailed quantitative study of the influence of All these studies on the process are generally done
factors where interactions between factors them are after the optimization of the formulation. However,
taken into account and for determining a predictive because the effects of formulation and process changes
model for optimization. are not generally independent, it may become neces-
The designs, proposed by Plackett and Burman in sary to carry out some sort of process study at the
1946,[2] comprise experiments in multiples of four. same time as the formulation optimization.
They will allow screening of up to one less factor than
the number of experiments. Those with 2n experiments
(4, 8, 16, 32, . . . ) are also fractional factorial designs.
The non-factorial designs have particular properties QUANTITATIVE PROCESS STUDIES USING
and complex aliasing, which has been held to make FACTORIAL DESIGNS
their interpretation difficult but also gives them certain
Purpose
Non-Prescription–
advantages over the fractional factorial designs. The
Outsourcing
12-experiment design, shown in coded variables
(Table 2), is such a design, and is useful for about Whereas the purpose of a screening study is to deter-
7–11 factors. mine which of a large number of factors have an influ-
The structure of the design is shown clearly in the ence on the formulation or process, that of a factor
table because the experiments are in their standard study is to determine quantitatively the influence of
order. However, they should be carried out in a ran- the different factors together on the response variables.
dom order, as should all the designs described here, The number of levels is usually again limited to two,
as much as is practicable. but sufficient experiments are carried out to allow for
The coefficient bi is the effect of the factor Xi, and interactions between factors.
is equal to half the average change in the response y
when the level of the factor is changed from xi ¼ 1 Two-level full factorial designs
to xi ¼ þ1. It is estimated (as bi) in the Plackett–
Burman design by subtracting the sum of the responses The simplest such designs are the 2k full factorial
for experiments for which xi ¼ 1 from those for designs, in which the experiments are all the 2k possible
which xi ¼ þ1 and dividing by the number of experi- combinations of two levels of k factors variables.
ments. Important and unimportant effects can then be Therefore, they consist of 4, 8, 16, 32, 64, . . . experi-
identified according to their absolute values. (Deter- ments for 2, 3, 4, 5, 6, . . . factors. Examples are given
mining active factors from the results of a factorial in Table 3 of the 22, 23, and 24 designs (each enclosed
design are shown later.) at the right and below by the solid lines). Thus, lines
1–4 of columns 1 and 2 show a 22 design for two fac-
Use of results of a screening design tors, and lines 1–16 of columns 1–4 a 24 design for four
factors.
Estimation of the effects allows influential or possibly The design is transformed into an experimental plan
influential factors to be identified. Non-influential fac- (with the natural or experimental values of the factor
tors (small effects) will not require further study. They variable at each level /þ1. The mathematical model
may be set at their midpoints, at their most economical associated with the design consists of the main effects
values (e.g., a short mixing time), or at their apparently of each variable plus all the possible interaction
best value even if the measured effect is apparently effects, interactions between two variables, but also
non-significant. between three and four factors and, in fact, between
After elimination of these non-influential factors, as many as there are in the model. However, although
there may still be too many factors to optimize in terms two-factor interactions are important, three-factor
of the resources available (time, raw material, opera- interactions are normally far less so. Higher-order
tors, availability of equipment, etc.). Generally, these interactions are invariably ignored and the values
less influential factors are kept constant, equal to determined for them attributed to the random vari-
their best level and the remainder optimized. In more ation of the experimental system.
2456 Optimization Methods
Table 3 Some full and fractional factorial designs for two to The responses are usually treated separately;
five factorsa however, when there are a number of more or less cor-
X1 X2 X3 X4 X5 Response related responses being studied, appropriate combina-
1 1 1 1 1 1 189
tions (principal components) may be analyzed instead
of the original responses.[9]
2 1 1 1 1 1 56
Once the important effects have been identified, a
3 1 1 1 1 1 94 simplified model can be written. If an interaction term
4 1 1 1 1 1 80 has been identified, the corresponding main effects
5 1 1 1 1 1 212 should also be included in the model even if they are
6 1 1 1 1 1 212 not all found active. Thus, if the interactions between
7 1 1 1 1 1 76
the factors X1 and X2 and the main effect of the factor
X1 are active, b2x2 should be included in the model
8 1 1 1 1 1 125
as well as b1x1 and b12x1x2.
9 1 1 1 1 1 351
10 1 1 1 1 1 534
11 1 1 1 1 1 275 Two-Level Fractional Factorial Designs
12 1 1 1 1 1 219
13 1 1 1 1 1 154
The number of experiments needed to study five or
more factors in a full factorial design is large, and to
14 1 1 1 1 1 752
determine the main effects and their interactions, a
1 1
Non-Prescription–
2.88
99
D
95
90 AC 2.69
Normal % probability
AD High spray rate
80 AE
70 C
A
50 E 2.50
log10(particle size)
30
20
10 B 2.31
5
CE
A: Povidone 1
2.12
B: Binder
C: Atomisation
D: Spray rate Low spray rate
E: Temperature –0.19 –0.02 0.14 0.31 0.48 1.94
Effect on log10(particle size)
Non-Prescription–
(From Ref.[8].)
Outsourcing
–1 +1
% PVP (coded values)
effects should be included also, even if they are small.
Fig. 3 Calculated effects from a two-level factorial design.
Here, we have:
Interaction diagram for spray rate and % povidone on par-
y ¼ b0 þ b1 x1 þ b2 x2 þ b3 x3 þ b4 x4 þ b5 x5 ticle size (logarithmic scale).
þ b13 x1 x3 þ b14 x1 x4 þ b15 x1 x5 þ b35 x3 x5
Mathematical Models
Statistical Experimental Designs for
The design used is a function of the model proposed. First-Order Models
Thus, if it is expected that the important responses
Non-Prescription–
vary relatively little over the domain, a first-order poly- The design must enable estimation of the first-order
Outsourcing
nomial will be selected. This will also be the case if the effects, preferably free from interference by the inter-
experimenter wishes to perform rather a few experi- actions between factors other variables. It should also
ments at first to check initial assumptions. He may allow testing for the fit of the model and, in particular,
then change to a second-order (quadratic) polynomial for the existence of curvature of the response surface
model. Second-order polynomials are those most com- (center points). Two-level factorial designs may be
monly used for response surface modeling and process used for this (shown earlier).
optimization for up to five variables. Important points to note when using a first-order
Examples of polynomial models are a first order model, with or without interactions, are that:
model for five factors:
1. Maximum and minimum values of responses
are of necessity predicted at the edge of the
y ¼ b0 þ b1 x1 þ b2 x2 þ b3 x3 þ b4 x4 experimental domain;
þ b5 x5 þ e 2. The first-order model should normally be used
only in the absence of curvature of the response
surface. If the experimental values of the center
points are different from the calculate values
(i.e., there is lack of fit), then the response sur-
face is curved and a second-order design and
model should be used; and
3. The experimenter should test for interaction
terms between two factors in the model. If inter-
actions seem to be important he should make
sure that they are properly identified.
number of experimental runs, which can be a disadvan- does the central composite design but cannot be set up
tage if resources are limited. However, it can be carried by augmenting a factorial design.
out in two stages: the factorial design first then the The design for three factors is shown in Table 5.
axial design if the results are satisfactory. It can be seen that the hexagonal design for two factors
If we wish to study the system by varying the para- is the first seven rows and the first two columns. Thus,
meters around a point of interest, the domain is a it possible to add a factor to a design. Another advan-
sphere, and the coordinates of axial experiments are tage is that because it is part of a continuous network,
outside those of the factorial ones. a is chosen to give it allows the experimental domain to be shifted in any
the best statistical properties (e.g., constant prediction direction by adding experiments at one side of the
precision) and lies between 2 and approximately 2.4. domain and eliminating them at the other (Fig. 6).
The design for three factors, where a is set at 1.682, Vojnovic et al.[13] give an example of its use in granu-
is shown in Fig. 5 and Table 4. lation.
Center-point experiments must be done as part of Hybrid designs are saturated or almost saturated
both stages. Another advantage is that each factor is designs; that is, they have only enough experimental
at five levels, thus allowing testing of lack of fit and runs to calculate the coefficients of the quadratic model
for the possible need for cubic terms in the model. (10 runs for 3 factors 16 runs, for 4 factors, and 28 runs,
Fig. 4 shows response surfaces calculated from the for 6 factors). They are useful when the responses are
data of Senderak, Bonsignore, and Mungan[11] obtained not expected to vary enormously but where the quad-
using such a design at a constant value of the third factor. ratic model is esteemed necessary and resources (in
possible numbers of experiments) are low.[6,14]
Non-Prescription–
Other standard designs If the experimental region is defined by maximum
Outsourcing
and minimum values of each factor, then the domain
The central composite design is most often used, how- is ‘‘cubic.’’ The central composite design can be
ever, there are others whose particular properties make applied to such a situation, the axial points being set
them particularly useful. One of these is the Doehlert then at 1, coded values corresponding to the mini-
design, which is part of a continuous hexagonal mum and maximum allowed values. Other designs
network.[12] It requires slightly fewer experiments than for the cubic domain are reviewed in Ref.[6].
A B
65 1.5 61
5
70
2
4.5
60
58
Sucrose invert medium
4
% Sucrose invert medium
72
2.5
3.5
55
3 55 74
3
3.5
4
50 52 76
4.5
5
78
5.5
45 49
15 17.5 20 22.5 25 17 18.5 20 22.5 25
% Propylene glycol % Propylene glycol
Fig. 5 Contour diagrams of (A) turbidity and (B) cloud point as function of % propylene glycol and sucrose invert medium (slice
taken at constant value of 4.3% polysorbate 80). (From Ref.[11].)
2460 Optimization Methods
16 0 0 0
Outsourcing
17 0 0 0
a
To be included with both axial and factorial designs if carried out separately.
Mixed and Irregular Domains— discrete but numerical values or may even be qualitat-
D-Optimal Designs ive in nature.
There are no classic experimental designs that exist
If the experimental domain is cubic and spherical or for such circumstances, and a purely empirical
spherical, the standard experimental designs can nor- approach is required: 1) to postulate a mathematical
mally be used. However, the domain may be irregular model that is expected to describe the response and
in shape as certain combinations of values variable 2) to then select from among the many possible experi-
may be excluded a priori for technical reasons or ments a design that will determine the model coeffi-
may even have been tried and failed to give a result, cients with maximum efficiency.
or certain factors may be forced to take either fixed There are various ways of obtaining such a design,
by far the most common being based on the exchange
algorithm of Fedorov. There are also a number of
Table 5 Doehlert design for three factors criteria for describing how good the design is, the
k X1 X2 X3
1 0 0 0 1
2 1 0 0
3 0.5 0.866 0
4 0.5 0.866 0 0
5 0.5 0.866 0 X2
6 0.5 0.866 0
7 1 0 0 –1 Initial design
Additional experiments
8 0.5 0.289 0.816 X3 = 0
9 0.5 0.289 0.816 X3 = 0.816
X3 = –0.816
10 0 0.577 0.816 –2
–2 –1 0 1 2
11 0.5 0.289 0.816 X1
12 0.5 0.289 0.816
Fig. 6 Doehlert design in three dimensions (factors) show-
13 0 0.577 0.816
ing extension to a new experimental domain.
Optimization Methods 2461
D-optimal criterion being the most usual, based on the fractions x1, x2, x3, because x1 þ x2 þ x3 ¼ 1,
optimization of the overall precision of estimation of
the coefficients of the model.[6,15,16] This method and y ¼ a0 þ a1 x1 þ a2 x2 þ a3 x3 þ e
type of design is extremely flexible because:
becomes
1. It allows experiments within irregular experi-
mental domains; y ¼ b1 x1 þ b2 x2 þ b3 x3 þ e
2. Previous experiments carried out within the
experimental domain may be taken into The variables cannot be varied independently. If
account; there are no upper and lower restraints on the propor-
3. Classical designs in which experiments have tions of the components, the domain for three factors
failed to give a result may be repaired by rede- can be described as a equilateral triangle whose apices
fining the domain and finding the best experi- represent the pure components. A four-component
ments (according to the D-optimal criterion) mixture is described by a regular tetrahedron. For
to replace the experiment(s) which failed; five components, the equivalent 4D figure must be
4. The models may be polynomials with missing imagined.
coefficients, or even non-polynomials; Just as the first-order mixture model has a different
5. The experiments may be carried out in two or form from that for independent variables, so does the
more stages, with models of increasing com- second-order design:
plexity;
y ¼ b0 þ b1 x1 þ b2 x2 þ b3 x3 þ b12 x1 x2
Non-Prescription–
6. Further experiments may be added to a D-
Outsourcing
optimal design to validate the model (lack of þ b13 x1 x3 þ b23 x2 x3 þ e
fit); and
7. They can be used for mixture models with The special cubic model describes a certain third-
constraints (see below). order curvature in the response surface:
A B
64.00
Cellulose
1' 1'
2' 2'
3 5 5.00 5.00
10
9 11
4 6 12 13
7' 7'
8' 8'
Lactose Phosphate
Non-Prescription–
Outsourcing
Polymer 64.00 5.00 64.00
Lactose Phosphate
Fig. 8 D-optimal mixture design. (A) definition of the design space. (B) Contour plot of mean dissolution time at 25% polymer
content. (From Ref.[6].)
55 Limitations of Response
Turbidity: 3 Surface Methodology
Turbidity: 3
X1
Sequential Simplex Optimization
Fig. 10 Summary of the expanded simplex method of
Introduction Nelder and Mead.
Unlike the other optimization methods described here, is carried out opposite point W to give a new simplex
the sequential simplex method for optimization neither reflecting the original one. Depending on the value of
assumes nor determines a mathematical model for the the response at R relative to that at W, N, and B, the
phenomena studied. step size may be expanded to arrive quickly at the
A simplex is a convex geometric figure of kþ1 non- region of the optimum, and then be contracted around
Non-Prescription–
planar vertices in k dimensional space, the number of the optimum. The various possibilities are shown in
Outsourcing
dimensions corresponding to the number of inde- Fig. 10 for two factors. ‘‘R > W’’ means that point
pendent factors. Thus, for two factors, it is a triangle, R is better than point W, etc.
and for three factors, it is a tetrahedron. The method
is sequential because the experiments are analyzed
one by one as each is carried out. The basic method R replaces W if : N R B Reflection
used a constant step size,[25] allowing the region of or: R > B and E B
experimentation to move at a constant rate toward E replaces W if : R > B and E B Expansion
the optimum. However, a modification that allows
CR replaces W if : W<R N Contraction
the simplex to expand and contract, proposed by
(exterior)
Nelder and Mead[26] in 1965, is more generally used.
It has been reviewed recently by Waters.[27] CW replaces W if : W > R Contraction
(interior)
temperature and humidity, the machine used, the exact 10. Box, G.E.P.; Draper, N.R. Empirical Model-Building and
Outsourcing
26. Nelder, J.A.; Mead, R. A simplex method for function mini- BIBLIOGRAPHY
mization. Comput. J. 1965, 1, 308.
27. Walters, F. Sequential simplex optimization—an update.
Anal. Lett. 1999, 32 (2), 193. Anderson, V.L.; McLean, R.A. Design of Experiments:
28. Taguchi, G. System of Experimental Design: Engineering A Realistic Approach; Marcel Dekker, Inc.: New York,
Methods to Optimize Quality and Minimize Cost; UNI- 1974.
PUB/Fraus International White Plain, 1987. Grove, D.M.; Davis, T.P. Engineering Quality and Experi-
29. Nair, V. Taguchi’s parameter design: a panel discussion. mental Design; Longman Scientific and Technical: Harlow,
Technometrics 1992, 34, 127–161. UK, 1992.
30. Shoemaker, A.C.; Kwok, L.; Wu, C.F.J. Economical exper- Haaland, P.D. Experimental Design in Biotechnology; Marcel
imentation methods for robust design. Technometrics 1991, Dekker, Inc.: New York, 1989.
33 (4), 415–427. Myers, R.H.; Montgomery, D.C. Response Surface Method-
31. Montgomery, D.C. Using fractional factorial designs for ology; Wiley-Interscience: New York, 1995.
robust process development. Qual. Eng. 1990, 3, 193–205. Schmidt, S.R.; Launsby, R.L. Understanding Industrial
32. Box, G.E.P.; Meyer, R.D. Dispersion effects from fractional Designed Experiments, 3rd Ed.; Air Academic Press: Color-
designs. Technometrics 1986, 28, 19–27. ado Springs, 1993.
Non-Prescription–
Outsourcing