Design of Experiments
Design of Experiments
Experimen A B C AB AC BC ABC
t no
1 - - -
2 + - -
3 - + -
4 + + -
5 - - +
6 + - +
7 - + +
8 + + +
Geometric view of two-level factorial designs: (a) two factors
and (b) three factors.
Fractional Factorial Design
It is a powerful experimental design technique used in statistics
and research to study the effects of multiple factors on a response
variable while minimizing the number of experimental runs.
This approach is particularly useful when dealing with a large
number of factors, as it allows researchers to focus on the most
significant effects without the need for a full factorial design, which
can become impractical due to the exponential increase in the
number of runs required.
Fractional factorial designs are often denoted as 2k-p where k is the
number of factors and p indicates the fraction of the full factorial
design used.
For example, when examining four input factors, a half factorial
design 24-1 = 8experiments may be implemented.
A quarter fraction factorial design 25-2 = 8 experiments may be
adopted to study 5 input factors.
Loss of resolution – Aliase
Resolution – a measure of confounding
2III3-1 Design matrix 2IV 4-1 Design matrix
X1 X2 X3 X1 X2 X3 X4
1 -1 -1 +1 1 -1 -1 -1 -1
2 -1 +1 -1 2 +1 -1 -1 +1
3 +1 -1 -1 3 -1 +1 -1 +1
4 +1 +1 +1 4 +1 +1 -1 -1
5 -1 -1 +1 +1
6 +1 -1 +1 -1
7 -1 +1 +1 -1
8 +1 +1 +1 +1
Where,
X3=X1*X2 (X3 is confounding with X1 and X2)
X4=X1*X2*X3
Mixed Factorial Design:
This design involves at least one independent variable manipulated
within subjects and at least one between subjects.
It combines the strengths of both within-subjects and between-
subjects designs.
1.Between-Subjects Factors: These are factors where different
groups of participants are exposed to different levels of the factor.
For example, if you are studying the effect of a new teaching
method, one group of students might receive the new method
(treatment group), while another group continues with the traditional
method (control group).
2.Within-Subjects Factors: These are factors where the same
participants are exposed to all levels of the factor. For instance, if
you are measuring the effect of different types of study
environments (quiet vs. noisy), each participant would experience
both environments.
Advantages of Full factorial design
1. Efficiency
Fewer Experiments: By examining multiple factors at once, researchers
can gather more information from fewer experimental runs compared to
studying one factor at a time.
2. Interaction Effects
Understanding Interactions: Factorial design allows researchers to assess
not only the main effects of each factor but also how they interact with
each other, providing a deeper understanding of the system being studied.
3. Comprehensive Analysis
Detailed Insights: It helps in identifying not just individual effects but also
combinations that might lead to unexpected results, which can be crucial
for applications like pharmaceuticals.
4. Randomization
• Reduced Bias: Randomization of treatment assignments helps control
for potential confounding variables, leading to more reliable results.
5. Flexibility
• Adaptable to Different Scenarios: Factorial designs can be easily
adapted to various fields, including agriculture, pharmaceuticals, and
manufacturing, making them widely applicable.
6. Statistical Power
• Increased Power: More comprehensive data collection can increase
the statistical power of the analysis, making it easier to detect
significant effects.
7. Robustness
• Exploring Robustness: Factorial designs help in assessing how robust
findings are to changes in experimental conditions, which is critical in
fields like drug development.
8. Cost-Effective
• Resource Efficiency: By optimizing the number of experiments,
factorial designs can reduce costs associated with time, materials, and
labor.
9. Facilitates Screening
• Screening Multiple Factors: Researchers can screen multiple factors
and levels in early phases of research to identify the most promising
ones for further study.
Surface response methodology(RSM)
Response Surface Methodology is a statistical and
mathematical technique used for modeling and optimizing
complex relationships between multiple independent variables
(factors) and one or more dependent variables (responses). It is
particularly useful in situations where the relationship between
the factors and the response is not well understood or is nonlinear.
Advantages of RSM
• Efficient Exploration: Allows for the investigation of multiple factors
simultaneously.
• Predictive Modeling: Helps create predictive models for complex systems.
• Optimization: Facilitates finding optimal conditions for desired outcomes.
• Visualization: Provides graphical representations of the relationships between
factors and responses.
For example, the growth of a plant is affected by a certain amount of
water x1 and sunshine x2. The plant can grow under any combination
of treatment x1 and x2. Therefore, water and sunshine can vary
continuously. When treatments are from a continuous range of values,
then a Response Surface Methodology is useful for developing,
improving, and optimizing the response variable. In this case, the plant
growth y is the response variable, and it is a function of water and
sunshine. It can be expressed as
y = f (x1, x2) + ε
The variables x1 and x2 are independent variables where the response
y depends on them.
The dependent variable y is a function of x1, x2, and
the experimental error term, denoted as ε.
The error term ε represents any measurement error on the response,
as well as other type of variations not counted in f.
It is a statistical error that is assumed to distribute normally with zero
mean and variance σ 2 .
In most RMS problems, the true response function f is unknown. In
order to develop a proper approximation for f, the experimenter
usually starts with a low-order polynomial in some small region.
If the response can be defined by a linear function of independent
variables, then the approximating function is a first-order model.
A first-order model with 2 independent variables can be expressed as
Y= β0 + β1X1 + β2X2 + ε
If there is a curvature in the response surface, then a higher degree
polynomial should be used.
The approximating function with 2 variables is called a second-order
model:
Y= β0 + β1X1 + β2X2 + β2X212+ β11X212 + β22X222 + β12X1X2 + ε
In general all RSM problems use either one or the mixture of the both
of these models. In each model, the levels of each factor are
independent of the levels of other factors.
When the levels of each factor are not independent then a mixture
model is appropriate for designing an RMS model.
The response surface analysis is performed by using the fitted surface.
The response surface designs are types of designs for fitting response
surface.
Therefore, the objective of studying RSM can be accomplish by
(1) Understanding the topography of the response surface (local
maximum, local minimum, ridge lines), and
(2) Finding the region where the optimal response occurs. The goal is
to move rapidly and efficiently along a path to get to a maximum or
a minimum response so that the response is optimized.
Response Surface Methods and Designs
Response Surface Methods are designs and models for working with
continuous treatments when finding the optima or describing the
response.
The first goal for Response Surface Method is to find the optimum
response. When there is more than one response then it is important
to find the compromise optimum.
The second goal is to understand how the response changes in a given
direction by adjusting the design variables.
The graph is helpful to see the shape of a response surface; hills,
valleys, and ridge lines. Hence, the function f (x1, x2) can be plotted
versus the levels of x1 and x2.
y = f (x1, x2) + ε
In this graph, each value of X1 and X2 generates a y-value.
This three-dimensional graph shows the response surface from the side and it is called a
response surface plot.
The contour plots can show contour lines of X1 and X2 pairs that have the same response
value y.
When there are more than two independent variables, graphs are difficult or almost
impossible to use to illustrate the response surface, since it is beyond 3-dimension.
The most common designs used in response surface methodology are
Central composite design(CCD),
Box-Behnken design,
Dohlert design and
Mixture response surface methodology.
Central composite Design
Central Composite Design (CCD) is a type of experimental design used
in Response Surface Methodology (RSM) to build a second-order
(quadratic) model for the response variable without needing a full
three-level factorial experiment.
CCD is particularly effective for optimizing processes and exploring the
relationships between factors.
This design has three different design points namely,
Edge point (±1)
Star point (±α)
Centre point
The center of the design space to a factorial point is ±1 unit for each
factor.
The distance from the center of the design space to a star point is |α| >
1.
The distance of each axial point (star point) from the centre in a CCD is α.
A value <1 puts the axial point in the cube,
A value = 1 puts them on the faces of the cube and
A value >1 puts them outside the cube.
The precise value of α depends on certain properties desired for the
design and on the number of factors involved.
• Types of central composite design
• The Box and Wilson design or CCD model comprising of factorial, and
factorial3 design.
• The star point outside the domine and the center point, representing the
experimental domine, helps determine the response surface plot.
• By estimating the precision of surface responses, the value of α can be
determined; where star design is ± α.
• There are three types of CCD; the α can be determined according to the
calculation possibilities and the required precision, which can be obtained
from surface responses.
• The α value’s positioning determines the quality of the design or
estimation. The rate by design is identified by determining the position of
the points.
• Eventually, the center composite design was classified into three types:
Central Composite
Design Type Terminology Comments
CCC designs are the original form of the central composite design. The star
points are at some distance from the center based on the properties
desired for the design and the number of factors in the design. The star
points establish new extremes for the low and high settings for all factors.
circumscribed CCC (5 levels)
These designs have circular, spherical, or hyperspherical symmetry and
require 5 levels for each factor. Augmenting an existing factorial or
resolution V fractional factorial design with star points can produce this
design.
For those situations in which the limits specified for factor settings are
truly limits, the CCI design uses the factor settings as the star points and
CCI (5 levels) creates a factorial or fractional factorial design within those limits (in other
Inscribed
words, a CCI design is a scaled down CCC design with each factor level of
the CCC design divided by α to generate the CCI design). This design also
requires 5 levels of each factor.
In this design the star points are at the center of each face of the factorial
CCF (3 levels) space, so α = ± 1. This variety requires 3 levels of each factor. Augmenting
Face Centered
an existing factorial or resolution V design with appropriate star points can
also produce this design.
Experimental run using CCD