Unit-4 Chapter-6 Design of Experiments & Regression Analysis Part-1
Unit-4 Chapter-6 Design of Experiments & Regression Analysis Part-1
CHAPTER-6
DESIGN OF EXPERIMENTS &
REGRESSION ANALYSIS
Dr. Dhiren R. Patel
INTRODUCTION TO DESIGN OF EXPERIMENTS
Decide what phenomenon you wish to investigate. Specify how you can manipulate the factor and hold all other
conditions fixed, to insure that these extraneous conditions aren't influencing the response you plan to measure.
Then measure your chosen response variable at several (at least two) settings of the factor under study. If
changing the factor causes the phenomenon to change, then you conclude that there is indeed a cause-and-effect
relationship at work.
How many factors are involved when you do an experiment? Some say two - perhaps this is a comparative
experiment? Perhaps there is a treatment group and a control group? If you have a treatment group and a
control group then, in this case, you probably only have one factor with two levels.
You probably follow a recipe so there are many additional factors that control the ingredients - i.e., a
mixture.
What parts of the recipe did they vary to make the recipe a success? Probably many factors, temperature
and moisture, various ratios of ingredients, and presence or absence of many additives.
Now, should one keep all the factors involved in the experiment at a constant level and just vary one to see
what would happen?
The agricultural origins, 1918 – 1940s The second industrial era, late 1970s –
R. A. Fisher & his co-workers 1990
Profound impact on agricultural science Quality improvement initiatives in many companies
Factorial designs, ANOVA CQI and TQM were important ideas and became
management goals
The first industrial era, 1951 – late 1970s Taguchi and robust parameter design, process
Box & Wilson, response surfaces robustness
Applications in the chemical & process industries
The modern era, beginning circa 1990,
when economic competitiveness and
globalization are driving all sectors of the
economy to be more competitive.
DR. DHIREN R. PATEL 7
Immediately following World War II the first industrial era marked another
resurgence in the use of DOE.
It was at this time that Box and Wilson (1951) wrote the key paper in response
surface designs thinking of the output as a response function and trying to find the
optimum conditions for this function.
oHe came up with the concept of robust parameter design and process robustness.
•Now it is a company and they employ a technique which has been adopted by many of the
large manufacturing companies.
•This is a technique that uses statistics to make decisions based on quality and feedback
loops.
oThis evolved in the 1960s when medical advances were previously based on anecdotal data; a
doctor would examine six patients and from this wrote a paper and published it.
oThe incredible biases resulting from these kinds of anecdotal studies became known.
oThe outcome was a move toward making the randomized double-blind clinical trial the gold
standard for approval of any new product, medical device, or procedure.
If you are doing a comparative experiment where you have two treatments, a treatment and
a control, for instance, you need to include in your experimental process the assignment of those
treatments by some random process.
You need to have a deliberate process to eliminate potential biases from the conclusions, and
random assignment is a critical step.
of the mean is? It is the square root of the estimate of the variance of the sample mean, i.e.,
The width of the confidence interval is determined by this statistic. Our estimates of the mean become less
variable as the sample size increases.
Replication is the basic issue behind every method we will use in order to get a handle on how precise our
estimates are at the end. We always want to estimate or control the uncertainty in our results. We achieve this
estimate through replication. Another way we can achieve short confidence intervals is by reducing the error
variance itself. However, when that isn't possible, we can reduce the error in our estimate of the mean by increasing
n.
Another way is to reduce the size or the length of the confidence interval is to reduce the error variance - which
brings us to blocking.
DR. DHIREN R. PATEL 15
BLOCKING
Blocking is a technique to include other factors in our experiment which contribute to undesirable variation.
Much of the focus in this class will be to creatively use various blocking techniques to control sources of variation
that will reduce error variance.
For example, in human studies, the gender of the subjects is often an important factor.
Age is another factor affecting the response.
Age and gender are often considered nuisance factors which contribute to variability and make it difficult to
assess systematic effects of a treatment.
By using these as blocking factors, you can avoid biases that might occur due to differences between the
allocation of subjects to the treatments, and as a way of accounting for some noise in the experiment.
We want the unknown error variance at the end of the experiment to be as small as possible.
Our goal is usually to find out something about a treatment factor (or a factor of primary interest), but in
addition to this, we want to include any blocking factors that will explain variation.
DR. DHIREN R. PATEL 16
MULTI-FACTOR DESIGNS
Multi-factor experimental designs: designs, designs, response surface designs, etc.
The point to all of these multi-factor designs is contrary to the scientific method where everything is
held constant except one factor which is varied.
The one factor at a time method is a very inefficient way of making scientific advances.
It is much better to design an experiment that simultaneously includes combinations of multiple factors
that may affect the outcome.
Then you learn not only about the primary factors of interest but also about these other factors.
These may be blocking factors which deal with nuisance parameters or they may just help you
understand the interactions or the relationships between the factors that influence the response.
At the end of the study, they might realize that drug A had only been given to the male subjects
and drug B was only given to the female subjects.
This refers to the fact that if you analyze the difference between the average response of the
subjects on A and the average response of the subjects on B, this is exactly the same as the
average response on males and the average response on females.
You would not have any reliable conclusion from this study at all.
The difference between the two drugs A and B, might just as well be due to the gender of the
subjects since the two factors are totally confounded.
DR. DHIREN R. PATEL 19
Confounding is something we typically want to avoid but when we are building complex
experiments we sometimes can use confounding to our advantage.
We will confound things we are not interested in order to have more efficient experiments for
the things we are interested in.
We may be interested in main effects but not interactions so we will confound the interactions
in this way in order to reduce the sample size, and thus the cost of the experiment, but still
have good information on the main effects.