Lecture 5 Chapter 3
Lecture 5 Chapter 3
Chapter 3
DOX 6E Montgomery 2
Minitab
DOX 6E Montgomery 3
DOX 6E Montgomery 4
DOX 6E Montgomery 5
Standard
Mean Median deviation
Abs(0.34-0.520)= 0.18
DOX 6E Montgomery 6
DOX 6E Montgomery 7
DOX 6E Montgomery 8
DOX 6E Montgomery 9
DOX 6E Montgomery 11
The Box-Cox transformation is a statistical technique used to
stabilize variance and make data more closely resemble a normal
distribution, which can improve the accuracy of various statistical
analyses. It’s especially useful when data is positively skewed or
when the spread of the data increases with the mean.
DOX 6E Montgomery 17
DOX 6E Montgomery 18
Comparing Pairs of Treatment Means
Practical Interpretation of Results
• After conducting the experiment, performing the statistical analysis, and
investigating the underlying assumptions, the experimenter is ready to draw
practical conclusions about the problem he or she is studying. For this more
formal techniques need to be applied.
• A Regression Model
• The factors involved in an experiment can be either quantitative or qualitative.
• A quantitative factor is one whose levels can be associated with points on a
numerical scale, such as temperature, pressure, or time.
• Qualitative factors, on the other hand, are factors for which the levels cannot
be arranged in order of magnitude. Operators, batches of raw material, and
shifts are typical qualitative factors because there is no reason to rank them in
any particular numerical order
• With a quantitative factor such as time, the experimenter is usually
interested in the entire range of values used, particularly the response from a
subsequent run at an intermediate factor level. That is, if the levels 1.0, 2.0,
and 3.0 hours are used in the experiment, we may wish to predict the
response at 2.5 hours. Thus, the experimenter is frequently interested in
developing an interpolation equation for the response variable in the
experiment. This equation is an empirical model of the process that has
been studied.
• The general approach to fitting empirical models is called regression
analysis
• As a first approximation, we could try fitting a linear model to
the data, say
• R-squared, or the coefficient of determination, measures the
proportion of the variance in the dependent variable that is predictable
from the independent variables.
• It provides an indication of how well the model explains the data. The
value ranges from 0 to 1, with values closer to 1 indicating a better fit.
• Adjusted R-squared modifies R2 to account for the number of
predictors in the model.
• It penalizes the addition of irrelevant variables that don’t improve the
model.
• This makes it a more reliable metric for comparing models with
different numbers of predictors.
• Predicted R-squared measures how well the model predicts new,
unseen data, as opposed to how well it fits the current data.
• This metric is often calculated using cross-validation techniques,
where the model is trained on a portion of the data and then tested on
the remaining data.
• There isn’t a single formula for predicted R2 as it depends on the
cross-validation method, but it can be expressed as:
Prediction Error Sum of Squares (PRESS)