LVM Class 5
LVM Class 5
©
ConnectMV, 2011 1
Copyright, sharing, and attribution notice
© ConnectMV, 2011 2
We appreciate:
I if you let us know about any errors in the slides
I any suggestions to improve the notes
I telling us if you use the slides, especially commercially, so we
can inform you of major updates
I emailing us to ask about different licensing terms
© ConnectMV, 2011 3
Summary of Process Monitoring
©
ConnectMV, 2011 4
Review of assignment
©
ConnectMV, 2011 5
Why we use Hotelling’s T 2
© ConnectMV, 2011 6
Hotelling’s T 2
I After extracting components from X we accumulate A score
vectors in matrix T
©
ConnectMV, 2011 8
Hotelling’s T 2
a=A
X 2
ti,a
Ti2 = ≥0
sa
a=1
I Interpretation: directed
distance from the center to
where the point is projected
on the plane
I T 2 has an F -distribution
I Often show the 95%
confidence limit value, called
2
TA,α=0.05
© ConnectMV, 2011 9
Hotelling’s T 2
2 t12 t22
I If A = 2, equation for 95% limit = TA=2,α=0.05 = +
s12 s22
I An equation for an ellipse
I s1 and s2 are constant for a given model
I Points on ellipse have a constant distance from model center
© ConnectMV, 2011 10
Hotelling’s T 2
© ConnectMV, 2011 11
Why Euclidean distances don’t work
http://www.youtube.com/watch?v=ExoAbXPJ7NQ
© ConnectMV, 2011 12
Contribution plots
© ConnectMV, 2011 13
Diagnosing a problem
©
ConnectMV, 2011 14
LVM for troubleshooting: contribution plot
Example:
I 207: temperature on tray 129
in distillation column 3
I 158: a tag from distillation
column 3
I 33 and 277: related to
concentration of feed A
©ConnectMV, 2011 15
Contributions in the score space: one PC
©
ConnectMV, 2011 16
Contributions in the score space: one PC
Score = ti,a = xi pa = linear combination
I xi,1 p1,a xi,2 p2,a . . . xi,K pK ,a ←− there are K terms
©
ConnectMV, 2011 18
Contributions in more than 1 score
©
ConnectMV, 2011 19
Contribution plots in T 2
© ConnectMV, 2011 20
Contributions in the score space
©
ConnectMV, 2011 21
Contributions in the score space
©
ConnectMV, 2011 22
Contributions: modifying the starting point
We can modify the starting point, not necessary to use origin:
(to) (to)
I ti,a = xi pa
(from) (from) (from)
I t
i,a = xi pa ←− usually the origin: ti,a =0
Subtract:
(to) (from) (to) (from)
ti,a − ti,a = xi − xi pa
∆ti,a = ∆xi pa ←− plot as bar plot
In general:
v 2
u
(to) (from)
u t − t
(to) (from) u pk,a · i,a i,a
X
contrib(xk ) = xi,k − xi,k t
a
s a
© ConnectMV, 2011 23
Contributions in the residuals
©ConnectMV, 2011 24
Contribution plots: T 2 and SPE
©
ConnectMV, 2011 25
Leverage
I (T0 T) =
I Leveragei = scaled down version of Ti2
i=N
X
I Leveragei = A = the number of columns in T
i=1
A
I Cut off for Leveragei = 3 ·
N
I Points with Leveragei > cut off have large influence on model
©
ConnectMV, 2011 26
Leverage example
A
Cut off = 3 · = 3 × 3/184
N
©
ConnectMV, 2011 27
Variable importance to prediction
Characteristics of variables that have important role in model?
I Have large (absolute) weights: why?
I Come from a component that has a high R 2
©
ConnectMV, 2011 29
Jackknifing
We re-calculate the model G + 1 times during cross-validation:
I G times, once per group
I The “+1” is from the final round, where we use all
observations
©
ConnectMV, 2011 31
Wafer case study
I Data source: Silicon wafer thickness
I Nine thickness measurements from a silicon wafer.
I Thickness measured at the nine locations
©
ConnectMV, 2011 32
Wafer case study I
©
ConnectMV, 2011 33
Wafer case study II
9. What is the interpretation of p2 ? From a quality control
perspective, if you could remove the variability due to p2 , how
much of the variability would you be removing from the
process?
10. Plot the corresponding time series plot for t1 . What do you
notice in the sequence of score values?
11. Repeat the above question for the second component.
12. Use all the data as testing data (184 observations, of which
the first ≈ 100 were used to build the model).
13. Do the outliers that you excluded earlier show up as outliers
still? Do the contribution plots for these outliers give the
same diagnosis that you got before?
14. Are there any new outliers in points 101 to 184? If so, what
are is their diagnosis?
© ConnectMV, 2011 34
Monitoring analogy: your health
©
ConnectMV, 2011 35
Monitoring analogy: your health
Where did that intuitive model for your body’s health come from?
© ConnectMV, 2011 36
Monitoring analogy: making errors
Assume the doctor is always right and that the baseline hypothesis
is: “you are healthy”
I Type 1 error: you detect a problem (e.g. hard to breathe);
doctor says nothing is wrong
I You’ve raised a false alarm
I You feel outside your limits,
I but the truth is: “you are healthy”
I Type 1 error = raise an alarm when there isn’t a problem
© ConnectMV, 2011 37
Monitoring analogy: making errors
Assume the doctor is always right and that the baseline hypothesis
is: “you are healthy”
I Type 2 error: you feel OK; but go to doctors for physical and
they detect a problem
I You feel within your limits,
I but the truth is: “you are not healthy”
I Type 2 error = don’t raise an alarm when there is a problem
I The grid
© ConnectMV, 2011 38
Monitoring concept for a process
©
ConnectMV, 2011 39
Variability
© ConnectMV, 2011 40
Variability
More realistically:
I Sensor drift, spikes, noise, recalibration shifts, errors in our
sample analysis
I Operating staff: introduce variability into a process
I Raw material properties are not constant
I External conditions change (ambient temperature, humidity)
I Equipment breaks down, wears out, sensor drift, maintenance
shut downs
I Feedback control introduces variability
© ConnectMV, 2011 41
Variability in your product
Assertion
Customers expect both uniformity and low cost when they buy
your product. Variability defeats both objectives.
Remind yourself of the last time you bought something that didn’t
work properly
© ConnectMV, 2011 42
Variability costs you money
© ConnectMV, 2011 43
The high cost of variability in your raw materials
I Flip it around: you receive highly variable raw materials:
I That variability lands up in your product, or
I you incur additional cost (energy/time/materials) to process it
© ConnectMV, 2011 44
So what do we want
©
ConnectMV, 2011 45
Process monitoring: relationship to feedback control
©
ConnectMV, 2011 46
Other types of monitoring you will see
© ConnectMV, 2011 47
Shewhart chart (recap)
I Named for Walter Shewhart from Bell Telephone and Western
Electric, parts manufacturing, 1920’s
I A chart for monitoring variable’s location, shown with
I a lower control limit (LCL), usually at +3σ
I a upper control limit (UCL), usually at −3σ
I a target, at the setpoint/desired value
© ConnectMV, 2011 48
Judging the chart’s performance
I Type I error:
I value plotted is from common-cause operation, but falls
outside limits
I if values are normally distributed, how many will fall outside?
I ±2σ limits?
I ±3σ limits?
I Synonyms: false alarm, producer’s risk
I Type II error:
I value plotted is from abnormal operation, but falls inside limits
I Synonyms: false negative, consumer’s risk
© ConnectMV, 2011 49
Adjusting the chart’s performance
Key point
Control chart limits are not set in stone. Adjust them!
©ConnectMV, 2011 50
Discussion
©
ConnectMV, 2011 51
Discussion
©
ConnectMV, 2011 53
Discussion: monitoring only final quality data
© ConnectMV, 2011 54
Discussion: monitoring only final quality data
© ConnectMV, 2011 55
Process monitoring with PCA: scores
©
ConnectMV, 2011 56
Process monitoring with PCA: scores
©
ConnectMV, 2011 57
Process monitoring with PCA: Hotelling’s T 2
a=A
X 2
ta
Hotelling’s T2 =
sa
a=1
I The distance along the model plane
I Is a one-side monitoring plot
I What does a large T 2 value mean?
©
ConnectMV, 2011 58
Process monitoring with PCA: SPE
SPEi = (xi − x̂i )0 (xi − x̂i ) = e0i ei
I Distance off the model plane
I Is a one-side monitoring plot
I What does a large SPE value mean?
©
ConnectMV, 2011 59
Diagnosing a problem
©
ConnectMV, 2011 60
LVM for troubleshooting: contribution plot
Example:
I 207: temperature on tray 129
in distillation column 3
I 158: a tag from distillation
column 3
I 33 and 277: related to
concentration of feed A
©ConnectMV, 2011 61
Contribution plots
I Scores: ti,a = xi pa
I xi,1 p1,a xi,2 p2,a . . . xi,k pk,a . . . xi,K pK ,a
I Derivation on the board
©
ConnectMV, 2011 62
Contributions in the score space
©
ConnectMV, 2011 63
Contributions in the score space
©
ConnectMV, 2011 64
Contributions in the score space
©
ConnectMV, 2011 65
Contribution plots
I SPE = e0i ei
I where
e0i = x0i − b
x0i
I (xi,1 − x̂i,1 ) (xi,2 − x̂i,2 ) . . . (xi,K − x̂i,K )
©
ConnectMV, 2011 66
Industrial case study: Dofasco
© ConnectMV, 2011 67
Dofasco case study: slabs of steel
©
ConnectMV, 2011 69
Dofasco case study: breakout
©
ConnectMV, 2011 70
Dofasco case study: monitoring for breakouts
©
ConnectMV, 2011 71
Dofasco case study: monitoring for breakouts
©
ConnectMV, 2011 72
Dofasco case study: an alarm
©
ConnectMV, 2011 73
Dofasco case study: previous version
A previous version of the monitoring chart:
©
ConnectMV, 2011 75
Dofasco case study: economics of monitoring
©
ConnectMV, 2011 76
Lumber case study
Show video
©
ConnectMV, 2011 77
Lumber case study
©ConnectMV, 2011 79
General procedure to build monitoring models I
1. Identify variable(s) to monitor.
2. Retrieve historical data (computer systems, or lab data, or
paper records)
3. Import data and just plot it.
I Any time trends, outliers, spikes, missing data gaps?
4. Locate regions of stable, common-cause operation.
I Remove spikes and outliers
5. Building monitoring model
6. Model includes control limits (UCL, LCL) for scores, SPE and
Hotelling’s T 2
7. Test your chart on new, unused data.
I Testing data: should contain both common and special cause
operation
8. How does your chart work?
I Quantify the type I and II error.
©
ConnectMV, 2011 80
General procedure to build monitoring models II
I Adjust the limits;
I Repeat this step, as needed to achieve levels of error
9. Run chart on your desktop computer for a couple of days
I Confirm unusual events with operators; would they have
reacted to it? False alarm?
I Refine your limits
10. Not an expert system - will not diagnose problems:
I use your engineering judgement; look at patterns; knowledge
of other process events
11. Demonstrate to your colleagues and manager
I But go with dollar values
12. Installation and operator training will take time
13. Listen to your operators
I make plots interactive - click on unusual point, it drills-down
to give more context
©
ConnectMV, 2011 81
Challenges for real-time monitoring
©
ConnectMV, 2011 82
Important readings
© ConnectMV, 2011 83