Machine Learning Approaches To Condition Monitoring
Machine Learning Approaches To Condition Monitoring
net/publication/330452351
CITATIONS READS
0 3,885
1 author:
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
Fault Diagnosis and Prognosis using Kernel Methods in Nonlinear Dynamic Processes View project
Statistical Machine Learning Applications in Chemical Process Operations, Design, and Control View project
All content following this page was uploaded by Karl Ezra Salgado Pilario on 25 July 2019.
January 2019
www.cranfield.ac.uk
Outline
2
What is Machine Learning?
A field of study concerned with giving computers the
ability to learn without being explicitly programmed.
(Arthur Samuel, 1959)
Images from:
https://www.regencychess.co.uk/checkers-game-p-1626.html
https://www.ibm.com/ibm/history/ibm100/us/en/icons/ibm700series/impacts/
3
What is Machine Learning?
IBM Watson
Jeopardy, 2011
AlexNet
ImageNet Large Scale Visual Recognition
Challenge 2012
Google DeepMind’s Alpha Go
March 2016
Images:
http://www.slate.com/blogs/future_tense/2016/03/09/google_deepmind_s_alphago_ai_beats_champion_lee_sedol_in_go.html
http://time.com/4281476/ibm-artificial-intelligence-watson-2016/
https://www.cc.gatech.edu/~hays/compvision/proj6/
4
Types of Machine Learning Problems
Supervised Learning Goal: Make a computer Unsupervised Learning Goal: Make a computer
learn from examples learn patterns from data
and non-examples. by itself.
These are dogs. These are cars. Here are some pictures.
5
Types of Machine Learning Problems
Labelled
Data
Unlabelled
Data
6
Other applications of Machine Learning
7
Why is ML taking off only now?
BIG DATA
Nowadays… (Industry 4.0)
Source: https://www.ibmbigdatahub.com/infographic/four-vs-big-data
8
Why is ML taking off only now?
9
Why is ML taking off only now?
10
Outline
11
General Condition Monitoring Methodology
Example:
Process Data +
Fault
Cases
Simulate the Process at Faulty
Condition
-
1. Air Line Blockage
2. Water Line Blockage
3. Top Separator Blockage
4. Open Direct Bypass
5. Slugging Conditions in Riser
Fault Detection,
6. And so on… Diagnosis, and
Prognosis
12
General Condition Experience of
Plant Operators
Physics & Chemistry
Monitoring Principles
Methodology Knowledge-
System of Experts
Differential
Equations
Process using based Process
Modelling Simulation Historical
Software Fault Data
Physics- Rules of Thumb
Multivariate
Statistics
Ge et al. (2013) Review of Recent Research on Data-based Process Monitoring. Industrial and Engineering Chemistry Research, 52.
13
Chiang et al. (2001). Fault Detection and Diagnosis in Industrial Systems. Springer-Verlag London.
Machine Learning for Condition Monitoring
Process Variable 2
? Faulty ?
Data
Classification
Normal Clustering
Data
? Dimensionality
Regression Reduction
Process Variable 2
Fault magnitude
?
Faulty Data
Density
Estimation
Normal Data
Process Variable 2
1.33 2.14 Normal Input data
PV 2
1.01 2.01 Normal
… … …
𝑓 𝒙 = the neural network itself
− output for faulty
In ANN, the aim is to find a = (+)
output for normal
decision boundary so that
the classification error is a
minimum. Process Variable 1 𝜎 ⋅ = 0 to 1, probability that
𝒙 is normal.
Faulty Data
Normal Data *The decision boundary consists of
point locations where 𝜎 ⋅ = 0.5.
17
Artificial Neural Networks (ANN)
for classification
Model: 𝑦 = 𝜎(𝑓 𝒙 )
where:
𝑥1 PV 1
𝒙= Input data
𝑓 𝒙 PV 2
18
Support Vector Machines (SVM)
for classification
Model: 𝑦 = 𝜎(𝒘𝑇 𝒙 + 𝑏)
Given: Support
Margin Vector
PV 1 PV 2 𝒚 where:
1.25 0.85 Faulty
PV 1
𝒙=
Process Variable 2
1.33 2.14 Normal Input data
1.01 2.01 Normal
PV 2
𝑦= −
… … … output for faulty
(+) output for normal
In SVM, the aim is to find a
decision boundary that 𝜎 ⋅ = 0 to 1, probability that
maximizes the margin of 𝒙 is normal.
separation between the Process Variable 1
two classes of data. *The decision boundary consists of
Decision point locations where 𝜎 ⋅ = 0.5.
Faulty Data
Boundary
Normal Data *Nonlinear boundaries are
achievable using kernels.
19
Solution by Optimization
NOTICE: In ANN, the aim is to find a In SVM, the aim is to find a decision
decision boundary so that the boundary that maximizes the margin of
classification error is a minimum. separation between the two classes of data.
Differential
Stochastic Evolution
Gradient Descent
Parameter 2
20
Outline
21
Nature of Plant Data
Multivariate
2. Plant Data is High-dimensional
(~100-1000 variables)
Cross-correlation
23 Ge et al. (2013) Review of Recent Research on Data-based Process Monitoring. Industrial and Engineering Chemistry Research, 52.
Challenge for supervised learning
• Performance is too low on the • The pattern was learned • Performance is too good to be true
training data set • Bias and variance are balanced on the training data set
• Patterns were not learned at all • Model is not too complex nor simple • Patterns “learned” from noise
• The model is too simple • The model may generalize well to • The model is too complex
(high bias, low variance) unseen test data (low bias, high variance)
• The model cannot generalize to • The model cannot generalize to
unseen test data unseen test data
24
Challenge for supervised learning
• Performance is too low on the • The pattern was learned • Performance is too good to be true
training data set • Bias and variance are balanced on the training data set
• Patterns were not learned at all • Model is not too complex nor simple • Patterns “learned” from noise
• The model is too simple • The model may generalize well to • The model is too complex
(high bias, low variance) unseen test data (low bias, high variance)
• The model cannot generalize to • The model cannot generalize to
unseen test data unseen test data
25
Challenge for supervised learning
Classification
Error
Model complexity
26
Further reading / studying
27
View publication stats