WK01 Lect-1
WK01 Lect-1
STATISTICS and
Machine Learning
Dr. Jian-Ping Li
C0.15b/ Chesham
Tel. 4539 Email: [email protected]
1
University of Bradford Home Page
1. Introduction of My Research
2. The reason to learn this module
3. Teaching Organisation
• Recommended reading
• Lectures
• Tutorials
• Lab organisation
4. Assessment
5. Support
6. Introduction of Contents
3
University of Bradford Home Page
Dr Jian-Ping Li
Shubert Functions
n 5
0.8
0.6
0.4
0.2
10
0
0 0.2 0.4 0.6 0.8 1 8
(#1)
6
-186~-184
4 -123~-121
-79~-78
2 -54~-50
-49~-44
0 -39~-30
-24~-19
-2 -17~-10
-9~-5
-4 >0.4
-6
-8
-10 3L
-10 -8 -6 -4 -2 0 2 4 6 8 10
1 3L
2
2D Shubert Function →18 global
6
solutions and 760 local solutions 3L 5 4L
10
Trap functions 3
4
200 9
160
z 4L
Fitness
200
160 120
Fitness
120
80
80
7
x y
40
40 8L
0 0
0 5 10 15 20
c 0 10 20 30
c 8L
8 L=0.635m(25in)
http://www.scholarpark.co.uk/mos
1. Why to learn Statistics
10
2. Teaching Organisation
Lectures
Time Room Demonstrator
9-10/Thursday B4.02 JP Li
Tutorial Sessions
Time/Date Room Program Demonstrator
10-11 am/ Thursday B4.02 All Zeixion Jin
/JP Li
11
3. Assessment
• Exam
• 70% weight Statistical formulae and tables provided
12
General Support
13
Introduction to
Engineering Statistics
14
Introduction to Engineering Statistics
https://www.youtube.com/watch?
v=886ahjgAxI8
15
Introduction to Engineering Statistics
17
Why do we need Statistics?
• Variability is the most fundamental law of nature;
• Variability affects performance of all systems;
• We need to take variability into account in all
aspects of engineering:
18
Why do we need Statistics?
19
How do you define “quality” of a
product?
• Quality is ultimately
judged on the basis of the
observed behaviour;
people always expect
consistent performance of
function around the ideal
/ target.
• Functions/Performance,
reliability, maintainability,
cost …
20
How do you define “quality” of a
product?
Cost
Performance / reliability
21
The Engineered System Model
Where do these “Noise Variables”
come from?
Noise Variables
Engineered
Input System / Output
Product /
Process
Controlled
Variables
Engineering design parameters 22
The Engineered System Model
Change
Customer / over time
Systems
operation/ (e.g. wear)
interactions
use
Manufacturing
/ part to part
Environment
Noise Variables
Engineered
Input System / Output
Product /
Process
Controlled
Variables
Engineering design parameters 23
Statistical Thinking
24
Engineering Statistics
26
Engineering Statistics
27
Topics Covered in the ES Module
29
ES Foundation Topics – Example
Problems
Example (1): Concrete strength
• A manufacturer of concrete structures is carrying out
concrete compressive strength tests on samples
collected from ongoing production at regular times.
The nominal specification is 60MPa.
• Sample of strength test results:
Sample 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
Strength 64.5 66 66.8 67 67.7 67.7 67.8 68.2 68.7 69 69 69.9 70.5 71.5 71.9
[Mpa]
Normal
distribution
model for
concrete
strength
Predictions for the
mean concrete
strength
31
ES Foundation Topics – Example
Problems
32
ES Foundation Topics – Example
Problems
Example 2: Customer Evaluation of Opening Effort
Response (1-10)
Competitor Competitor Prototype Prototype Prototype
C1 (22N) C2 (3N) P1 (4.5N) P2 (10.5N) P3 (18N)
1 4 8 7 9 9
2 4 5 6 9 6
3 5 4 8 9 9
4 7 4 9 10 9
5 7 4 8 8 9
6 8 5 9 9 8
7 3 5 8 10 6
8 5 3 6 9 8
9 7 4 6 9 6
10 4 5 9 8 8
and so on… Which concept is best?
(from a customer viewpoint) 33
Topics Covered in the ES Module
RSM
Design of
Experiments
Linear Regression
(fitting empirical models)
35
36
Autonomous Car
37
Types of Learning
38
Supervised Learning: Regression
39
Supervised Learning: Classification
40
Unsupervised Learning
41
Reinforcement Learning
42
Reinforcement Learning
43
Machine Learning vs Artificial Intelligence
44
EXPLORATORY DATA ANALYSIS
Part 1: Introduction
45
Exploratory Data Analysis
Outline Learning Objectives
http://tbn0.google.com/images?q=tbn:NQbtKn6w7jxJoM:http://www.tayna.co.uk/getimage.php%3Fimage%3Dtayna/5368570146tayna.jpg
http://tbn0.google.com/images?q=tbn:NQbtKn6w7jxJoM:http://www.tayna.co.uk/getimage.php%3Fimage%3Dtayna/5368570146tayna.jpg
Population
http://tbn0.google.com/images?q=tbn:NQbtKn6w7jxJoM:http://www.tayna.co.uk/getimage.php%3Fimage%3Dtayna/5368570146tayna.jpg
http://tbn0.google.com/images?q=tbn:NQbtKn6w7jxJoM:http://www.tayna.co.uk/getimage.php%3Fimage%3Dtayna/5368570146tayna.jpg
http://tbn0.google.com/images?q=tbn:NQbtKn6w7jxJoM:http://www.tayna.co.uk/getimage.php%3Fimage%3Dtayna/5368570146tayna.jpg
http://tbn0.google.com/images?q=tbn:NQbtKn6w7jxJoM:http://www.tayna.co.uk/getimage.php%3Fimage%3Dtayna/5368570146tayna.jpg
http://tbn0.google.com/images?q=tbn:NQbtKn6w7jxJoM:http://www.tayna.co.uk/getimage.php%3Fimage%3Dtayna/5368570146tayna.jpg
http://tbn0.google.com/images?q=tbn:NQbtKn6w7jxJoM:http://www.tayna.co.uk/getimage.php%3Fimage%3Dtayna/5368570146tayna.jpg
http://tbn0.google.com/images?q=tbn:NQbtKn6w7jxJoM:http://www.tayna.co.uk/getimage.php%3Fimage%3Dtayna/5368570146tayna.jpg
http://tbn0.google.com/images?q=tbn:NQbtKn6w7jxJoM:http://www.tayna.co.uk/getimage.php%3Fimage%3Dtayna/5368570146tayna.jpg
http://tbn0.google.com/images?q=tbn:NQbtKn6w7jxJoM:http://www.tayna.co.uk/getimage.php%3Fimage%3Dtayna/5368570146tayna.jpg
http://tbn0.google.com/images?q=tbn:NQbtKn6w7jxJoM:http://www.tayna.co.uk/getimage.php%3Fimage%3Dtayna/5368570146
http://tbn0.google.com/images?q=tbn:NQbtKn6w7jxJoM:http://www.tayna.co.uk/getimage.php%3Fimage%3Dtayna/5368570146tayna.jpg
49
Types of Data
Variable / Continuous Data
• Continuous data assumes infinite
values in an interval;
• Are dependent on precision of
measurement!
• Measurement scale can be divided
into infinite values;
• Examples: temperature, pressure,
length, time, weight, strength, etc.
50
Quiz
What type of data are the following:
51
Describing Data with Numbers
• An engineer is developing rubber “O” rings for
plasma etching tools used in the semiconductor
industry. A tensile strength test is carried out
after the prototypes (sample of 8) have been
immersed in nitric acid for 25 minutes at 30⁰C.
• As expected, there is variability in the tensile
strength measurements.
Tensile strength measurements
1035 1040 1045 1050 1055 1060 1065 1070 1075 How can we describe
Tensile strength [psi] variability in the tensile
strength data? 52
Measures of Location
n
xi x 1 + x 2 + ... + x n
• We can calculate the sample mean x= i=1 =
n n
• For the “O” ring data
8
xi x1 + x 2 + ... + x 8 1037 + 1047 + ... + 1040 8440
i=1
x= = = = = 1055.0 [psi]
8 8 8 8
The sample mean can be thought as the “balance point” of the data set.
53
Measures of Spread
55
Measures of Spread
However, we
can sum the
squares of the
deviations!
n Population
size
• V(x) is still not very convenient to use from a
practical point of view because it has different units
(squared) than the data;
• We can calculate the standard deviation by taking
the square root of variance:
= V(x) =
i
(x − x )2
n 57
Sample Variance and Standard
Deviation
• In calculating the variance and standard deviation of a sample we
should use (n-1) as divisor for the sum of squares; this attempts to
correct the error in estimating the mean (i.e. if the true mean of the
population is different from the sample mean X , then the true
sum of squares around would be bigger).
s = s2 =
(x i − x )2
=
1348
= 13.9 [psi]
variance;
• and are used for
n−1 7
population mean and
standard deviation 58
Thanks!
60