Lecture 2
Lecture 2
1
Lecture 2: Modeling process quality
Additional Readings
Group values of the variable into bins, then count the number
of observations that fall into each bin
Plot frequency (or relative frequency) versus the values of
the variable
Chapter 3 Introduction to Statistical Quality Control, 7th Edition by Douglas C. Montgomery. 10
Copyright (c) 2012 John Wiley & Sons, Inc.
Chapter 3 Introduction to Statistical Quality Control, 7th Edition by Douglas C. Montgomery. 11
Copyright (c) 2012 John Wiley & Sons, Inc.
Chapter 3 Introduction to Statistical Quality Control, 7th Edition by Douglas C. Montgomery. 12
Copyright (c) 2012 John Wiley & Sons, Inc.
Numerical Summary of Data
Sample average:
17
Interquartile Range
• In descriptive statistics, the interquartile range (IQR), also
called the midspread or middle 50%, or technically H-
spread, is a measure of statistical dispersion, being equal to
the difference between 75th and 25th percentiles.
-
• IQR helps figure out how precise data is. It
eliminates outliers.
- 1.5 ( - ), + 1.5 ( - )]
19
The Box Plot
(or Box-and-Whisker Plot)
• Box-and-Whisker Plots enclose the
interquartile range of the data that has the
median* displayed within. It also displays
“whiskers”; showing the extreme observations
in the sample.
*The fiftieth percentile of the data distribution is called the sample median.
The median can be thought of as the data value that exactly divides the sample
in half, with half of the observations smaller than the median and half of them
larger.
A probability distribution is a mathematical model that relates the value of the variable with
the probability of occurrence of that value in the population. In other words, we might
visualize layer thickness as a random variable because it takes on different values in the
population according to some random mechanism, and then the probability distribution of layer
thickness describes the probability of occurrence of any value of layer thickness in the
population. There are two types of probability distributions.
review
In fact, any random phenomenon that occurs on a per unit (or per unit area, per unit
volume, per unit time, etc.) basis is often well approximated by the Poisson distribution.
The probability distribution of the number x of Bernoulli trials needed to get one success.
Because the sequence of Bernoulli trials are independent, the count of the number of
trials until the next success can be started from anywhere without changing the
probability distribution.
For example, suppose we are examining a series of medical records searching for
missing information. If, for example, 100 records have been examined, the probability
that the first error occurs on record number 105 is just the probability that the next five
records are GGGGB, where G denotes good and B denotes an error.
If the probability of finding a bad record is 0.05, the probability of finding a bad record on
the fifth record examined is :
This is identical to the probability that the first bad record occurs on
record 5. This is called the lack of memory property of the geometric
distribution.