Data Description PDF
Data Description PDF
Umi Yuliatin,M.Sc
PEM Akamigas
25 Maret 2019
2 RANDOM SAMPLING
5 Boxplot
Umi Yuliatin,M.Sc
PEM Akamigas
25 Maret 2019
Definition
If the n observations in a sample are denoted by x1 , x2 , ..., xn , the sample mean is
x1 + x2 + ... + xn
x̄ =
n
∑ni=1 xi
=
n
Example:
suppose that an engineer is designing a nylon connector to be used in an automotive
engine application. The engineer is considering establishing the design specification on
wall thickness at 3/32 inch but is somewhat uncertain about the effect of this decision
on the connector pull-off force. If the pull-off force is too low, the connector may fail
when it is installed in an engine. Eight prototype units are produced and their pull-off
forces measured, resulting in the following data (in pounds): 12.6, 12.9, 13.4, 12.3, 13.6,
13.5, 12.6, 13.1
x1 + x2 + ... + xn
x̄ =
n
∑8i=1 xi
=
8
12.6 + 12.9 + .. + 13.1
=
8
104
=
8
= 13.0
Example:
suppose that an engineer is designing a nylon connector to be used in an automotive
engine application. The engineer is considering establishing the design specification on
wall thickness at 3/32 inch but is somewhat uncertain about the effect of this decision
on the connector pull-off force. If the pull-off force is too low, the connector may fail
when it is installed in an engine. Eight prototype units are produced and their pull-off
forces measured, resulting in the following data (in pounds): 12.6, 12.9, 13.4, 12.3, 13.6,
13.5, 12.6, 13.1
x1 + x2 + ... + xn
x̄ =
n
∑8i=1 xi
=
8
12.6 + 12.9 + .. + 13.1
=
8
104
=
8
= 13.0
Example:
suppose that an engineer is designing a nylon connector to be used in an automotive
engine application. The engineer is considering establishing the design specification on
wall thickness at 3/32 inch but is somewhat uncertain about the effect of this decision
on the connector pull-off force. If the pull-off force is too low, the connector may fail
when it is installed in an engine. Eight prototype units are produced and their pull-off
forces measured, resulting in the following data (in pounds): 12.6, 12.9, 13.4, 12.3, 13.6,
13.5, 12.6, 13.1
x1 + x2 + ... + xn
x̄ =
n
∑8i=1 xi
=
8
12.6 + 12.9 + .. + 13.1
=
8
104
=
8
= 13.0
Definition
If x1 , x2 , ..., xn is a sample of n observations, the sample variance is
Gambar: Calculation of terms for the sample variance and sample standard deviation
Variance and sample standard deviation for the pull-off force data
When the population is finite and consists of N values, we may define the population
variance as
∑ N ( xi − µ ) 2
σ 2 = i=1
N
The sample variance is an estimate of the population variance.
Definition
If the n observations in a sample are denoted by x1 , x2 , .., xn the sample range is
r = max(xi ) − min(xi )
Definition
A population consists of the totality of the observations with which we are concerned.
Definition
A sample is a subset of observations selected from a population.
Definition
A population consists of the totality of the observations with which we are concerned.
Definition
A sample is a subset of observations selected from a population.
Example:
These data are the compressive strengths in pounds per square inch (psi) of 80
specimens of a new aluminum-lithium alloy undergoing evaluation as a possible
material for aircraft structural elements. The data were recorded in the order of testing,
and in this format they do not convey much information about compressive strength.
Gambar: Steam and leaf diagram for the compressive strenghth data from the table
Gambar: summary statistics for the compressive strength data from minitab
Exercises
1 The percentage of cotton in material used to manufacture men’s shirts follows.
Construct a stem-and-leaf display for the data.
Boxplot
A box plot displays the three quartiles, the minimum, and the maximum of the data on
a rectangular box, aligned either horizontally or vertically. The box encloses the
interquartile range with the left (or lower) edge at the first quartile,q1 , and the right (or
upper) edge at the third quartile, q3 . A line is drawn through the box at the second
quartile (which is the 50th percentile or the median),q2 = x̄.
This box plot indicates that the distribution of compressive strengths is fairly
symmetric around the central value, because the left and right whiskers and the
lengths of the left and right boxes around the median are about the same.
Exercise:
1 The ”cold start ignition time” of an automobile engine is being investigated by a
gasoline manufacturer. The following times (in seconds) were obtained for a test
vehicle: 1.75,1.92, 2.62, 2.35, 3.09, 3.15, 2.53, 1.91.
(a) Calculate the sample mean and sample standard deviation.
(b) Construct a box plot of the data.
A time series or time sequence is a data set in which the observations are recorded in
the order in which they occur. A time series plot is a graph in which the vertical axis
denotes the observed value of the variable (say x) and the horizontal axis denotes the
time (which could be minutes, days, years, etc.)
A time series or time sequence is a data set in which the observations are recorded in
the order in which they occur. A time series plot is a graph in which the vertical axis
denotes the observed value of the variable (say x) and the horizontal axis denotes the
time (which could be minutes, days, years, etc.)
The general impression from this display is that sales show an upward trend. There is
some variability about this trend, with some years’ sales increasing over those of the
last year and some years’ sales decreasing
Exercises:
1 The following data are the viscosity measurements for a chemical product
observed hourly (read down, then left to right).
Construct and interpret either a digidot plot or a separate stem-and-leaf and time
series plot of these data.
The pull-off force for a connector is measured in a laboratory test. Data for 40 test
specimens follow (read down, then left to right).