0% found this document useful (0 votes)
76 views38 pages

Data Description PDF

This document provides an overview of methods for summarizing and displaying data, including calculating measures of central tendency (mean) and variation (variance, standard deviation). It gives examples using data on the pull-off force of prototype connectors. The mean pull-off force was estimated as 13.0 pounds based on 8 observations. The sample variance was calculated as 0.2286 pounds squared and the standard deviation as 0.48 pounds. These statistical measures help analyze the data and understand the characteristics of the connector design.

Uploaded by

Ika Fanumby
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
76 views38 pages

Data Description PDF

This document provides an overview of methods for summarizing and displaying data, including calculating measures of central tendency (mean) and variation (variance, standard deviation). It gives examples using data on the pull-off force of prototype connectors. The mean pull-off force was estimated as 13.0 pounds based on 8 observations. The sample variance was calculated as 0.2286 pounds squared and the standard deviation as 0.48 pounds. These statistical measures help analyze the data and understand the characteristics of the connector design.

Uploaded by

Ika Fanumby
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 38

Data Description

Umi Yuliatin,M.Sc

PEM Akamigas

25 Maret 2019

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 1 / 33


Content:

1 Data Summary and Display

2 RANDOM SAMPLING

3 Steam and Leaf Diagram

4 Frequency Distributions and Histogram

5 Boxplot

6 Time Sequence Plot

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 2 / 33


Data Description

Umi Yuliatin,M.Sc

PEM Akamigas

25 Maret 2019

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 3 / 33


Data Summary and Display

Data Summary and Display

Well-constructed data summaries and displays are essential to good statistical


thinking, because they can focus the engineer on important features of the data or
provide insight about the type of model that should be used in solving the problem.
The computer has become an important tool in the presentation and analysis of data.
While many statistical techniques require only a hand-held calculator, much time and
effort may be required by this approach, and a computer will the tasks much more
efficiency

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 4 / 33


Data Summary and Display

Definition
If the n observations in a sample are denoted by x1 , x2 , ..., xn , the sample mean is

x1 + x2 + ... + xn
x̄ =
n
∑ni=1 xi
=
n

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 5 / 33


Data Summary and Display

Example:
suppose that an engineer is designing a nylon connector to be used in an automotive
engine application. The engineer is considering establishing the design specification on
wall thickness at 3/32 inch but is somewhat uncertain about the effect of this decision
on the connector pull-off force. If the pull-off force is too low, the connector may fail
when it is installed in an engine. Eight prototype units are produced and their pull-off
forces measured, resulting in the following data (in pounds): 12.6, 12.9, 13.4, 12.3, 13.6,
13.5, 12.6, 13.1
x1 + x2 + ... + xn
x̄ =
n
∑8i=1 xi
=
8
12.6 + 12.9 + .. + 13.1
=
8
104
=
8
= 13.0

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 6 / 33


Data Summary and Display

Example:
suppose that an engineer is designing a nylon connector to be used in an automotive
engine application. The engineer is considering establishing the design specification on
wall thickness at 3/32 inch but is somewhat uncertain about the effect of this decision
on the connector pull-off force. If the pull-off force is too low, the connector may fail
when it is installed in an engine. Eight prototype units are produced and their pull-off
forces measured, resulting in the following data (in pounds): 12.6, 12.9, 13.4, 12.3, 13.6,
13.5, 12.6, 13.1
x1 + x2 + ... + xn
x̄ =
n
∑8i=1 xi
=
8
12.6 + 12.9 + .. + 13.1
=
8
104
=
8
= 13.0

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 6 / 33


Data Summary and Display

Example:
suppose that an engineer is designing a nylon connector to be used in an automotive
engine application. The engineer is considering establishing the design specification on
wall thickness at 3/32 inch but is somewhat uncertain about the effect of this decision
on the connector pull-off force. If the pull-off force is too low, the connector may fail
when it is installed in an engine. Eight prototype units are produced and their pull-off
forces measured, resulting in the following data (in pounds): 12.6, 12.9, 13.4, 12.3, 13.6,
13.5, 12.6, 13.1
x1 + x2 + ... + xn
x̄ =
n
∑8i=1 xi
=
8
12.6 + 12.9 + .. + 13.1
=
8
104
=
8
= 13.0

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 6 / 33


Data Summary and Display

A physical interpretation of the sample mean as a measure of location is shown in the


dot diagram of the pull-off force data.

Gambar: The sample mean as a balance point for a system of weights.

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 7 / 33


Data Summary and Display

For a finite population with N measurements, the mean is


x1 + x2 + ... + xN
µ=
N
The sample mean, x̄ , is a reasonable estimate of the population mean,µ. Therefore, the
engineer designing the connector using a 3/32-inch wall thickness would conclude, on
the basis of the data, that an estimate of the mean pull-off force is 13.0 pounds.

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 8 / 33


Data Summary and Display

Definition
If x1 , x2 , ..., xn is a sample of n observations, the sample variance is

∑ni=1 (xi − x̄)2


s2 =
n−1
The sample standard deviation, s, is the positive square root of the sample variance.

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 9 / 33


Data Summary and Display

Gambar: Calculation of terms for the sample variance and sample standard deviation

Variance and sample standard deviation for the pull-off force data

∑ni=1 (xi − x̄)2


s2 =
n−1
1.60
=
7
= 0.2286(pounds)2

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 10 / 33


Data Summary and Display

And the sample standard deviation is



s = s2

= 0.2286
= 0.48

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 11 / 33


Data Summary and Display

When the population is finite and consists of N values, we may define the population
variance as
∑ N ( xi − µ ) 2
σ 2 = i=1
N
The sample variance is an estimate of the population variance.

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 12 / 33


Data Summary and Display

Definition
If the n observations in a sample are denoted by x1 , x2 , .., xn the sample range is

r = max(xi ) − min(xi )

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 13 / 33


RANDOM SAMPLING

Definition
A population consists of the totality of the observations with which we are concerned.

Definition
A sample is a subset of observations selected from a population.

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 14 / 33


RANDOM SAMPLING

Definition
A population consists of the totality of the observations with which we are concerned.

Definition
A sample is a subset of observations selected from a population.

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 14 / 33


RANDOM SAMPLING

Gambar: Relationship between a population and sample

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 15 / 33


Steam and Leaf Diagram

Steam and Leaf Diagram

Example:
These data are the compressive strengths in pounds per square inch (psi) of 80
specimens of a new aluminum-lithium alloy undergoing evaluation as a possible
material for aircraft structural elements. The data were recorded in the order of testing,
and in this format they do not convey much information about compressive strength.

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 16 / 33


Steam and Leaf Diagram

Gambar: Compressive strenght (in psi) of 80 Aluminium-lithium Alloy specimens

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 17 / 33


Steam and Leaf Diagram

Step for constructing a steam and leaf diagram


1 Divide each number xi into two parts: a stem, consisting of one or more of the
leading digits and a B, consisting of the remaining digit.
2 List the stem values in a vertical column.
3 Record the leaf for each observation beside its stem.
4 Write the units for stems and leaves on the display.

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 18 / 33


Steam and Leaf Diagram

Gambar: Steam and leaf diagram for the compressive strenghth data from the table

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 19 / 33


Steam and Leaf Diagram

Gambar: summary statistics for the compressive strength data from minitab

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 20 / 33


Steam and Leaf Diagram

Exercises
1 The percentage of cotton in material used to manufacture men’s shirts follows.
Construct a stem-and-leaf display for the data.

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 21 / 33


Steam and Leaf Diagram

An important quality characteristic of water is the concentration of suspended solid


material. Following are 60 measurements on suspended solids from a certain lake.
Construct a stem-and-leaf diagram for this data and comment on any important
features that you notice. Compute the sample mean, sample standard deviation, and
the sample median.

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 22 / 33


Frequency Distributions and Histogram

Frequency Distributions and Histogram

A frequency distribution is a more compact summary of data than a stem-and-leaf


diagram. To construct a frequency distribution, we must divide the range of the data
into intervals, which are usually called class intervals, cells, or bins.
The histogram is a visual display of the frequency distribution. The stages for
constructing a histogram follow:

Contructing a Histogram (Equal Bin Widths)


1 Label the bin (class interval) boundaries on a horizontal scale.
2 Mark and label the vertical scale with the frequencies or the relative frequencies.
3 Above each bin, draw a rectangle where height is equal to the frequency (or
relative frequency) corresponding to that bin.

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 23 / 33


Frequency Distributions and Histogram

Frequency Distributions and Histogram

A frequency distribution is a more compact summary of data than a stem-and-leaf


diagram. To construct a frequency distribution, we must divide the range of the data
into intervals, which are usually called class intervals, cells, or bins.
The histogram is a visual display of the frequency distribution. The stages for
constructing a histogram follow:

Contructing a Histogram (Equal Bin Widths)


1 Label the bin (class interval) boundaries on a horizontal scale.
2 Mark and label the vertical scale with the frequencies or the relative frequencies.
3 Above each bin, draw a rectangle where height is equal to the frequency (or
relative frequency) corresponding to that bin.

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 23 / 33


Frequency Distributions and Histogram

Gambar: Frequency Distribution for the Compressive Strength Data in

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 24 / 33


Frequency Distributions and Histogram

Gambar: Histogram of compressive strenght for 80 alumunium-lithium alloy specimens

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 25 / 33


Boxplot

Boxplot

A box plot displays the three quartiles, the minimum, and the maximum of the data on
a rectangular box, aligned either horizontally or vertically. The box encloses the
interquartile range with the left (or lower) edge at the first quartile,q1 , and the right (or
upper) edge at the third quartile, q3 . A line is drawn through the box at the second
quartile (which is the 50th percentile or the median),q2 = x̄.

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 26 / 33


Boxplot

Gambar: Description of boxplot

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 27 / 33


Boxplot

Gambar: Boxplot for compressive strenght data

This box plot indicates that the distribution of compressive strengths is fairly
symmetric around the central value, because the left and right whiskers and the
lengths of the left and right boxes around the median are about the same.

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 28 / 33


Boxplot

Exercise:
1 The ”cold start ignition time” of an automobile engine is being investigated by a
gasoline manufacturer. The following times (in seconds) were obtained for a test
vehicle: 1.75,1.92, 2.62, 2.35, 3.09, 3.15, 2.53, 1.91.
(a) Calculate the sample mean and sample standard deviation.
(b) Construct a box plot of the data.

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 29 / 33


Time Sequence Plot

Time Sequence Plot

A time series or time sequence is a data set in which the observations are recorded in
the order in which they occur. A time series plot is a graph in which the vertical axis
denotes the observed value of the variable (say x) and the horizontal axis denotes the
time (which could be minutes, days, years, etc.)

Gambar: Company sales by year (a) and by quarter (b)

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 30 / 33


Time Sequence Plot

Time Sequence Plot

A time series or time sequence is a data set in which the observations are recorded in
the order in which they occur. A time series plot is a graph in which the vertical axis
denotes the observed value of the variable (say x) and the horizontal axis denotes the
time (which could be minutes, days, years, etc.)

Gambar: Company sales by year (a) and by quarter (b)

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 30 / 33


Time Sequence Plot

The general impression from this display is that sales show an upward trend. There is
some variability about this trend, with some years’ sales increasing over those of the
last year and some years’ sales decreasing

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 31 / 33


Time Sequence Plot

Exercises:
1 The following data are the viscosity measurements for a chemical product
observed hourly (read down, then left to right).

Construct and interpret either a digidot plot or a separate stem-and-leaf and time
series plot of these data.

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 32 / 33


Time Sequence Plot

The pull-off force for a connector is measured in a laboratory test. Data for 40 test
specimens follow (read down, then left to right).

Construct a time series plot of the data.


Construct and interpret either a digidot plot or a stem-and-leaf plot of the data.

Umi Yuliatin,M.Sc (PEM Akamigas) Data Description 25 Maret 2019 33 / 33

You might also like