0% found this document useful (0 votes)
490 views86 pages

Minitab Introduction

Minitab is statistical software used for business analysis. It allows importing and storing data in worksheets, generating graphs and statistical results, and reporting outputs. Key features include storing all project components together, editing graphs and reports, and transferring data between applications. Minitab assists with statistical analysis through an intuitive interface and toolbars.
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
490 views86 pages

Minitab Introduction

Minitab is statistical software used for business analysis. It allows importing and storing data in worksheets, generating graphs and statistical results, and reporting outputs. Key features include storing all project components together, editing graphs and reports, and transferring data between applications. Minitab assists with statistical analysis through an intuitive interface and toolbars.
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
You are on page 1/ 86

tab Introduction D M A I C Black Belt DM

Introduction to Minitab

Minitab
Is an Application Software
For Studying Statistical Tools and
Applying Them for Business Needs.
It Complements Six Sigma for its Features and
Ease of
Implementation.

Six Sigma Certification 1


tab Introduction D M A I C Black Belt DM

Objectives
At the end of this this topic, Introduction to Minitab, you
will be able to:
• Describe the basics of Minitab.
• Derive statistical parameters for a given set of data using
Minitab and otherwise.
• Analyze the given data graphically and statistically using
Minitab.

Six Sigma Certification 2


tab Introduction D M A I C Black Belt DM

Minitab Basics
In Minitab, there are projects (Minitab Project, *.mpj) in which
• Data is stored in Minitab Worksheet, *.mtw
• Graphs are stored in Minitab Graphs, *.mgf
• Results in Minitab Session, *.txt
• Reports in Minitab Project Report: *.rtf

To Start a New Project


• Choose File > New.
• Choose Project, then click OK.
To save a project, you save all your work at once: all the data, all the
output in the Session window, and all the open Graph windows. When
you reopen the project, all that information appears for you.

Six Sigma Certification 3


tab Introduction D M A I C Black Belt DM

Minitab Basics
To Save a Project:
• Choose File > Save Project As.
• In Save in, navigate to the location where you'd like to keep your
project.
• In File name, enter a name for your project, and click Save.
After this, simply choose File > Save Project to save all of your work.
Output or data can be used in another application or project by saving
Session window output, data, and graphs as separate files.
To Open a Worksheet
• Data in an empty project is written in the available worksheet.
• It can be copied from a different worksheet by opening it in the
project.

Six Sigma Certification 4


tab Introduction D M A I C Black Belt DM

Minitab Basics
To use the file <Pressure.mtw> available in the Data subdirectory or
folder.
– Choose File > Open Worksheet
– Move to the Data subdirectory and select the worksheet Pressure.mtw.
– Click Open.
Graphs allow you to display patterns, relationships, and distributions in
your data that are difficult to evaluate simply by looking at a worksheet.
We will study few graphs useful for data analysis.
Selecting Graph Items for Editing Cosmetic changes like the font of or
color, or structural changes like increasing the range of a scale can be
done.
• For making selection, there are three methods:
– Click the item.
– Choose Editor > Select Item, then choose the item from the list.
– Select the item from the list in the graph editing toolbar.

Six Sigma Certification 5


tab Introduction D M A I C Black Belt DM

Minitab Basics
• After selecting the intended item, editing can be done by either of the
ways:
– Double-click the item.
– Choose Editor > Edit.
– Right-click and choose Edit.
– Click, the Edit button, on the Graph Editing toolbar.
Graphs in Other Applications
• Graph may be added in Report Pad in Minitab or to Word or
PowerPoint.
– With the layout active, choose Edit > Copy Graph.
– In the Word document, place the cursor where you want to insert the
graph.
– Choose Edit > Paste Special.

Six Sigma Certification 6


tab Introduction D M A I C Black Belt DM

Minitab Basics
– In As, choose Mtb Graph Object. This ensures that the graph can be edited
in the Word document. With other options (picture or bitmap, for example),
the resulting image cannot be edited with Minitab graph editing tools.
– Click OK.
– Double-click the graph in the Word document. Minitab's graph annotation
toolbar appears, and you can double-click any graph item to edit it as you
would in Minitab.

Save and Exit


– Choose File > Save Project.
– Exit Minitab by choosing File > Exit.
– It also saves the Report Pad which can be referred for presentations.

Six Sigma Certification 7


tab Introduction D M A I C Black Belt DM

Minitab Basics : Window


• Multiple (max Four)
Interactive Windows.
• Only one can open at a time.
• Windows can be saved
separately.

Session Window:
Tool Bar Menu Bar • Analytical Output

Data Window:
• A Worksheet different from a Spreadsheet
• Column names are above first row
3 data types of Columns: Text/ Numeric/ Date
Info Window: • Everything in a column is considered to be the same
• Synopsis of worksheet variable

History Window:
• Stores Commands

Six Sigma Certification 8


tab Introduction D M A I C Black Belt DM

Minitab Basics : Toolbar


Session
SessionWindow
WindowToolbar
Toolbar

Previous Command Manage Graphs


Open File
Save File Next Command Close Graphs
Find
Print Window Find Next Cancel

Cut Help
Copy
Paste Last Dialog Box
Undo Session Window
Data Window

Six Sigma Certification 9


tab Introduction D M A I C Black Belt DM

Minitab Basics : Toolbar


Data
DataWindow
WindowToolbar
Toolbar
Manage Graphs
Open File Insert Cells
Save File Insert Rows Close Graphs
Insert Columns
Print Window Cancel
Move Columns
Clear Cells

Cut Previous Brushed Row Help


Copy Next Brushed Row
Paste Last Dialog Box
Undo Session Window
Data Window

Commands can also be accessed from drop down menus or Hot keys.

Six Sigma Certification 10


tab Introduction D M A I C Black Belt DM

Minitab Basics
To Transfer Data from Worksheet to Session Window:
Go to Data > Display Data

Six Sigma Certification 11


tab Introduction D M A I C Black Belt DM

Minitab Basics
To Change Data type:
Go to Data > Change Data Type
Change Data
Type:
Manip > Change Data Type >
Numeric to Text
Numeric to Text Manip > Change Data Type >
Text to Numeric
Manip > Change Data Type >
Date/Time to Text
Manip > Change Data Type >
Date/Time to
Numeric
Manip > Change Data Type >
Numeric to
Date/Time
Manip > Change Data Type >
Text to Date/Time

Six Sigma Certification 12


tab Introduction D M A I C Black Belt DM

Minitab Basics
To Code Data type:
Go to Data > Code

Text to Numeric Code

Six Sigma Certification 13


tab Introduction D M A I C Black Belt DM

Minitab Basics
To Extract data:
Go to Data > Extract from Date/ Time > Text
Data > Extract from Date/ Time > Numeric

Month extracted from Date

Six Sigma Certification 14


tab Introduction D M A I C Black Belt DM

Basic Statistics
Statistics is the collection, organization, analysis, interpretation and
presentation of data. The most common ones are:
Count Average
Minimum Maximum
Sum Percent Defects
We would be studying a few of the important Statistics such as:
• Statistics which defines the location:

Recall
Mean Median
Mode
• Statistics which defines the dispersion:
Range Variance
Standard Deviation Quartile

Six Sigma Certification 15


tab Introduction D M A I C Black Belt DM

Basic Statistics : Dispersion


Turn-Around-Time (TAT) of service calls for two brands of TVs ‘A’
and ‘B’ are shown. As a customer, which one would you prefer to buy?

I Turn-around-time A

Turn-around-time B

Debate
µ Bµ A

II Turn-around-time A

Turn-around-time B

µ Bµ A

Six Sigma Certification 16


tab Introduction D M A I C Black Belt DM

Basic Statistics : Dispersion


Only central - tendency of data cannot be the deciding factor
for judging performance. The variation- tendency (spread)
of data also needs to be known.
Range
A value obtained by finding the difference between maximum and
minimum values of a data set = Maximum – Minimum

Variance and Standard Deviation


Deviation is the distance of data from its mean, showing how much data is
distributed. If these deviations are summed up, it gives zero. In order to find
the data deviation, the individual differences from the mean are squared.
The sum of the values gives variance and when the square root is taken, its
gives the standard deviation.

Six Sigma Certification 17


tab Introduction D M A I C Black Belt DM

Population and Sample Statistics


Population Statistics Sample Statistics
Population mean (µ) Sample mean (X-bar)

µ= ∑ xi
X=
∑ Xi
N n
Population variance (σ2) Estimate for variance (s2)

σ 2
=
∑ (x − µ )
i
2
s2 =
∑ i
( x − x ) 2

N n −1
Population Std Deviation(σ) Estimate for Std Deviation(s)

σ = (σ 2 ) s = (s 2 )
Six Sigma Certification 18
tab Introduction D M A I C Black Belt DM

Basic Statistics : Dispersion


Quartile
Data is put in order and each quartile holds 25% of the total data
components. Quartile gives some idea of the dispersion of data. There are
three quartiles designated as Q1, Q2 and Q3
First quartile, Q1 = a value corresponding to 25%
Second Quartile (median), Q2 = a value corresponding to 50%
Third Quartile, Q3 = a value corresponding to 75%
Inter quartile Range, IQR = Q3-Q1

Example - Calculate quartile and IQR for the data:


1, 10, 20, 4, 9, 5, 4, 3
Q2 (Median) = 4.5

1 3 4 4 5 10 9 20 IQR = 7.5

Q1 = 3.25 Q3 = 10.75

Six Sigma Certification 19


tab Introduction D M A I C Black Belt DM

Exercise

Find mean, median, mode, range, variance, and standard


deviation (sigma) for the sample data given below:

2, -2, 0, 1, 5, 4, 3, 1, 0, -2, -4, -3 ,-2, -2, 0

Six Sigma Certification 20


tab Introduction D M A I C Black Belt DM

Solution
For the data set below, the values for various parameters is
2, -2, 0, 1, 5, 4, 3, 1, 0, -2, -4, -3 ,-2, -2, 0.
The sorted data is -4, -3, -2, -2, -2, -2, 0, 0, 0, 1, 1, 2, 3, 4, 5.
Q1 Q2 Q3
1. Mean = 0.0667
IQR = Q3 – Q1 = 0
2. Median = 0
3. Mode = -2
4. Range = 9-5 = 4
5. Variance = 6.92
6. Standard Deviation (sigma) = 2.63

Six Sigma Certification 21


tab Introduction D M A I C Black Belt DM

Solving Using Minitab


• Open a new Worksheet (in the existing or a new Project)
• Type in the numbers vertically in a column
• Name the column as DATA
• Go to Stat Basic
Statistics Display
Descriptive Statistics.
Statistics
Double- click C1 to be
chosen as “Variables”
• Go to Statistics to
select the required
values and click OK
• Find the answers
displayed in the
Session window.
Six Sigma Certification 22
tab Introduction D M A I C Black Belt DM

Solving Using Minitab

Six Sigma Certification 23


tab Introduction D M A I C Black Belt DM

Graphical Analysis
We have seen that Minitab can be used for statistical analysis
of data. Now let us study how we can analyze data
graphically using the tools listed below:
– Pareto Chart
– Histogram
– Dot Plot
– Box Plot
– Scatter Plot
– Matrix Plot
– Marginal Plot
– Time-Series Plot
– Run Chart
– Normality Test
Six Sigma Certification 24
tab Introduction D M A I C Black Belt DM

Pareto Analysis
Pareto chart is named after an Italian economist
Vilfredo Pareto whose theory states that 80% of
wealth is owned by 20% of the population. Based on
this theory, the chart is used to visually depict the
significance level of categories plotted.
• The theory that 80% of the problems come from 20%
of the causes is called Pareto’s principle. While the
percentages may not be always exactly 80/20, there
usually are “the vital few and the trivial many.”
• Generally, Pareto chart is used to plot a measurement Vilfredo Pareto
that reflects cost to the organization. 1848 - 1923

Example – Draw a Pareto chart for Customer Complaints received


from eight different zones of India – 54,25,75,12,65,42,12,41
<Pareto_complaints.mtw>

Six Sigma Certification 25


tab Introduction D M A I C Black Belt DM

Pareto Analysis
To draw a Pareto chart, follow the path –
Minitab> Stats > Quality Tools > Pareto Chart

Pareto Chart of Zone

400 100

80
300

60

Percent
Count
200
40

100
20

0 0
Zone One Three Five Eight Six Two Four Other
Count 132 102 52 41 32 25 12 12
Percent 32.4 25.0 12.7 10.0 7.8 6.1 2.9 2.9
Cum % 32.4 57.4 70.1 80.1 88.0 94.1 97.1 100.0

Six Sigma Certification 26


tab Introduction D M A I C Black Belt DM

Pareto Analysis
Conclusion:
The chart shows One, Three and Five as the major sources of customer
complaints. The improvements should be initially concentrated on these
major contributors.
Pareto Chart is used when:
• Trying to focus on the most significant problem or cause
• Relating cause and effect, by comparing a Pareto chart classified by
causes with one classified by effects
• Analyzing data by groups, to reveal unnoticed patterns
• Communicating with others about your data
• Evaluating improvement, by comparing before and after data

Six Sigma Certification 27


tab Introduction D M A I C Black Belt DM

Exercise
Draw a Pareto Chart for the following travel expenses for six months. As the approving authority for travel
expenses, which departments should you monitor first for controlling the expenses:

Deptt Travel Deptt Travel Deptt Travel


Expenses Expenses Expenses

A 5000 E 5200 C 700


C 1000 E 4500 E 4600
B 2500 A 4000 E 2500
B 2700 E 3500 B 2200
D 8000 D 5800 D 7500
A 1700 B 3200 C 200
C 300 B 3000 A 5400
D 6500 C 400 E 2800
A 8000 D 7500 B 1200
D 7000 A 6200 C 500

<Pareto_dept.mtw>
Six Sigma Certification 28
tab Introduction D M A I C Black Belt DM

Solution
Pareto Chart depicts (visually) the total expenses for six months for each
of the five departments.

Pareto Chart of Deptt


120000
100
Maximum
100000
Contributors
80
Travel Expenses

80000

Percent
60
60000

40
40000

20000 20

0 0
Department D A E B Other
Count 42300 30300 23100 14800 3100
Percent 37.2 26.7 20.3 13.0 2.7
Cum % 37.2 63.9 84.2 97.3 100.0

Six Sigma Certification 29


tab Introduction D M A I C Black Belt DM

Exercise
Reena is a quality personnel whose job involves inspecting TVs for sound
output on the scale 1 to 5, (1 being the lowest wattage and 5 the highest).
Following is the data for 20 TVs. Calculate the percentage of TVs that Reena
should reject. (Criterion for reject is that the sound output is between 1 – 3).

Sound Output Sound Output Sound Output


1 4 2
1 2 5
2 3 4
2 5 4
4 5 3
5 4 1
2 4

<Pareto_TV.mtw>
Six Sigma Certification 30
tab Introduction D M A I C Black Belt DM

Solution

Six Sigma Certification 31


tab Introduction D M A I C Black Belt DM

Solution
The percentage of rejected TVs = 25 + 15 + 10 = 50%

Maximum Pareto Chart of Sound Output


Contributors 20 100

80
15
Frequency

60

Percent
10
40

5
20

0 0
Sound Output Scale 4 2 5 1 3
Count 6 5 4 3 2
Percent 30.0 25.0 20.0 15.0 10.0
Cum % 30.0 55.0 75.0 90.0 100.0

Six Sigma Certification 32


tab Introduction D M A I C Black Belt DM

Histogram
• Is a chart that displays distribution, center location, and
variation of data by categorizing data.
• Unlike bar graph (commonly used in Excel), it can show
distribution of continuous data.
From the given data, draw a Histogram for the number of
working days (production) in a month for a three year period.
M onth Jan-02 F eb-02M ar-02 A pr-02M ay -02 Jun-02 Jul-02 A ug-02S ep-02 O c t-02 Nov-02Dec -02
W ork ing_day s 22 22 24 23 24 21 17 23 23 24 19 23

M onth Jan-03 F eb-03M ar-03 A pr-03M ay -03 Jun-03 Jul-03 A ug-03S ep-03 O c t-03 Nov-03Dec -03
W ork ing_day s 19 18 26 20 17 22 25 22 23 20 25 24

M onth Jan-04 F eb-04M ar-04 A pr-04M ay -04 Jun-04 Jul-04 A ug-04S ep-04 O c t-04 Nov-04Dec -04
W ork ing_day s 23 23 24 19 18 25 24 26 21 25 26 24

<histogram.mtw>
Six Sigma Certification 33
tab Introduction D M A I C Black Belt DM

Histogram

Six Sigma Certification 34


tab Introduction D M A I C Black Belt DM

Histogram
Histogram of W_ days
7 7
7

Data labels
6
may be defined
5
Frequency

4 4
4

3 3
3

2 2 2 2
2

0
18 20 22 24 26
W_ days

Six Sigma Certification 35


tab Introduction D M A I C Black Belt DM

Histogram

Histogram of W_ days
Normal
Mean 22.33
7
StDev 2.586
N 36
6

5
Frequency

4
Shows
distribution 3

in 2
comparison
to normal 1

curve of 0
same mean 18 20 22 24 26 28
W_ days
and standard
deviation Other options may be tried
Six Sigma Certification 36
tab Introduction D M A I C Black Belt DM

Dot Plot
• Is a chart that plots dots on a number line depicting
frequency and spread of the data. If the data size is large,
each dot on the chart represents more than one value.
Considering the same data used for plotting Histogram, let us
now draw a Dot Plot
• Go to
Minitab > Graph > Dotplot
• Choose the option “ Simple”
Simple

<histogram.mtw>
Six Sigma Certification 37
tab Introduction D M A I C Black Belt DM

Dot Plot
To find the corresponding values for dots,
• Right- click on the graph and choose
Brush.
Brush
• Right-click again and select the
columns required to be displayed in “Set
ID Values”.
• Choose the area and find the display.

Six Sigma Certification 38


tab Introduction D M A I C Black Belt DM

Box Plot
• Boxplot is used to obtain information about the shape, dispersion, and mid-value of a given
data.
• Spots outliers
• Used to assess the symmetry of the data
Draw a Boxplot the same working days example. (Change the working
days for Jun-2003 as 10 and Dec-2003 as 31).
• Go to
Minitab > Graph > Boxplot
• Choose the option “ Simple”
Simple
• Select the data column

<boxplot.mtw>
Six Sigma Certification 39
tab Introduction D M A I C Black Belt DM

Box Plot

Boxplot of W_ days OUTLIER


OUTLIER

30
UPPER LIMIT= Q3 + 1.5 (Q3- Q1) 3rd Quartile Q3

25
W_ days

MEDIAN
20

1st Quartile Q1
LOWER 15
LIMIT= Q1 - 1.5 (Q3- Q1)

10

OUTLIER
OUTLIER

Six Sigma Certification 40


tab Introduction D M A I C Black Belt DM

Box Plot : Analysis


• The line drawn through the box represents the median of the data.
• The edge above the median represents the first quartile (Q1), while the
edge below represents the third quartile (Q3). Thus the box portion of
the plot represents the interquartile range (IQR = Q3-Q1), or the
middle 50% of the observations.
• The lines extending from the box are called whiskers. The whiskers
extend outward to indicate the lowest and highest values in the data set
(they exclude outliers).
• Extreme values, or outliers, are represented by dots. A value is
considered an outlier if it is outside of the box (greater than Q3 or less
than Q1) by more than 1.5 times the IQR.
• Brush may be used on the graph to find the values of outliers.

Six Sigma Certification 41


tab Introduction D M A I C Black Belt DM

Scatter Plot
• The Scatter Plot illustrates the relationship between two variables.
• Though the variation in one variable with respect to the other is graphically
shown, the graph cannot be relied on making judgments on the same.
Example- Relationship between population in India to Growth rate of China.
Draw the Scatter Plot for the data
Year Month Sales Adv Exp AdAgency Year Month Sales Adv Exp AdAgency
2000 January 210 30 Wise 2001 January 368 30 Smart
2000 February 205 25 Wise 2001 February 358 25 Smart
2000 March 202 55 Wise 2001 March 345 36 Smart
2000 April 245 43 Wise 2001 April 380 36 Smart
2000 May 237 60 Wise 2001 May 391 30 Smart
2000 June 290 50 Wise 2001 June 403 35 Smart
2000 July 299 60 Wise 2001 July 410 45 Smart
2000 August 345 43 Wise 2001 August 430 29 Smart
2000 September 326 34 Wise 2001 September 410 33 Smart
2000 October 355 36 Wise 2001 October 415 35 Smart
2000 November 359 38 Wise 2001 November 435 37 Smart
2000 December 371 34 Wise 2001 December 450 40 Smart

<Scatterplot.mtw>

Six Sigma Certification 42


tab Introduction D M A I C Black Belt DM

Scatter Plot
• For the given data, we may find the relationship of Sales w.r.t. the
expenses on Advertisement cost, its variation with month/ Ad agency
• Go to Graph > Scatterplot > Simple
• All three plots can be made together by choosing Sales Exp as y and
Adv Exp, month, and Ad agency as x(s)

Six Sigma Certification 43


tab Introduction D M A I C Black Belt DM

Scatter Plot
Positive
Scatterplot of Sales vs Adv Exp, Month, Year relation, but
Adv Exp Month groups

No 400
relation

300

200
Sales

20 30 40 50 60
37600 37700 37800 37900 38000
Year

400

300

High sales expenses


200 in the year 2001
2000.00 2000.25 2000.50 2000.75 2001.00

Six Sigma Certification 44


tab Introduction D M A I C Black Belt DM

Scatter Plot (With Group)


Scatterplot of Sales vs Adv Exp
AdAgency
450
Smart
Wise

400

350

Sales
300

250

200

20 30 40 50 60
Adv Exp

Plot
suggests
Sales are
more for
Smart
Agency

Six Sigma Certification 45


tab Introduction D M A I C Black Belt DM

Matrix Plot
• Matrix Plot is used to find relationship between multiple variables
• Let us create a Matrix Plot for the Sales data discussed in Scatter Plot
• Go to Graph > Matrix Plot > With Groups and enter data as:

Six Sigma Certification 46


tab Introduction D M A I C Black Belt DM

Matrix Plot
Matrix Plot of Sales, Adv Exp, Month, Year
20 40 60 2000.0 2000.5 2001.0

AdAgency
400
Smart
Sales Wise
300
Mirror Image

200
60

40 Adv Exp

20 38000

37800
Month

37600
2001.0

2000.5 Year

2000.0
200 300 400 37600 37800 38000

Mirror Image
• Same result can be drawn from the plot
Six Sigma Certification 47
tab Introduction D M A I C Black Belt DM

Marginal Plot
• A marginal plot is a Scatter plot with graphs at the margins of the
x- and/or y-axes, which depict the distribution of the points in each
direction, or the sample marginal distributions.
Let us create a Marginal Plot for Failing_students Rich_students School_Code
3 130 1
the given data: 2 55 2
1 35 3
5 205 4
4 162 5
1 24 6
4 160 7
2 65 8
3 100 9
5 210 10
1 30 11
2 60 12
1 12 13
4 175 14
2 75 15
<Marginal_students.mtw> 3 125 16
Six Sigma Certification 48
tab Introduction D M A I C Black Belt DM

Marginal Plot
• Go to Graph > Marginal Plot > With Histograms

Six Sigma Certification 49


tab Introduction D M A I C Black Belt DM

Marginal Plot
Marginal Plot of Failing_ students vs Rich_ students

Shows
positive
relationship

5
Failing_students

0 50 100 150 200


Rich_ students

Expert Tip: The Marginal plot only indicates the relationship between entities.
The method of finding the exact equation between entities will be learnt later.
Six Sigma Certification 50
tab Introduction D M A I C Black Belt DM

Time Series Plot


• The Time Series Plot is used to:
– Detect seasonality in data
– Detect trends in data over time
– Compare trends across groups
• The time series data is plotted on the vertical y-axis versus time
on the horizontal x-axis.
• A Scatter plot can be used instead, if:
– the data is not in chronological order, or
– the data collection intervals are irregular, you may want to
create a Scatter plot instead.

Six Sigma Certification 51


tab Introduction D M A I C Black Belt DM

Time Series Plot


Date Owl Cat Tortoise Mouse

• Example – A toy manufacturer 1/1/05


1/2/05
22
23
23
23
22
21
22
21
1/3/05 24 24 24 24
in America has four production 1/4/05
1/5/05
23
25
25
19
22
21
20
24

lines working on all days. The 1/6/05


1/7/05
23
24
18
19
20
22
21
17
1/8/05 22 17 23 22
data shows number of defects in 1/9/05
1/10/05
23
21
23
24
21
20
23
24
1/11/05 19 26 22 18
each line for the month of 1/12/05
1/13/05
20
19
23
19
24
22
23
19

January, 2005. Find the stability 1/14/05


1/15/05
21
19
18
17
20
21
18
26
1/16/05 20 20 22 20
of the data. 1/17/05
1/18/05
24
22
17
25
23
22
17
22
1/19/05 25 25 21 25
• Go to 1/20/05
1/21/05
22
23
26
23
20
21
18
23
1/22/05 20 17 23 20
Stat > Time Series > Time 1/23/05
1/24/05
25
24
19
18
23
22
25
19

Series Plot 1/25/05 19 19 22 19


1/26/05 22 23 22 22
1/27/05 24 19 22 17
1/28/05 22 17 23 22
1/29/05 23 23 21 21
1/30/05 23 23 21 23
<Run.mtw> 1/31/05 21 24 20 24

Six Sigma Certification 52


tab Introduction D M A I C Black Belt DM

Time Series Plot : Multiple

Six Sigma Certification 53


tab Introduction D M A I C Black Belt DM

Time Series Plot : Multiple


Time Series Plot of Owl, Cat, Tortoise, Mouse
26 Variable
Owl
Cat
Tortoise
24
Mouse

22
Data

20

18

16
5 5 5 5 5 5 5 5 5 5
200 200 200 200 200 200 200 200 200 200
03/ 06/ 09/ 12/ 15/ 18/ 21/ 24/ 27/ 30/
/ / / / / / / / / /
01 01 01 01 01 01 01 01 01 01
Date

Six Sigma Certification 54


tab Introduction D M A I C Black Belt DM

Time Series Plot : Simple


Time Series Plot of Tortoise Time Series Plot of Mouse

24 26

24
23
Tortoise

22

Mouse
22

20

21
18

20
16
5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
3/2 6/2 9/2 2/2 5/2 8/2 1/2 4/2 7/2 0/2 3/2 6/2 9/2 2/2 5/2 8/2 1/2 4/2 7/2 0/2
/0 /0 /0 /1 /1 /1 /2 /2 /2 /3 /0 /0 /0 /1 /1 /1 /2 /2 /2 /3
01 01 01 01 01 01 01 01 01 01 01 01 01 01 01 01 01 01 01 01
Date Date

Time Series Plot of Cat Time Series Plot of Owl


26 25

24 24

23
22
Cat

Owl

22
20
21

18 20

16 19

05 05 05 05 05 05 05 05 05 05 5 5 5 5 5 5 5 5 5 5
20 20 20 20 20 20 20 20 20 20 00 00 00 00 00 00 00 00 00 00
3/ 6/ 9/ 2/ 5/ 8/ 1/ 4/ 7/ 0/
3/2 6/2 9/2 2/2 5/2 8/2 1/2 4/2 7/2 0/2
/0 /0 /0 /1 /1 /1 /2 /2 /2 /3 /0 /0 /0 /1 /1 /1 /2 /2 /2 /3
01 01 01 01 01 01 01 01 01 01 01 01 01 01 01 01 01 01 01 01
Date Date

Six Sigma Certification 55


tab Introduction D M A I C Black Belt DM

Time Series Plot : Result Analysis


• The time series plot consists of
– Time scale (index, calendar, clock, or stamp column) on the x-axis
– Data scale on the y-axis
– Lines displaying each time series
• For the data, four sets of data, one for each line are plotted
the same time series plot.
• The time series plot suggests no grouping or seasonal
effect in data. Though it gives some indication but
primarily suggests no trends/ some oscillations.
• The plot is not a good way to judge. To statistically show
the evidence of non- randomness, Run chart is used.

Six Sigma Certification 56


tab Introduction D M A I C Black Belt DM

Run Chart
• Run charts are used to monitor process changes
associated with characteristic of interest over time.
• It is used to find patterns in process data or randomness
or stability on the basis of –
– Test for number of runs about the median
• Clusters
• Oscillations
– Test for number of runs up and down
• Trends
• Mixtures
• Consider the same example of toys.
• Go to Stat > Quality Tools > Run Chart
<Run.mtw>
Six Sigma Certification 57
tab Introduction D M A I C Black Belt DM

Run Chart

If the data is in
single column If the data is in
multiple columns

Six Sigma Certification 58


tab Introduction D M A I C Black Belt DM

Run Chart
Run about the median
Run Chart of Owl, ..., Mouse
26

24
Owl, ..., Mouse

22

20

18

16
2 4 6 8 10 12 14 16 18 20 22 24 26 28 30
17>16.35 Sample
Number of runs about median: 17 Number of runs up or down: 19
Expected number of runs: 16.35484 Expected number of runs: 20.33333
19<20.33
Longest run about median: 5 Longest run up or down: 4
Approx P-Value for Clustering: 0.59407 Approx P-Value for Trends: 0.27916
Approx P-Value for Mixtures: 0.40593 Approx P-Value for Oscillation: 0.72084

Six Sigma Certification 59


tab Introduction D M A I C Black Belt DM

Run Chart : Graph Analysis


Run about the median
• A run about the median is one or more consecutive
points on the same side of the median. When the points
are connected by a line, a run ends when the line crosses
the median. A new run begins with the next plotted point.
• In the example, 17 runs about the median were observed.
The blue circles marked show the runs about the median.

Six Sigma Certification 60


tab Introduction D M A I C Black Belt DM

Run Chart : Graph Analysis


Test for number of runs about the median
• The test checks two types of non-random behavior - mixtures and
clusters.
• Mixtures are characterized by an absence of points near the median.
• Clusters are groups of points that have similar values.
• When the observed number of runs is
– statistically greater than the expected number of runs, then mixtures are
suggested. (In the example, 17>16.35, therefore mixtures).
– statistically less than the expected number of runs, then clusters are
suggested.
• In this example, though Mixtures are more than Clusters, the p-values
for clustering (0.594) and mixtures (0.405) are greater than the alpha-
level of 0.05. Therefore, you can conclude that the data does not
indicate mixtures or clusters.

Six Sigma Certification 61


tab Introduction D M A I C Black Belt DM

Run Chart
Run up or down
Run Chart of Owl, ..., Mouse
26

24
Owl, ..., Mouse

22

20

18

16
17>16.35 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30
Sample
Number of runs about median: 17 Number of runs up or down: 19
19<20.33
Expected number of runs: 16.35484 Expected number of runs: 20.33333
Longest run about median: 5 Longest run up or down: 4
Approx P-Value for Clustering: 0.59407 Approx P-Value for Trends: 0.27916
Approx P-Value for Mixtures: 0.40593 Approx P-Value for Oscillation: 0.72084

Six Sigma Certification 62


tab Introduction D M A I C Black Belt DM

Run Chart : Graph Analysis


Run up or down
• A run up or down is one or more consecutive points in
the same direction. A new run begins each time there is a
change in direction (either ascending or descending) in the
sequence of data.
• For example, when the preceding value is smaller, a run up
begins and continues until the proceeding value is larger
than the next point, then a run down begins.
• In the example below, 19 runs up or down were observed.
The blue lines marked show the runs up or down.

Six Sigma Certification 63


tab Introduction D M A I C Black Belt DM

Run Chart : Graph Analysis


Test for number of runs up and down
• This test is based on the number of runs up or down -
increasing or decreasing - and is sensitive to two types of
non-random behavior - oscillation and trends.
• When the observed number of runs is
– statistically greater than the expected number of runs, then
oscillation is suggested.
– statistically less than the expected number of runs, then a trend is
suggested. (In the example, 19<20.33, therefore trends).
• In this example, the p-values for trends (0. 279) and
oscillation (0. 720) are greater than the alpha-level of 0.05.
Therefore, you can conclude that the data does not indicate
a trend or oscillation.
Six Sigma Certification 64
tab Introduction D M A I C Black Belt DM

Normality Test
• A normality test is used to check if the given data is
normally distributed.
• It is important to check for normality because most of the
tools (tests of means and variances) to be discussed,
assume the data to be normal. The data, therefore should
first be checked because taking it through the tools.
• Though most of the data collected is normal (Central
Limit Theorem), as soon as the data is collected, it should
be checked for normality before it is treated by a statistical
tool. Normality may also be checked for any data at any
point of project cycle, which is relevant to the project.

Six Sigma Certification 65


tab Introduction D M A I C Black Belt DM

Normality Test
How to Check for Normality?
There are various methods to check for MONTH #ACCIDENTS

JAN 3
Normality. We will discuss the three most
FEB 4
commonly used methods:
MAR 0

Probability Plot APRIL 1

• Follow the path in Minitab: MAY 2

JUNE 4
– Minitab> Graph > Probability Plot
JULY 3
• Example – Given is the data of the
AUGUST 5
number of accidents in an year on a highly
SEPTEMBER 1
accident – prone highway. Check if the
OCTOBER 4
data is normal.
NOVEMBER 2

DECEMBER 0

Six Sigma Certification 66


tab Introduction D M A I C Black Belt DM

Normality Test: Probability Plot

Six Sigma Certification 67


tab Introduction D M A I C Black Belt DM

Normality Test: Probability Plot

Six Sigma Certification 68


tab Introduction D M A I C Black Belt DM

Normality Test: Probability Plot


Conclusion:
P – value = 0.436. Since p – value is greater than 0.05, the given data is
normal.
0.05 is the alpha Normality test for accidents
value (alpha risk/ Normal
99

type – I error or
Mean 2.417
StDev 1.676
95 N 12

the significance 90
AD
P-Value
0.339
0.436

level). Generally, 80
70
Percent

60
the value assigned 50
40

is 5% unless stated 30
20

otherwise or 10

5
demanded by the
1
process. -2 -1 0 1 2 3
ACCIDENTS
4 5 6 7

Six Sigma Certification 69


tab Introduction D M A I C Black Belt DM

Normality Test: Graphical Summary


• Minitab> Stat > Basic Statistics > Graphical Summary

Default is 95% unless


stated otherwise

Six Sigma Certification 70


tab Introduction D M A I C Black Belt DM

Confidence Level
• Confidence Interval gives you a range of likely values based on the
sample.Confidence Level is how sure you want to be that the
population mean or std. deviation falls in the confidence interval you
are going to calculate based on the sample!
• Six Sigma and industry typically use a 95% Confidence Level which
means:
– 95% chance that the population mean or std. deviation lies within
the confidence interval.
– 5% chance (alpha risk) that population mean is outside the
confidence interval.
– For highly sensitive processes which are working at a high sigma
level, this value of alpha risk may be reduced from 5% to ensure
less alpha error. Similarly, vice – versa also holds true.

Six Sigma Certification 71


tab Introduction D M A I C Black Belt DM

Normality Test: Graphical Summary

Confidence interval for


Mean

Conclusion:
Since p – value is greater than 0.05, the given data is normal.
Six Sigma Certification 72
tab Introduction D M A I C Black Belt DM

Normality Test
Minitab> Stat > Normality Test

Six Sigma Certification 73


tab Introduction D M A I C Black Belt DM

Normality Test
Conclusion:
P – value = 0.436. Since p – value is greater than 0.05, the given data is
normal.

The Prediction Probability Plot of ACCIDENTS


Normal - 95% CI

Interval is the 99
Mean
StDev
2.417
1.676

range in which the 95

90
N
AD
12
0.339
P-Value 0.436
new response value 80

is expected to fall. 70
Percent

60

That is, it provides 50


40
30
an interval of 20

possible response 10

5
values given a
combination of 1
-5.0 -2.5 0.0 2.5 5.0 7.5
predictor levels. ACCIDENTS

Six Sigma Certification 74


tab Introduction D M A I C Black Belt DM

Exercise
For the given data, check if it is normal. If so, analyze your data using all/
few of the following charts:
Durability Carpet_typeCom positionBrand Plastic_fibreColour Porosity_Inv App_tem p
18.95 1 A Strength 10 W hite 42.9 25
– Pareto Chart 12.62 1 B Strength 7 Red 30.2 30
11.94 1 A Strength 7 Red 28.9 30
– Histogram 14.42 1 B Strength 8 Red 33.8 30
– Dot Plot 10.06 2 A Strength 6 Red 25.1 30
7.19 2 B Strength 5 Black 19.4 35
– Box Plot 7.03 2 A Strength 5 Black 19.1 35
14.66 2 B Strength 8 Red 34.3 30
– Scatter Plot 10.92 3 A Power 6 Red 26.8 30
13.28 3 B Power 8 Red 31.6 30
– Matrix Plot 14.52 3 A Power 8 Red 34 30
12.51 3 B Power 7 Red 30 30
– Marginal Plot 10.46 4 A Power 6 Red 25.9 30
21.4 4 B Power 12 W hite 47.8 20
– Time-Series Plot 18.1 4 A Power 10 W hite 41.2 25
– Run Chart 22.5 4 B Power 12 W hite 50 20

<Charts_Exercise.mtw>

Six Sigma Certification 75


tab Introduction D M A I C Black Belt DM

Solution
• Here the response is Durability and rest are the input
variables. Let us do Normality Test to find if the data is
Normal.
Probability Plot of Durability
Normal
99
Mean 13.79
StDev 4.545
95 N 16
AD 0.337
90
P-Value 0.458
80
70
Percent

60
50
40
30
20 Since p > 0.05,
10 Data is Normal
5

1
5 10 15 20 25
Durability

<Charts_Exercise.mpj
Six Sigma Certification 76
tab Introduction D M A I C Black Belt DM

Solution
• Though not mentioned, the most desired value for
Durability would be the highest value
• Let us draw a dot plot to see the response.

As seen using Brush, the


lower values occur for the
combinations below.

Six Sigma Certification 77


tab Introduction D M A I C Black Belt DM

Solution
• Let us see if a Box plot can give some information
Boxplot of Durability vs Brand
Brand 4 gives more
22.5
variation but also
20.0
higher values
17.5
Brand 2 gives lower
Durability

15.0
Durability values
12.5

10.0 Boxplot of Durability vs Carpet_ type

22.5
7.5

20.0
5.0
Power Strength
17.5
Brand
Durability

15.0

12.5

Strength brand gives 10.0

lower Durability (also 7.5

seen from Dot plot) 5.0


1 2 3 4
Carpet_ type

Six Sigma Certification 78


tab Introduction D M A I C Black Belt DM

Solution

• We would code the text variables so that we can treat them


as Numeric – Composition (A=1,B=2) and Colour
(White=1, Red=2, Black =3)

Six Sigma Certification 79


tab Introduction D M A I C Black Belt DM

Solution
• Data > Code > Text to Numeric

Six Sigma Certification 80


tab Introduction D M A I C Black Belt DM

Solution
• In this case, we are interested only in the relationship of Xs
with y, Durability, we are using a Scatter plot

Six Sigma Certification 81


tab Introduction D M A I C Black Belt DM

Solution
Scatterplot of Durability vs Carpet_ type, Plastic_ fibr, Porosity, ...
Carpet_type Plastic_fibre Porosity Brand
Power
20 Strength

15

10
Durability

5
1.5 2.5 3.5 5.0 7.5 10.0 25 35 45
App_temp NComp NColour

20

15

10

5
24 30 361.0 1.5 2.0 1 2 3

Note: Matrix Plot can also be drawn instead of Scatter plot.


Six Sigma Certification 82
tab Introduction D M A I C Black Belt DM

Solution
Conclusions
Durability is undesirable if:
• There is less plastic fiber
• Less porosity_inv
• High temperature (beyond 30)
• High Ncolour (Black)
• Strength brand
• Carpet type 2
• Both compositions give mixed values.

Six Sigma Certification 83


tab Introduction D M A I C Black Belt DM

Self- Practice Exercise

Analyze the data for the defect causes


given by <Charts_Exercise2.mtw>,
using the discussed tools.
Discuss the answers
amongst your
teams.

Six Sigma Certification 84


tab Introduction D M A I C Black Belt DM

Solution
Pareto Chart of Defect type
7000
100
6000 Major Contributors
5000 80

Percent
Count

4000 60
3000
40
2000
20
1000
Run Chart of No. of Defects
0 0
Defect type
in
g
es
s
ar
e ol d re ken ken ts er
to ge w a ar th
iss htn w y a ro 100ro P O
M ig ar
d bl m
ar
d B B g
al Br d H em da H a rt lug sin
u s ng ing P P i s
an w e as gi er 80M
M Lo ag ng a iss No. of Defects
m i ck M Po
w
Da iss Pa
M 60
Count 2068 895 716 499 443 422 412 411 364 261
Percent 31.9 13.8 11.0 7.7 6.8 6.5 6.3 40 6.3 5.6 4.0
Cum % 31.9 45.6 56.7 64.4 71.2 77.7 84.0 90.4 96.0 100.0
20

0
1 20 40 60 80 100 120 140 160 180 200 220 240
Sample

p>0.05, Random data Number of runs about median:


Expected number of runs:
130
123.48163
Number of runs up or down:
Expected number of runs:
163
163.00000
Longest run about median: 5 Longest run up or down: 4
Approx P-Value for Clustering: 0.79806 Approx P-Value for Trends: 0.50000
Approx P-Value for Mixtures: 0.20194 Approx P-Value for Oscillation: 0.50000

Six Sigma Certification 85


tab Introduction D M A I C Black Belt DM

Minitab : Ready Reckoner

<Minitab_Shortcut_Guide>

Six Sigma Certification 86

You might also like