0% found this document useful (0 votes)

37 views

14-Pattern of Data

The document discusses various ways to describe patterns of data distribution including the center, spread, shape, and unusual features. It covers different charts and graphs used to visualize distributions such as dot plots, bar charts, histograms, stem-and-leaf plots, box plots, and scatter plots. Tables are also presented as an alternative way to display data. Key aspects like comparing distributions and Simpson's paradox are mentioned.

Uploaded by

Hoàng Lê

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views

14-Pattern of Data

Uploaded by

Hoàng Lê

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

Pattern of data

Part 1 – section 4

Lecturer: Le Hoai Long (Ph.D.)

1
[email protected]
Center
• The center of a distribution is located at the
median of the distribution.
• This is the point where about half of the
observations are on either side.

Lecturer: Le Hoai Long (Ph.D.)

2
[email protected]
Spread
• The spread of a distribution refers to the
variability of the data.
• If the observations cover a wide range, the
spread is larger. If the observations are
clustered around a single value, the spread is
smaller

Lecturer: Le Hoai Long (Ph.D.)

3
[email protected]
Shape
• The shape of a distribution is described by the
following characteristics.
– Symmetry
– Number of peaks. Distributions can have few
or many peaks.
• Distributions with one clear peak are called
unimodal,
• and distributions with two clear peaks are
called bimodal.

Lecturer: Le Hoai Long (Ph.D.)

4
[email protected]
Shape
• And by the following characteristics.
– Skewness. Distributions with most of their
observations on the left (toward lower values)
are said to be skewed right; and so on.
– Uniform. When the observations in a set of
data are equally spread across the range of the
distribution, the distribution is called a uniform
distribution.

Lecturer: Le Hoai Long (Ph.D.)

5
[email protected]
Shape

Lecturer: Le Hoai Long (Ph.D.)

6
[email protected]
Gap and outlier
• Gaps: areas of a
distribution where
there are no
observations.
• Outliers: distributions
are characterized by
extreme values that
differ greatly from the
other observations.
Lecturer: Le Hoai Long (Ph.D.)
7
[email protected]
Chart and graph
Dotplot
• A dotplot is made up of dots plotted on a graph.
– Each dot can represent a single observation or a
specified number of observations.
– The dots are stacked in a column over a category
– If the categories are quantitative, the pattern of data
in a dotplot can be described in terms of symmetry
and skewness
• Dotplots are used most often to plot frequency
counts within a small number of categories,
usually with small sets of data.

Lecturer: Le Hoai Long (Ph.D.)

8
[email protected]
Dotplot
• In SPSS:
1. Graphs
2. Legacy
dialogs
3. Scatter/
Dot

Lecturer: Le Hoai Long (Ph.D.)

9
[email protected]
Chart and graph
Bar Charts
• A bar chart is made up of columns plotted on
a graph.
– The columns are positioned over a label that
represents a categorical variable.
– The height of the column indicates the size of the
group defined by the column label.

Lecturer: Le Hoai Long (Ph.D.)

10
[email protected]
Chart and graph
Histograms
• Like a bar chart, a histogram is made up of
columns plotted on a graph. Usually, there is no
space between adjacent columns.
– The columns are positioned over a label that
represents a quantitative variable.
– The column label can be a single value or a range of
values.
– The height of the column indicates the size of the
group defined by the column label.

Lecturer: Le Hoai Long (Ph.D.)

11
[email protected]
Bar chart and histogram
• In SPSS: Graphs => Legacy dialogs => Bar
(Histogram)

Lecturer: Le Hoai Long (Ph.D.)

12
[email protected]
Chart and graph
Difference Between Bar Charts and Histograms
• With bar charts, each column represents a
group defined by a categorical variable; and
with histograms, each column represents a
group defined by a quantitative variable.
• It is always appropriate to talk about the
skewness of a histogram. And how about bar
charts?

Lecturer: Le Hoai Long (Ph.D.)

13
[email protected]
Chart and graph
Stemplots
• A stemplot is used to display quantitative
data, generally from small data sets (50 or
fewer observations).
• The entries on the left are called stems; and
the entries on the right are called leaves
• Stemplots usually do not include explicit
labels for the stems and leaves
Lecturer: Le Hoai Long (Ph.D.)
14
[email protected]
Stemplot (Stem and leaf)

Lecturer: Le Hoai Long (Ph.D.)

15
[email protected]
Chart and graph
Boxplot Basics
• A boxplot splits the data set into quartiles. The body of
the boxplot consists of a "box” which goes from the
first quartile (Q1) to the third quartile (Q3).
• Within the box, a vertical line is drawn at the Q2, the
median of the data set.
• Two horizontal lines, called whiskers, extend from the
front and back of the box. The front whisker goes from
Q1 to the smallest non-outlier in the data set, and the
back whisker goes from Q3 to the largest non-outlier
• If the data set includes one or more outliers, they are
plotted separately as points on the chart
Lecturer: Le Hoai Long (Ph.D.)
16
[email protected]
Boxplot
• In SPSS: Graphs => Legacy dialogs => Boxplot

Lecturer: Le Hoai Long (Ph.D.)

17
[email protected]
Chart and graph
Scatterplot
• A scatterplot is a graphic tool used to display
the relationship between two quantitative
variables
• A scatterplot consists of an X axis (the
horizontal axis), a Y axis (the vertical axis), and
a series of dots.
• Each dot on the scatterplot represents one
observation from a data set
Lecturer: Le Hoai Long (Ph.D.)
18
[email protected]
Chart and graph
Scatterplot
• Scatterplots are used to analyze patterns in
bivariate data.
• These patterns are described in terms of
linearity, slope, and strength.

Lecturer: Le Hoai Long (Ph.D.)

19
[email protected]
Scatter plot

Lecturer: Le Hoai Long (Ph.D.)

20
[email protected]
Compare distributions
• Focus on four
features:
– Center.
– Spread.
– Shape.
– Unusual
features.

Lecturer: Le Hoai Long (Ph.D.)

21
[email protected]
Table
• Alternatively, data can be presented in table
form
– One-way table
– Two-way table

Lecturer: Le Hoai Long (Ph.D.)

22
[email protected]
Table
• A one-way table is the tabular equivalent of a bar
chart. Like a bar chart, a one-way table displays
categorical data in the form of frequency counts
and/or relative frequencies.
– Frequency Tables: a one-way table shows frequency
counts for a particular category of a categorical
variable
– Relative Frequency Tables: a one-way table shows
relative frequencies for particular categories of a
categorical variable
Lecturer: Le Hoai Long (Ph.D.)
23
[email protected]
Table
• A two-way table (also called a contingency
table) is a useful tool for examining
relationships between categorical variables.
The entries in the cells of a two-way table can
be frequency counts or relative frequencies
just like a one-way table

Lecturer: Le Hoai Long (Ph.D.)

24
[email protected]
Table

Lecturer: Le Hoai Long (Ph.D.)

25
[email protected]
Be careful,
Simpson’s paradox
• Simpson's paradox (or the Yule-Simpson
effect) is a paradox in which a correlation
present in different groups is reversed when
the groups are combined.
• It occurs when frequency data are hastily
given causal interpretations.
• Simpson's Paradox disappears when causal
relations are brought into consideration
(Wikipedia)

Lecturer: Le Hoai Long (Ph.D.)

26
[email protected]
Be careful,
Simpson’s paradox
• Consider the situation of two contractors in the table
below (Good quality/number of contracts)
• Who is better? (Long N.D. 2010)
Type of contract
Civil Industrial Total
Contractor A 40/60 13/15 53/75
66.6% 86.7% 70.7%
Contractor B 5/8 42/50 47/58
62.5% 84% 81%
Lecturer: Le Hoai Long (Ph.D.)
27
[email protected]

Statistics For Business Handbook PDF
No ratings yet
Statistics For Business Handbook PDF
108 pages
Ôn tập lý thuyết_SB_chap 1-5
No ratings yet
Ôn tập lý thuyết_SB_chap 1-5
12 pages
SB Revision
No ratings yet
SB Revision
19 pages
Descriptive Statistics Final
No ratings yet
Descriptive Statistics Final
35 pages
[IN] descriptive
No ratings yet
[IN] descriptive
4 pages
Chương 1e. Statistics
No ratings yet
Chương 1e. Statistics
31 pages
revision-sb-revision
No ratings yet
revision-sb-revision
20 pages
Descriptive Statistics, Tables and Graphs 20
No ratings yet
Descriptive Statistics, Tables and Graphs 20
34 pages
Descriptive Stats
No ratings yet
Descriptive Stats
39 pages
Picturing Distributions With Graphs
No ratings yet
Picturing Distributions With Graphs
21 pages
Lecture-2b Chart
No ratings yet
Lecture-2b Chart
6 pages
Statistics Lecture PDF
No ratings yet
Statistics Lecture PDF
51 pages
Behavioral Statistics: Chapter 2 - Describing Data With Tables and Graphs
No ratings yet
Behavioral Statistics: Chapter 2 - Describing Data With Tables and Graphs
47 pages
Introduction To Business Statistics
No ratings yet
Introduction To Business Statistics
27 pages
Intro To Statistics
No ratings yet
Intro To Statistics
35 pages
Tutoring Session 2023 - Statistics For Business
No ratings yet
Tutoring Session 2023 - Statistics For Business
65 pages
Lecture 2 - Table and Chart
No ratings yet
Lecture 2 - Table and Chart
9 pages
AE-9-REVIEWER
No ratings yet
AE-9-REVIEWER
7 pages
Bustat Reviewer
No ratings yet
Bustat Reviewer
6 pages
Introduction To Descriptive Statistics I: Sanju Rusara Seneviratne Mbpss
No ratings yet
Introduction To Descriptive Statistics I: Sanju Rusara Seneviratne Mbpss
35 pages
Biostatistics - i
No ratings yet
Biostatistics - i
46 pages
Chapter 2 Statistics Review 2023
No ratings yet
Chapter 2 Statistics Review 2023
21 pages
STAB22 Lecture's Notes
No ratings yet
STAB22 Lecture's Notes
64 pages
Tes9e ch02
No ratings yet
Tes9e ch02
102 pages
SLIDES Statistics-Chapter 2
No ratings yet
SLIDES Statistics-Chapter 2
31 pages
Types of Graphs
No ratings yet
Types of Graphs
19 pages
1. Descriptive Statistics (1)
No ratings yet
1. Descriptive Statistics (1)
65 pages
STATISTICS Reviewer
No ratings yet
STATISTICS Reviewer
4 pages
5-6
No ratings yet
5-6
77 pages
Basic Concepts
No ratings yet
Basic Concepts
105 pages
Final SB: Chapter1: Overview of Statistics
No ratings yet
Final SB: Chapter1: Overview of Statistics
32 pages
Data presentation2023-MRM112-3
No ratings yet
Data presentation2023-MRM112-3
17 pages
Data Visualization & Data Exploration - Unit II
No ratings yet
Data Visualization & Data Exploration - Unit II
26 pages
RVO-STATISTICS - Statistics - Introduction To Statistics IBBI
No ratings yet
RVO-STATISTICS - Statistics - Introduction To Statistics IBBI
93 pages
Unit Summary
No ratings yet
Unit Summary
31 pages
Biostat Aguila Mission Solis (1)
No ratings yet
Biostat Aguila Mission Solis (1)
44 pages
Stats For PGDM
No ratings yet
Stats For PGDM
52 pages
C2
No ratings yet
C2
44 pages
Topic 9
No ratings yet
Topic 9
57 pages
Statistical Description of Data CAF
No ratings yet
Statistical Description of Data CAF
12 pages
RM Data Analysis
No ratings yet
RM Data Analysis
67 pages
Unit 1
No ratings yet
Unit 1
72 pages
Unit 01 Statistics
No ratings yet
Unit 01 Statistics
10 pages
Introduction To Statistics
0% (1)
Introduction To Statistics
20 pages
Introduction To Statistics
100% (3)
Introduction To Statistics
43 pages
Introduction To Statistics: Ratheesh R.L Lecturer Murlidhar College of Nursing
No ratings yet
Introduction To Statistics: Ratheesh R.L Lecturer Murlidhar College of Nursing
50 pages
STA 111 Note
No ratings yet
STA 111 Note
12 pages
Origin and Growth of Statistics
No ratings yet
Origin and Growth of Statistics
18 pages
Working With Graphs and Tables
No ratings yet
Working With Graphs and Tables
52 pages
LECTURED Statistics Refresher
100% (1)
LECTURED Statistics Refresher
123 pages
6 PDF
No ratings yet
6 PDF
93 pages
CHAPTER 2 Descriptive Statistics
No ratings yet
CHAPTER 2 Descriptive Statistics
5 pages
Chapter 2
No ratings yet
Chapter 2
30 pages
Biostat Lecture 3-1
No ratings yet
Biostat Lecture 3-1
162 pages
BIOSTATS-PRELIMS-REV
No ratings yet
BIOSTATS-PRELIMS-REV
4 pages
Math Midterm
No ratings yet
Math Midterm
9 pages
Lec 2 - Descriptive Statistics
No ratings yet
Lec 2 - Descriptive Statistics
40 pages
ProbStat_Lec02_mine
No ratings yet
ProbStat_Lec02_mine
27 pages
Bio Statics
No ratings yet
Bio Statics
93 pages
Combinatorial Geometry in the Plane
From Everand
Combinatorial Geometry in the Plane
Hugo Hadwiger
3/5 (1)
Statistical Graphics Procedures by Example Effective Graphs Using SAS
No ratings yet
Statistical Graphics Procedures by Example Effective Graphs Using SAS
370 pages
Seaborn 2
No ratings yet
Seaborn 2
49 pages
Chapter 2, Part B Descriptive Statistics: Tabular and Graphical Presentations
No ratings yet
Chapter 2, Part B Descriptive Statistics: Tabular and Graphical Presentations
30 pages
Sma2217 Tutorial 2
50% (2)
Sma2217 Tutorial 2
6 pages
KaleidaGraph Manual Version 3.6
No ratings yet
KaleidaGraph Manual Version 3.6
325 pages
Unit 5 (CORRELATION AND REGRESSION)
No ratings yet
Unit 5 (CORRELATION AND REGRESSION)
23 pages
Introduction To Statistical Quality Control
No ratings yet
Introduction To Statistical Quality Control
43 pages
Boot Camp Workbook (Solutions)
No ratings yet
Boot Camp Workbook (Solutions)
17 pages
Strsas
No ratings yet
Strsas
3 pages
Data Visualization Cheat Sheet
No ratings yet
Data Visualization Cheat Sheet
1 page
WORKSHEET 22 (P3) - Perimeter, Area and Statistics
No ratings yet
WORKSHEET 22 (P3) - Perimeter, Area and Statistics
3 pages
3.38 Test Interviews, Continued. Refer To Exercise 3.37. Problem Statement
No ratings yet
3.38 Test Interviews, Continued. Refer To Exercise 3.37. Problem Statement
4 pages
07 Simple Linear Regression Part2
No ratings yet
07 Simple Linear Regression Part2
9 pages
Show What You Know
No ratings yet
Show What You Know
6 pages
How To Summarize Qualitative Data ?
No ratings yet
How To Summarize Qualitative Data ?
30 pages
CH 2 Answers
No ratings yet
CH 2 Answers
27 pages
Fundamentals of Data Science Lab Manual New1
No ratings yet
Fundamentals of Data Science Lab Manual New1
32 pages
26 - Correlation and Regression Analysis
No ratings yet
26 - Correlation and Regression Analysis
50 pages
Math Questions
No ratings yet
Math Questions
20 pages
BNAD 277 Tableau Assignment
No ratings yet
BNAD 277 Tableau Assignment
1 page
Title:: Grade: 12 (Probability and Statistics) Overall Goal
No ratings yet
Title:: Grade: 12 (Probability and Statistics) Overall Goal
7 pages
Correlation Analysis
No ratings yet
Correlation Analysis
32 pages
Corporate Governance Reform Within The Uk Banking Industry and Its Effect On Firm Performance
No ratings yet
Corporate Governance Reform Within The Uk Banking Industry and Its Effect On Firm Performance
15 pages
Quality Management System Iso
No ratings yet
Quality Management System Iso
8 pages
Representation of Data - Final
No ratings yet
Representation of Data - Final
27 pages
Chapter 2 Part 1
No ratings yet
Chapter 2 Part 1
38 pages
Modeling Class X AI
No ratings yet
Modeling Class X AI
24 pages
Welcome To OBIEE: Hands-On Workshop
No ratings yet
Welcome To OBIEE: Hands-On Workshop
46 pages
L34, 35 Matplotlib
No ratings yet
L34, 35 Matplotlib
4 pages
Level I R08 Probability Concepts: Test Code: L1 R08 PRCO Q-Bank 2020
No ratings yet
Level I R08 Probability Concepts: Test Code: L1 R08 PRCO Q-Bank 2020
11 pages