0% found this document useful (0 votes)

10 views22 pages

Stats Lecture-2

Uploaded by

zeyneperolmez

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views22 pages

Stats Lecture-2

Uploaded by

zeyneperolmez

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

Probability and

Statistics Lecture 2
Dr Sumeyye BAKIM
2024
1
Outline
• Histograms
• Shapes of Frequency Distributions
• Misleading Graphs
• Frequency Tables and Histograms in
Research Articles

2
Histograms
A graph is another good way to facilitate
understanding of a large group of scores. A result
can be described in a thousand words or a
thousand numbers. A simple approach is to create
a graph of the frequency table. A graph of the
information in a frequency table is called a
histogram, which is a type of bar chart. In a
histogram, the height of each bar represents the
frequency of each value in the frequency table.
Normally, in a histogram, all the bars are placed
side by side without any gaps in between

A histogram is a bar-like graph of a frequency distribution

where the values are plotted along the horizontal axis, and the
height of each bar represents the frequency of that value; since
the bars are typically placed side by side without gaps, it gives

3
the appearance of a city skyline.
Stress Level Example
Histogram

4
Social Interaction Frequency
Histogram

Histogram for number of social interactions during a week for 94 college students,
based on grouped frequencies. (Data from McLaughlin-Volpe et al., 2001.
5
How to Make a Histogram

❶ Make a frequency table (or grouped frequency table).

❷ Put the values along the bottom of the page, from left to right, from lowest
to highest

Attention!! When creating a histogram from a grouped frequency table, the values you place at the
bottom of the page are the midpoints of the intervals. The midpoint of an interval is halfway between
the start of that interval and the start of the next highest interval (for the interval 0-4, the midpoint is
2.5).
❸ Make a scale of frequencies along the left edge of the page that goes from 0 at the bottom to the highest
frequency for any value.
❹ Make a bar above each value with a height for the frequency of that value.
For each bar, make sure that the middle of the bar is above its value.

If you have a nominal variable, the histogram is called a bar chart.

Since the values of a nominal variable are not ordered, a gap is left between the bars.
6
Closest Person Example
Bar Graph

Bar graph for the closest

person in life for 208 students
(see Table 4). (Data from Aron
et al., 1995.)

7
Shapes of Frequency Distributions
A frequency distribution shows the pattern of frequencies across various values. A frequency table
or histogram defines a frequency distribution because each illustrates how the frequencies are
spread or 'distributed.' Psychologists also describe this shape in words. It is important to define the
shape of a distribution.

Question: How many 'peaks' are there in a distribution?

Single?
Double?
Multiple? None?
8
FREQUENCY POLIGONS
Unimodal Distribution A frequency distribution in which one value has a
frequency that is clearly larger than the others.

Two frequency distributions that have approximately

Bimodal Distribution equal frequencies, each clearly larger than the others.

Multimodal Distribution Any distribution with two or more high points

Rectangular Distribution A distribution with values of all about the same frequency

Unimodal Bimodal Rectangular 9

In a frequency polygon, the line moves from point to point. The height of each
point represents the number of scores at that value, creating the silhouette of
a mountain. Scores obtained from most studies typically follow an
approximately unimodal (single-peaked) distribution. Bimodal and other
multimodal distributions may occasionally occur.

In the example of stress levels,

the most frequently occurring
value is 7 (the highest
frequency is 7). This is an
example of a unimodal
distribution.

10
Bimodal
(a) A bimodal distribution showing
the possible frequencies for people
of different ages in a toddler’s play
area.

Rectangular
(b) A regular distribution showing
the possible frequencies of
students at different grade
levels in an elementary school.

11
Symmetric and Skewed Distributions
Take another look at the histograms of the
stress rating example. The distribution is
balanced by an increase in scores towards
the ends, which is somewhat unusual.
Most things we measure in science have
equal numbers on both sides of the
middle. This means that in science, scores
often follow an approximately symmetric
distribution (if you fold the graph of a
symmetric distribution in half, the two
halves look the same).

12
A distribution that is not symmetric is called a skewed
distribution. The stress rating distribution is an example of
this. A skewed distribution has a long and stretched side,
resembling a tail. The side with fewer scores (the tail-like
side) is considered the direction of skewness. Thus, the stress
study example, which has very few scores at the lower end, is
skewed to the left.
The example of social interactions, which has very few scores
at the upper end, is skewed to the right (see the figure on
right). The figure below shows examples of approximately
symmetric and skewed distributions.

Approximately Positively Skewed Negatively Skewed

symmetrical
13
Strongly Skewed Distributions and Floor Effect
Strongly skewed distributions often arise in science when
there is an upper and lower limit on the measured
variable.

For example, a mechanical component cannot have a

negative tensile strength. The situation where tensile
strength cannot take a value lower than zero is called a
floor effect. The right-skewed distribution caused by this
lower limit can be seen in the figure

14
A distribution skewed to the right due to a floor effect: fictional
distribution of the number of children in families.
Ceiling Effect
The skewed distribution caused by the upper
limit can be seen in the figure on the right. This
distribution represents the results of an adults'
multiplication table test and is strongly skewed
to the left. This illustrates a ceiling effect. A
ceiling effect is also evident in the stress level
example, where the highest stress level is 10
and cannot exceed this value.
A distribution skewed to the left due to a ceiling
effect: fictional distribution of adults’ scores on
a multiplication table test.

Floor effect: Skewed to the right

Ceiling effect : Skewed to the left 15
Normal Distribution Social Interaction

Scientists define distributions by their peak points, which can

either rise or fall. The standard for this comparison is the bell-
shaped curve. In research and natural phenomena,
distributions typically resemble this bell curve, known as the
normal distribution. It's important to note that the normal
distribution has a single peak (unimodal) and is symmetrical in
shape. However, in examples of stress levels and social
interactions, the distributions can be skewed. In general, when
examining studies, results often closely align with the normal Stress Level
distribution, except in these two cases.

16
Kurtotic Distribution
Kurtosis measures how different the shape of a distribution is from a normal curve. Is it taller or
flatter than the normal curve? The term "kurtosis" comes from the Greek word "kyrtos," meaning
"curve."
The figure below (b) shows a kurtotic distribution with a more pronounced peak than the normal
curve. Figure (c) illustrates an extreme example of a kurtotic distribution that is very flat. (A
rectangular distribution would be an even more extreme example.)
Distributions that are taller or flatter than a normal curve also tend to have different shapes in
their tails. Distributions with a very tall curve typically have more data points in their tails
compared to the normal curve (see figure b).
In contrast, flatter distributions tend to have fewer data points in their tails than the normal curve
(see figure c).
By comparing kurtosis to the normal curve, we can determine how much it has become taller or
flatter. The key point here is the number of data points in the tails.

(a) normal, (b) heavy-tailed, and (c)

light-tailed distributions. The normal
distribution is shown as a dashed line
in (b) and (c).
17
Misleading Graphs

The most serious discussions about frequency

tables and histograms are not among scientists,
but among the general public.

Of course, people can lie with statistics and

often do. It's easy to lie with words, but you
may not always recognize the lies told with
numbers.

There are two main ways to explain how

frequency tables and graphs can be misused
and how to recognize such abuses.

18
Failure to Use Equal Interval Sizes
A fundamental requirement for a grouped frequency
table or graph is that the size of the intervals must
be equal. If they are not, the table or graph can be
very misleading. The table next to this text gives the
impression that commissions paid to travel agencies
dropped dramatically in 1978.

Upon closer inspection of the graph, it reveals that the

third bar for each airline only represents the first half
of 1978. Therefore, only half of the year is being
compared to complete previous years. If we assume
that the second half of 1978 is similar to the first half,
the information in this graph actually indicates an
increase rather than a decrease in 1978. For example,
Delta Airlines reached a total of $72 million in 1978,
significantly higher than the $57 million in 1977.

19
Histograms in Research Articles
Maggi and colleagues (2007) conducted a study on age-related
changes in smoking behavior among Canadian adolescents. As
shown in the figure, they created a histogram from a grouped
frequency table to display their results. Their histogram
represents the results from two samples (illustrated with dark
and light bars).

As can be seen from the figure, less than 10% of those aged 10-11
have tried smoking, while more than half of those aged 16-17
have attempted it. In this example, the researchers drew the
histogram with gaps between the bars, whereas gaps should be
avoided (unless you are drawing a bar graph for a nominal
variable).

Additionally, the differing sample sizes in each age group can lead
to misleading percentages.

20
Exaggeration of Proportions
The height of a histogram or bar graph (or
frequency polygon) typically starts at 0 or the
lowest value on the scale and extends to the
highest value on the scale.

Figure a shows a bar graph that does not

adhere to this standard. The bar graph
illustrates the average housing prices in a
specific area over four years (from 2008 to
2011). By starting the vertical axis at $150,000
(rather than 0), the graph appears to
exaggerate the changes in housing prices over
time.

Figure b, which starts the vertical axis at $0,

shows the same results. You can observe the
changes in housing prices year by year in
Figure b, and these changes are accurate.

21
The total ratio of a histogram or bar graph should be approximately 1 to 1.5 times its length, as
seen in Figure a for the stress rating example. However, consider what happens if we make the
graph much shorter or longer, as shown in Figures b and c. This change is akin to using
software to alter a person's photograph: the actual image is distorted. Any shape of a
histogram can be considered correct in some sense. However, a ratio of 1 to 1.5 has been
adopted as a standard for comparison purposes. Altering this ratio misleads the viewer.

12C Group 7 TIOC TITLE DEFENSE Compiled
86% (7)
12C Group 7 TIOC TITLE DEFENSE Compiled
6 pages
CONCLUSION Survey Theodolite
50% (6)
CONCLUSION Survey Theodolite
6 pages
C. A Review Engagement Focuses On Providing Limited Assurance On Financial Statement of A Private
No ratings yet
C. A Review Engagement Focuses On Providing Limited Assurance On Financial Statement of A Private
13 pages
Effectiveness of Study Habits
No ratings yet
Effectiveness of Study Habits
13 pages
Organisational Culture On Productivity - Namibia
No ratings yet
Organisational Culture On Productivity - Namibia
130 pages
Chapter 3: Organization, Utilization, and Communication of Test Results
No ratings yet
Chapter 3: Organization, Utilization, and Communication of Test Results
25 pages
Lesson 7 Organization of Test Data Using Tables and GraphsDDDD
No ratings yet
Lesson 7 Organization of Test Data Using Tables and GraphsDDDD
24 pages
Statistics For Managers Using Microsoft Excel: 6 Global Edition
No ratings yet
Statistics For Managers Using Microsoft Excel: 6 Global Edition
64 pages
Area Under Normal Curve Worksheet Answers
100% (1)
Area Under Normal Curve Worksheet Answers
4 pages
LLM Thesis Template
100% (3)
LLM Thesis Template
4 pages
Chapter 2 - Organization and Presentation of Data: Learning Outcomes
No ratings yet
Chapter 2 - Organization and Presentation of Data: Learning Outcomes
8 pages
Picturing Distributions With Graphs
No ratings yet
Picturing Distributions With Graphs
21 pages
Histograms, Frequency Polygons, and Ogives: Section 2.3
No ratings yet
Histograms, Frequency Polygons, and Ogives: Section 2.3
20 pages
Sample Test1
No ratings yet
Sample Test1
4 pages
BLM - Manual of Instructions - 1973
100% (1)
BLM - Manual of Instructions - 1973
332 pages
What Is It?
No ratings yet
What Is It?
3 pages
CH - 2 (Organizing and Graphing Data)
No ratings yet
CH - 2 (Organizing and Graphing Data)
83 pages
Introduction To Biostatistics
No ratings yet
Introduction To Biostatistics
59 pages
Analytical Techniques Lec 1
No ratings yet
Analytical Techniques Lec 1
42 pages
Psychological Statistics Lesson 2
No ratings yet
Psychological Statistics Lesson 2
41 pages
CH 2
No ratings yet
CH 2
39 pages
Psy 055 P1 Week 3 Module 3
No ratings yet
Psy 055 P1 Week 3 Module 3
67 pages
Data Summary 2
No ratings yet
Data Summary 2
69 pages
Week 1 - CH 2
No ratings yet
Week 1 - CH 2
49 pages
Job Satisfaction of The Employees in City Bank
No ratings yet
Job Satisfaction of The Employees in City Bank
17 pages
Chapter 2
No ratings yet
Chapter 2
51 pages
Data Organization
No ratings yet
Data Organization
69 pages
Chapter 2
No ratings yet
Chapter 2
74 pages
2. presenting of data - ١١١٠٥٩
No ratings yet
2. presenting of data - ١١١٠٥٩
39 pages
BIOL 2163 Lecture 2 - Summarizing and Graphing Data
No ratings yet
BIOL 2163 Lecture 2 - Summarizing and Graphing Data
59 pages
8614, Unit 3 5
No ratings yet
8614, Unit 3 5
48 pages
Per g01 Pub 585 Touchstone AssessmentQPHTMLMode1 GATE2451 GATE2451S2D3770 17402100628643803 CS25S23051153 GATE2451S2D3770E1.HTML#
No ratings yet
Per g01 Pub 585 Touchstone AssessmentQPHTMLMode1 GATE2451 GATE2451S2D3770 17402100628643803 CS25S23051153 GATE2451S2D3770E1.HTML#
30 pages
2 Frequency Distribution
No ratings yet
2 Frequency Distribution
32 pages
Lesson2 - Measures of Tendency
No ratings yet
Lesson2 - Measures of Tendency
65 pages
Descriptive Statistics FDT and Data Presentation
No ratings yet
Descriptive Statistics FDT and Data Presentation
60 pages
2-Organizing and Displaying Data
No ratings yet
2-Organizing and Displaying Data
65 pages
P03 - Tables and Charts
No ratings yet
P03 - Tables and Charts
30 pages
Prof Ed 6 Lesson 7
No ratings yet
Prof Ed 6 Lesson 7
31 pages
Organizing & Displaying of Data
No ratings yet
Organizing & Displaying of Data
22 pages
Ch04 Quantitative Data
No ratings yet
Ch04 Quantitative Data
48 pages
Chapter 2
No ratings yet
Chapter 2
40 pages
RVO-STATISTICS - Statistics - Introduction To Statistics IBBI
No ratings yet
RVO-STATISTICS - Statistics - Introduction To Statistics IBBI
93 pages
STUDY94@817302
No ratings yet
STUDY94@817302
18 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
21 pages
Lecture-02 Data Organization and Presentation
No ratings yet
Lecture-02 Data Organization and Presentation
36 pages
Geospatial Measurement of Urban Sprawl and Land Transformation Using Multi-Temporal Datasets: A Case Udy of Sonipat-Kundli Urban Agglomeration
No ratings yet
Geospatial Measurement of Urban Sprawl and Land Transformation Using Multi-Temporal Datasets: A Case Udy of Sonipat-Kundli Urban Agglomeration
23 pages
PSY123 Lecture 10-1
No ratings yet
PSY123 Lecture 10-1
31 pages
Organizing and Graphing Data
No ratings yet
Organizing and Graphing Data
83 pages
002 Frequency Distribution PSY102
No ratings yet
002 Frequency Distribution PSY102
59 pages
Office Hour: - Chu Chi Wing - Monday 2:30-3:30p.m - Chen Zirui - Tuesday 3:30-4:30p.m Thursday4:30-5:30p.m
No ratings yet
Office Hour: - Chu Chi Wing - Monday 2:30-3:30p.m - Chen Zirui - Tuesday 3:30-4:30p.m Thursday4:30-5:30p.m
28 pages
Sarcopenia HF SRMA
No ratings yet
Sarcopenia HF SRMA
9 pages
Aps 6 3 Notes
No ratings yet
Aps 6 3 Notes
6 pages
AAHL P3 D Questions
No ratings yet
AAHL P3 D Questions
3 pages
V2 Chapter3 Summer 2020 - 21 - Tagged
No ratings yet
V2 Chapter3 Summer 2020 - 21 - Tagged
36 pages
Effects of Change Orders On Cost Growth
No ratings yet
Effects of Change Orders On Cost Growth
9 pages
Week 2
No ratings yet
Week 2
15 pages
PP 42-49
No ratings yet
PP 42-49
8 pages
Effects of Work-Family and Family-Work Conflicts On Flexible Work Arrangements Demand: A Gender Role Perspective
No ratings yet
Effects of Work-Family and Family-Work Conflicts On Flexible Work Arrangements Demand: A Gender Role Perspective
22 pages
Business Report Presentation
No ratings yet
Business Report Presentation
14 pages
Part 1 Descriptive
No ratings yet
Part 1 Descriptive
42 pages
Frequency Distributions: Essentials of Statistics For The Behavioral Sciences
No ratings yet
Frequency Distributions: Essentials of Statistics For The Behavioral Sciences
45 pages
Davy Depth Based Classifier
No ratings yet
Davy Depth Based Classifier
33 pages
Biostatistics Module 3
No ratings yet
Biostatistics Module 3
9 pages
Summarizing and Graphing Data: 2-1 Review and Preview 2-2 Frequency Distributions
No ratings yet
Summarizing and Graphing Data: 2-1 Review and Preview 2-2 Frequency Distributions
12 pages
Basic Statistics
No ratings yet
Basic Statistics
4 pages
Principles For Sustainable Riverfront Development For Malaysia
No ratings yet
Principles For Sustainable Riverfront Development For Malaysia
16 pages
Unit 4 Quantitative Analysis and Interpretation
No ratings yet
Unit 4 Quantitative Analysis and Interpretation
10 pages
BADB1014 Quantitative Methods - Lesson 3
No ratings yet
BADB1014 Quantitative Methods - Lesson 3
23 pages
Chapter IV Data Exploration and Visualization
No ratings yet
Chapter IV Data Exploration and Visualization
3 pages
Session 3 - Organizing Graphing Data - MZS 2020
No ratings yet
Session 3 - Organizing Graphing Data - MZS 2020
30 pages
Psych Stats
No ratings yet
Psych Stats
6 pages
CE 201 - Surveying: Introduction To The Course Introduction To The Course
No ratings yet
CE 201 - Surveying: Introduction To The Course Introduction To The Course
18 pages
Siraj
No ratings yet
Siraj
13 pages
Ortho
No ratings yet
Ortho
6 pages
Missing Items in Lectures
No ratings yet
Missing Items in Lectures
2 pages
Norms and Basic Statistics For Testing
No ratings yet
Norms and Basic Statistics For Testing
4 pages
Basic Statistics
No ratings yet
Basic Statistics
23 pages
Frequency Distrobution & Graphs
No ratings yet
Frequency Distrobution & Graphs
18 pages
Psych Stats Sum
No ratings yet
Psych Stats Sum
10 pages
Sampling Techniques
No ratings yet
Sampling Techniques
1 page
Statistics 2
No ratings yet
Statistics 2
9 pages
Name: Ahmed Abd Almagieed Bujbara Number: 211519205 Group: 1
No ratings yet
Name: Ahmed Abd Almagieed Bujbara Number: 211519205 Group: 1
8 pages
Summary Chapter 2
No ratings yet
Summary Chapter 2
2 pages
A Systematic Review of Research On Flipped Language Classrooms: Theorical Foundations, Learning Activities, Tools, Research Topics and Finding
No ratings yet
A Systematic Review of Research On Flipped Language Classrooms: Theorical Foundations, Learning Activities, Tools, Research Topics and Finding
4 pages
00 Special English - Directional Drilling PDF
No ratings yet
00 Special English - Directional Drilling PDF
12 pages
CT Study Poster PAEA Oct 2018 PDF
No ratings yet
CT Study Poster PAEA Oct 2018 PDF
1 page
Frequency Distribution & Graghs
No ratings yet
Frequency Distribution & Graghs
28 pages
Math 140 Chapter 2 Notes
No ratings yet
Math 140 Chapter 2 Notes
5 pages
2-3 Histograms, Frequency Polygons, and Ogives: Chp.2 Page 1
No ratings yet
2-3 Histograms, Frequency Polygons, and Ogives: Chp.2 Page 1
2 pages
Image Histogram: Unveiling Visual Insights, Exploring the Depths of Image Histograms in Computer Vision
From Everand
Image Histogram: Unveiling Visual Insights, Exploring the Depths of Image Histograms in Computer Vision
Fouad Sabry
No ratings yet

Stats Lecture-2

Uploaded by

Stats Lecture-2

Uploaded by

Probability and

A histogram is a bar-like graph of a frequency distribution

❶ Make a frequency table (or grouped frequency table).

If you have a nominal variable, the histogram is called a bar chart.

Bar graph for the closest

Question: How many 'peaks' are there in a distribution?

Two frequency distributions that have approximately

Multimodal Distribution Any distribution with two or more high points

Unimodal Bimodal Rectangular 9

In the example of stress levels,

Approximately Positively Skewed Negatively Skewed

For example, a mechanical component cannot have a

Floor effect: Skewed to the right

Scientists define distributions by their peak points, which can

(a) normal, (b) heavy-tailed, and (c)

The most serious discussions about frequency

Of course, people can lie with statistics and

There are two main ways to explain how

Upon closer inspection of the graph, it reveals that the

Figure a shows a bar graph that does not

Figure b, which starts the vertical axis at $0,

You might also like