0% found this document useful (0 votes)

97 views23 pages

BADB1014 Quantitative Methods - Lesson 3

This document discusses descriptive statistics and how to organize and summarize data. It covers constructing frequency distributions to organize quantitative data, calculating measures of central tendency (mean, median, mode) and dispersion (range, variance, standard deviation). It also discusses displaying qualitative data using bar charts, histograms, and pie charts. The goal is to choose an appropriate method to summarize a data set and communicate patterns in the data.

Uploaded by

Prashant

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

97 views23 pages

BADB1014 Quantitative Methods - Lesson 3

Uploaded by

Prashant

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

LESSON 3 TOPIC: Descriptive Statistics • To construct a frequency

• Frequency distribution distribution

• Measures of Central Tendency • To calculate mean, median
Measures of dispersion mode for population and
sample
• To calculate range, variance
and standard deviation for
population & sample

Why This Lesson

To describe situations, draw conclusions, or make inferences about events, one must organise the data in some
meaningful way. The most convenient method of organising data is to construct a frequency distribution. After
organising the data, the researcher must present them so they can be understood by those who will benefit from
reading the study. The most useful method of presenting the data is by constructing statistical charts and graphs.
There are many different types of charts and graphs, and each one has a specific purpose. This lesson shows the
statistical methods that can be used to summarise data. The method is the finding of averages, median, mode,
range, variance and standard deviation will be discussed in this lesson.

Introduction

Statistics
Statistics is the mathematical science that deals with the collection, analysis, and presentation of data, which
can then be used as a basis for inference and induction.

Data
Values assigned to observations or measurements

Information
Data that are transformed into useful facts that can be used for a specific purpose, such as making a decision

The Two Main Types of Data

Data can be classified into two categories, namely qualitative and quantitative

© UNITAR International University 1 Prepared by: Zainora Hayat bin Hudi

Classifying Data by Level of Measurement

Branches of statistics

Descriptive statistics
• collecting, summarising, and displaying data
Inferential statistics
• making claims or conclusions about the data based on a sample

Population and Sample

Population
• represents all possible subjects that are of interest in a particular study
Sample
• refers to a portion of the population that is representative of the
population from which it was selected

Parameter and statistics

• Parameter – a described characteristic about a population

• Statistic – a described characteristic about a sample

© UNITAR International University 2 Prepared by: Zainora Hayat bin Hudi

Inferential Statistics

Making claims about a population by examining sample results

• Example:

Constructing a Frequency Distribution

A frequency distribution shows the number of data observations that fall into specific intervals.
• Graphically summarise information not readily observable by merely looking at data in a table
• A class is a category (row) in a frequency distribution.

Example: Number of iPads sold per day

© UNITAR International University 3 Prepared by: Zainora Hayat bin Hudi

Discrete data are values based on observations that can be counted and are typically represented by whole
numbers.
• Represent something that has been counted
• Take on whole numbers such as 0, 1, 2, 3

Continuous data are values that can take on any real numbers, including numbers that contain decimal points.
• Usually measured rather than counted
• Examples are weight, time, and distance.

Examples of Discrete data

• Number of children per family
• Number of cars listed per insurance policy
• Vacation days per month
Examples of Continuous data
• Time required to read chapter 2
• Thickness of paint applied to a car body
• Voltage of batteries produced in August

Relative frequency distributions display the proportion of observations of each class relative to the total number
of observations.
• Shows the fraction of observations in each class
• Found by dividing each frequency by the total number of observations
• The fractions in a relative frequency distribution add up to 1.00.

Example:

© UNITAR International University 4 Prepared by: Zainora Hayat bin Hudi

Cumulative Relative Frequency Distributions
A cumulative relative frequency distribution totals the proportion of observations that are less than or equal to
the class at which you are looking.
• Shows the accumulated proportion as values vary from low to high

Using a Histogram to Graph a Frequency Distribution

A histogram is a graph showing the number of observations in each class of a frequency distribution.

The Shape of Histograms

© UNITAR International University 5 Prepared by: Zainora Hayat bin Hudi

Constructing a Frequency Distribution Using Grouped Quantitative Data

Ideally, the number of classes in a frequency distribution should be between 4 and 20.
• Some data sets, particularly those with continuous data, require several values to be grouped together
in a single class.
• This grouping prevents having too many classes in the frequency distribution, which can make it difficult
to detect patterns.

Number of Classes
One method to determine the number of classes in a frequency distribution is the rule
2k ≥ n
where k = Number of classes
n = Number of data points
• Find the lowest value of k that satisfies the rule.
Suppose n = 50
25 = 32 < 50 (k = 5 is too small.)
26 = 64 > 50 (k = 6 is a good choice.)

Class Width

Once k is known, the width of each class can be found.

• The width is the range of numbers to put into each class.

© UNITAR International University 6 Prepared by: Zainora Hayat bin Hudi

• Round this estimation to a useful whole number that makes the frequency distribution more readable.

There is no one correct answer for the class width.

• The goal is to create a histogram to clearly and usefully show the pattern in the data.
• Often there is more than one acceptable way to accomplish this.

Class Boundaries

Class boundaries represent the minimum and maximum values for each class.
• Choose class boundaries that are easy to read.
☺🗹 ☹🗷
3 to less than 6 minutes 3.21 to less than 6.21 minutes
6 to less than 9 minutes vs. 6.21 to less than 9.21 minutes
9 to less than 12 minutes 9.21 to less than 12.21 minutes

Class Frequencies

Find class frequencies by counting and recording the number of observations in each class.
• This is easier when the data are sorted.
Example:

Rules for Classes for Grouped Data

1. Equal-size classes. All classes in the frequency distribution must be of equal width.
2. Mutually exclusive classes. Class boundaries cannot overlap.
3. Include all data values. Make sure all data values are accounted for in the total row of the frequency
distribution.

© UNITAR International University 7 Prepared by: Zainora Hayat bin Hudi

4. Avoid empty classes. It is undesirable for a histogram to display a class so narrow that there are no
observations in it.
5. Avoid open-ended classes (if possible). These violate the first rule of equal class sizes.

The Consequences of Too Few or Too Many Classes

Wide classes result in few class intervals:

• Can obscure important patterns
• Gives a “blocky” distribution graph
• Summarizes the data too much
• Tells us little about the true distribution shape

Too many narrow classes has consequences:

• Results in a “jagged” histogram
• Some classes may be empty
• Does not summarize the data enough

The Ogive
The ogive is a line graph that plots the cumulative relative frequency distribution.
It provides a simple representation of the frequencies that are less than or equal to a certain number.

© UNITAR International University 8 Prepared by: Zainora Hayat bin Hudi

Displaying Qualitative Data

Qualitative data are values that are categorical.

• Can be nominal or ordinal measurement level
• Describe a characteristic, such as gender or level of education
Frequency distributions help display qualitative data by indicating the number of occurrences of various
categories.

Bar Charts

Bar charts are a good tool for displaying qualitative data that have been organised in categories.

Vertical Bar Chart Horizontal bar chart

Pareto Charts

Pareto charts are bar charts that show the frequency of the categories that cause quality control problems.
Show quality problem categories in decreasing order
• The most problematic categories are shown first
Pareto charts also plot the cumulative relative frequency as a line on the chart known as an ogive.

© UNITAR International University 9 Prepared by: Zainora Hayat bin Hudi

Pie Charts
Pie charts are another excellent tool for comparing proportions for categorical data.
Each segment of the pie represents the relative frequency of one category.
• All categories in the data set must be included in the pie.
• Use a pie chart to compare the relative sizes of all possible categories.
• Bar charts are more useful when you want to highlight the actual data values and when the classes
combined don’t form a whole.

Stem and Leaf Display

A stem and leaf display splits the data values into stems (the larger place values) and leaves (the smaller place
value).

© UNITAR International University 10 Prepared by: Zainora Hayat bin Hudi

By listing all of the leaves to the right of each stem, we can graphically describe how the data are distributed.
• All the original data points are visible on the display
• Easy to construct by hand
• Provides a histogram-like view of the distribution

For this example, use the 10’s digit as the stem

Use the 1’s digit as the leaf

1. Sort the data from lowest to highest.

2. Determine the unique stem values.
7, 8, 9 are the different stem values in this example.
3. List the stems in a vertical column and then add the leaf values to the right of the appropriate stem, in
ascending order.

7|8 8 9 9 9
8|0 0 0 0 1 1 2 3 3 4 4 4 5 6 7 8
9|0 2 5

To get more detail the stems can be split in half

7(5) | 8 8 9 9 9
8(0) | 0 0 0 0 1 1 2 3 3 4 4 4
8(5) | 5 6 7 8
9(0) | 0 2
9(5) | 5

• The stem labeled 7(5) stores all the scores between 75 and 79.
• The stem 8(0) stores all the scores between 80 and 84.

Measures of Central Tendency

Central tendency is a single value used to describe the center point of a data set.

© UNITAR International University 11 Prepared by: Zainora Hayat bin Hudi

The Mean

The mean, or average, is the most common measure of central tendency.

• Calculate the mean by adding all the values in a data set and then dividing the result by the number of
observations.

Formula for the Sample Mean:

Formula for the Population Mean:

Example: suppose a sample of size n = 5 gives the following values:

6.2 7.1 4.8 9.0 3.3
The sample mean:

© UNITAR International University 12 Prepared by: Zainora Hayat bin Hudi

Advantages and Disadvantages of Using the Mean to Summarise Data

Advantages:
• Simple to calculate
• Summarizes the data with a single value
Disadvantages:
• With only a summary value you lose information about the original data.
• Sample 1 with n = 3: 999, 1000, 1001 𝑥̅ = 1000
• Sample 2 with n = 3: 0, 1000, 2000 𝑥̅ = 1000
• Just knowing the mean does not help you know what the underlying data looks
like.
• The value of the mean is sensitive to outliers (values that are much higher or lower than most of
the data).

The Median
The median is the value in the data set for which half the observations are higher and half the observations are
lower.
• First arrange the data in ascending order.

Example with sample of size n = 7:

21 27 27 28 34 45 50
The median value is, therefore, in the fourth position of our sorted data.
21 27 27 28 34 45 50

The median is not sensitive to outliers.

21 27 27 28 34 45 5000
• The median is still 28.
When there are odd numbers of data values, the median is always the middle value in the data set.

When there are even numbers of data values, the median is halfway between the two middle values.
Example with sample of size n = 6:
145 157 170 182 204 209

The Mode
The mode is the value that appears most often in a data set.
• If no data value or category repeats more than once, then we say that the mode does not exist.
• More than one mode can exist if two or more values tie for the most frequent.
The mode is a particularly useful way to describe categorical data.

Example with numerical data:

© UNITAR International University 13 Prepared by: Zainora Hayat bin Hudi

• Number of children per family in a sample of 24 families:
0,0,0,0,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,4,5
Number
of children Frequency
The value that appears most often is 2
0 4 (occurs 8 times), so the mode = 2
1 5 children.
2 8
3 4
4 2
5 1

Example with categorical data:

• The car that appears most often is Toyota (occurs 7 times), so the mode is the Toyota model.

Example:
Prices for 5 homes have been collected

House Prices:

Sum 3,000,000
Which Measure of Central Tendency Should You Use?

The mean is generally used as it is relatively easy to determine and most widely understood by people with little
statistical training.
If outliers are present, the median is often used, since the median is not sensitive to outliers
• For example, median home prices may be reported for a region; it is less sensitive to outliers.
For categorical data, the mode is the only choice

Mesures of Variability

Measures of variability show how much spread is present in the data.

The Range

Simplest measure of variation

Difference between the highest value and the lowest value in a data set

Advantages:

• Easy to calculate and understand
Disadvantages:
• Only based on two numbers in the data set
(Ignores the way in which data are distributed)
• Sensitive to outliers

Example:

The Variance and Standard Deviation

The Standard Deviation

The standard deviation is the square root of the variance.

• Has the same units as the original data
Sample standard deviation formula:

Calculating the Sample Standard Deviation

Short-Cut Formulas for the Sample Variance and Standard Deviation

Equivalent, but easier for hand calculations

The Variance and Standard Deviation for a Population

Used when the data set represents an entire population rather than a sample from a population

Short-Cut Formulas for the Population Variance and Standard Deviation

Example calculation using short-cut formula:

The standard deviation is a common measure of consistency in business applications, such as quality
control.
• The standard deviation measures the amount of variability around the mean.
The standard deviation is affected by the scale of the data.
• When sample means are very different, comparing standard deviations can be misleading.

The Coefficient of Variation

The coefficient of variation, CV, measures the standard deviation in terms of its percentage of the mean.
• A high CV indicates high variability relative to the size of the mean.
• A low CV indicates low variability relative to the size of the mean.
A smaller coefficient of variation indicates more consistency within a set of data values.

Example:

Working with Grouped Data

Suppose data has already been summarised by a frequency distribution.

• The individual data values are no longer shown.
• Only grouped data is available.
To estimate the average for the frequency distribution:
• Find the midpoint for each group.
(The midpoint is the halfway point in each group.)
• Use the midpoint as a representative value for that group.

Example: The Mean of Grouped Data
Example An online merchant has collected the following grouped data for the number of web pages viewed
by a sample of its customers:

Number of pages Frequency

1 to under 5 6

5 to under 9 12

9 to under 13 10

13 to under 17 4

The merchant would like to calculate the average number of viewed pages.

1. Find the midpoint of each class

Midpoint
Number of pages Frequency
(mi)

1 to under 5 3 6

5 to under 9 7 12

9 to under 13 11 10

13 to under 17 16 4

2. Calculate the mean

The average number of viewed pages is about 8.5.

The Variance and Standard Deviation of Grouped Data

- end of content –

Graduate Studies and Applied Research: College of Teacher Education
100% (2)
Graduate Studies and Applied Research: College of Teacher Education
31 pages
Bangladesh Map
No ratings yet
Bangladesh Map
1 page
Characteristics of Capital Market
No ratings yet
Characteristics of Capital Market
12 pages
CN Fin Mkts Syllabus BUSF-SHU 286 - Spring 2023
No ratings yet
CN Fin Mkts Syllabus BUSF-SHU 286 - Spring 2023
10 pages
Organizational Behaviour PPT-2
No ratings yet
Organizational Behaviour PPT-2
22 pages
9.15.16 Thermop Independent Record
No ratings yet
9.15.16 Thermop Independent Record
14 pages
Aib Journal Ijarbn2020
No ratings yet
Aib Journal Ijarbn2020
56 pages
National Horticulture Board: Year 2009-10
No ratings yet
National Horticulture Board: Year 2009-10
231 pages
Psych Assessment Practice Test
100% (2)
Psych Assessment Practice Test
24 pages
Best Ringtones Net - Ringtone Download - Best Ringtone Download MP3
No ratings yet
Best Ringtones Net - Ringtone Download - Best Ringtone Download MP3
182 pages
CAMS Initial (Kotak) 2
No ratings yet
CAMS Initial (Kotak) 2
50 pages
Ucsp - Module2 (Week3-4)
No ratings yet
Ucsp - Module2 (Week3-4)
12 pages
Motor Insurance - Two Wheeler Policy - Bundled
No ratings yet
Motor Insurance - Two Wheeler Policy - Bundled
3 pages
Classical Compartmental Modeling
No ratings yet
Classical Compartmental Modeling
33 pages
анг 10класс олимпиада
No ratings yet
анг 10класс олимпиада
8 pages
Mercantilist and Physiocrats
No ratings yet
Mercantilist and Physiocrats
10 pages
Ofi Factsheet
100% (1)
Ofi Factsheet
2 pages
Vut Pqec It
No ratings yet
Vut Pqec It
4,185 pages
CH 2
No ratings yet
CH 2
6 pages
فهم الامراض النفسيه
No ratings yet
فهم الامراض النفسيه
353 pages
Traditional Learning Vs Digital Learning: A Paradigm Shift.: Abstract
No ratings yet
Traditional Learning Vs Digital Learning: A Paradigm Shift.: Abstract
17 pages
Sample Paper PVTC: Building Standards in Educational and Professional Testing
No ratings yet
Sample Paper PVTC: Building Standards in Educational and Professional Testing
6 pages
Adv Unit9 ExtraPractice
No ratings yet
Adv Unit9 ExtraPractice
2 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
3 pages
Code, Standard, Specification and Procedure
No ratings yet
Code, Standard, Specification and Procedure
13 pages
Pre-Exam Notes - AI
No ratings yet
Pre-Exam Notes - AI
84 pages
The Development of The Self-Report Measure Instrument For Measuring M-TPACK
No ratings yet
The Development of The Self-Report Measure Instrument For Measuring M-TPACK
9 pages
Coaching Classroom Instruction
No ratings yet
Coaching Classroom Instruction
2 pages
Bravo Application Detail
No ratings yet
Bravo Application Detail
2 pages
Main
No ratings yet
Main
35 pages
A New Strategic Plan
No ratings yet
A New Strategic Plan
9 pages
AP 5 Integumentary
No ratings yet
AP 5 Integumentary
6 pages
12th NCEQ - Semifinals PDF
No ratings yet
12th NCEQ - Semifinals PDF
2 pages
Ummy Video Downloader User Manual English
No ratings yet
Ummy Video Downloader User Manual English
5 pages
AbbVie Builds A Global Systems Infrastructure
No ratings yet
AbbVie Builds A Global Systems Infrastructure
3 pages
Eq 140802022456 Phpapp01
No ratings yet
Eq 140802022456 Phpapp01
41 pages
Taxi Data Analysis Using K-Mean Clustering Algorithm
No ratings yet
Taxi Data Analysis Using K-Mean Clustering Algorithm
6 pages
Roadmap Interactive
No ratings yet
Roadmap Interactive
64 pages
Medel Sonata
No ratings yet
Medel Sonata
54 pages
ET Prime - HDFC Bank HDFC Merger - The HDFC LTD and HDFC Bank Merger - What Does It Mean For Investors - The Economic Times
No ratings yet
ET Prime - HDFC Bank HDFC Merger - The HDFC LTD and HDFC Bank Merger - What Does It Mean For Investors - The Economic Times
8 pages
Candidate Profile - Apprenticeship Training Portal
No ratings yet
Candidate Profile - Apprenticeship Training Portal
2 pages
Spoken Book PDF
No ratings yet
Spoken Book PDF
22 pages
Sathi A Das 2003
No ratings yet
Sathi A Das 2003
10 pages
Strategic Management
No ratings yet
Strategic Management
6 pages
Academy of Management The Academy of Management Review
No ratings yet
Academy of Management The Academy of Management Review
22 pages
Mamba-2 - Installation Manual
No ratings yet
Mamba-2 - Installation Manual
20 pages
Report 3
No ratings yet
Report 3
6 pages
HSL - End of The Day Summary 15072022-202207151712151972854
No ratings yet
HSL - End of The Day Summary 15072022-202207151712151972854
4 pages
570 ASM2 NguyenDangQuang GBS0909A
No ratings yet
570 ASM2 NguyenDangQuang GBS0909A
34 pages
Endemic Animals Info
No ratings yet
Endemic Animals Info
3 pages
Automatic Arabic Number Plate Recognition
No ratings yet
Automatic Arabic Number Plate Recognition
7 pages
NNSA Strategy
No ratings yet
NNSA Strategy
35 pages
Discover - . Empower: Learn
No ratings yet
Discover - . Empower: Learn
41 pages
Atos 2019 Financial Report PDF
No ratings yet
Atos 2019 Financial Report PDF
106 pages
IBPS PO Mains Memory Based Paper 4th Feb 2021 English
No ratings yet
IBPS PO Mains Memory Based Paper 4th Feb 2021 English
47 pages
Strategic Marketing Assignment
No ratings yet
Strategic Marketing Assignment
7 pages
Case Study Analysis
No ratings yet
Case Study Analysis
15 pages
1 50
No ratings yet
1 50
50 pages
UTILTS User Guide
No ratings yet
UTILTS User Guide
137 pages
Strategies For Leveraging Master Brands
No ratings yet
Strategies For Leveraging Master Brands
10 pages
Weka Book Questions
0% (1)
Weka Book Questions
2 pages
Operating Instructions: CD Stereo System SC-MAX370
No ratings yet
Operating Instructions: CD Stereo System SC-MAX370
20 pages
Employees Are Satisfied With Their Benefits, But So What? The Consequences of Benefit Satisfaction On Employees' Organizational Commitment and Turnover Intentions
No ratings yet
Employees Are Satisfied With Their Benefits, But So What? The Consequences of Benefit Satisfaction On Employees' Organizational Commitment and Turnover Intentions
25 pages
8th Quarterly Progress Report of JEEViKA
No ratings yet
8th Quarterly Progress Report of JEEViKA
19 pages
Theory Mcqs
No ratings yet
Theory Mcqs
102 pages
Topic 3
No ratings yet
Topic 3
22 pages
Benefits of Social Networking Sites On Students Education
No ratings yet
Benefits of Social Networking Sites On Students Education
6 pages
(WWW - Entrance Exam - Net) AFCAT PAPER 1
No ratings yet
(WWW - Entrance Exam - Net) AFCAT PAPER 1
15 pages
2. presenting of data - ١١١٠٥٩
No ratings yet
2. presenting of data - ١١١٠٥٩
39 pages
Boro Line
No ratings yet
Boro Line
75 pages
BA V Semester 2018-19 & Onwards PDF
No ratings yet
BA V Semester 2018-19 & Onwards PDF
125 pages
1st Mid
No ratings yet
1st Mid
19 pages
COL Project Manual Final 5598
100% (1)
COL Project Manual Final 5598
59 pages
Unit 6: Data Collection and Analysis: Lectured by Dr. Gautam Maharjan
No ratings yet
Unit 6: Data Collection and Analysis: Lectured by Dr. Gautam Maharjan
17 pages
Action Research Report - Converting The Lack of Interest of Students Through Captivating Classroom Strategies
No ratings yet
Action Research Report - Converting The Lack of Interest of Students Through Captivating Classroom Strategies
44 pages
VXMLRef 007-02542-0025 R4.21 v01
No ratings yet
VXMLRef 007-02542-0025 R4.21 v01
232 pages
Emotional Factors That Effect Managerial Decision Making in Organizations
No ratings yet
Emotional Factors That Effect Managerial Decision Making in Organizations
7 pages
Gerdes Segal Lietz 2010
No ratings yet
Gerdes Segal Lietz 2010
19 pages
Grade of Swallowing Toxicity For FEES
No ratings yet
Grade of Swallowing Toxicity For FEES
9 pages
IT Growth and Global Change: A Conversation With Ray Kurzweil
No ratings yet
IT Growth and Global Change: A Conversation With Ray Kurzweil
6 pages
An Assesment On The Impact of Converting Used Paper Into Flower Vase For Grade 12 Abm Students at Bestlink College of The Philippines S.Y. 2019-2020
No ratings yet
An Assesment On The Impact of Converting Used Paper Into Flower Vase For Grade 12 Abm Students at Bestlink College of The Philippines S.Y. 2019-2020
59 pages
Research Goals and Types of Research Designs
No ratings yet
Research Goals and Types of Research Designs
10 pages
Fairhurst 1989
No ratings yet
Fairhurst 1989
26 pages
Social and Emotional Loneliness and Self-Reported Difficulty Initiating and Maintaining Sleep (DIMS) in A Sample of Norwegian University Students
No ratings yet
Social and Emotional Loneliness and Self-Reported Difficulty Initiating and Maintaining Sleep (DIMS) in A Sample of Norwegian University Students
9 pages
EJC 9758 2023 Prelim P2
No ratings yet
EJC 9758 2023 Prelim P2
6 pages
Gee Fe Chapter7
No ratings yet
Gee Fe Chapter7
4 pages
44 Thieves
No ratings yet
44 Thieves
3 pages
Business Project - Assignment 2
No ratings yet
Business Project - Assignment 2
34 pages
Matida Statistics
No ratings yet
Matida Statistics
3 pages
Part-A Assignment No. 3
No ratings yet
Part-A Assignment No. 3
2 pages

BADB1014 Quantitative Methods - Lesson 3

Uploaded by

BADB1014 Quantitative Methods - Lesson 3

Uploaded by

LESSON 3 TOPIC: Descriptive Statistics • To construct a frequency

• Frequency distribution distribution

Why This Lesson

The Two Main Types of Data

© UNITAR International University 1 Prepared by: Zainora Hayat bin Hudi

Population and Sample

Parameter and statistics

• Parameter – a described characteristic about a population

© UNITAR International University 2 Prepared by: Zainora Hayat bin Hudi

Making claims about a population by examining sample results

Constructing a Frequency Distribution

Example: Number of iPads sold per day

© UNITAR International University 3 Prepared by: Zainora Hayat bin Hudi

Examples of Discrete data

© UNITAR International University 4 Prepared by: Zainora Hayat bin Hudi

Using a Histogram to Graph a Frequency Distribution

The Shape of Histograms

© UNITAR International University 5 Prepared by: Zainora Hayat bin Hudi

Once k is known, the width of each class can be found.

© UNITAR International University 6 Prepared by: Zainora Hayat bin Hudi

There is no one correct answer for the class width.

Rules for Classes for Grouped Data

© UNITAR International University 7 Prepared by: Zainora Hayat bin Hudi

The Consequences of Too Few or Too Many Classes

Wide classes result in few class intervals:

Too many narrow classes has consequences:

© UNITAR International University 8 Prepared by: Zainora Hayat bin Hudi

Qualitative data are values that are categorical.

Vertical Bar Chart Horizontal bar chart

© UNITAR International University 9 Prepared by: Zainora Hayat bin Hudi

Stem and Leaf Display

© UNITAR International University 10 Prepared by: Zainora Hayat bin Hudi

For this example, use the 10’s digit as the stem

1. Sort the data from lowest to highest.

To get more detail the stems can be split in half

Measures of Central Tendency

Measures of Central Tendency

© UNITAR International University 11 Prepared by: Zainora Hayat bin Hudi

The mean, or average, is the most common measure of central tendency.

Formula for the Sample Mean:

Formula for the Population Mean:

Example: suppose a sample of size n = 5 gives the following values:

© UNITAR International University 12 Prepared by: Zainora Hayat bin Hudi

Example with sample of size n = 7:

The median is not sensitive to outliers.

Example with numerical data:

© UNITAR International University 13 Prepared by: Zainora Hayat bin Hudi

Example with categorical data:

© UNITAR International University 15 Prepared by: Zainora Hayat bin Hudi

Measures of variability show how much spread is present in the data.

Simplest measure of variation

© UNITAR International University 16 Prepared by: Zainora Hayat bin Hudi

The Variance and Standard Deviation

© UNITAR International University 17 Prepared by: Zainora Hayat bin Hudi

The standard deviation is the square root of the variance.

Calculating the Sample Standard Deviation

© UNITAR International University 18 Prepared by: Zainora Hayat bin Hudi

Equivalent, but easier for hand calculations

The Variance and Standard Deviation for a Population

© UNITAR International University 19 Prepared by: Zainora Hayat bin Hudi

Example calculation using short-cut formula:

The Coefficient of Variation

© UNITAR International University 20 Prepared by: Zainora Hayat bin Hudi

Working with Grouped Data

Suppose data has already been summarised by a frequency distribution.

© UNITAR International University 21 Prepared by: Zainora Hayat bin Hudi

Number of pages Frequency

1. Find the midpoint of each class

2. Calculate the mean

The average number of viewed pages is about 8.5.

© UNITAR International University 22 Prepared by: Zainora Hayat bin Hudi

© UNITAR International University 23 Prepared by: Zainora Hayat bin Hudi

You might also like