Open navigation menu

Scribd

0% found this document useful (0 votes)

20 views13 pages

Freq. Distribution Characteristics

Uploaded by

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views13 pages

Freq. Distribution Characteristics

Uploaded by

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 13

Characteristics of Frequency Distribution — Descriptive Statistics

Frequency Distribution — Frequency distribution in statistics, a list,

table, graph or data set organized to show the frequency of occurrence

of each possible outcome of a repeatable event observed many times.

For example, in the following list of numbers(1, 2, 3, 4, 6, 9, 9, 8, 5, 1,

1, 9, 9, 0, 6, 9). The frequency of the number 9 is 5 (because it occurs

5 times).

Frequency Distributions are classified into four types

Modality

Symmetry

Measure of Central Tendency

Measure of Dispersion or Variability

Modality
Modality — The modality of a distribution is determined by the number

of peaks it contains.

Types of Modality: Unimodal, Bimodal, Multimodal.

Types of Modality

Unimodal — A unimodal distribution has one values that occur

frequently (one peak)

Bimodal — A bimodal distribution has two values that occur frequently

(two peaks) and

Multimodal — A multimodal has two or several frequently occurring

values (more than two peaks)

ymmetry

Symmetry — Symmetry means that one half of the distribution is a

mirror image of the other half of the image.

Types of Symmetry: Symmetric, Asymmetric

Normal Curve(Symmetric) & Positive/Negative Skew(Asymmetric)

Symmetric — The normal distribution is a symmetric

distribution with no skew. The tails are exactly the same. A normal bell

curve equal on both sides.

Normal Bell Curve (Symmetric)

Asymmetric —Asymmetry is the absence of, or a violation of, symmetry.

which is not identical on both sides of a central line.

Types of Asymmetric: Positive Skewness, Negative Skewness

Positive Skewness — Positive Skewness is when the tail on the right

side of the distribution is longer or fatter than the tail on the left side. The

mean and median will be greater than the mode.

Negative Skewness — Negative Skewness is when the tail of the left

side of the distribution is longer or fatter than the tail on the right side.

The mean and median will be less than the mode.

Negative & Positive Skewness(Asymmetric)

Measure of Central Tendency

A measure of central tendency is a single value that attempts to

describe a set of data by identifying the central position within that set
of data. As such, measures of central tendency are sometimes

called measures of central location. They are also classed as summary

statistics.

In other words, central tendency computes the “center” around which

the data is distributed.

The mean, median and mode are all valid measures of central

tendency

Mean — (Average Value)

Mode — (Middle Value)

Median — (Value occurs maximum no of times)

Mean
The mean is equal to the sum of all the values in the data set divided by

the number of values in the data set.

For Example, We have 10 random numbers like (1,5,2,8,4,55,9,7,3,6) and

we need to add all the 10 numbers

sum of all 10 numbers →1+5+2+8+4+55+9+7+3+6 = 100

Mean

So, if we have n values in a data set and they have values x1, x2, …, xn,

the sample mean, usually denoted by x¯ (pronounced “x bar”), is:

Sample Mean

This formula is usually written in a slightly different manner using the

Greek capital letter, ∑, pronounced “sigma”, which means “sum

of…”:

Sample Mean Formula

You may have noticed that the above formula refers to the sample mean.

So, why have we called it a sample mean?

Please take a look on my previous post Population and Sample to get

better understanding in sample and population.

This is because, in statistics, samples and populations have very

different meanings and these differences are very important, even if, in

the case of the mean, they are calculated in the same way. To

acknowledge that we are calculating the population mean and not the

sample mean, we use the Greek lower case letter “mu”, denoted as μ:

Population Mean Formula

Disadvantages of Mean:

Let us take the above example for summarizing

We have 10 random numbers like (1,5,2,8,4,55,9,7,3,6)

Let us assume this 10 random numbers as 10 employee salary in

thousands

(1k,5k,2k,8k,4k,55k,9k,7k,3k,6k)

Outlier — Outliers are data points that are far from other data points.

In other words, they’re unusual values in a dataset.

So here one Employee has large amount of salary = 55k, So this

value is far from other data points and it affects the whole data, so it is

called as the outlier data

Note: Mean is highly affected by the outliers. The mean is

being skewed by the two large salaries. Therefore, in this situation, we

would like to have a better measure of central tendency. As we will find

out later, taking the median would be a better measure of central

tendency in this situation.

Median

The median is a simple measure of central tendency. To find the median,

we arrange the observations in order from smallest to largest value. If

there is an odd number of observations, the median is the middle value.

If there is an even number of observations, the median is the average of

the two middle values.

Simple way to remember: Middle Value is called Median

If data count is in odd:

1,7,6, 9, 8, 2, 3, 5,4 → Arrange it is ascending order

1,2,3,4,5,6,7,8,9 → Total Count = 9 (odd number)

Middle Value is the Median → Median = 5

If data count is in even:

1,7,6, 9, 8, 2, 3, 5,4,10 → Arrange it is ascending order

1,2,3,4,5,6,7,8,9,10 → Total Count = 10 (odd number)

Middle 2 Values is the Median → Average the 2 numbers to get the

median

Median Formula

Median = 5.5

Mode

·Mode is the number which appears most often in a set of number and

Mode is used for categorical data where we wish to know which is the

most common category.

Example: in {5, 4, 6, 5, 9, 5, 7, 3} the Mode is 5 (it occurs most often)

problem with the mode is that it will not provide us with a very good

measure of central tendency when the most common mark is far away

from the rest of the data in the data set

Note: To use the mode to describe the central tendency of this

data set would be misleading

Measure of Dispersion or Variability

Measures of dispersion describe the spread of the data. They include

the range, interquartile range, standard deviation and variance. The

range is given as the smallest and largest observations. This is the

simplest measure of variability.

Variability is also referred to as spread, scatter or dispersion. It is most

commonly measured with the following: Range — the difference

between the highest and lowest values.

Variability refers to how spread out a group of data is. The common
measures of variability are the range, IQR, variance, and standard

deviation.

Measures of variability or dispersion are descriptive statistics that

can only be used to describe the data in a given data set or study.
Range
Variance
Standard Deviation
Inter Quartile Range (IQR)

Range

The range is the difference between the lowest and highest values.

Range = Maximum Value — Minimum Value (Max — Min)

Example: In {2,4, 6, 9, 3, 7,10}, order in ascending order

lowest value is 2, and the highest is 10

Range = 10–2 = 8

Variance

The variance measures the average degree to which each point differs

from the mean. The average of all data points.

Variance measures variability from the average or mean. Therefore,

the variance statistic can help determine the risk an investor assumes

when purchasing a specific security. A large variance indicates that

numbers in the set are far from the mean and from each other, while a

small variance indicates the opposite

Unlike range and quartiles, the variance combines all the values in a

data set to produce a measure of spread. … It is calculated as the average

squared deviation of each number from the mean of a data set.

For example, for the numbers 1, 2, and 3 the mean is 2 and

the variance is 0.667

Variance Formula

Standard Deviation

The standard deviation is a statistic that measures the dispersion of a

dataset relative to its mean and is calculated as the square root of the

variance. If the data points are further from the mean, there is a

higher deviation within the data set; thus, the more spread out the data,

the higher the standard deviation.

Standard Deviation Formula

Inter Quartile Range (IQR)

Before going to IQR, let us know about the Quartile and Percentile

Percentile — Nth Percentile states that at least Nth % of values less

than or equal to this value and (100-N) is greater than equal to this value.

percentile simply states Nth Percentile of people are below me

Percentile Formula

Quartile — In statistics, a quartile is a type of quantile which divides the

number of data points into four parts, or quarters, of more-or-less equal

size. The data must be ordered from smallest to largest to compute

quartiles; as such, quartiles are a form of order statistic.

Dividing data in to ¼ parts

Q1 →1st Quartile — 25th Percentile

Q2 → 2nd Quartile — 50th Percentile

Q3 → 3rd Quartile — 75th Percentile

Inter Quartile Range — The IQR describes the middle 50% of values

when ordered from lowest to highest. To find the interquartile

range (IQR), first find the median (middle value) of the lower and upper

half of the data. These values are quartile 1 (Q1) and quartile 3 (Q3).

IQR = Q3-Q1

You might also like

FINAL ASSESSMENT ASSIGNMENT MBA Stats - Maths 28062020 105406pm PDF
No ratings yet
FINAL ASSESSMENT ASSIGNMENT MBA Stats - Maths 28062020 105406pm PDF
3 pages
Probabilistic Reliability Engineering (PDFDrive)
No ratings yet
Probabilistic Reliability Engineering (PDFDrive)
536 pages
What Are The Characteristics of A Good Research Design
50% (2)
What Are The Characteristics of A Good Research Design
1 page
Sobol The Monte Carlo Method LML PDF
No ratings yet
Sobol The Monte Carlo Method LML PDF
81 pages
Answer Chap 08 PDF
No ratings yet
Answer Chap 08 PDF
10 pages
Generalized Kappa
No ratings yet
Generalized Kappa
11 pages
Business Statistics by Gupta 365 379
No ratings yet
Business Statistics by Gupta 365 379
15 pages
Solution To DMOP Make Up Exam 2016
No ratings yet
Solution To DMOP Make Up Exam 2016
5 pages
ch04 Sampling Distributions
No ratings yet
ch04 Sampling Distributions
60 pages
Chapter 5. Regression Models: 1 A Simple Model
No ratings yet
Chapter 5. Regression Models: 1 A Simple Model
49 pages
PTSP Notes Final
No ratings yet
PTSP Notes Final
57 pages
Chapter12 Datahandling
No ratings yet
Chapter12 Datahandling
42 pages
UKP6053 L3 Descriptive Statsitcs
100% (1)
UKP6053 L3 Descriptive Statsitcs
92 pages
Ch3 Numerically Summarizing Data
No ratings yet
Ch3 Numerically Summarizing Data
35 pages
Statistics For Health Research: Non-Parametric Methods
No ratings yet
Statistics For Health Research: Non-Parametric Methods
56 pages
Lectorial Week 6b NEW
No ratings yet
Lectorial Week 6b NEW
16 pages
Statistics Unit 6 Notes
No ratings yet
Statistics Unit 6 Notes
10 pages
Discrete Random Variables
No ratings yet
Discrete Random Variables
15 pages
Statical Data 1
No ratings yet
Statical Data 1
32 pages
Chapter 10 Test Questions
No ratings yet
Chapter 10 Test Questions
3 pages
Classification L12
No ratings yet
Classification L12
20 pages
Data Description Analysis
No ratings yet
Data Description Analysis
40 pages
Math2101Stat 2 2
No ratings yet
Math2101Stat 2 2
23 pages
Anova (Analysis of Variance) : Test Statistic For ANOVA
No ratings yet
Anova (Analysis of Variance) : Test Statistic For ANOVA
2 pages
Learning Sheet No. 8
No ratings yet
Learning Sheet No. 8
4 pages
DS Module 2
No ratings yet
DS Module 2
113 pages
Descriptive Statsistics
No ratings yet
Descriptive Statsistics
34 pages
Dsbda Unit 2
No ratings yet
Dsbda Unit 2
155 pages
2 - Central Tendency and Dispersion - SFB
No ratings yet
2 - Central Tendency and Dispersion - SFB
69 pages
Introduction To Descriptive Statistics
No ratings yet
Introduction To Descriptive Statistics
73 pages
Bus Stats Ch14 PDF
No ratings yet
Bus Stats Ch14 PDF
77 pages
S2 Big Data Week 5 Quiz - Attempt Review
No ratings yet
S2 Big Data Week 5 Quiz - Attempt Review
9 pages
Unit 3
No ratings yet
Unit 3
42 pages
Fitting With R
No ratings yet
Fitting With R
22 pages
Descriptive Statistics PDF
100% (1)
Descriptive Statistics PDF
40 pages
Topic 3
No ratings yet
Topic 3
49 pages
Profed 10
No ratings yet
Profed 10
4 pages
Chapter 3 - Numerical Technique - Send
No ratings yet
Chapter 3 - Numerical Technique - Send
49 pages
Introduction To Statistics PDF
No ratings yet
Introduction To Statistics PDF
32 pages
f592b059 1643454320549
No ratings yet
f592b059 1643454320549
39 pages
103 Sept 2002 Solution
No ratings yet
103 Sept 2002 Solution
12 pages
Bandit
No ratings yet
Bandit
8 pages
Descriptive Stat
No ratings yet
Descriptive Stat
13 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
83 pages
Measure of Central Tendency Dispersion A
No ratings yet
Measure of Central Tendency Dispersion A
8 pages
Unit-3 DS Students
No ratings yet
Unit-3 DS Students
35 pages
SSC CGL Tier 2 Statistics - Last Minute Study Notes: Measures of Central Tendency
No ratings yet
SSC CGL Tier 2 Statistics - Last Minute Study Notes: Measures of Central Tendency
10 pages
Presentation 4
No ratings yet
Presentation 4
29 pages
Jerome Statistics
No ratings yet
Jerome Statistics
12 pages
Basic Statistics
No ratings yet
Basic Statistics
24 pages
Ids Unit 2 Notes Ckm-1
No ratings yet
Ids Unit 2 Notes Ckm-1
30 pages
Unit 6 Interpreting Evaluation Results
No ratings yet
Unit 6 Interpreting Evaluation Results
54 pages
Unit 1 - Business Statistics & Analytics
No ratings yet
Unit 1 - Business Statistics & Analytics
25 pages
Probability and Statistics Lecture Notes
100% (1)
Probability and Statistics Lecture Notes
9 pages
2 - Introduction To Statistics
No ratings yet
2 - Introduction To Statistics
97 pages
Julian Requiestas - 1. Introduction To Statistics Levels of Measurement and The Grid
No ratings yet
Julian Requiestas - 1. Introduction To Statistics Levels of Measurement and The Grid
38 pages
Measures of Central Tendency
100% (1)
Measures of Central Tendency
48 pages
Descreptive Statistics 1
No ratings yet
Descreptive Statistics 1
74 pages
Statistics ClassNotes - 2
No ratings yet
Statistics ClassNotes - 2
10 pages
المحاضرة رقم 3
No ratings yet
المحاضرة رقم 3
44 pages
Business Statistics - KMBN104
No ratings yet
Business Statistics - KMBN104
25 pages
Lesson 3.2 Measures of Central Tendency Position and Variation
No ratings yet
Lesson 3.2 Measures of Central Tendency Position and Variation
62 pages
Describing Data - Numerical Measure
No ratings yet
Describing Data - Numerical Measure
33 pages
Unit - 2 Biostatistics
No ratings yet
Unit - 2 Biostatistics
9 pages
Stat 1101 4 7
No ratings yet
Stat 1101 4 7
18 pages
Stats Prac 1
No ratings yet
Stats Prac 1
10 pages
Line of Regression Part 1
No ratings yet
Line of Regression Part 1
27 pages
Slides For IT SKill
No ratings yet
Slides For IT SKill
63 pages
Milestone FMT
No ratings yet
Milestone FMT
2 pages
2nd Unit - Statistics
No ratings yet
2nd Unit - Statistics
15 pages
Ib A&i 3.1
No ratings yet
Ib A&i 3.1
38 pages
Psychology Project
No ratings yet
Psychology Project
14 pages
Measures of Central Tendency
100% (15)
Measures of Central Tendency
15 pages
Biostatistics (Descriptive Statistics)
No ratings yet
Biostatistics (Descriptive Statistics)
30 pages
Unit 3 Measure of Central Location
No ratings yet
Unit 3 Measure of Central Location
29 pages
Midterms Day 4
No ratings yet
Midterms Day 4
51 pages
(Ebook PDF) Knowing The Odds An Introduction To Probability Instant Download
No ratings yet
(Ebook PDF) Knowing The Odds An Introduction To Probability Instant Download
58 pages
DHS 9758 2023 Prelim P2 Solution
No ratings yet
DHS 9758 2023 Prelim P2 Solution
13 pages
Research pt-1
No ratings yet
Research pt-1
17 pages
DDDDDD 2
No ratings yet
DDDDDD 2
5 pages
Week 3
No ratings yet
Week 3
37 pages
CH 2 Lecture Notes
No ratings yet
CH 2 Lecture Notes
12 pages
Discriptive Statistics
No ratings yet
Discriptive Statistics
23 pages
Share MBBS - Lecture 4 (1) - 1
No ratings yet
Share MBBS - Lecture 4 (1) - 1
68 pages
Chapter 3 (Technical English For Statistics)
No ratings yet
Chapter 3 (Technical English For Statistics)
8 pages
Statistics 1
No ratings yet
Statistics 1
10 pages
Chapter 3
No ratings yet
Chapter 3
17 pages
Descriptive Statistics: Six Sigma Thinking, #3
From Everand
Descriptive Statistics: Six Sigma Thinking, #3
Sumeet Savant
No ratings yet
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
From Everand
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
Seaport AI Madhavan
No ratings yet
Introduction To Business Statistics Through R Software: Software
From Everand
Introduction To Business Statistics Through R Software: Software
Editor IJSMI
No ratings yet