0% found this document useful (0 votes)

14 views8 pages

STATISTICS (Averages and Variation)

Uploaded by

roannicolesibya21

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views8 pages

STATISTICS (Averages and Variation)

Uploaded by

roannicolesibya21

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

CHAPTER 3

Averages and Variation

PART 1

MODE - for discrete data, the mode is the value that occurs the most - may involve
one or two or even three values

Example:
✔ 1,1,2,2,2,3,4,5,6,6 mode= 2 ✔ -1,-1,0,0,0,1,2,3,3,4,4,4,4 mode= 0, 4
✔ 5,6,8,10,12,15,20 no mode ✔ 8,8,9,9,10,10,11,11,12,12 no mode - for
continuous data, its is (are) the peak(s) of the distribution

Advantages:
⮚ Easy to fine
⮚ Not sensitive to extreme values
⮚ Only measure of central tendency for categorical data

Disadvantages:
⮚ Only uses some of the data

MEDIAN - the central value of an ordered distribution: half of the dataisbelow the
median and half of the data is above the median 1. Order the data from the smallest to
largest
2. For an odd number of values, the median is the middle value 3. For an even
number of values, the median is the average of thetwo middle values

Example:
5,6,6,8,10 median = 6
5,6,8,10,12,14 median = 9 (8+10/2)

For a large data sets, it is handy to know that the position of the meanis n + 1
2
Advantage:
⮚ Not sensitive to extreme values
Disadvantage:
⮚ Only includes one or two data values
sum of all values
MEAN - the average value. For discrete data
number of values
Where:
● n is the sample size
● N is the population size

Example:
1,1,1,2,2,3,3,4,5,5 mean = 2.7 (27/10)
1,1,1,2,2,3,3,4,5,100 mean = 12.2 (122/10)

Advantages:
⮚ Every data value is used
⮚ Reliable:means of samples from the same population do not varymuch (relatively
speaking)
Disadvantage:
⮚ Sensitive to extreme values

TRIMMED MEAN - we trim k% from both “ends” of the data: removeextreme

values.
Procedures:
1. Put the data in order from the smallest to largest
n k%
100
2. Calculate how many values make up k%
3. Discard the number of values from (2) fromthe top andthebottom of the data
4. Calculate the mean on the remaining values

Example:
Calculate a 5% trimmed mean

1,1,2,3,4,4,5,5,5,6,6,6,6,7,7,8,9,10,18n = 19
5% of 19 = 95
WEIGHTED MEAN - gives more weight or importance to some values: like grades

Example:
You want to know your grade in statistics before the final exam. You currently have a
homework (20%) grade of 92, three test grades(12% each) of 100, 85, 96, and a
participation grade (20%) of 98.
PART 2

RANGE - the overall spread of the data between the minimumandmaximum

values
R = max - min

Example:
-1,-1,0,0,0,1,2,3,3,4,4,4
5,6,8,10,12,15,100

Advantage:
⮚ Easy to find
Disadvantage:
⮚ Very sensitive to extreme values
⮚ Does not provide information about the shape

STANDARD DEVIATION - it measures the variation of all values fromthe mean.

Advantages:
⮚ Uses all values
⮚ Same units as the data
Disadvantages
⮚ Difficult to calculate
⮚ Sensitive to extreme values

Note: the variance is the square of the standard deviation

* The round-off rule for science states that you include one moredecimal place than
you have in your data. But you do not round until thefinal answer

PART 3

COEFFICIENT OF VARIATION (CV) - it is a measure of relativevariation. We use it

to compare the variation in two or more samples or populations

Note: It is always better to have less variation

PART 4
CHEBYSHEV’S THEOREM

● Use to determine the minimum proportion of data (or the population) that must lie
within more (greater) than 1 standard deviation toeither side of the mean
● For any set of data (either population or sample) and for any constant k greater than 1,
the proportion of the data that must lie withinkstandard deviations on either side of
the mean is at least

● It applies to any distribution as long as the man and standarddeviation are defined
(finite)
● Tells us the minimum proportion (percentage) of the data (or thepopulation) that falls
within k standard deviations of the mean(either side of the mean)
● A minimum of 88.9% of the data falls between the values 3 standarddeviations below
the mean and 3 standard deviations above the mean. ⮚ This implies that a maximum
of 11.1% of data fall beyond3standard deviations of the mean
⮚ Such values might be suspect outliers, particularly for amound-shaped symmetric
distribution
PERCENTILE, QUARTILES & 5# SUMMARY

PERCENTILE - the Pth percentile (1< P< 99) of a distribution is a valuesuch that P%
of the data fall below it and (100-P)%of the data fall or above it.

Example:
If you are in the 89
th
percentile of math score, what %of students
have scores:
a. Below yours? 89%
b. Above yours? 11% (100 - 89%)

Note: There is no 100

th
percentile because any person is part of 100%soa
100% can’t be below that person’s score because the person is

QUARTILES
Q1 = 25
th
percentile
Q2 = 50
th
percentile (median)
Q3 = 75
th
percentile

Procedure:
1. Put the data in order from the smallest to largest 2. Find the median (Q2)
3. Find the median of the values below (not equal to) the median-Q14. Find
the
median of the values above (not equal to) the median -Q3 5 NUMBER

SUMMARY

1. Minimum value = 111

2. Q1 = 182
3. Q2 = 221.5
4. Q3 = 319
5. Maximum value = 439

The 5 number summary for example 2 are:

111, 182, 221.5, 319, 439

BOX AND WHISKER PLOTS (BOX PLOTS) - a useful technique from

exploratory data analysis for describingdata

Procedure:
1. Draw a scale horizontal scale
2. Above the scale draw a box from Q1 to Q3 (height of boxcanvary)
3. Draw a solid vertical line from the top to the bottomof thebox at Q2
4. Draw horizontal lines (whiskers) from the left end of thebox(Q1) to the
minimum (lowest) value (located verticallynear the center of the box) and from
the right end of the box(Q3) to the maximum (highest) value

Symmetric Distribution - if the line for Q2 id approximatelyat thecenter of the

box, the distribution is symmetric

Skewed to the left - the line is closer to Q3; left (horizontal) or lower (vertical) side of
box bigger
Skewed to the right - the line is closer to Q1; right side (horizontal) on upper side
(vertical) is bigger

DSILYTC Session 5 - Descriptive Statistics
No ratings yet
DSILYTC Session 5 - Descriptive Statistics
99 pages
Descriptive Statistics W25
No ratings yet
Descriptive Statistics W25
41 pages
Central Tendency - HU 2023
No ratings yet
Central Tendency - HU 2023
48 pages
Chapter 2.2
No ratings yet
Chapter 2.2
32 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
59 pages
ST8114 Module1 PartI UnivariateEDA
No ratings yet
ST8114 Module1 PartI UnivariateEDA
60 pages
EECM3724 Unit 1 Ch3 Slides 2022
No ratings yet
EECM3724 Unit 1 Ch3 Slides 2022
48 pages
Slides Week2
No ratings yet
Slides Week2
43 pages
Chapter 3
No ratings yet
Chapter 3
17 pages
Statistics For Business and Economics: Using Numerical Measures To Describe Data
No ratings yet
Statistics For Business and Economics: Using Numerical Measures To Describe Data
74 pages
Chap03 - Numerically Describing Data
No ratings yet
Chap03 - Numerically Describing Data
41 pages
Analysis of Statistcal Data
No ratings yet
Analysis of Statistcal Data
46 pages
Business Statistics
No ratings yet
Business Statistics
106 pages
Week 6+7+8
No ratings yet
Week 6+7+8
37 pages
Slides Chp03 Stats 20221
No ratings yet
Slides Chp03 Stats 20221
41 pages
Chapter 2
No ratings yet
Chapter 2
46 pages
Lecture 3 Numerical Measures of Data
No ratings yet
Lecture 3 Numerical Measures of Data
36 pages
Chapter 4 Measures of Dispersion (Variation)
No ratings yet
Chapter 4 Measures of Dispersion (Variation)
34 pages
Lecture III-Measures of Dispersion
No ratings yet
Lecture III-Measures of Dispersion
33 pages
Lecture 9
No ratings yet
Lecture 9
40 pages
FDSA Unit 2
No ratings yet
FDSA Unit 2
44 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
38 pages
Chapter 3 Review
100% (1)
Chapter 3 Review
12 pages
City Uni of New York
No ratings yet
City Uni of New York
33 pages
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
From Everand
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
Seaport AI Madhavan
No ratings yet
PC 2 Statistics by Praveen Mathur
No ratings yet
PC 2 Statistics by Praveen Mathur
44 pages
Statistical Data
No ratings yet
Statistical Data
41 pages
Descriptive Statistics
100% (1)
Descriptive Statistics
37 pages
Lecture 2-3 Data Analysis Location & Dispression
No ratings yet
Lecture 2-3 Data Analysis Location & Dispression
43 pages
RMBS BPT402
No ratings yet
RMBS BPT402
103 pages
Introduction To Biostatistics
No ratings yet
Introduction To Biostatistics
53 pages
Stat 102 Module 3
No ratings yet
Stat 102 Module 3
8 pages
Regression Diagnostic Ii: Heteroscedasticity: Damodar Gujarati
No ratings yet
Regression Diagnostic Ii: Heteroscedasticity: Damodar Gujarati
7 pages
MMW Reviewer
No ratings yet
MMW Reviewer
9 pages
Descriptive and Inferential Statistics. Confidence Interval
No ratings yet
Descriptive and Inferential Statistics. Confidence Interval
42 pages
Measures of Central Tendency
100% (15)
Measures of Central Tendency
15 pages
Stat Chapter 5-9
No ratings yet
Stat Chapter 5-9
32 pages
Chapter 6 Examples
No ratings yet
Chapter 6 Examples
408 pages
Mathematics P2 Grade 11 Nov 2017 Memo Afr & Eng
No ratings yet
Mathematics P2 Grade 11 Nov 2017 Memo Afr & Eng
19 pages
Chapter 5
No ratings yet
Chapter 5
6 pages
STAE Lecture Notes - LU3
No ratings yet
STAE Lecture Notes - LU3
24 pages
Discriptive Statistics
No ratings yet
Discriptive Statistics
50 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
35 pages
Statistics Unit1 Notes
No ratings yet
Statistics Unit1 Notes
11 pages
History Reporting
No ratings yet
History Reporting
61 pages
Stat 1101 4 7
No ratings yet
Stat 1101 4 7
18 pages
ch03 Ver3
No ratings yet
ch03 Ver3
25 pages
Ken Black QA ch03
0% (1)
Ken Black QA ch03
61 pages
St130: Basic Statistics Week 3: Lecture: School of Computing Information and Mathematical Sciences
No ratings yet
St130: Basic Statistics Week 3: Lecture: School of Computing Information and Mathematical Sciences
62 pages
Lecture 2b - Describing Data-Numerical
No ratings yet
Lecture 2b - Describing Data-Numerical
47 pages
Introduction To Descriptive Statistics
No ratings yet
Introduction To Descriptive Statistics
73 pages
QTTM 409 Class Presentaiton of DR P James Daniel Paul MSB LPU
No ratings yet
QTTM 409 Class Presentaiton of DR P James Daniel Paul MSB LPU
111 pages
Spring Semester, 2020-2021
No ratings yet
Spring Semester, 2020-2021
40 pages
Statistics I Chapter 2: Univariate Data Analysis
No ratings yet
Statistics I Chapter 2: Univariate Data Analysis
27 pages
Measusres of Locations
No ratings yet
Measusres of Locations
52 pages
Measures of Central Tendency and Spread: Chapter 1, Section 2
No ratings yet
Measures of Central Tendency and Spread: Chapter 1, Section 2
36 pages
Measures of Variability and Position
No ratings yet
Measures of Variability and Position
34 pages
Descriptive Statistics 1
No ratings yet
Descriptive Statistics 1
63 pages
Chapter 3: Statistics
No ratings yet
Chapter 3: Statistics
3 pages
Research
No ratings yet
Research
9 pages
Statistics Midterm Review
No ratings yet
Statistics Midterm Review
21 pages
Mathematical Expectation
No ratings yet
Mathematical Expectation
34 pages
Mcqs On Biostatistics: Public Health Dentistry
No ratings yet
Mcqs On Biostatistics: Public Health Dentistry
19 pages
Mastery Test Math 10 Quarter 4
No ratings yet
Mastery Test Math 10 Quarter 4
3 pages
Introductory of Statistics - Chapter 3
No ratings yet
Introductory of Statistics - Chapter 3
7 pages
Estimating Risk and Return On Assets
No ratings yet
Estimating Risk and Return On Assets
28 pages
Visualization - Hist and Box
No ratings yet
Visualization - Hist and Box
23 pages
StatProb11 Q4 LAS8 Pearson-Correlation-Coefficient
No ratings yet
StatProb11 Q4 LAS8 Pearson-Correlation-Coefficient
9 pages
Multicollinearity
No ratings yet
Multicollinearity
26 pages
Lp-Modulle 1-Measures of Position For Ungrouped Data
No ratings yet
Lp-Modulle 1-Measures of Position For Ungrouped Data
22 pages
GCSE Maths Revision: Cheeky Revision Shortcuts
From Everand
GCSE Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (2)
رياضيات ٧
No ratings yet
رياضيات ٧
38 pages
Jasmen PR GRP 4
No ratings yet
Jasmen PR GRP 4
19 pages
Week 06 Normal Distribution and Parameter Estimation
No ratings yet
Week 06 Normal Distribution and Parameter Estimation
53 pages
Ans Data Analysis SAC 2019
No ratings yet
Ans Data Analysis SAC 2019
16 pages
PSMOD - Sample Practical Test (A)
No ratings yet
PSMOD - Sample Practical Test (A)
3 pages
Ranvijay Ba
No ratings yet
Ranvijay Ba
5 pages
Measures of Location and Spread
No ratings yet
Measures of Location and Spread
1 page
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
8 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
4 pages
GOY AL Brothers Prakashan: X X X X X N
No ratings yet
GOY AL Brothers Prakashan: X X X X X N
22 pages
Studies in Financial Derivatives, Assignment 1
No ratings yet
Studies in Financial Derivatives, Assignment 1
17 pages
Maths Unit 5
No ratings yet
Maths Unit 5
12 pages
Math 118 PPT 12.4
No ratings yet
Math 118 PPT 12.4
21 pages
LCAa 2 Loria
No ratings yet
LCAa 2 Loria
7 pages
Assumption of Linear Regression
No ratings yet
Assumption of Linear Regression
6 pages
Assign 1
No ratings yet
Assign 1
1 page
Chi-Test For Variance Assignment
No ratings yet
Chi-Test For Variance Assignment
3 pages
Kuesioner Pola Asuh Scale: All Variables: Case Processing Summary
No ratings yet
Kuesioner Pola Asuh Scale: All Variables: Case Processing Summary
4 pages
The CUSUM Test: When The Regression Is Estimated Using Only The First T 1
No ratings yet
The CUSUM Test: When The Regression Is Estimated Using Only The First T 1
3 pages

STATISTICS (Averages and Variation)

Uploaded by

STATISTICS (Averages and Variation)

Uploaded by

CHAPTER 3

Averages and Variation

TRIMMED MEAN - we trim k% from both “ends” of the data: removeextreme

RANGE - the overall spread of the data between the minimumandmaximum

STANDARD DEVIATION - it measures the variation of all values fromthe mean.

Note: the variance is the square of the standard deviation

COEFFICIENT OF VARIATION (CV) - it is a measure of relativevariation. We use it

Note: It is always better to have less variation

Note: There is no 100

1. Minimum value = 111

The 5 number summary for example 2 are:

BOX AND WHISKER PLOTS (BOX PLOTS) - a useful technique from

Symmetric Distribution - if the line for Q2 id approximatelyat thecenter of the

You might also like