0% found this document useful (0 votes)

20 views68 pages

Unit 2 Notes

Here are the key points about describing spread in a distribution: - Range is the difference between the maximum and minimum values. It provides the span of the data but does not account for how the data is distributed within that span. - Standard deviation is a measure of how far the values are spread out from the mean. It provides information about the variability or dispersion of the data. A low standard deviation indicates data points are close to the mean, while a high standard deviation indicates values are spread out over a wider range. - Interquartile range (IQR) is the difference between the first (Q1) and third (Q3) quartiles. It describes the spread of the middle 50% of the data by

Uploaded by

eva.pickerell1306

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views68 pages

Unit 2 Notes

Uploaded by

eva.pickerell1306

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 68

2.

1 Frequency Distributions Graphs

FrequencyDistribution A tablethatshowsclass intervals of data entries
w counts of each entry
Count Lowe Class limit least
growoften
that belongs to the class
4min for the
all the starting Isle
same upper class limit greatest
width same belongs to the class
amount of ymzx ending valve for the
s class

tow to create
Decide how many classes do you want
you pick 5 20
Find the class width
distance between lowelorupper limits
of Consecutive classes
find range largest min
Range of classes
round up to next Convienent whole

ex test grades I
s class tally fear Relative Edit
601373 11 2 45 2
60,70 75,8597 88 11
74 2 25
Éeau
range 97 60 37 89102 1 1 45
width 37 3 12.3 12 1,31 p
Find class limit
Find min value lower limit
add width to lower limit
Tally the data entry t count tallies Frequency
Other features
Midpoints middle of class
lower class limit upperclass limit 42
2
Relative Frequency percent proportion of data that
falls in the entry
classifiers
BAIKIE size
E
percentage
Cumulative to sum up sum of the frequency andeveryfrequency
before
Fequ Histogram To
THEYET
Is TOUCH unless theres a gap in data
3
xp

Quantitative
ONLY
8 starts stop class bounds
Relitie Frequ G 0.5 from lowerlimit
Histogram y 0.5 to upper limit

É
Quantitative
2.2 moregraphs t displays
PieChart circledividedintosectors that representcategories the area of
eachsector is proportional tothe frequency of each category
1001
25 a
50 proportional

25 y
ex
r f In
So 7
class width 51
Frequency Histogram: Ingles 35 4
Construct a frequency distribution and a frequency histogram for the data set using the
indicated number of classes

fY
7 41
2 46
47 51 I
52 56 0
tell
Displaying qualitative data:

p
tables

numerical if the
is a table
zip codes
student ID

Jersey
Pie Chart:
● A circle that is divided into sectors that represent categories

● The area of each sector is proportional to the frequency of each category

1004

50
He
Pie Chart: our example
The data represent the results of an online survey that asked adults how they will invest their
money.
Invest more in stocks 50 Sdi Total 100
Hold on to more cash 25 251 relativefreqEmpie
Invest more in bonds 15 ist
Invest the same as last year 10 10 I
1st
101
so't
251
Pareto Chart:
● Vertical bar graph

● Height of each bar represents frequency or relative frequency

● The bars are positioned in order of decreasing height, with the tallest bar
positioned at the left.

4
mad
Quantitative data:

numerical measurement counts

ex histogram
Stem and Leaf: 19,99 leaf
● Each number is separated into a stem and leaf stem
T
first digit lo's
place I 0 I
● As many leaves as entries in data set 2,2 2
0 3 5

● Leaves are single digits

Hey 1 2 12
Stem and Leaf: our example
Use a stem-and-leaf plot to display the data. The data represent the ages of the top
15 highest-paid CEOs
53 72 55 67 59 57 55 59 61 60 59 56 63 58 58

2
3
4
43
gs a a a aaa
6 0 1 3,7
7 2
Dot plot:
● Each data entry is plotted, using a point, above a horizontal axis

tf
Dot plot: our example
Use a dot plot to display the data. The data represent the life spans (in days) of 30
houseflies.
9 9 4 11 10 5 13 9 7 11 6 8 14 10 6
10 10 7 14 11 7 8 6 13 10 14 14 8 13 10

Iii
Scatter plot:
● Ordered pairs are graphed as points in a coordinate plane

● Used to show the relationship between two quantitative variables

gray Ei

Time hrs
Describing a distribution: center
Mean:
O
average

population mean Md Fe is pop size

man
sample
m
EI is sample size

Median:

middle value when data is in order

1 2,13 4
Describing a distribution:
Mode: mostoccurring

1,2 3,3 4
Saff

Outlier: data
entry far removed
1 2,3 4 4
Welcome!
Agenda: Due dates/ upcoming:
● 2.3: measures of Central ● You’ll need your calculators
Tendency starting next class
● How many pairs of shoes do you
own?
● Work time
Describing distributions - SCUFS
● Shape modesshew qual prechart
● Center meanmedian panetochant

quant dotplot
● Unusual features omens gaps clusters
● Spread range Standard deviation Scatterplot
far
Stem'sheat
context
histogram
Describing distributions - Shape
● Modes peaks mounds howmany 3 onmove multimodal

unimodal bimodal

TM
M
Describing distributions - Shape

● Skew direction of tail

Symetric
Describing distributions - Shape

● Skew

negative

I
Describing distributions - Shape

● Skew

positive

I
h
Shape

● Skew
Describing distributions - center
Mean average
popmean M
Samplemean I
EI

Median8Middlevalue whendata is in order

The mean is affected by outliers that do not influence the mean.
● Distribution of data is skewed to the left, the mean is often less than the
median
● Distribution is skewed to the right, the mean is often greater than the median

Skewed median mean

symetric
Unusual features

Gaps, Outliers, Clusters

grouped space
clyster
Mfmovea
far
Fig
Spread (variability)

Range:

Me
max min

going
p
Describing distributions - SCUFS
● Shape
Shew modes

● Center
meanymedian

● Unusual features
outliers gapcluster

● Spread I QR St dev
range
Describing distributions - spread
The deviation of an entry x in a population data set is the difference between the
entry and the mean of the data set

deviation X Mmean
a
value

Square deviations x my
Eft
Describing distributions - spread
Variance: distance between values's mean insquare units

Population variance:
O
EITI
Sample variance:
s
ELMI
Describing distributions - spread
Standard deviations: distance between values mean

Population st. dev: O EMIT

Sample st. dev:
SEEIN
Spread: Standard deviation
Measures how far each value is from the mean
Describing distributions - spread
Standard deviation by hand: calculate the standard deviation of the data set:

Step1 find meanE 6, 2, 3, 1

Step4 square root
6 21 3
FE
Step2 sumof squaredeviations Ex It 3
33712 332 13 332 1
6
9 I t O t 4 14
divide n
steps
II
Describing distributions - spread
Standard deviation by hand: calculate the standard deviation of the data set:

4, 3, 5, 2
4 2 3.5
345
4 3.55 133.53715 3.572 12 3.55
0.25 0.25 2.25 2.25 9

F 21.7
Describing distributions - spread
Standard deviation on a calculator:
Describing distributions - spread
In a study of high school football players that suffered concussions, researchers
placed the players in two groups. Players that recovered from their concussions
in 14 days or less were placed in Group 1. Those that took more than 14 days
were placed in Group 2. The recovery times (in days) for Group 1 are listed
below.

Find the sample variance and standard deviation of the recovery times.

4 7 6 7 9 5 8 10 9 8 7 10
Describing distributions - spread
Standard deviation in your name:
1. Write down the letters in your preferred first name and convert them to
numbers.
prehenell
51851212
1693 11
2. Using the values, calculate the st. dev.
of your name

3. Interpret the results

mean
I 10 I
Sampler
4.8
88 population
Describing distributions - spread
Outliers:

if somethingis 2 St dev above or below

the mean its an outlier
Describing distributions - spread
Why the standard deviation as a measure of variation is valuable:

30
I I l FI
M
I I
30
Within 1 standard deviation of the mean: about 68% of the data
Within 2 standard deviation of the mean: about 95% of the data
Within 3 standard deviation of the mean: about 99.7% of the data
Empirical rule
Describing distributions - spread
The monthly utility bills for eight more households are listed.Are any of the data
entries very unusual? Explain your reasoning.

$65, $52, $63, $83, $77, $98, $84, $70

Practice

Start working on:

● Section 2.4 page 93: #’s: 13, 15, 18, 21-24, 29, 30 and 33
at
mean
St dev
range

Homework: If you do not finish (due next class)

Summary statistics

The first quartile, Q1:

● The median of the half of the ordered data set from the minimum to the
position of the median
median

I
4, 7, 8, 8, 11, 13, 15, 19, 21

7.5
Q1
Summary statistics

The third quartile, Q3:

● The median of the half of the ordered data set from the position of the
median to the maximum

3
1

4, 7, 8, 8,
121
11, 13, 15, 19, 21
Summary statistics

Find Q1, median (Q2), and Q3 from the data set: (note it’s in order)

med
1, 2, 5, 6, 7, 9, 12, 15, 18, 19, 27
Q Q3
of middle half
Summary statistics
us spread
tells
I QR of data
Interquartile range:

3 g

example 18 5 13
Summary statistics

Outliers:
rule Q 1.511 QR

3 1.5 IOR

ex I 4 7 9 11 12,13 17 22,30
7 1.51107 8
IQR 17 7 10
17 15 32
No outliers
Summary statistics:

Five Number Summary:

Min
Qi
med

3
Max
Summary statistics display: stat too
Boxplot:

outer

mm Q1 Md Tif rather pick

next biggest
Summary statistics display:
tar
Comparing boxplots: SCUÉE
stfu median

a a

Tt skew aft
Iright
skew
Pre calc O 6
Stats O 5
used measure position
Fractiles

Quartile dude data late 4

equal parts

dived data into 10

Decile

Percentile divide data into 100

Percentiles of values less
percent
than X

Percentile of x t.FI aaftfIasksstuan

80th percentile 80 of ppl fell below you

I n I l s t
50thpercentile soy
Standardized scores I 100

Standardized z-score: measures how many standard deviations a data value is

from the mean
Standardized scores postneg
t below
able
z vakst.IT
pop z
XI outliers 2 St dev

sample z
XI
Using z-scores to compare data Z
XI
The scores for a pre-calc test are normally distributed with a mean of 81.5 and
standard deviation of 4.7. Stats tests are normally distributed with a mean of 79.9
and standard deviation of 9.3.scores

You score a 84.5 in per-calc and your friend in stats scores a 85. Who did better?

5f.IT o 6y
7,1
8 0.55
Normal distribution Asymmetric
area under the
curve
total area Foot

30 20 O M O 20 30
Label the following normal distribution given the mean and standard
deviation:

80 90 100 110 120 130 140

Within 1 standard deviation of the mean: about 68% of the data
Within 2 standard deviation of the mean: about 95% of the data
Within 3 standard deviation of the mean: about 99.7% of the data
Empirical rule Total area loot

s
Normal distribution - use

What percent of adults have a systolic blood pressure below 100 mmHg?

100 68
321

Ht 687

I f
What percent of adults have a systolic blood pressure above 120 mmHg?

841 161

I
What percent of adults have a systolic blood pressure between 90 and 120
mmHg?

81.51 100 6 2 5 81.5

2 St 161

Gentle Lentil Case Solution
100% (2)
Gentle Lentil Case Solution
4 pages
Assign 6 Option C Excel - 7ed
0% (1)
Assign 6 Option C Excel - 7ed
17 pages
05 RSH630
No ratings yet
05 RSH630
78 pages
Topic 2 - Descriptive - Statistics
No ratings yet
Topic 2 - Descriptive - Statistics
36 pages
Descriptive Statistics and Exploratory Data Analysis
No ratings yet
Descriptive Statistics and Exploratory Data Analysis
36 pages
Screenshot 2024-07-22 at 10.26.36 AM
No ratings yet
Screenshot 2024-07-22 at 10.26.36 AM
35 pages
Click To Add Text Dr. Cemre Erciyes: Soc 2003 Statistical Methods and Computer Applications in Social Sciences 18/19
No ratings yet
Click To Add Text Dr. Cemre Erciyes: Soc 2003 Statistical Methods and Computer Applications in Social Sciences 18/19
69 pages
Manm526 W1
No ratings yet
Manm526 W1
38 pages
Stats Notes by Warad
No ratings yet
Stats Notes by Warad
5 pages
Numerical Descriptive Measures
No ratings yet
Numerical Descriptive Measures
25 pages
Numerical Descriptive Measures
No ratings yet
Numerical Descriptive Measures
25 pages
02 - Descriptive Statistics
No ratings yet
02 - Descriptive Statistics
45 pages
Lecture 6
No ratings yet
Lecture 6
84 pages
Statistics I Chapter 2: Univariate Data Analysis
No ratings yet
Statistics I Chapter 2: Univariate Data Analysis
27 pages
Measure of Dispersion-Intro
No ratings yet
Measure of Dispersion-Intro
14 pages
C1S1 Statistics Packet
No ratings yet
C1S1 Statistics Packet
24 pages
7u7 PDF
No ratings yet
7u7 PDF
31 pages
Click To Add Text Dr. Cemre Erciyes
No ratings yet
Click To Add Text Dr. Cemre Erciyes
69 pages
Introduction To Biostatistics
No ratings yet
Introduction To Biostatistics
59 pages
Chapter 2
No ratings yet
Chapter 2
74 pages
Frequency Distributions and Graphs2
No ratings yet
Frequency Distributions and Graphs2
8 pages
Stat Distributions
No ratings yet
Stat Distributions
24 pages
CHP 2
No ratings yet
CHP 2
96 pages
Describing Data: Probability and Statistics For Science and Engineering With Examples in R
No ratings yet
Describing Data: Probability and Statistics For Science and Engineering With Examples in R
24 pages
Chapter 4 Measures of Dispersion (Variation)
No ratings yet
Chapter 4 Measures of Dispersion (Variation)
34 pages
History Reporting
No ratings yet
History Reporting
61 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
51 pages
QBM 101 Business Statistics: Department of Business Studies Faculty of Business, Economics & Accounting HE LP University
No ratings yet
QBM 101 Business Statistics: Department of Business Studies Faculty of Business, Economics & Accounting HE LP University
62 pages
SCSA1606 - Predictive and Advanced Analytics - Unit II
No ratings yet
SCSA1606 - Predictive and Advanced Analytics - Unit II
50 pages
8614.educational Statitics Unit 4
No ratings yet
8614.educational Statitics Unit 4
34 pages
Central Tendency - HU 2023
No ratings yet
Central Tendency - HU 2023
48 pages
Data Analysis - Statistics
No ratings yet
Data Analysis - Statistics
68 pages
Measures of Central Tendency & Variation
No ratings yet
Measures of Central Tendency & Variation
86 pages
Day 01-Basic Statistics
No ratings yet
Day 01-Basic Statistics
36 pages
Lecture-1 Descriptive Statistics
No ratings yet
Lecture-1 Descriptive Statistics
50 pages
WK 1b Biostat
No ratings yet
WK 1b Biostat
38 pages
Lecture 3
No ratings yet
Lecture 3
41 pages
Lesson2 - Measures of Tendency
No ratings yet
Lesson2 - Measures of Tendency
65 pages
Staticus: Math 103 Lecture 9 Class Notes
No ratings yet
Staticus: Math 103 Lecture 9 Class Notes
4 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
34 pages
Descriptive Statistics Analysis Part 1
No ratings yet
Descriptive Statistics Analysis Part 1
42 pages
MEASURES
No ratings yet
MEASURES
5 pages
6062b249804f2baef22989a2 - SS AP Statistics
No ratings yet
6062b249804f2baef22989a2 - SS AP Statistics
35 pages
Unit 4 Quantitative Analysis and Interpretation
No ratings yet
Unit 4 Quantitative Analysis and Interpretation
10 pages
Business Statistics
No ratings yet
Business Statistics
106 pages
Statistics From PLTW
No ratings yet
Statistics From PLTW
64 pages
H1.1 Definitions, Measures, Plots, CLT
No ratings yet
H1.1 Definitions, Measures, Plots, CLT
83 pages
Descriptive Statistics Summary (Session 1-5) : Types of Data - Two Types
No ratings yet
Descriptive Statistics Summary (Session 1-5) : Types of Data - Two Types
4 pages
Chapter 1 (Introduction)
No ratings yet
Chapter 1 (Introduction)
40 pages
Averages and Variation Eda
No ratings yet
Averages and Variation Eda
29 pages
Chapter 3 Numerical Summaries of Data: Important Note: Follow Rounding Instructions
100% (1)
Chapter 3 Numerical Summaries of Data: Important Note: Follow Rounding Instructions
4 pages
Chapter 2
No ratings yet
Chapter 2
46 pages
7CCMMS61 Statistics For Data Analysis: Francisco Javier Rubio Department of Mathematics
No ratings yet
7CCMMS61 Statistics For Data Analysis: Francisco Javier Rubio Department of Mathematics
13 pages
CH 2
No ratings yet
CH 2
49 pages
RM EBBA Class 8 CH0 11 Quatitative Analysis
No ratings yet
RM EBBA Class 8 CH0 11 Quatitative Analysis
37 pages
ST8114 Module1 PartI UnivariateEDA
No ratings yet
ST8114 Module1 PartI UnivariateEDA
60 pages
Inferential Statistics
No ratings yet
Inferential Statistics
92 pages
Lec 11 Chapter IV Descriptiv and Inferential Stat.
No ratings yet
Lec 11 Chapter IV Descriptiv and Inferential Stat.
26 pages
Why Study Dispersion?: Spread of The Data
No ratings yet
Why Study Dispersion?: Spread of The Data
31 pages
FDSA Unit-2
No ratings yet
FDSA Unit-2
41 pages
Stat 1101 4 7
No ratings yet
Stat 1101 4 7
18 pages
Unit 1: Exploratory Data Analysis
No ratings yet
Unit 1: Exploratory Data Analysis
28 pages
Outros 138547 1 10 20141119 PDF
No ratings yet
Outros 138547 1 10 20141119 PDF
5 pages
Six Sigma DMAIC
No ratings yet
Six Sigma DMAIC
2 pages
OLAH DATA - Humam Juzaili Afif - J3E116060
No ratings yet
OLAH DATA - Humam Juzaili Afif - J3E116060
13 pages
Biostatistics Unit 2
No ratings yet
Biostatistics Unit 2
20 pages
University of Gondar
100% (2)
University of Gondar
39 pages
Neat Python Latest PDF
No ratings yet
Neat Python Latest PDF
95 pages
Process Capability and Capability Index
No ratings yet
Process Capability and Capability Index
18 pages
Deviation and Z Score Worksheet
No ratings yet
Deviation and Z Score Worksheet
4 pages
Taylor Ims11 Tif Ch11-Probability and Statistics
No ratings yet
Taylor Ims11 Tif Ch11-Probability and Statistics
31 pages
A Study of Metacognitive Skills Among Senior Secondary Students in Relation To Subject Stream and Various Demographics
No ratings yet
A Study of Metacognitive Skills Among Senior Secondary Students in Relation To Subject Stream and Various Demographics
13 pages
Markowitz Portpolio Theory
No ratings yet
Markowitz Portpolio Theory
16 pages
POM 2023 Week 8 Solution
No ratings yet
POM 2023 Week 8 Solution
3 pages
Finite Population
No ratings yet
Finite Population
13 pages
Measurement of Fluid Flow in Closed Conduits
No ratings yet
Measurement of Fluid Flow in Closed Conduits
20 pages
Weighing Balance Service Manual
100% (1)
Weighing Balance Service Manual
51 pages
SCADA-Data-Based Static Yaw Misalignment
No ratings yet
SCADA-Data-Based Static Yaw Misalignment
3 pages
Chapter 1 SAMPLING AND SAMPLING DISTRIBUTIONS PDF
No ratings yet
Chapter 1 SAMPLING AND SAMPLING DISTRIBUTIONS PDF
86 pages
Modern Sample Preparation For Chromatography 2nd Edition Serban Moldoveanu Victor David 2024 Scribd Download
100% (1)
Modern Sample Preparation For Chromatography 2nd Edition Serban Moldoveanu Victor David 2024 Scribd Download
55 pages
Statistics and Probability STAT 112 Grade11 Week 1 10 Kuya Piolo
No ratings yet
Statistics and Probability STAT 112 Grade11 Week 1 10 Kuya Piolo
109 pages
Probability Analysis For Estimating Annual One Day Maximum Rainfall in Tamil Nadu Agricultural University
No ratings yet
Probability Analysis For Estimating Annual One Day Maximum Rainfall in Tamil Nadu Agricultural University
5 pages
Confidence Intervals: By: Asst. Prof. Xandro Alexi A. Nieto UST - Faculty of Pharmacy
No ratings yet
Confidence Intervals: By: Asst. Prof. Xandro Alexi A. Nieto UST - Faculty of Pharmacy
25 pages
Worksheet Chapter 1-9
No ratings yet
Worksheet Chapter 1-9
7 pages
The Mean of The Sample Mean
No ratings yet
The Mean of The Sample Mean
31 pages
Graham & Santangelo 2014 - Meta-Analysis Spelling Instruction
No ratings yet
Graham & Santangelo 2014 - Meta-Analysis Spelling Instruction
42 pages
Problems in Surveying
No ratings yet
Problems in Surveying
2 pages
Quality Management and Six Sigma
No ratings yet
Quality Management and Six Sigma
83 pages
Lognormal Pricing III
No ratings yet
Lognormal Pricing III
10 pages

Unit 2 Notes

Uploaded by

Unit 2 Notes

Uploaded by

2.

1 Frequency Distributions Graphs

● The area of each sector is proportional to the frequency of each category

● Height of each bar represents frequency or relative frequency

numerical measurement counts

● Leaves are single digits

● Used to show the relationship between two quantitative variables

population mean Md Fe is pop size

middle value when data is in order

● Skew direction of tail

Median8Middlevalue whendata is in order

Skewed median mean

Gaps, Outliers, Clusters

Population st. dev: O EMIT

Step1 find meanE 6, 2, 3, 1

3. Interpret the results

if somethingis 2 St dev above or below

$65, $52, $63, $83, $77, $98, $84, $70

Start working on:

Homework: If you do not finish (due next class)

The first quartile, Q1:

The third quartile, Q3:

Five Number Summary:

mm Q1 Md Tif rather pick

Quartile dude data late 4

dived data into 10

Percentile divide data into 100

Percentile of x t.FI aaftfIasksstuan

80th percentile 80 of ppl fell below you

Standardized z-score: measures how many standard deviations a data value is

80 90 100 110 120 130 140

81.51 100 6 2 5 81.5

You might also like